Re: [tcpm] Fwd: I-D Action: draft-bagnulo-tcpm-tcp-low-rtt-00.txt
Karen Elisabeth Egede Nielsen <karen.nielsen@tieto.com> Thu, 18 August 2016 08:05 UTC
Return-Path: <karen.nielsen@tieto.com>
X-Original-To: tcpm@ietfa.amsl.com
Delivered-To: tcpm@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 1A4E212D0BD for <tcpm@ietfa.amsl.com>; Thu, 18 Aug 2016 01:05:52 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.701
X-Spam-Level:
X-Spam-Status: No, score=-2.701 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=tieto.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id E-tq-HS8wvzT for <tcpm@ietfa.amsl.com>; Thu, 18 Aug 2016 01:05:49 -0700 (PDT)
Received: from mail-it0-x234.google.com (mail-it0-x234.google.com [IPv6:2607:f8b0:4001:c0b::234]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 8C6DB12D0A1 for <tcpm@ietf.org>; Thu, 18 Aug 2016 01:05:47 -0700 (PDT)
Received: by mail-it0-x234.google.com with SMTP id n128so11650607ith.1 for <tcpm@ietf.org>; Thu, 18 Aug 2016 01:05:47 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=tieto.com; s=google; h=from:references:in-reply-to:mime-version:thread-index:date :message-id:subject:to:cc; bh=y6LJiuW44bFkTnJCT9ljMTLoVk22Vqf22OsO5lA1H0s=; b=ufKhuypXfDIsU8z+U70prs+7GkdrnsOlxcZRL0wVYN86SHJ2qVZUKMvn5k2g5q6EWx KvO/+HKxMZz2L6zDrMONy11STbjS+9zD1rLW+LxhoUcCq/oTCj8KmXCNWp2+fG0y3SvK K2RVny1VHGMvFIc2ks8q0qpqPeUSUEL/mKEV4=
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:references:in-reply-to:mime-version :thread-index:date:message-id:subject:to:cc; bh=y6LJiuW44bFkTnJCT9ljMTLoVk22Vqf22OsO5lA1H0s=; b=PiOX0fsMiInNaTt46pvfNnsGUnuKlWRMrz0hi4jaGHRmsO1HSBZlVd/V0YiU/F+kpE zFW0ahRo87pfY8s2D2b9KoULVicUsKn8eneCDpRD5Wdsxu3vCyyTL98nAHjLrRbCBpJB HIu+JmYV0LvCzaQKB9SF12tWPQyxcrmZ3MRzTJStikw1cjqd71rwROTAFVIJmotUfijQ VCntb69q2+DjUDDHFkqjVIiWuw63gJpsPaxDJvLWykom7mTEpOEWrpFcAmIdBhcZP0VQ JnFa0EiMn2FOLvuHXR7mFgP/slVIba7mCKT9KN5ISD32ovz2/iGKPCj3fiHyaDMhPRa3 GSXA==
X-Gm-Message-State: AEkoouslxtXN6o2kJftt1MSFKYQpQu0BxiBXWt4q0x5RJ29+5BDEMHHglO9XUQknZp8/jyWwcCJnWm1IDcCZvohU+efUU20VpGkt45uh0yk9DNnqeXawZrUQSGlMnG/KJ4okIOw=
X-Received: by 10.36.210.68 with SMTP id z65mr1760438itf.32.1471507547133; Thu, 18 Aug 2016 01:05:47 -0700 (PDT)
From: Karen Elisabeth Egede Nielsen <karen.nielsen@tieto.com>
References: <20160708211635.32095.77047.idtracker@ietfa.amsl.com> <2f56d1b8-7450-42de-4a24-53df9cf4c045@it.uc3m.es> <a1f456db6e656785da8e356b9f530717@mail.gmail.com> <CAK6E8=etaxFSgBvQt27s=rB03rRQxQ_u43qWRExTVXys9A82wg@mail.gmail.com> <623f46dedcc661ae74563f6bf08f46a5@mail.gmail.com> <CAK6E8=cYtP_KdvK9TEDov3j523OfcLfeBMgNSvaSueqzqwTC7Q@mail.gmail.com>
In-Reply-To: <CAK6E8=cYtP_KdvK9TEDov3j523OfcLfeBMgNSvaSueqzqwTC7Q@mail.gmail.com>
MIME-Version: 1.0
X-Mailer: Microsoft Outlook 15.0
Thread-Index: AQGd4OoGFyImcsqDlVk0oY8T3VfV3QH845XxAem+5TgCNV3HJwFbXVb9ANTgh9KgdCWb0A==
Date: Thu, 18 Aug 2016 10:05:45 +0200
Message-ID: <00594c32f56877bfc2fde20132250174@mail.gmail.com>
To: Yuchung Cheng <ycheng@google.com>
Content-Type: text/plain; charset="UTF-8"
X-DomainID: tieto.com
Archived-At: <https://mailarchive.ietf.org/arch/msg/tcpm/7tX_99sKTeFZ8_0E82eIB-S-6Fw>
Cc: tcpm IETF list <tcpm@ietf.org>, TCP Prague List <tcpPrague@ietf.org>
Subject: Re: [tcpm] Fwd: I-D Action: draft-bagnulo-tcpm-tcp-low-rtt-00.txt
X-BeenThere: tcpm@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: TCP Maintenance and Minor Extensions Working Group <tcpm.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tcpm>, <mailto:tcpm-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tcpm/>
List-Post: <mailto:tcpm@ietf.org>
List-Help: <mailto:tcpm-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tcpm>, <mailto:tcpm-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 18 Aug 2016 08:05:52 -0000
HI Yuchung, Marcello > -----Original Message----- > From: Yuchung Cheng [mailto:ycheng@google.com] > Sent: 15. august 2016 18:49 > To: Karen Elisabeth Egede Nielsen <karen.nielsen@tieto.com> > Cc: marcelo bagnulo braun <marcelo@it.uc3m.es>; TCP Prague List > <tcpPrague@ietf.org>; tcpm IETF list <tcpm@ietf.org> > Subject: Re: [tcpm] Fwd: I-D Action: draft-bagnulo-tcpm-tcp-low-rtt-00.txt > > On Mon, Aug 15, 2016 at 2:00 AM, Karen Elisabeth Egede Nielsen > <karen.nielsen@tieto.com> wrote: > > Hi Yuchung, > > > >> > > >> > An additional comment is that new approaches to retransmissions - > >> > like TCP TLP and TCP RACK (also SCTP TLR which however is not > >> > progressing at the moment) might fundamentally alter the picture. > >> > I.e., if retransmissions are sent pro-actively in tail loss > >> > situations then more conservative RTOs may be kept for situations > >> > where it is prudent to wait longer. Don't know if TCP TLP is so > >> > widely deployed that it is something that you should relate to even > >> > if it may be superseded by RACK. Just a thought. > >> TLP and RACK help reduce timeout cases in DC environment for sure. > >> But still it can not completely avoid timeouts. > > > > [Karen Elisabeth Egede Nielsen] Agreed. > > > > For SCTP TLR we still see RTO-timeouts when also > > probes/retransmissions are lost. > > I assume that it would be something like this also for Rack/TLP - > > though potentially depending on how many consecutive probes that are > allowed. > > The role of RTO-timeouts are different when RACK/TLP is enabled and it > > might be counterproductive to have RTO-timeouts be too aggressive as > > the function indeed with TLP/RACK becomes something much closer to the > > original intend > > (read: how I understand the original intend) - namely to introduce a > > pause for the network to recover when things are really (consistently) > > bad. > > > > If the RTO is lowered down to be comparable in order of magnitude with > > the RTT (with some delay_ack considerations) we are narrowing the > > situation where TLP/RACK probing is able to kick in before > > RTO-retransmissions starts anyway. > > > > I understand that RTOs should be adjusted to the network dynamics, but > > I do think that it is important to understand - for the DC environment > > - if the need for the low RTOs in DC's is motivated mainly by having > > RTO-retransmissions fix the tail loss recovery deficiencies of TCP, > > which RACK/TLP addresses, or more generally for a need for more timely > > probe and reactivation for/after network recovery. > > > > Or perhaps you see the TLP probe timeout and the RTO timeout > > eventually being driven by the same timer, the reactions (e.g. CC > > reactions) then possibly depending on state (probe already sent or > > not). > I agree with your thought here. > > More generally, I'd like to evolve the state of art recovery with these > principles: > 1. React with ack events as much as possible > > 2. Send tiny probes to trigger (1) after 2-3 RTTs > > 3. Have conservative RTOs that (exp-backoff) expire at an order of RTT to > repair the head for reliability. > > > On thing about RTO: RTO current reset cwnd to 1 upon firing no matter > what. > We should ONLY reset cwnd to 1 if no news (acks/data) for a long duration. > This is quite important for performance: today cwnd is reset to 1 even if > every packet but the first one has been recently delivered. This > unnecessarily conservative because the ack clock isn't lost yet. But the > collateral damage to performance is huge on large BDP networks. This > results > work-arounds to shorten and avoid RTOs. > > > I'd like to address these points in the upcoming RACK/TLP draft. > > [Karen Elisabeth Egede Nielsen] I agree completely with your view and approach. In respect to the draft draft-bagnulo-tcpm-tcp-low-rtt-00.txt and the given recommendations for low RTO is that they do not (likely) take 2. Into account - meaning that they are looking for the low RTO to solve 2 for tail recovery. The bad thing is that they then also get the CWND reductions (as you also say). I would like for that this aspect is taken into consideration/given thoughts in the draft draft-bagnulo-tcpm-tcp-low-rtt-00.txt and the general 4LS effort. If we end with very low recommendations for RTO in order to handle 2. we risk undermining the RTO-retransmission function as well as the TLP function for now and forever. It might be that one for TCPs, that do not support TLP(/RACK), need to operate with very low RTOs to handle tail loss's "fast enough", but the recommendation for "future standard ?" TCP with TLP always enabled might be different. In this context lowering the RTO to handle 2. should be considered as a "temporary hack" ? BR, Karen
- Re: [tcpm] Fwd: I-D Action: draft-bagnulo-tcpm-tc… Joe Touch
- [tcpm] Fwd: I-D Action: draft-bagnulo-tcpm-tcp-lo… marcelo bagnulo braun
- Re: [tcpm] Fwd: I-D Action: draft-bagnulo-tcpm-tc… Yuchung Cheng
- Re: [tcpm] Fwd: I-D Action: draft-bagnulo-tcpm-tc… Karen Elisabeth Egede Nielsen
- Re: [tcpm] Fwd: I-D Action: draft-bagnulo-tcpm-tc… Yuchung Cheng
- Re: [tcpm] Fwd: I-D Action: draft-bagnulo-tcpm-tc… Karen Elisabeth Egede Nielsen
- Re: [tcpm] Fwd: I-D Action: draft-bagnulo-tcpm-tc… Karen Elisabeth Egede Nielsen
- Re: [tcpm] Fwd: I-D Action: draft-bagnulo-tcpm-tc… Praveen Balasubramanian
- Re: [tcpm] Fwd: I-D Action: draft-bagnulo-tcpm-tc… Yuchung Cheng
- Re: [tcpm] Fwd: I-D Action: draft-bagnulo-tcpm-tc… marcelo bagnulo braun
- Re: [tcpm] Fwd: I-D Action: draft-bagnulo-tcpm-tc… Yuchung Cheng
- Re: [tcpm] Fwd: I-D Action: draft-bagnulo-tcpm-tc… Karen Elisabeth Egede Nielsen