Re: [tcpPrague] [tcpm] Fwd: I-D Action: draft-bagnulo-tcpm-tcp-low-rtt-00.txt

Karen Elisabeth Egede Nielsen <karen.nielsen@tieto.com> Thu, 18 August 2016 08:05 UTC

Return-Path: <karen.nielsen@tieto.com>
X-Original-To: tcpprague@ietfa.amsl.com
Delivered-To: tcpprague@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 1F4BC12D0CA for <tcpprague@ietfa.amsl.com>; Thu, 18 Aug 2016 01:05:52 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.701
X-Spam-Level:
X-Spam-Status: No, score=-2.701 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=tieto.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id L6936so7vfNt for <tcpprague@ietfa.amsl.com>; Thu, 18 Aug 2016 01:05:49 -0700 (PDT)
Received: from mail-it0-x233.google.com (mail-it0-x233.google.com [IPv6:2607:f8b0:4001:c0b::233]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 8C84A12D0B5 for <tcpPrague@ietf.org>; Thu, 18 Aug 2016 01:05:47 -0700 (PDT)
Received: by mail-it0-x233.google.com with SMTP id n128so11650605ith.1 for <tcpPrague@ietf.org>; Thu, 18 Aug 2016 01:05:47 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=tieto.com; s=google; h=from:references:in-reply-to:mime-version:thread-index:date :message-id:subject:to:cc; bh=y6LJiuW44bFkTnJCT9ljMTLoVk22Vqf22OsO5lA1H0s=; b=ufKhuypXfDIsU8z+U70prs+7GkdrnsOlxcZRL0wVYN86SHJ2qVZUKMvn5k2g5q6EWx KvO/+HKxMZz2L6zDrMONy11STbjS+9zD1rLW+LxhoUcCq/oTCj8KmXCNWp2+fG0y3SvK K2RVny1VHGMvFIc2ks8q0qpqPeUSUEL/mKEV4=
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:references:in-reply-to:mime-version :thread-index:date:message-id:subject:to:cc; bh=y6LJiuW44bFkTnJCT9ljMTLoVk22Vqf22OsO5lA1H0s=; b=cEXqD5PeQtCDcznp5tsSF2x0TaUrFJ82JVZvk5Zr7YuMXePXB5CQeawZzSWW+9V6yy z+fnrrq36NU7KOH6IS/1C1CJk73+mYZxK2mvrgNTwhDXj0K6AK4WnWzvaUNtU2Xd6Hvb PrIPXJy86RdM2NQZT7b74CbxcmFspzTm78f0YZN812T7XQJW0rk5M1rnOWjVE9WDavca jje3GRQLyFnyKDQtDKaa71DgPml+peh/FOVT2R7wlaBLljVpADJByxUhQ/VQycIKtCJz ON5DrUFwpBkGM3uso3SXQYsK2xLrVH0l5z79N2C6pkeyOcTOpeXcPkiQzjZ0PPleowPD 5mIQ==
X-Gm-Message-State: AEkoouvoxKl2J3kzcXyr45k3C4VS3e0b7w0/OJYDRwU5DzR5lOqUmPgUILkBZ1sqIt4nE8sjOa2V5ldadta4FP+D/tT9MMvjcZaKx/laqV7q9OY5B4jceuIKuLnDD0jfLE0VWuUHnNeNeQ==
X-Received: by 10.36.210.68 with SMTP id z65mr1760438itf.32.1471507547133; Thu, 18 Aug 2016 01:05:47 -0700 (PDT)
From: Karen Elisabeth Egede Nielsen <karen.nielsen@tieto.com>
References: <20160708211635.32095.77047.idtracker@ietfa.amsl.com> <2f56d1b8-7450-42de-4a24-53df9cf4c045@it.uc3m.es> <a1f456db6e656785da8e356b9f530717@mail.gmail.com> <CAK6E8=etaxFSgBvQt27s=rB03rRQxQ_u43qWRExTVXys9A82wg@mail.gmail.com> <623f46dedcc661ae74563f6bf08f46a5@mail.gmail.com> <CAK6E8=cYtP_KdvK9TEDov3j523OfcLfeBMgNSvaSueqzqwTC7Q@mail.gmail.com>
In-Reply-To: <CAK6E8=cYtP_KdvK9TEDov3j523OfcLfeBMgNSvaSueqzqwTC7Q@mail.gmail.com>
MIME-Version: 1.0
X-Mailer: Microsoft Outlook 15.0
Thread-Index: AQGd4OoGFyImcsqDlVk0oY8T3VfV3QH845XxAem+5TgCNV3HJwFbXVb9ANTgh9KgdCWb0A==
Date: Thu, 18 Aug 2016 10:05:45 +0200
Message-ID: <00594c32f56877bfc2fde20132250174@mail.gmail.com>
To: Yuchung Cheng <ycheng@google.com>
Content-Type: text/plain; charset="UTF-8"
X-DomainID: tieto.com
Archived-At: <https://mailarchive.ietf.org/arch/msg/tcpprague/Bi2c6Tu5n0VL4hf0I74o5RDz55Y>
Cc: marcelo bagnulo braun <marcelo@it.uc3m.es>, tcpm IETF list <tcpm@ietf.org>, TCP Prague List <tcpPrague@ietf.org>
Subject: Re: [tcpPrague] [tcpm] Fwd: I-D Action: draft-bagnulo-tcpm-tcp-low-rtt-00.txt
X-BeenThere: tcpprague@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: "To coordinate implementation and standardisation of TCP Prague across platforms. TCP Prague will be an evolution of DCTCP designed to live alongside other TCP variants and derivatives." <tcpprague.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tcpprague>, <mailto:tcpprague-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tcpprague/>
List-Post: <mailto:tcpprague@ietf.org>
List-Help: <mailto:tcpprague-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tcpprague>, <mailto:tcpprague-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 18 Aug 2016 08:05:52 -0000

HI Yuchung, Marcello

> -----Original Message-----
> From: Yuchung Cheng [mailto:ycheng@google.com]
> Sent: 15. august 2016 18:49
> To: Karen Elisabeth Egede Nielsen <karen.nielsen@tieto.com>
> Cc: marcelo bagnulo braun <marcelo@it.uc3m.es>; TCP Prague List
> <tcpPrague@ietf.org>; tcpm IETF list <tcpm@ietf.org>
> Subject: Re: [tcpm] Fwd: I-D Action: draft-bagnulo-tcpm-tcp-low-rtt-00.txt
>
> On Mon, Aug 15, 2016 at 2:00 AM, Karen Elisabeth Egede Nielsen
> <karen.nielsen@tieto.com> wrote:
> > Hi Yuchung,
> >
> >> >
> >> > An additional comment is  that new approaches to retransmissions -
> >> > like TCP TLP and TCP RACK (also SCTP TLR which however is not
> >> > progressing at the moment) might fundamentally alter the picture.
> >> > I.e., if retransmissions are sent pro-actively in tail loss
> >> > situations then more conservative RTOs may be kept for situations
> >> > where it is prudent to wait longer. Don't know if TCP TLP is so
> >> > widely deployed that it is something that you should relate to even
> >> > if it may be superseded by RACK. Just a thought.
> >> TLP and RACK help reduce timeout cases in DC environment for sure.
> >> But still it can not completely avoid timeouts.
> >
> > [Karen Elisabeth Egede Nielsen] Agreed.
> >
> > For SCTP TLR we still see RTO-timeouts when also
> > probes/retransmissions are lost.
> > I assume that it would be something like this also for Rack/TLP -
> > though potentially depending on how many consecutive probes that are
> allowed.
> > The role of RTO-timeouts are different when RACK/TLP is enabled and it
> > might be counterproductive to have RTO-timeouts be too aggressive as
> > the function indeed with TLP/RACK becomes something much closer to the
> > original intend
> > (read: how I understand the original intend) - namely to introduce a
> > pause for the network to recover when things are really (consistently)
> > bad.
> >
> > If the RTO is lowered down to be comparable in order of magnitude with
> > the RTT (with some delay_ack considerations) we are narrowing the
> > situation where TLP/RACK probing is able to kick in before
> > RTO-retransmissions starts anyway.
> >
> > I understand that RTOs should be adjusted to the network dynamics, but
> > I do think that it is important to understand - for the DC environment
> > - if the need for the low RTOs in DC's is motivated mainly by having
> > RTO-retransmissions fix the tail loss recovery deficiencies of TCP,
> > which RACK/TLP addresses, or more generally for a need for more timely
> > probe and reactivation for/after network recovery.
> >
> > Or perhaps you see the TLP probe timeout and the RTO timeout
> > eventually being driven by the same timer, the reactions (e.g. CC
> > reactions)  then possibly depending on state (probe already sent or
> > not).
> I agree with your thought here.
>
> More generally, I'd like to evolve the state of art recovery with these
> principles:
> 1. React with ack events as much as possible
>
> 2. Send tiny probes to trigger (1) after 2-3 RTTs
>
> 3. Have conservative  RTOs that (exp-backoff) expire at an order of RTT to
> repair the head for reliability.
>
>
> On thing about RTO: RTO current reset cwnd to 1 upon firing no matter
> what.
> We should ONLY reset cwnd to 1 if no news (acks/data) for a long duration.
> This is quite important for performance: today cwnd is reset to 1 even if
> every packet but the first one has been recently delivered. This
> unnecessarily conservative because the ack clock isn't lost yet. But the
> collateral damage to performance is huge on large BDP networks. This
> results
> work-arounds to shorten and avoid RTOs.
>
>
> I'd like to address these points in the upcoming RACK/TLP draft.
>
>
[Karen Elisabeth Egede Nielsen] I agree completely with your view and
approach.

In respect to the draft draft-bagnulo-tcpm-tcp-low-rtt-00.txt and the given
recommendations for low RTO
is that they do not (likely) take 2. Into account - meaning that they are
looking for the low RTO to solve 2 for tail recovery.
The bad thing is that they then also get the CWND reductions (as you also
say).


I would like for that this aspect is taken into consideration/given thoughts
in the draft draft-bagnulo-tcpm-tcp-low-rtt-00.txt and the general 4LS
effort.
If we end with very low recommendations for RTO in order to handle 2. we
risk undermining the RTO-retransmission function as well as the TLP function
for now and forever.

It might be that one for TCPs, that do not support TLP(/RACK), need to
operate with very low RTOs to handle tail loss's "fast enough", but the
recommendation for
"future standard ?" TCP with TLP always enabled might be different. In this
context lowering the RTO to handle 2. should be considered as a "temporary
hack" ?

BR, Karen