Re: [tcpPrague] [tcpm] Fwd: I-D Action: draft-bagnulo-tcpm-tcp-low-rtt-00.txt

Yuchung Cheng <ycheng@google.com> Fri, 12 August 2016 19:51 UTC

Return-Path: <ycheng@google.com>
X-Original-To: tcpprague@ietfa.amsl.com
Delivered-To: tcpprague@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id D3F3912DA93 for <tcpprague@ietfa.amsl.com>; Fri, 12 Aug 2016 12:51:43 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -3.948
X-Spam-Level:
X-Spam-Status: No, score=-3.948 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_LOW=-0.7, RP_MATCHES_RCVD=-1.247, SPF_PASS=-0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=google.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id at3LQLRLjXOU for <tcpprague@ietfa.amsl.com>; Fri, 12 Aug 2016 12:51:41 -0700 (PDT)
Received: from mail-io0-x235.google.com (mail-io0-x235.google.com [IPv6:2607:f8b0:4001:c06::235]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 1C70B12DA7F for <tcpPrague@ietf.org>; Fri, 12 Aug 2016 12:51:41 -0700 (PDT)
Received: by mail-io0-x235.google.com with SMTP id q83so34106906iod.1 for <tcpPrague@ietf.org>; Fri, 12 Aug 2016 12:51:41 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc:content-transfer-encoding; bh=sWnygOL5qe+EkDikO78f4hdLR9aOs/np/u3uoJeefpM=; b=c3XQXGc74jL+1nQDhtxrJFd1OtoOcuUOtRrbPl9PIm+YNIfPVQ3sdnDMvsRo0OGQU8 Er0bSWR7tMusAkPNNnlDVd7vU1eoi7xyS+SEksjpdl28gwaq68/hoMxmh0yuL3/hX86i 9egD+5fLV9YEjeZbWEghZgfI/kmEYv9Z1IuNeuY1xqKIWzjMI8Pupb9MFOq/n76YEPgn 6G6pZ9XAxEykRIaqv9HQqzOrJH/GG5hk1u4sRdnZCKkApQ8p+vvQ2O7dgx3AVdcRVVXC SwgYNREWkOYzTMCg7w8bGTvStZ7JHX4AFc8ZwFftVWeIydDjlAFXvqyuuPhaK3tVQeZJ sCoQ==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc:content-transfer-encoding; bh=sWnygOL5qe+EkDikO78f4hdLR9aOs/np/u3uoJeefpM=; b=UZBgFS8iHsGOGHJhaO67SyAEGmoNAAti0qOgRvLlyAEu3Bdy2EHZzART9lN5MpYBni IBvFxP4LMbGV6cLT1h1wBWWGsNBGoEFapISP2hKd62yXvj0vP2HiPVdp2e3zEU9fyksb dj8Lc+3ezVebXmxbgh1u0Llfz2HLePrq/KRM+8u1GVPFHQyEV8S9vFe7fXAHQD7s/Pm1 klpLqZUcWsJKROO0fNw0G4+YLlt+OmRFBYKMPuz0c/K3PbXIoaRad1SYOmWzXu61HiUs f+LhScqw3oojkPjebJV9Kh48RgV+vIoCrUB2G7+f4dWMJPdfiWOArlfI5J7qV7m3UWYq w66A==
X-Gm-Message-State: AEkoouunf71s/WMxB2HF3dtyGyf/b+163t6FQlbJkXe9aNbdc8ydMUQLF7dYkbDsCI9fdyQslbmWNeOvcPrKEMjJ
X-Received: by 10.107.201.135 with SMTP id z129mr22528293iof.114.1471031500135; Fri, 12 Aug 2016 12:51:40 -0700 (PDT)
MIME-Version: 1.0
Received: by 10.64.244.103 with HTTP; Fri, 12 Aug 2016 12:50:59 -0700 (PDT)
In-Reply-To: <5e5d0f96-9094-689e-8104-40ba48a6692b@it.uc3m.es>
References: <20160708211635.32095.77047.idtracker@ietfa.amsl.com> <2f56d1b8-7450-42de-4a24-53df9cf4c045@it.uc3m.es> <a1f456db6e656785da8e356b9f530717@mail.gmail.com> <CAK6E8=etaxFSgBvQt27s=rB03rRQxQ_u43qWRExTVXys9A82wg@mail.gmail.com> <5e5d0f96-9094-689e-8104-40ba48a6692b@it.uc3m.es>
From: Yuchung Cheng <ycheng@google.com>
Date: Fri, 12 Aug 2016 12:50:59 -0700
Message-ID: <CAK6E8=d9ixycwKmrwESWrZW_WMELUO8-3ztnXcOEKL89BLx4ZA@mail.gmail.com>
To: marcelo bagnulo braun <marcelo@it.uc3m.es>
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
Archived-At: <https://mailarchive.ietf.org/arch/msg/tcpprague/Aiw25PYJVPMoYZoe3BGdDtkQ2aE>
Cc: Karen Elisabeth Egede Nielsen <karen.nielsen@tieto.com>, tcpm IETF list <tcpm@ietf.org>, TCP Prague List <tcpPrague@ietf.org>
Subject: Re: [tcpPrague] [tcpm] Fwd: I-D Action: draft-bagnulo-tcpm-tcp-low-rtt-00.txt
X-BeenThere: tcpprague@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: "To coordinate implementation and standardisation of TCP Prague across platforms. TCP Prague will be an evolution of DCTCP designed to live alongside other TCP variants and derivatives." <tcpprague.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tcpprague>, <mailto:tcpprague-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tcpprague/>
List-Post: <mailto:tcpprague@ietf.org>
List-Help: <mailto:tcpprague-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tcpprague>, <mailto:tcpprague-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 12 Aug 2016 19:51:44 -0000

On Fri, Aug 12, 2016 at 12:40 PM, marcelo bagnulo braun
<marcelo@it.uc3m.es> wrote:
> Karen, Yuchung,
> Thanks for the comments.
> Replies below.
>
>
> El 12/08/16 a las 21:16, Yuchung Cheng escribió:
>
>> On Fri, Aug 12, 2016 at 4:30 AM, Karen Elisabeth Egede Nielsen
>> <karen.nielsen@tieto.com> wrote:
>>>
>>> Hi Marcello,
>>>
>>> I very much appreciate your draft.
>>>
>>> Even more so as it addresses one of the things I find to be a bit
>>> "disturbing"  in the l4s drafts - Namely the claim that the finer saw
>>> tooth functionality of l4s/DCTCP alone solves the latency problems (while
>>> DCTCP indeed also is tuning the RTO timeout aggressively).
>>> E.g., the formulation in  section 1.1 of
>>>
>>> https://datatracker.ietf.org/doc/draft-briscoe-tsvwg-ecn-l4s-id/?include_t
>>> ext=1
>>>
>>> It turns out that a TCP algorithm like DCTCP that solves TCP's
>>>     scalability problem also solves the latency problem, because the
>>>     finer sawteeth cause very little queuing delay.  A supporting paper
>>>     [DCttH15] gives the full explanation of why the design solves both
>>>     the latency and the scaling problems, both in plain English and in
>>>     more precise mathematical form.  The explanation is summarised
>>>     without the maths in [I-D.briscoe-aqm-dualq-coupled].
>>>
>>> could benefit from a reference to your draft or the issue that you
>>> addresses here (at least for comprehensiveness).
>>>
>>> An additional comment is  that new approaches to retransmissions - like
>>> TCP TLP and TCP RACK (also SCTP TLR which however is not progressing at
>>> the moment) might fundamentally alter the picture. I.e., if
>>> retransmissions are sent pro-actively in tail loss situations then more
>>> conservative RTOs may be kept for situations where it is prudent to wait
>>> longer. Don't know if TCP TLP is so widely deployed that it is something
>>> that you should relate to even if it may be superseded by RACK. Just a
>>> thought.
>>
>> TLP and RACK help reduce timeout cases in DC environment for sure. But
>> still it can not completely avoid timeouts. So I agree we should keep
>> timeouts conservative, but the current RFCs are way too conservative:
>> min RTO of 1 second is 4-5 orders of magnitude of DC RTTs.
>>
>> The issue is, the "right" min-RTO depends on the actual DC and stacks:
>> they all have different RTTs and timer granularity. The draft cites
>> Glenn's paper, but Morgan Stanley's DC could be different than others.
>>
>
> Right.
>
> I guess we should could say that the min RTO should be in the order of the
> RTT of the considered network when there is zero queueng delay.
> But if the draft is aiming to be a reccomendation for imeplenters of DC
> network TCP stacks, then i guess it should provide a guidance that is
> independent of the DC network itself, since the implementation should work
> for a variety of DC networks, no?
>
>
> I guess the more general question is whether we need this document i.e. a
> document describing reccoemndation about how to set TCP for low RTT
> networks. If people think it is a good idea to have such a document, we then
> figure how if this is
my conundrum is that DC tcp stuff is usually not an inter-op issue for
the Internet so this draft is more of a "how to tune your DC tcp
stack". but I can be convinced either ways.

>
>> Other comments on the draft:
>>
>> 1. min ssthresh or cwnd: with very large incast ((tens of) thousands
>> of senders into one receiver), cwnd of 0.1 pkt can still be too big. the
>> only way is to pace the packets over N*RTT intervals.
>>
> Right, If i understood correctly Bob's paper (cited in the draft) this is
> exactly what they are proposing.
> I dont think this document is the right place to define the behaviour of
> CWND less than 1 MSS. I think this should be done in a separated document,
> which is then referred by the low-rtt draft.
>
> Are you aware of other documents (draft or papers others than the one from
> Bob) describing mechanisms to achieve this?
>
>> 2. delayed acks:
>> as mentioned the actual time depends a lot on the DC implementation,
>> which is really "vender/owner"-specific. instead of perhaps an option
>> during setup to inform the sender about the max delay in the ack works
>> better.
>>
> I am not sure if a follow.
> What is the benefit of allowing dynamic negotiation of the delay ack time?
> Thanks, marcelo
the receiver may decide to delay ack up to certain time. it could be
based on DC implementation or host limit. It'd be good for sender to
learn about that to adapt his RTO.

For example, say it's a cloud VM. VM of OS1 may use a max delay for
d-ack say 2ms. but VM of OS2 may use 40ms. The lack of such
information makes the sender to these VMs use something  conservative
of known max of all, otherwise he risks spurious RTOs.

>
>
>
>
>>
>>> BR, Karen
>>>
>>>> -----Original Message-----
>>>> From: tcpm [mailto:tcpm-bounces@ietf.org] On Behalf Of marcelo bagnulo
>>>> braun
>>>> Sent: 8. juli 2016 23:19
>>>> To: tcpm IETF list <tcpm@ietf.org>
>>>> Subject: [tcpm] Fwd: I-D Action: draft-bagnulo-tcpm-tcp-low-rtt-00.txt
>>>>
>>>> Hi,
>>>>
>>>> We just submitted this draft for consideration of the WG. Comments are
>>>> appreciated.
>>>>
>>>> Regards, marcelo
>>>>
>>>>
>>>>
>>>>
>>>> -------- Mensaje reenviado --------
>>>> Asunto:       I-D Action: draft-bagnulo-tcpm-tcp-low-rtt-00.txt
>>>> Fecha:        Fri, 08 Jul 2016 14:16:35 -0700
>>>> De:   internet-drafts@ietf.org
>>>> Responder a:  internet-drafts@ietf.org
>>>> Para:         i-d-announce@ietf.org
>>>>
>>>>
>>>>
>>>> A New Internet-Draft is available from the on-line Internet-Drafts
>>>
>>> directories.
>>>>
>>>>
>>>>           Title           : Recommendations for increasing TCP
>>>
>>> performance in low RTT
>>>>
>>>> networks.
>>>>           Authors         : Marcelo Bagnulo
>>>>                             Koen De Schepper
>>>>                             Glenn Judd
>>>>        Filename        : draft-bagnulo-tcpm-tcp-low-rtt-00.txt
>>>>        Pages           : 7
>>>>        Date            : 2016-07-08
>>>>
>>>> Abstract:
>>>>      This documents compiles a set of issues that negatively affect TCP
>>>>      performance in low RTT networks as well as the recommendations to
>>>>      overcome them.
>>>>
>>>>
>>>> The IETF datatracker status page for this draft is:
>>>> https://datatracker.ietf.org/doc/draft-bagnulo-tcpm-tcp-low-rtt/
>>>>
>>>> There's also a htmlized version available at:
>>>> https://tools.ietf.org/html/draft-bagnulo-tcpm-tcp-low-rtt-00
>>>>
>>>>
>>>> Please note that it may take a couple of minutes from the time of
>>>
>>> submission
>>>>
>>>> until the htmlized version and diff are available at tools.ietf.org.
>>>>
>>>> Internet-Drafts are also available by anonymous FTP at:
>>>> ftp://ftp.ietf.org/internet-drafts/
>>>>
>>>> _______________________________________________
>>>> I-D-Announce mailing list
>>>> I-D-Announce@ietf.org
>>>> https://www.ietf.org/mailman/listinfo/i-d-announce
>>>> Internet-Draft directories: http://www.ietf.org/shadow.html or
>>>> ftp://ftp.ietf.org/ietf/1shadow-sites.txt
>>>>
>>>>
>>>> _______________________________________________
>>>> tcpm mailing list
>>>> tcpm@ietf.org
>>>> https://www.ietf.org/mailman/listinfo/tcpm
>>
>>
>