Re: [tsvwg] [iccrg] IETF104 ICCRG TCP Prague presentation

Sebastian Moeller <moeller0@gmx.de> Thu, 04 April 2019 10:53 UTC

Return-Path: <moeller0@gmx.de>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id BFAD0120013; Thu, 4 Apr 2019 03:53:47 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.349
X-Spam-Level:
X-Spam-Status: No, score=-2.349 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=gmx.net
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id XkPDhAj4PlaW; Thu, 4 Apr 2019 03:53:45 -0700 (PDT)
Received: from mout.gmx.net (mout.gmx.net [212.227.15.15]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 1EC9C12000E; Thu, 4 Apr 2019 03:53:44 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.net; s=badeba3b8450; t=1554375178; bh=+Kq7vPOLVw4u0HPC5DIBFV1j/PLmpCk6aU+bdkEaBHg=; h=X-UI-Sender-Class:Subject:From:In-Reply-To:Date:Cc:References:To; b=ImH+cUmv83NKEJZOlRZIc9MF4LEGOUueWHU6+QJPWuKZ3LuPFYH21nARKZG90B9pC 1gNi5yaPdh+TjIK9dK/teptG0VGnZ1yRa7GMjvb76HLQX0CWkoZENQXi9ML/QqeiGy uWrKR8yXk1nNE6/zr/5aiWnwALHWskKRVomb76bk=
X-UI-Sender-Class: 01bb95c1-4bf8-414a-932a-4f6e2808ef9c
Received: from [172.16.12.10] ([134.76.241.253]) by mail.gmx.com (mrgmx002 [212.227.17.190]) with ESMTPSA (Nemesis) id 0LjZEm-1ges9x3djU-00banl; Thu, 04 Apr 2019 12:52:57 +0200
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (Mac OS X Mail 11.5 \(3445.9.1\))
From: Sebastian Moeller <moeller0@gmx.de>
In-Reply-To: <FD7FA874-3413-4F62-81CB-0F68B9038CC7@cablelabs.com>
Date: Thu, 04 Apr 2019 12:52:55 +0200
Cc: Jonathan Morton <chromatix99@gmail.com>, Bob Briscoe <in@bobbriscoe.net>, "iccrg@irtf.org" <iccrg@irtf.org>, tsvwg IETF list <tsvwg@ietf.org>
Content-Transfer-Encoding: quoted-printable
Message-Id: <851D37C7-2B52-4949-8957-818760909C20@gmx.de>
References: <623CA1C6-3425-46E6-AFD9-6FD7D0DBE422@gmx.de> <a7bf3c8d-bb9e-e7e8-3343-ccdbac795591@bobbriscoe.net> <82B40A22-F2C2-4D56-868A-F53D7AEA37D7@gmail.com> <4507BFFD-A68A-4E8A-8FFB-347F95E3F2E8@cablelabs.com> <2491A216-4522-4942-BDEB-301D61CC5D03@gmail.com> <FD7FA874-3413-4F62-81CB-0F68B9038CC7@cablelabs.com>
To: Greg White <g.white@CableLabs.com>
X-Mailer: Apple Mail (2.3445.9.1)
X-Provags-ID: V03:K1:J1M7Kn3i2SuiaeeyDFX+xmglSzSFtLcepIDdpfPRLG0rqx74tsy wwHzQ1O6DJeFDEX7CZnfL3usEHAJLOL4c5X/7kbuRQlIU0uZhh9+f0IFUFILvxKywTivLQl t02CGXvntGgFRL89aLMoLQuI3yQ+4P3rWYqiewxpMmWbrjvi1romfwl4fDGDjEzQZf0Z9qh 1FqZGPZ4ktqatysjochsQ==
X-UI-Out-Filterresults: notjunk:1;V03:K0:s21jByTzcmU=:KI0le84Ylv/8fAOuQefQGF RoUYHNckKOUQ86kAoFrQRPVNHaLED/JlNYm+9H2bnwftV3UAgnnTFdzNv3cS6x0fHalNAL7B0 8bCITYBStUAmbNsNdk7xfdXairzAoj35m05t+Clnhg9DUjQOsiYOzWDJyytIv6IcOuevfwAiR lwtWts3CdVImf5/GsWU2y0M3sMnLur6zfyUOQ44Anhkql9FIaHYF3CLF6ii/JsKz69dKJwjVC FqKUbjhNPxAfuvk/ql2XDUagi3Y7FjhpU0Uvvr28vKE0+G4a5U8ozZhW3p5zot/N5EM3eTuJl DWJnYRxWXIqjAmSK6ZLXG/7onIf+7gnK/O8LgQibqnhLeZSTLkCrZbogwt7D92y61V0M88U6N aNRpN2z91+BBbBGESUvyq9sqaxxqmXwaHDbDlsLK1MvAmStrAhN5SaDGCXlDWbLVwv1wiihEa GqQ45wiw1BHS+bu4FQxkZeoO8FYbDBVfmb+gN+ppAre5wqqyXCRSilZWd9pdn7JB9v9KZHsPz Cjr4S+uT41Cl0e0JPc+aeC9K3P2ezraacUlAtc4QGyjRdhb8zEJn/nkYVd48J0D3GsmZd79Cc j83d5YOTBeO0T7HwxsUop0rKssyry92ZthhomfEMMfgwQsA4hn21EP2amRbrbkPTuHVbBpc3W ElZhH9GIDDIcyS6Q3nmOa2OCKSU52O0WvxflbW3SbihdGQ4YXw23fNAC+r3pmLD4YNxmjBTk3 QT7KPo/t+KHycX7x7nnlbBMaKW0W6AMGN3g5ZbEcRik2taV9kbLa6mbjHgAoaY0qSh7Oqr0K0 J73Db8DOAXrDByAuJd64G0x6aFtqbBLZ8UhNyHqNrUr95jgFdW9CU298Ir/+jkBaZaV8/mH5U xkv7xRrsi6gyv+akBlk1L220laj1zj8tHBqmOeNBp+mUMHfGP9s1XOWDaHYuFufxhcu2qw+3F 3JK4gtlwJ3A==
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/4S_5KbFmDxSj15D5c-Ep-XLtgpg>
Subject: Re: [tsvwg] [iccrg] IETF104 ICCRG TCP Prague presentation
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg/>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 04 Apr 2019 10:53:48 -0000


> On Apr 4, 2019, at 11:40, Greg White <g.white@CableLabs.com> wrote:
> 
> 
> 
> On 4/4/19, 1:28 AM, "Jonathan Morton" <chromatix99@gmail.com> wrote:
> [...]    
>    The present out-of-tree version of Cake supports an L4S-style high-fidelity ECN marking method, simultaneously with conventional ECN marking.  It does so using the SCE signalling method, with ECT(1) being an output rather than an input.  This is, in fact, the running code we had at the time of our TSVWG presentation.
> 
> [GW] Has any progress been made on a variant of Cake that utilizes the L4S ECT(1) semantic?  It would be useful for comparison purposes.  
> 
> [...]    
>> Alternatively, if fq_codel/CAKE were to treat ECT(1) as non-ECT (or implement the L4S style ECN) this would again not be an issue, I believe.
> 
>    Perhaps, but I thought FQ AQMs weren't much of a problem to begin with - aside from the ingress-queuing problem?  
> 
> [GW] Well, a goal would be for FQ AQMs to provide appropriate high-fidelity congestion signals to L4S flows so that they can achieve all four Ls and the S.  Failing that, to not provide weaker congestion signals for L4S flows as compared to classic TCP.  It isn't a problem from a fairness perspective, but the goal is low latency after all.    This seems like a fairly simple change to FQ.
> 
>    I'm also reluctant to interpret ECT(1) or CE incompatibly with RFC-3168, unless given very good reason - and I currently have a good counter-argument.  If I may quote from that document, it does specifically contemplate use of the fourth codepoint to signal "mild congestion":
> 
>>   One possibility might have been for the data sender to use the fourth
>>   ECN codepoint to indicate an alternate semantics for ECN.  However,
>>   this seems to us more appropriate to be signaled using a
>>   differentiated services codepoint in the DS field.
>> 
>>   A second possible use for the fourth ECN codepoint would have been to
>>   give the router two separate codepoints for the indication of
>>   congestion, CE(0) and CE(1), for mild and severe congestion
>>   respectively.
> 
> [GW] I don't believe that the RFCs are on your side here.   RFC 3168 contemplates, and rejects, that definition of ECT(1),

[SM I believe this to be caused by not considering pulse modulation though, as the argument is that a single code-point might not be "rich" enough to signal graded congestion information, but IMHO DCTCP shows that this is solved. In any way RFC 3168 seems a bit dated for this discussion as all agree to go beyond it, no?


> so I'm not sure how you read it as supporting the use. RFC 3168 explicitly defines ECT(1) as being a transport sender signal that it is "ECN Capable Transport".  I don't see how you can claim adherence to RFC 3168, yet use ECT(1) to mean (S)CE.  Moreover, RFC 3168 has been updated by (Standards-Track) RFC 8311, which enables the L4S experiments, and which the SCE approach also violates.  
> 
> [...] 
>    I can't help noticing, meanwhile, that DualQ seems to be designed for use wherever a single-queue AQM would normally fit (though I have my doubts that it will work on much hardware already deployed at crucial locations).  
> 
> [GW] We have worked very closely with all of the DOCSIS equipment vendors to ensure that Low Latency DOCSIS (including dual-queue) is implementable via a firmware update to ALL existing DOCSIS 3.1 CMs and CMTS that are in the field today.  In some markets that is more than 90% of deployed CMTSs. I obviously can't speak to other access network technologies, but to the extent that forwarding architectures are similar, I would presume that doubling the number of queues is significantly simpler than multiplying by 1000.

	But why, in the end memory is linear and your proposed mandatory "Annex P Queue Protection Algorithm (Normative") seems to care all the costs of tracking flows already, so why is that considerably cheaper than using that information also for scheduling purposes?

> 
>    Yet at the same time, you claim such single-queue AQMs are presently (and will remain) so rare that you haven't even developed a method of reliably detecting them.  This seems like an important discrepancy.
> 
> [GW] The concern would be single-queue *ECN* AQMs. Single-queue packet drop AQMs do exist.

[SM] What about my hobby-horse "ingress-ECN-aware-fq-AQM"?


> 
>    Meanwhile, work continues.
> 
> [GW] Why?  It seems to me that the L4S approach can be implemented easily in fq_codel and CAKE, as well as in systems that can only realistically support dual-queue.

[SM] if that system has memory and a CPU it _can_ support fq as well, that is a _choice_ not an _impossibility_. It might be un-economically, sure. But again, why will such a device be able to do Queue Protection then?


>  It has been demonstrated to provide scalable throughput and ultra-low loss and queuing delay

Which sounds greater than it is since most applications care for end-to-end latency, and L4S basically moves (a part of) the queueing delay from the bottleneck AQM back into the sending application, it does not remove that latency completely.

> (which, I think, is the goal of SCE as well), and has quite a bit of support and momentum behind it. The philosophical debate as to which is better, dualq-coupled-aqm or FQ, can continue and implementers can choose whichever they like.  I don't doubt that the SCE signaling method could be used in a variant of DCTCP, but the *huge* downside is that it can only work on bottleneck links that are able (or willing) to support FQ,

[SM] Why, the AQM will simply supply all ECN enabled with the graded information about the queueing load of the AQM, it is up to the endpoints to make sense out of this.

> which will be precious few.  In 2-3 years' time, 10s of millions of actual bottleneck links in broadband networks will be supporting dual-queue.

[SM]	I guess most bottlenecks nowadays support a variant of RED but support dies not equal exercise it. But sure I like how the DOCSIS standards-process  can influence a large swath of critical bottleneck links, more power to you.

>  It seems that the only real argument in favor of the SCE signaling method is the fallback in presence of single-queue classic ECN (unless you think blocking implementers from supporting dual-queue is some kind of perverse benefit).  

[SM] I see the SCE proposal as the better more evolutionary approach forward, the L4S approach basically spells, the current state is unrecoverable and we need to start fresh, but then does no much of that sort (re-using PIE and DCTCP is also nicely evolutionary).

> Help analyzing (and addressing if need be) that situation would be the most effective way of achieving the shared goal of low latency scalable congestion control.  Continuing to push for (or worse, deploy) an incompatible solution just increases the risk that no one will be able to achieve that goal.

[SM] these wise words typically apply to both sides of a discussion and have just as much merits one way or he other. I note that in reality we often see something else, alternative approaches duking it out and driving their evolution exactly because they are not the only game in town....


Best Regards
	Sebastian