Re: [tsvwg] RTT-independence (was: L4S vs SCE)

Sebastian Moeller <moeller0@gmx.de> Wed, 18 December 2019 08:12 UTC

Return-Path: <moeller0@gmx.de>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id F162712093E; Wed, 18 Dec 2019 00:12:41 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.349
X-Spam-Level:
X-Spam-Status: No, score=-2.349 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_HELO_NONE=0.001, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=gmx.net
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id d5ZUFf4CqRpm; Wed, 18 Dec 2019 00:12:38 -0800 (PST)
Received: from mout.gmx.net (mout.gmx.net [212.227.17.20]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id DAC40120913; Wed, 18 Dec 2019 00:12:37 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.net; s=badeba3b8450; t=1576656710; bh=J0Udkgo7m3q07od3XKRi7fN89gyL+ibk/SW3YADuD9s=; h=X-UI-Sender-Class:Subject:From:In-Reply-To:Date:Cc:References:To; b=fIR4DFRAEg24IQfS/vbm6LqFqCTFX2kRuK+urkNkvF+tc/NS34eqhVajNao7jbmsI W/hdyuMw/aKBkBs2OBZ7vofQwYGsdmLazwhqFATbGCmohohdVM6k3zu2HGicuuGYkQ RlMZ+v6qv1JK3oaZeAkZacv6z9e59ouNtvif6WDA=
X-UI-Sender-Class: 01bb95c1-4bf8-414a-932a-4f6e2808ef9c
Received: from [10.11.12.33] ([134.76.241.253]) by mail.gmx.com (mrgmx105 [212.227.17.168]) with ESMTPSA (Nemesis) id 1MWzfl-1iA9iZ31Mv-00XIKT; Wed, 18 Dec 2019 09:11:50 +0100
Content-Type: text/plain; charset=utf-8
Mime-Version: 1.0 (Mac OS X Mail 12.4 \(3445.104.11\))
From: Sebastian Moeller <moeller0@gmx.de>
In-Reply-To: <a55c3ec7-e1a7-2bfd-1db4-a3a2c3dfe79e@bobbriscoe.net>
Date: Wed, 18 Dec 2019 09:11:48 +0100
Cc: "De Schepper, Koen (Nokia - BE/Antwerp)" <koen.de_schepper@nokia-bell-labs.com>, G Fairhurst <gorry@erg.abdn.ac.uk>, Ingemar Johansson S <ingemar.s.johansson@ericsson.com>, "tsvwg-chairs@ietf.org" <tsvwg-chairs@ietf.org>, "tsvwg@ietf.org" <tsvwg@ietf.org>
Content-Transfer-Encoding: quoted-printable
Message-Id: <1BF0AB34-DAF4-4407-9E8B-ADE53CB3E64F@gmx.de>
References: <HE1PR07MB44250F3C4E6A744DDCC3DAFCC24C0@HE1PR07MB4425.eurprd07.prod.outlook.com> <ad7b763e-b3dd-36cf-a9c5-7de99476babb@mti-systems.com> <12ED7632-5E3E-4EB9-B65E-8A8324067C9A@akamai.com> <5DD4BB25.3060700@erg.abdn.ac.uk> <5658232C-07D5-4C89-B16A-58A928332FC6@gmx.de> <HE1PR07MB4425D989D4A266C73331FFA5C24F0@HE1PR07MB4425.eurprd07.prod.outlook.com> <CAJU8_nUK5cZLFE-0UBzf0a7T0hC7C+CpCsUy_+ZU_p4oxW9BmQ@mail.gmail.com> <HE1PR07MB442560D0715BC921AB9B7FE3C24F0@HE1PR07MB4425.eurprd07.prod.outlook.com> <AM4PR07MB345968E8C665304DFBD5B11FB94F0@AM4PR07MB3459.eurprd07.prod.outlook.com> <AAC23B5C-447F-428D-956F-850653A561F7@gmx.de> <a55c3ec7-e1a7-2bfd-1db4-a3a2c3dfe79e@bobbriscoe.net>
To: Bob Briscoe <in@bobbriscoe.net>
X-Mailer: Apple Mail (2.3445.104.11)
X-Provags-ID: V03:K1:HpDKOQsSkdFTgi4Q10Y81iOxnn/d8m+lXaaaNJinxcSjG2camsL k10LX5+FaoclNBxtAes6UX8ZGj2MQH7GbSi8kGQn2iG8sBr0iGB42d7vbgFV7AUH7bVwNXN AwBhzp8c3yqf3H3oCh0R+OpciltptphzAqXU2am8DQTgNBhM0vlkVSz9hJWODFCtGuW3o5b LGrybEFoxJw+YxriYQCHg==
X-UI-Out-Filterresults: notjunk:1;V03:K0:wELWgui4jJE=:yZVa4+7Fqs/V8yH2m2o27N u9jVhXDN7bW6fz/s69PXvaTmfh4PLml3FVwAKtYylcxs948UeRKw9qldmN70VN440DrVoOb8p Lhf2iLN0TxaSsVTL8KushwRCpV3zTo+nIDpp6VgaBK1VCiY+bnehd9kW3p0TupYJKYnBTQvrl 0EnlvF8SVJryJsHZUzqzD8/7P/vRjlqduI3vf20FiJr6Mcu+sy9vnCE4Eo4kl2C6Z46ZyGbEJ OSAw61ziGNtHSZrg7bDmBDKfWKnSy29LxKao1Yk173uLAStr4o1XNje1BqGIed9sLgchCEAZv GC+pMqxZgbkDxUUB8vHD/tMPTTt06q1kStLutE/zcTgMtWUXKXKdmS02dkdX7u4wkj4drnR1D REWhKxtpqqrVxFg2ufyaFtrsivFNrVXUVIOhNiap7DcqhUIDMkXbT3s/j5gJQkp2nubhy3IHb FJRcQZRnbDLQQfFbMotpGytFvDsna4K7skzvMVXZ1nEumcF3LU34ZSltuXTilQXuMyJwiZDKy QZ3qoHz1HiTEu6cFLuBb08ywg059dGLGhu79yOHU6yMONDOu1NkdFct68ZwJ890koWRENYXwt K0Fp3Tk2tcpcdHPlG1BLWFb/nKKH9UGw0zfe45uC7ILoiuV6nqmoBzB3nnZuzMpspIM7lXlKk KDjTDXYTOSrUdlGAHGR4KM7HlChTTOHmjfg1k65TN9udFFGphv0ECZ1xKwyyZZQ2tRzSfQ+OA YnJL2x8UhIrcPRU5Xbez0L1O6YDRYf1ddYdDNUETgQXMcuSMaIKhhJWCt+cbhvE4lZniHiV2+ ngptqok8z2D0IROLrxdXeaxUWlUGndnkb0Hv20rJ6ZuxNU3DycbtLtvpVSS4wogdn9Y1bo9AU wUI3bW8kIjWJjH3Rqc7Ow74chpLnRF9QFoka+arUjCBLLe/BwAAuekMgCD+7V+W+z+zwyBFhm 7RWVTTYA5KBLPhpoOnSne7oCq8FG4p0xiM4znLzPP+dhgmd4OBVg6ujiBN2sGacMKpGSU0Yax wKdxgZAgO+QGjqQvzNsAKrSkvcDRMdawj3yQMb1A2uyMhYFiyM3EEV24dXgdNnP0tlSkF2ug9 cuxwBc+obMwlfMoX4u5OBr/osrt7tmc6J/HgFaQttC7anIy7PzG3ZVbcK/e7AIX1x8gx9dFQ0 ON9z5cRBHsPIVWjTdnJx1G3BOvPWrdIzqnYS1HGqP6n4o/B5x/Ju63J9og4PKFgvsnQO8xAys yzqiY5h2EhiGMh7xeBVE8Q8VWBHUFjPqnIMg/0K9q/J+5gMYFwqEwBhQOZ80=
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/m_Fb1VczNwEdv6lpPBjl5n3Jy1Y>
Subject: Re: [tsvwg] RTT-independence (was: L4S vs SCE)
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg/>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 18 Dec 2019 08:12:45 -0000

Hi Bob,
 more below in-line.

> On Dec 18, 2019, at 01:23, Bob Briscoe <in@bobbriscoe.net> wrote:
> 
> Sebastian,
> 
> I wanted to pick up your point about RTT-independence that you recently said has never been responded to. 
> 
> I have changed the subject line, because this is not an L4S vs SCE issue. With a shared queue it is just as much an issue for SCE as for L4S. With an FQ scheduler, it is just as much a non-issue for L4S as for SCE. 
> 
> 
> On 20/11/2019 14:18, Sebastian Moeller wrote:
>> Dear Koen, dear all,
>> 
>> 
>> 
>>> On Nov 20, 2019, at 13:40, De Schepper, Koen (Nokia - BE/Antwerp) <koen.de_schepper@nokia-bell-labs.com>
>>>  wrote:
>>> 
>>> 
>>> One of these opportunities here is to make L4S_TCP less RTT dependent. There have been many working implementations for less RTT-dependent CCs in the past. One that is widely deployed is Cubic, which is doing this for getting more throughput for longer RTTs. The only reason why it didn’t fly for lower RTTs on the current Internet is that they would hurt themselves (get lower throughput, competing with Reno).
>>> 
>> 	[SM] Looking at the pfifo_fast results in Høiland-Jørgensen T, Hurtig P, Brunstrom A (2015) The Good, the Bad and the WiFi: Modern AQMs in a residential setting. Computer Networks 89:90–106. For Cubic/pfifo_fast (linux former default combination), I fail to see a strong indicator that cubic is RTT invariant or getting more thoughput at longer RTTs (except for the 10ms versus 50ms "hump"). What paper/data should I be looking at instead?
>> 
>> 
>>>  
>>> As we are able to define a new independent behavior and the RTT dependence in L4S is taken serious (some even call it a show stopper) this is even a must do opportunity for L4S. It is all a matter of perspective: show-stopper <-> opportunity.
>>> 
>> 	[SM] I believe that working on more RTT independence is worth-while but no replacement for fixing the isolation system as it it is that equitable isolation from existing traffic that basically gives you the slate clean-"green field" development opportunity, no? 
> 
> [BB] RTT-dependence is caused by the end-system, and should be solved in the end-system (whether or not FQ is also deployed to solve it in some places).

	[SM] I fully agree, and that is also a convincing argument, why increasing RTT independence can not be the solution for the observed isolation failure of the dual queue coupled AQM, glad that you agree.



> Importantly, RTT-dependence only needs to be addressed in the newly deployed scalable (L4S or SCE) end-systems, not pre-existing end-systems like Reno & Cubic, as I'll now explain. 
> 
> The problem is due to the tiny RTT you can get when you have both a tiny base RTT and a tiny queue. For example:
> 
> Base RTT/ms
> Queue/ms
> Total RTT/ms
> Close Scalable flow:	2.5
> 0.5
> 3
> Distant Scalable flow:	100
> 0.5
> 100.5
> Close Classic flow:	2.5
> 15
> 7.5

	[SM] Note, 15 + 2.5 = 17.5, a typo for sure. If I am wrong and you really ment 7.5, please elaborate why you only add 1/3 of the queueing delay, as I assume we are talking about saturating traffic here.


> Distant Classic flow:	100
> 15
> 105

	[SM] 15 + 105 = 120

> 
> If all the above flows were RTT-dependent, the lowest 3ms RTT ('Close Scalable') would be the potential hog. For example against 105ms ('Distant Classic') if they were both 'window fair' like Reno, the throughput ratio would be 105:3 = 35:1

	[SM] I am not convinced that "window fairness" is in reality a useful property, throughput fairness seems much more desirable.

> 
> Whereas the Close vs Distant Classic case is cushioned by the 15ms queue. RTT ratio = 105:7.5 = 14:1, which was the Internet status quo pre-L4S (or SCE).

	[SM] If I fill in the modified numbers from above I get 115 / 17.5  ~ 6.6:1, but your point still stands.


> 
> So why is it enough for only scalable flows to reduce their RTT dependence? Assuming they do, the lowest 3ms RTT ('Close Scalable') doesn't go much faster than 'Distant Scalable'. If it's no longer a hog in relation to Distant Scalable, it's no longer a hog in relation to any of the other flows. 

	[SM] That seem strictly depend on your definition of hog 105/120 is not equal still giving L4S an IMHO (hard to justify) edge over standard compliant traffic. But I am happy to be convinced by actual empirical numbers.


> So the worst case reverts to the Close Classic flow, which isn't so bad because the total RTT is cushioned by the queue. So we will be no worse off than we were before L4S or SCE.
> 
> 
> You will see we explained this at the end of Section V.B 'Congestion Control Roadmap' in the draft journal paper about L4S that we posted earlier this year:
> “`Data Centre to the Home’: Deployable Ultra-Low Latency for All”
> 
> But is RTT-independent TCP Prague real? No, not quite yet. At the time we designed this, we simulated it, but we're now working on it again, including implementing it in Linux, so you should see results soon. The approach to RTT-independence that we suggest is explained in section 3.2 of a paper we presented to the iccrg back in Jul 2017. 
> “Resolving Tensions Between Congestion Control Scaling Requirements“

	[SM] At this point I want to see real numbers from empirical testing, thanks for the references though.


> 
> 
> All the above papers are available via the L4S landing page: https://riteproject.eu/dctth/#papers
>>> I'm also not against the recent vibrant energy, interest and engagement of people, but I think we can better use this energy in making things work. As you noticed, we can use the help to speed things up on the open source part.
>>> 
>> 	[SM]... while at the same time requesting help for implementation and dealing with the consequences of said "code point and requirements".
> 
> If you recall, when I asked you not to keep repeating other people's arguments without adding anything, you said the RTT-dependence point was your contribution to the debate.

	[SM] Then we miscommunicated, my point is, that if you fix the dual queue couplked AQM than you can tackle the worthy goal of less RTT dependence at your own sweet time.

> 
> The L4S team identified this problem in early 2015 during testing with diverse RTTs.

	[SM] And then opted to keep pushing for internet-wide roll-out of L4S with this issue unsolved, color me not impressed.


> When the Prague requirements were first agreed, we (Koen in particular) insisted that this should be included as a basic safety requirement, even tho some people said it wasn't important.
> The WG has so far kept it as a MUST requirement, justified by the potentially large discrepancy in rates that can result, articulated here:
>     https://tools.ietf.org/html/draft-ietf-tsvwg-ecn-l4s-id-08#appendix-A.1.5
> 
> If I were as cynical as some on this list, I could point out that discovering a requirement by reading one of the L4S documents doesn't really count as a contribution. But I'll let that pass.


	[SM] Except, you just did not let it pass... I will accept this as a passive aggressive offense.

> 
> What I wanted to show by this email is that a significant amount of work has been done on this, the problem is well-understood and solutions are in progress. RTT-independent congestion controls have been produced before - not that I'm belittling how hard it is to resolve the tension between requirements identified in the 'Tensions' paper above. 

	[SM] To repeat myself, this all supports the thesis that making RTT-independence the mechanism by which L4S equitably shares with standards-compliant traffic at the current state is premature. I very much would like to see the dual queue coupled AQM fixed, and once RTT independence (for all users of the L4S queue) is functional then the AQM can be changed again. 


> 
> So I ask you to take a more constructive approach. Banging on your keyboard in the style of "ner ner ne ner ner, I found a problem with L4S" doesn't help anyone,

	[SM] Let me be blunt, without the deserved push-back on the lists, "peaceful coexistence with single queue RFC3168 AQMs" would not have been tackled, the gist of the presentatuins so far has always been to ask the room, but do we ned to care for this at all? So IMHO pointing out L4S current short comings is important as otherwise little progress on the difficult issues would happen at all.

> when the problem you've "found" exists with any low latency shared queue, including an SCE-based solution. 

	[SM] Well, the something called "dual queue" AQM certainly needs to at least behave as if it had two queues in my book. L4S versus SCE is red herring here, my current issue is that the current implementation of L4S's reference AQM does not work as it should and as a normal reader would understand from reading the dualQ ID, IMHO that needs to be fixed. I do not believe that the not yet solved RTT-independence goal is the best approach to solve the issue, but you are free to spend your time as you please. 


> Instead you could have read all the papers referenced from the L4S landing page, \\


	[SM} Regrettably, I have read quite a lot of that material, and I typically see too much hype and too little critical analysis for my taste.

> understood how much everyone already understands the problem,

	[SM] I have no beef with your understanding of the issues, my complaint is that you do not act upon long-standing well-documented show-stopper issues. Not every documented bug can be dressed as a feature.

Best Regards
	Sebastian


> how much has already been done to solve it, and then you could have tried to work out the details of a full solution.
> 
> 
> Regards
> 
> 
> 
> Bob
> 
> 
>> 
>> 
>> Best Regards
>> 	Sebastian
>> 
>> 
>> 
>>>  
>>> Regards,
>>> Koen.
>>>  
>>> From: Ingemar Johansson S 
>>> <ingemar.s.johansson@ericsson.com>
>>>  
>>> Sent: Wednesday, November 20, 2019 11:50 AM
>>> To: Kyle Rose 
>>> <krose@krose.org>rg>; Ingemar Johansson S <ingemar.s.johansson@ericsson.com>
>>> 
>>> Cc: Sebastian Moeller 
>>> <moeller0@gmx.de>de>; G Fairhurst <gorry@erg.abdn.ac.uk>uk>; tsvwg@ietf.org; tsvwg-chairs@ietf.org; De Schepper, Koen (Nokia - BE/Antwerp) <koen.de_schepper@nokia-bell-labs.com>om>; 4bone@gndrsh.dnsmgr.net; Ingemar Johansson S <ingemar.s.johansson@ericsson.com>
>>> 
>>> Subject: RE: [tsvwg] L4S vs SCE
>>>  
>>> Hi
>>>  
>>> So given the imagined outcome that L4S fails.. two scenarios
>>>  
>>> If other SDOs or developers don’t pick up L4S then things are quite simple I guess, just declare the L4S drafts as deprecated, or is there more to do ?
>>>  
>>> But, if e.g. 3GPP somehow thinks that this is a good idea and adopts it.. Will the IETF send a message (LS?) to 3GPP with the message “please stop using L4S”, is this even a reasonable scenario?. After all, the fact that it is picked up by other SDOs, speaks against a failure ?
>>>  
>>> /Ingemar
>>>  
>>> From: Kyle Rose 
>>> <krose@krose.org>
>>>  
>>> Sent: den 20 november 2019 11:14
>>> To: Ingemar Johansson S 
>>> <ingemar.s.johansson=40ericsson.com@dmarc.ietf.org>
>>> 
>>> Cc: Sebastian Moeller 
>>> <moeller0@gmx.de>de>; G Fairhurst <gorry@erg.abdn.ac.uk>uk>; tsvwg@ietf.org; Ingemar Johansson S <ingemar.s.johansson@ericsson.com>om>; tsvwg-chairs@ietf.org; De Schepper, Koen (Koen) <koen.de_schepper@nokia.com>om>; 4bone@gndrsh.dnsmgr.net
>>> 
>>> Subject: Re: [tsvwg] L4S vs SCE
>>>  
>>> On Wed, Nov 20, 2019 at 6:04 PM Ingemar Johansson S 
>>> <ingemar.s.johansson=40ericsson.com@dmarc.ietf.org>
>>>  wrote:
>>> 
>>>>       How do you expect an industry/commercial roll-out of L4S
>>>> technology to behave, if the L4S experiment should terminated without
>>>> adoption as a standard track RFC? Are they supposed to phase-out using
>>>> ECT(1) as well, or is it understood that deployed L4S instances continue using
>>>> L4S methods?
>>>> 
>>> [IJ] The premise would be that L4S is declared a failure. I doubt that anybody has a good enough crystal ball to predict what happens. First it is necessary to come to the conclusion that L4S is a failure and I would argue that we are not yet there and I don't currently see that coming. Before that possible event I don't see it meaningful to speculate.
>>>  
>>> I'm going to have to go ahead and disagree with you strongly here. Given the potential consequences of cleaning up after a failed experiment without a plan worked out beforehand, this blithe approach is simply not acceptable.
>>>  
>>> In lots of cases, experiments are easy to terminate in an obvious way: for example, in one typical case, a code point can simply be abandoned, or (even better) a pollutable experimental code point returned to the available pool after some time. If that were the case here, it would not be difficult to enumerate a sequence of steps required to do so. It doesn't appear that's the case, however, so all the more reason to make sure we address this as part of the experiment setup.
>>>  
>>> A launch escape system of the necessary complexity should be a requirement of any experimental deployment. In this case, that might circumscribe the scope of the experiment itself.
>>>  
>>> Kyle
>>> 
>> 
> 
> -- 
> ________________________________________________________________
> Bob Briscoe                               
> http://bobbriscoe.net/