Re: [tsvwg] Question regarding acceptable sharing behavior between "normal traffic" and L4S traffic --> "see These L4S issues reported are not show stoppers"

Sebastian Moeller <moeller0@gmx.de> Mon, 18 November 2019 14:22 UTC

Return-Path: <moeller0@gmx.de>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 28D4F120105 for <tsvwg@ietfa.amsl.com>; Mon, 18 Nov 2019 06:22:55 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.349
X-Spam-Level:
X-Spam-Status: No, score=-2.349 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_HELO_NONE=0.001, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=gmx.net
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 439uOvZd8K51 for <tsvwg@ietfa.amsl.com>; Mon, 18 Nov 2019 06:22:53 -0800 (PST)
Received: from mout.gmx.net (mout.gmx.net [212.227.17.21]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id A5BB4120931 for <tsvwg@ietf.org>; Mon, 18 Nov 2019 06:22:52 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.net; s=badeba3b8450; t=1574086969; bh=yNDzL1rtTws5PdlAAzdD5pO9xINYg570GwdryUOOR9Y=; h=X-UI-Sender-Class:Subject:From:In-Reply-To:Date:Cc:References:To; b=ZI9NeKUCRWMquWT4CCeWW9CwJuBe1Dv9GL+XuBGQsMNGRENl5yMThaviyNKdrs5yw 96Vy4GAvPqLFDEfGSbWKLy0kUYKsyyI0PHq8yMWIrPy+JiCMnVuml8s6nTCB1lIXta wb5hs6CCdkYG2mb0PrlvGNj7wOaL63tnUSYv/KNM=
X-UI-Sender-Class: 01bb95c1-4bf8-414a-932a-4f6e2808ef9c
Received: from [10.11.12.33] ([134.76.241.253]) by mail.gmx.com (mrgmx104 [212.227.17.168]) with ESMTPSA (Nemesis) id 1MZTmY-1iK7Dl3EUI-00WRz7; Mon, 18 Nov 2019 15:17:43 +0100
Content-Type: text/plain; charset="us-ascii"
Mime-Version: 1.0 (Mac OS X Mail 12.4 \(3445.104.11\))
From: Sebastian Moeller <moeller0@gmx.de>
In-Reply-To: <AM4PR07MB3459F9002B330865D001F1A2B94D0@AM4PR07MB3459.eurprd07.prod.outlook.com>
Date: Mon, 18 Nov 2019 15:17:42 +0100
Cc: tsvwg IETF list <tsvwg@ietf.org>
Content-Transfer-Encoding: quoted-printable
Message-Id: <FF1F6E25-BDD1-4D52-A5E5-58CA15385BF8@gmx.de>
References: <AM4PR07MB3459F9002B330865D001F1A2B94D0@AM4PR07MB3459.eurprd07.prod.outlook.com>
To: "De Schepper, Koen (Nokia - BE/Antwerp)" <koen.de_schepper@nokia-bell-labs.com>
X-Mailer: Apple Mail (2.3445.104.11)
X-Provags-ID: V03:K1:/S/dgaOpsQoceHJyfppULAGl9JDurVmn+oqVugOzBwlMC6tzja1 vN6nomGlIos+7+X5c6prf1tHOi9tIETzJfgKAoZntXh1I+KWXXdMqcdsULxhcma3TIoQ4qW EFTFCTr2uqlqOJ4wttVLCXI24P2nlnE6hhRXyvkr6OYiYXOqlOFnAcmx+m/R0pLh1bfZ1gD W4vsVac52doA44KdusdWg==
X-UI-Out-Filterresults: notjunk:1;V03:K0:gc7UTZviYz0=:EPHM5aNn19jhqSwKf9v2P8 h8gfp1gvf9Np/nivqoN6MftbEp0d9eujgHeEKi+T7Wf1Kek7xRVGBqCyk08LzwJKNKU7xg6ij C1sI79yatoYE9gcfPi5uinYqIF8xGh2uXnhTo/DO8j05vDiHjnCAtaEj9IVnzOeVjhQ4R5I37 c14v6TpXE9Pt7gUoJeE9NK3E5GascrvT53s4giNg5cMfVNLjRuVdYSELPfZR3/XRJNz/lguQu dgs2E8/JvGYNJz6v1HRnueattpbVmoWUbozCEqQeHZpFQX0fb7sjqPoVMP0QC1CYviD1VPNgF 3SmPftBKmuVX6jvf++dMhyXTz6QASuFBXWLAzFo/NXHIgCi6ws50CjUxOfOuZJP0J5y2Z5wVg QT4dDrJXUAfaCz+cyHqj50Umxovtcl5XPWa29S8EtwiXKTioHynhZodWZDdk+fDWVTogP70Do cTiT8IM7pGraonJxg6Ftgq3NcdLErdIJEi5S2Qv5HA2EUzGBUyq6G+HwRQ+Vh4IO4/b9flEw5 1nGSVWjytcn/PErbpM6oDKxtaMgKjB3MCPrEqFTfoLKLjyTeCOpoVekudPTmSsBnJJKxsro4i 292eytWOvWvXdJ2wudXz+mPXJEJh1L1To9Q2hwZyKVyCGFxTsIJWn1N1CLh2H8yZNqb6wSSPx k5QbT45MZd6nU3o4lLqXA4Nv7cPO+K5SELen8uaHlIrMOHW+K9ebGYFHjPEkGjrjS4liO2pEw m6ByxDscZKvaf9+s0jau71DBguLfVfvEYUjTe3ZjCHll1iBmUIZljVLzF8OKmUndDponKrUOX R01KJYiK4Yhi/N43ezwUxr3hSWtDvy/Ftid0/vt4EU28BG4ET93ze8WYmHegwQJTNCiaCkNvI Kcgtr0O8kEsdh00/zzoCoQwHAOqFdYWY5qRiW1bnYK79qhlmMub2huj8VwxvzmUoAf2GG1JWn LrZzxdjz2bp0RXiEgXSlFJXvk+MZ5GSqa532cQx1JjZBcGY0QArYXaYNkwskk1TzWe4EMTFv2 8yd7Ky9bqKTPZILtJ1Oi6Vsj0KxFYqHNjJVz7OLYSP9GCwMvaA9YPpJsSAJVFterGdSq5Fz0l v7K06gXFtTHJAV7UMfjHnvs+HG9Sgc5fqNC2Qim5M8SlVEVh5lDYRaxTFndUg0sGhUZv3u48a 41fEV7fw8+Yjqmq5WBNMvL0SFUdWMNub8eJ4WxawTetb6qoqsk5coVMVZF21HKqgCgHeysCt1 PUFMflk3n3cVggt0b/wvkSTIriR7/SP1mifoMYGDolgp73WS/UyffV223Uyc=
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/eqZQBiPowfNRdundDsCYkraknds>
Subject: Re: [tsvwg] Question regarding acceptable sharing behavior between "normal traffic" and L4S traffic --> "see These L4S issues reported are not show stoppers"
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg/>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 18 Nov 2019 14:22:55 -0000

Hi Koen,


> On Nov 18, 2019, at 12:36, De Schepper, Koen (Nokia - BE/Antwerp) <koen.de_schepper@nokia-bell-labs.com> wrote:
> 
> Hi Sebastian,
> 
> See mail "These L4S issues reported are not show stoppers". In short: the TCP-Prague requirement "less RTT dependency" covers this. 
> This effect has been reported from the beginning. Not everyone is convinced this is even a problem, rather a feature, as it is the "Normal Internet behavior since 40 year".

	Koen, I believe you might want to re-read my email. I am not arguing about RTT dependence per se, but rather whether L4S meets the consensus "coexisting conditions" to merit roll-out into the internet, given that effectively it will build a low latency, higher bandwidth route between end hosts at the expense of both latency and bandwidth of non-participating flows. 
	Due to the unfortunate design decision to employ a loose coupling between the two queues instead of stricter isolation the artificially inflated RTT for a for normal flows (that dualq introduces under saturating load) puts them a distinct disadvantage in regards to L4S flows, the question hence is:

Does the TSVW community believe that this is acceptable, to declare a new design as "safely coexisting" and "ensur[ing] coexistence" if all that is guaranteed is not to starve the existing traffic?
 
IMHO neither the observed behavior, nor arguing the objections away instead of looking at the fundamental root cause that behavior strikes me as the most efficient/productive possible way forward.
I do want to add, that I even gave a pointer about how the issue might be ameliorated.
	But is it a correct representation of your position that demonstrated unequal sharing  is as designed and hence will not be reconsidered for improvements?


Best Regards
	Sebastian


> 
> Regards,
> Koen.
> 
> -----Original Message-----
> From: tsvwg <tsvwg-bounces@ietf.org> On Behalf Of Sebastian Moeller
> Sent: Monday, November 18, 2019 10:18 AM
> To: tsvwg IETF list <tsvwg@ietf.org>
> Subject: [tsvwg] Question regarding acceptable sharing behavior between "normal traffic" and L4S traffic
> 
> Dear All,
> 
> I have been making an argument repeatedly on this list in different threads to which I received inconsistent feed-back. So, I would very much like to raise this issue explicitly to the full list, as I believe this to be important to address in regards to L4S.
> 
> It has been observed by two independent measurements that the reference dual queue AQM demonstrates RTT-dependent unfairness between L4S and standard TCP traffic, the shorter the RTT the more of a bandwidth-advantage L4S gets.
> At a nominal RTT of 0ms (which as far as I can tell just means no additional netem delay, so a real RTT in the <1ms range) TCP Prague gets ~5-times more bandwidth than TCP cubic:
> The L4S team's measurement:
> https://l4s.cablelabs.com/l4s-testing/key_plots/batch-l4s-s1-2-cubic-vs-prague-50Mbit-0ms_var.png
> The SCE-team's measurement:
> https://www.heistp.net/downloads/sce-l4s-bakeoff/bakeoff-2019-11-11T090559-r2/l4s-s1-2/batch-l4s-s1-2-cubic-vs-prague-50Mbit-0ms_fixed.png
> 
> That leads to my questions:
> 
> A) Is/was everybody aware of this unequal sharing over a nominally equal path? 
> 
> I ask because the following two statements from the L4S-arch and the dualq-coupled I.D.'s seem to indicate a different kind of more equal sharing to me:
> 
> https://tools.ietf.org/html/draft-ietf-tsvwg-aqm-dualq-coupled-10 :
> "This specification defines `DualQ Coupled Active Queue Management
>  (AQM)', which enables these Scalable congestion controls to safely
>  co-exist with Classic Internet traffic.
> Analytical study and implementation testing of the Coupled AQM have
>   shown that Scalable and Classic flows competing under similar
>   conditions run at roughly the same rate."
> 
> and
> 
> https://tools.ietf.org/html/draft-ietf-tsvwg-l4s-arch-04 :
> "Further, the network part is simple to
>  deploy - incrementally with zero-config.  Both parts, sender and
>  network, ensure coexistence with other legacy traffic."
> 
> In the limit, the dual queue AQM will share traffic between the L4S queue and the normal queue approximately 15:1. Unless coexistence is considered to mean "does not starve" I fail to understand how claims and observed data match up.
> 
> B) Does everybody here think that this what is implied by the above stanzas and that this is an acceptable guarantee to merit allowing/endorsing the use of dual queue AQM over the internet?
> 
> 
> 
> 	Oliver Tilmans helpfully explained, that this behavior is draft-compatible but relies on a specific interpretation of "under similar conditions", once the queue under load is taken into consideration the RTT ration goes from 1:1 to 1:15 and hence the observed bandwidth sharing is considered acceptable due to conditions not being similar enough any more. 
> 
> I have two issues with that rationale:
> 
> 1) this does not seem to be the natural way to read that claim (Oliver mentioned that he had the same question/observation when he started to look into dualq, so my reading of the text is not completely outlandish)
> 
> 2) It is to a large degree a consequence of a design decision in the dual queue AQM that does not seem to have been sufficiently thoroughly considered.
> 	The point is that dual queue simply copies the recommendation to set the target for the "normal queue's" pie instance to 15 milliseconds (simply copying from QDELAY_REF in https://tools.ietf.org/html/rfc8033#appendix-B) without taking the consequence into consideration how this affects bandwidth sharing behavior between the two queues at short RTTs. According to codel's theory (https://tools.ietf.org/html/rfc8289#section-4.3) target should be set to ~5-10% of the interval, so instead of 15 ms, 5 ms probably should do to maintain bandwidth utilization while at the same time addressing part of the root cause for unequal sharing at low RTTs. It is well possible that 5ms will not wrk for a PIE derivative shaper like dualQ, but this should at least be explored before designing an undeserved advantage for L4S into the specifications.
> 
> 
> Best Regards
> 	Sebastian
> 
> 
> 
>