Re: [tsvwg] Question regarding acceptable sharing behavior between "normal traffic" and L4S traffic --> "see These L4S issues reported are not show stoppers"

"De Schepper, Koen (Nokia - BE/Antwerp)" <koen.de_schepper@nokia-bell-labs.com> Mon, 18 November 2019 15:33 UTC

From: "De Schepper, Koen (Nokia - BE/Antwerp)" <koen.de_schepper@nokia-bell-labs.com>
To: Sebastian Moeller <moeller0@gmx.de>
CC: tsvwg IETF list <tsvwg@ietf.org>
Thread-Topic: [tsvwg] Question regarding acceptable sharing behavior between "normal traffic" and L4S traffic --> "see These L4S issues reported are not show stoppers"
Thread-Index: AdWeA6ekYlrn2w+HQtyZh3WKDQhloAAF0jwAAAAx7VA=
Date: Mon, 18 Nov 2019 15:33:06 +0000
Message-ID: <AM4PR07MB3459620906A8857F95BA7DF8B94D0@AM4PR07MB3459.eurprd07.prod.outlook.com>
References: <AM4PR07MB3459F9002B330865D001F1A2B94D0@AM4PR07MB3459.eurprd07.prod.outlook.com> <FF1F6E25-BDD1-4D52-A5E5-58CA15385BF8@gmx.de>
In-Reply-To: <FF1F6E25-BDD1-4D52-A5E5-58CA15385BF8@gmx.de>
Accept-Language: nl-BE, en-US
Content-Language: en-US
received-spf: None (protection.outlook.com: nokia-bell-labs.com does not designate permitted sender hosts)
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
MIME-Version: 1.0
X-MS-Exchange-CrossTenant-Network-Message-Id: 42ef891c-712d-496d-b3e2-08d76c3c9bea
X-MS-Exchange-CrossTenant-originalarrivaltime: 18 Nov 2019 15:33:06.4842 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 5d471751-9675-428d-917b-70f44f9630b0
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: ptBRprDBWf/tRQnKGpegSVzj+QWv0rDHS0eUbHYY4zpqhqHh+hrSXZfprOVZgyVNU46LfbVJGCzNsNvgF1oqSfjjJs/qlyjKHI9yf2rtww8nuAjRuGRthg1d+bPE4U1R
X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM4PR07MB3458
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/VOIHCK3sKl2EXLsAHngOwbgCmmM>
Subject: Re: [tsvwg] Question regarding acceptable sharing behavior between "normal traffic" and L4S traffic --> "see These L4S issues reported are not show stoppers"
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg/>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 18 Nov 2019 15:33:25 -0000

Hi Sebastian,

>> But is it a correct representation of your position that demonstrated unequal sharing  is as designed and hence will not be reconsidered for improvements?

I did understand your worry, and to be clear this is exactly also my worry. So no, I'm not OK with this unequal sharing. My position is the same as yours. The only difference is that for L4S we need to solve this in the End-System.

>> I am not arguing about RTT dependence per se

Please read the following to understand the relevance:

A.1.5.  Reduce RTT dependence

   Description: A scalable congestion control needs to reduce or
   eliminate RTT bias over as wide a range of RTTs as possible, or at
   least over the typical range of RTTs that will interact in the
   intended deployment scenario.

   Motivation: Classic TCP's throughput is known to be inversely
   proportional to RTT, so one would expect flows over very low RTT
   paths to nearly starve flows over larger RTTs.  However, Classic TCP
   has never allowed a very low RTT path to exist because it induces a
   large queue.  For instance, consider two paths with base RTT 1ms and
   100ms.  If Classic TCP induces a 100ms queue, it turns these RTTs
   into 101ms and 200ms leading to a throughput ratio of about 2:1.
   Whereas if a Scalable TCP induces only a 1ms queue, the ratio is
   2:101, leading to a throughput ratio of about 50:1.

   Therefore, with very small queues, long RTT flows will essentially
   starve, unless scalable congestion controls comply with this
   requirement.

It is how the "Reduce RTT dependence" TCP-Prague requirement is described in the L4S-ID draft. Isn't it describing the exact test case that you also claim as an issue?

The source of this problem and (I'm happy to say) "our" worry, is RTT dependence, for which I mean that RTT is at the bottom part of the fraction in TCP's throughput equations. If flows get the same throughput independent of the RTT they experience, this issue disappears, right? This is the solution from an end-system perspective. Obviously we cannot make it as good as an FlowQueue-scheduler in the network (which is actually also not perfectly working under all circumstances). Also DualQ cannot change the behavior of Classic flows with a big RTT (which Cubic is solving partly).

>> given that effectively it will build a low latency, higher bandwidth route between end hosts at the expense of both latency and bandwidth of non-participating flows

This is indeed the problem that our experiments showed early, and as far as I remember this issue was discussed from the beginning even in meetings at the mike when I presented/warned for this issue.
My presentation pitch/insight was always that creating big buffers, is the current network's solution to solve the RTT-dependence of TCP congestion controls. By putting AQMs with smaller buffers we have to compromise on bigger RTT unfairness. The only reason why FQ-CoDel can get away with a 5ms threshold is indeed because it compensates the small buffer with FQ. PI2 could as well work with a target of 1ms wouldn't it be that this would also hurt extremely the longer RTT flows. If you want you can do an FQ_PI2 (a PI2 instance per flow-queue) with a target of 5ms or even 1ms. Never tried it, but I expect that it would work better than FQ_CoDel. Also I think you can setup an experiment with multiple bottlenecks where FQ_CoDel causes the same extreme effect. You need to make again combinations of FIFO and FQ_CoDel type nodes.

To solve this in congestion controls we should not expect perfection again. If you want (near) perfection then deploy FQ 😉. I'm happy with an approximation / good compromise. I think increasing RTT independence on the time the flow is getting frequent CE signals is a good goal. When a flow just starts, you want to benefit from the low RTT, to get fast up to speed and terminate quickly short flows. When you are at speed, you want to converge to the slower converging bigger RTT flows.

Regards,
Koen.

-----Original Message-----
From: Sebastian Moeller <moeller0@gmx.de> 
Sent: Monday, November 18, 2019 3:18 PM
To: De Schepper, Koen (Nokia - BE/Antwerp) <koen.de_schepper@nokia-bell-labs.com>
Cc: tsvwg IETF list <tsvwg@ietf.org>
Subject: Re: [tsvwg] Question regarding acceptable sharing behavior between "normal traffic" and L4S traffic --> "see These L4S issues reported are not show stoppers"

Hi Koen,

> On Nov 18, 2019, at 12:36, De Schepper, Koen (Nokia - BE/Antwerp) <koen.de_schepper@nokia-bell-labs.com> wrote:
> 
> Hi Sebastian,
> 
> See mail "These L4S issues reported are not show stoppers". In short: the TCP-Prague requirement "less RTT dependency" covers this. 
> This effect has been reported from the beginning. Not everyone is convinced this is even a problem, rather a feature, as it is the "Normal Internet behavior since 40 year".

	Koen, I believe you might want to re-read my email. I am not arguing about RTT dependence per se, but rather whether L4S meets the consensus "coexisting conditions" to merit roll-out into the internet, given that effectively it will build a low latency, higher bandwidth route between end hosts at the expense of both latency and bandwidth of non-participating flows. 
	Due to the unfortunate design decision to employ a loose coupling between the two queues instead of stricter isolation the artificially inflated RTT for a for normal flows (that dualq introduces under saturating load) puts them a distinct disadvantage in regards to L4S flows, the question hence is:

Does the TSVW community believe that this is acceptable, to declare a new design as "safely coexisting" and "ensur[ing] coexistence" if all that is guaranteed is not to starve the existing traffic?

IMHO neither the observed behavior, nor arguing the objections away instead of looking at the fundamental root cause that behavior strikes me as the most efficient/productive possible way forward.
I do want to add, that I even gave a pointer about how the issue might be ameliorated.
	But is it a correct representation of your position that demonstrated unequal sharing  is as designed and hence will not be reconsidered for improvements?

Best Regards
	Sebastian

> 
> Regards,
> Koen.
> 
> -----Original Message-----
> From: tsvwg <tsvwg-bounces@ietf.org> On Behalf Of Sebastian Moeller
> Sent: Monday, November 18, 2019 10:18 AM
> To: tsvwg IETF list <tsvwg@ietf.org>
> Subject: [tsvwg] Question regarding acceptable sharing behavior 
> between "normal traffic" and L4S traffic
> 
> Dear All,
> 
> I have been making an argument repeatedly on this list in different threads to which I received inconsistent feed-back. So, I would very much like to raise this issue explicitly to the full list, as I believe this to be important to address in regards to L4S.
> 
> It has been observed by two independent measurements that the reference dual queue AQM demonstrates RTT-dependent unfairness between L4S and standard TCP traffic, the shorter the RTT the more of a bandwidth-advantage L4S gets.
> At a nominal RTT of 0ms (which as far as I can tell just means no additional netem delay, so a real RTT in the <1ms range) TCP Prague gets ~5-times more bandwidth than TCP cubic:
> The L4S team's measurement:
> https://l4s.cablelabs.com/l4s-testing/key_plots/batch-l4s-s1-2-cubic-v
> s-prague-50Mbit-0ms_var.png
> The SCE-team's measurement:
> https://www.heistp.net/downloads/sce-l4s-bakeoff/bakeoff-2019-11-11T09
> 0559-r2/l4s-s1-2/batch-l4s-s1-2-cubic-vs-prague-50Mbit-0ms_fixed.png
> 
> That leads to my questions:
> 
> A) Is/was everybody aware of this unequal sharing over a nominally equal path? 
> 
> I ask because the following two statements from the L4S-arch and the dualq-coupled I.D.'s seem to indicate a different kind of more equal sharing to me:
> 
> https://tools.ietf.org/html/draft-ietf-tsvwg-aqm-dualq-coupled-10 :
> "This specification defines `DualQ Coupled Active Queue Management  
> (AQM)', which enables these Scalable congestion controls to safely  
> co-exist with Classic Internet traffic.
> Analytical study and implementation testing of the Coupled AQM have
>   shown that Scalable and Classic flows competing under similar
>   conditions run at roughly the same rate."
> 
> and
> 
> https://tools.ietf.org/html/draft-ietf-tsvwg-l4s-arch-04 :
> "Further, the network part is simple to  deploy - incrementally with 
> zero-config.  Both parts, sender and  network, ensure coexistence with 
> other legacy traffic."
> 
> In the limit, the dual queue AQM will share traffic between the L4S queue and the normal queue approximately 15:1. Unless coexistence is considered to mean "does not starve" I fail to understand how claims and observed data match up.
> 
> B) Does everybody here think that this what is implied by the above stanzas and that this is an acceptable guarantee to merit allowing/endorsing the use of dual queue AQM over the internet?
> 
> 
> 
> 	Oliver Tilmans helpfully explained, that this behavior is draft-compatible but relies on a specific interpretation of "under similar conditions", once the queue under load is taken into consideration the RTT ration goes from 1:1 to 1:15 and hence the observed bandwidth sharing is considered acceptable due to conditions not being similar enough any more. 
> 
> I have two issues with that rationale:
> 
> 1) this does not seem to be the natural way to read that claim (Oliver 
> mentioned that he had the same question/observation when he started to 
> look into dualq, so my reading of the text is not completely 
> outlandish)
> 
> 2) It is to a large degree a consequence of a design decision in the dual queue AQM that does not seem to have been sufficiently thoroughly considered.
> 	The point is that dual queue simply copies the recommendation to set the target for the "normal queue's" pie instance to 15 milliseconds (simply copying from QDELAY_REF in https://tools.ietf.org/html/rfc8033#appendix-B) without taking the consequence into consideration how this affects bandwidth sharing behavior between the two queues at short RTTs. According to codel's theory (https://tools.ietf.org/html/rfc8289#section-4.3) target should be set to ~5-10% of the interval, so instead of 15 ms, 5 ms probably should do to maintain bandwidth utilization while at the same time addressing part of the root cause for unequal sharing at low RTTs. It is well possible that 5ms will not wrk for a PIE derivative shaper like dualQ, but this should at least be explored before designing an undeserved advantage for L4S into the specifications.
> 
> 
> Best Regards
> 	Sebastian
> 
> 
> 
>

Re: [tsvwg] Question regarding acceptable sharing… De Schepper, Koen (Nokia - BE/Antwerp)
Re: [tsvwg] Question regarding acceptable sharing… Sebastian Moeller
Re: [tsvwg] Question regarding acceptable sharing… De Schepper, Koen (Nokia - BE/Antwerp)
Re: [tsvwg] Question regarding acceptable sharing… Sebastian Moeller