Re: [tsvwg] Related to "Non-L4S traffic abusing the L-queue" discussion during the interim

Sebastian Moeller <moeller0@gmx.de> Fri, 25 February 2022 22:32 UTC

Return-Path: <moeller0@gmx.de>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 076E43A07B7 for <tsvwg@ietfa.amsl.com>; Fri, 25 Feb 2022 14:32:43 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.649
X-Spam-Level:
X-Spam-Status: No, score=-1.649 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, FREEMAIL_FROM=0.001, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=no autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=gmx.net
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id rXtOEYtHyMHh for <tsvwg@ietfa.amsl.com>; Fri, 25 Feb 2022 14:32:38 -0800 (PST)
Received: from mout.gmx.net (mout.gmx.net [212.227.17.20]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 277E43A07B9 for <tsvwg@ietf.org>; Fri, 25 Feb 2022 14:32:37 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.net; s=badeba3b8450; t=1645828305; bh=edZs+jqc/3UH6a3TodKnBnnVWjRdsp8j4k/ca9Xs078=; h=X-UI-Sender-Class:Subject:From:In-Reply-To:Date:Cc:References:To; b=Rq+HjjU39eCJRiqtHEWN2beFoHtwIz6AHdlAo4+tYji8XNWLxHG/x33CevmgY5xlF TiQOKFTiK9YaaFTcieGsHYZyLi6tAGNLzS9iY0D1TgRdosBYysLsl9Lurfz2ViP4rB qpfoBLsGjxcuG7fFQ4NXLhcXNqfJT4mI3uj1dHUI=
X-UI-Sender-Class: 01bb95c1-4bf8-414a-932a-4f6e2808ef9c
Received: from smtpclient.apple ([95.116.211.112]) by mail.gmx.net (mrgmx104 [212.227.17.168]) with ESMTPSA (Nemesis) id 1M7b6b-1nHvpc07bB-0080CL; Fri, 25 Feb 2022 23:31:45 +0100
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (Mac OS X Mail 14.0 \(3654.120.0.1.13\))
From: Sebastian Moeller <moeller0@gmx.de>
In-Reply-To: <AM9PR07MB731311A9E4532FD501B5D94CB93E9@AM9PR07MB7313.eurprd07.prod.outlook.com>
Date: Fri, 25 Feb 2022 23:31:38 +0100
Cc: "Black, David" <David.Black@dell.com>, Neal Cardwell <ncardwell@google.com>, tsvwg IETF list <tsvwg@ietf.org>, Bob Briscoe <in@bobbriscoe.net>
Content-Transfer-Encoding: quoted-printable
Message-Id: <AC61D119-FE97-4386-8FF0-A7783FA01522@gmx.de>
References: <AM9PR07MB7313D5AAF6B9D66C74CC35A1B9369@AM9PR07MB7313.eurprd07.prod.outlook.com> <AM9PR07MB7313F1401B14F6F2DB72A2B2B93E9@AM9PR07MB7313.eurprd07.prod.outlook.com> <MN2PR19MB40454F60DEE5735EAD428465833E9@MN2PR19MB4045.namprd19.prod.outlook.com> <CADVnQyk+uSX9GJtMBnsBhn9NzY+L3BKfhhUJ=yu4Aya98YEonw@mail.gmail.com> <MN2PR19MB40458624D266CDB54009AB19833E9@MN2PR19MB4045.namprd19.prod.outlook.com> <AM9PR07MB731311A9E4532FD501B5D94CB93E9@AM9PR07MB7313.eurprd07.prod.outlook.com>
To: "De Schepper, Koen (Nokia - BE/Antwerp)" <koen.de_schepper@nokia-bell-labs.com>
X-Mailer: Apple Mail (2.3654.120.0.1.13)
X-Provags-ID: V03:K1:WAJjSxlKs0Ey2zN26gDJ4vg0HBRfXaWEJDYqDw36e9TSM5Q4adF Dhalx40JIQgRwikEULXBiFZkhIBGoMfwAU1aNBGDyboXX9bzkcMHqYc9MDUpWrJA233XaY7 dILS5wEuLXBzKv3Mtd3njV/oqUyqA5W/Q2z1lCYRchBdyIoxUHhqSbsAtC9Q9IZXMWBBJiD uukcSpBphC+RdsgWW8Kjg==
X-UI-Out-Filterresults: notjunk:1;V03:K0:3oQJzWEo4D0=:t/5Zi/kx5jEFEkwr0i7YTO oU19wHQ9R80hbXdVrcOwRZUklWUcIL+1hlFcXNIEkGEbLeHn3B1kZpYzRll/MUQUYU5Vlhpvb Yuu18+KpgCR9W8qkAIHVK0CXyZJhr/mkpKnfIgHRgJkYDVE9J0QMkoD1Qi7RZKKAZEWEXWAWH 1bqgeKrhp9u3IXYaQY1bPGcAcPfspiSbQpFL3LXivX8RG4kl5sQ0k0tNsyls3gILsOqeeIRSs 8tdtSF8H4Os0tD3/qSjXyd7Hfo4TkrcTPlEE2bmdlI7oIOYD/teMrEwlIxYc3NZPnEddadHGw Dir77I4gkAF+LqyxTe9AkKN+PTlvt0U15s6lMHgE4JSwLOeyo8mPEPAnbAkDnzXAikay1HfJp G3/ACXX8RBA0qkVyBvmev5ch6ZNoOerx7jnCCRbFgg6jALSGdkThOCDy7aA92HdbNPxZ3ZPGA ClbGKW6KuNFWJSd5bW97hEnB9qiEwNCSgny/64nzsT0kOlF644cwemGrnobQIWiycKK+eSjVW 5bcVA0wZwX4MN9Tltyau1HyDlJaTA2aUxDY/LkJqAcd7UHAXjQt66hoUcX1DmZPSkbIC3u/Vn e4L5FVGopfRF6F5nuSNuV68wN052iOPrWfTSEIXjklJKTXJMvaeoFPGNH0R1tS8JEadBM4+Sb n/WWLPsEsON6+zEPhSDTYStB/FOO1ThXSiiES3E43c8SBlWweRsWeUcuc/nnzjiMG1ssuWsTt kD3q4pe0NC15YyhowJYJ7DNDPCIUIMvP009vmU3IhMcvGyrIKNxnkNFQWtgR6eG7dv2OhQi3j OGQgRomuHoytfvMFCJy/7QvNGZmIIcZ9g5ydApxEUKb+8JC8rDH+DpVGmwBuIcGy+jI4C1QDm kQ2cB3PCvOdAdl9upAjXc9+97XVmjCllGTCK1hr4jhn7iwJuOdjc/Zv/+zeMHvk6NPRXTrWvc Qi+rD+pSKs3gVDPRwc/cnQSYN2fGJEInF/02LIprhpL8OxjO4Vj/T4Kq2ufNUkidJYsrkzFwG zs8S7TadhSzUIJvcmoOEkpQGIo7oqso8Za8U2FHlKsuWlbh9InjSRE5K08Fk2vdy5RwKN7YZS ukP2caFjJEOz+o=
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/OlKgM5yUtgPH2as3DfN_J6eQLZs>
Subject: Re: [tsvwg] Related to "Non-L4S traffic abusing the L-queue" discussion during the interim
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg/>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 25 Feb 2022 22:32:44 -0000

Hi Koen,

from your link:

"Only when non-responsive traffic is below the link capacity it can fully use that share, making the responsive flows share the rest of the capacity (as usual for any AQM on the Internet on a shared queue)." 


This pretty much confirms what I predicted, except the drop plots look incomplete, the highly interesting drops for Not-ECT UDP (for CoDel) seem to be missing (generally the labeling for figure 8 is imprecise, e.g. showing Drops ECT(1)-UDP in the first set PIE with ECT(0)-UDP). My hunch is that comparing drops ECT(1)-UDP in DualPi2 with the missing drops Not-ECT UDP in Codel, that in the latter we see much more drops than in the former, et voila, exploitable way to gain more throughput by marking traffic ECT(1)... as long as that traffic is reasonably paced and does not exceed the capacity of the L4S AQM (not an unlikely scenario even for capacity seeking traffic, my measly 100Mbps access link will not saturate the BNG uplink of my ISP or the link to the DSLAM)

Regards
	Sebastian


> On Feb 25, 2022, at 19:30, De Schepper, Koen (Nokia - BE/Antwerp) <koen.de_schepper@nokia-bell-labs.com> wrote:
> 
> Hi David,
>  
> To be sure, we re-did the overload tests recently, confirming the previous overload results. These results are available at: Overload results caused by non-responsive UDP traffic for PIE, DualPI2 and CoDel AQMs | l4steam.github.io
>  
> Specifically look at figure 8 at the end which shows that L4S traffic gets marks, up to 100% and appropriate drop if it reaches and exceeds the link capacity.
>  
> The test case of Jonathan is approximated by the 70Mbps non-responsive ECT(1) UDP traffic on a 100Mbps link on a DualPI2 (Prague+Cubic) test case. In Jonathan’s case it was 40Mbps on a 50Mbps link. We also evaluated in extreme when sending at 100Mbps non-responsive ECT(1) UDP traffic on a 100Mbps link, and even exceeding at 140Mbps and 200Mbps. You will see the results are as if it is on a Single Q PIE AQM. Note also that CoDel which never drops ECT packets, causes actually close to starvation and high tail-drop delay results as shown in figure 1, even with ECT(0). So I guess all the concerns about FQ_CoDel and tunnels/Hash-collisions are equally severe and not related to L4S alone (can just be exploited by ECT(0) traffic today already!!).
>  
> Koen.
>  
> From: Black, David <David.Black@dell.com> 
> Sent: Friday, February 25, 2022 7:04 PM
> To: Neal Cardwell <ncardwell@google.com>
> Cc: De Schepper, Koen (Nokia - BE/Antwerp) <koen.de_schepper@nokia-bell-labs.com>; tsvwg IETF list <tsvwg@ietf.org>; Jonathan Morton <chromatix99@gmail.com>; Bob Briscoe <in@bobbriscoe.net>; Black, David <David.Black@dell.com>
> Subject: RE: [tsvwg] Related to "Non-L4S traffic abusing the L-queue" discussion during the interim
>  
> Hi Neal,
>  
> So, I saw that explanation – could someone check the "running code" to make sure that the coupling and marking occur even when the L queue is always empty?
>  
> Thanks, --David
>  
> From: Neal Cardwell <ncardwell@google.com> 
> Sent: Friday, February 25, 2022 12:58 PM
> To: Black, David
> Cc: De Schepper, Koen (Nokia - BE/Antwerp); tsvwg IETF list; Jonathan Morton; Bob Briscoe
> Subject: Re: [tsvwg] Related to "Non-L4S traffic abusing the L-queue" discussion during the interim
>  
> [EXTERNAL EMAIL] 
> 
>  
>  
> On Fri, Feb 25, 2022 at 11:56 AM Black, David <David.Black@dell.com> wrote:
> Koen,
>  
> I'll observe that "traffic that is not responding at all to CE marks" is not necessary to achieve the reported results if the experimental setup "prevents the L queue from seeing any
> 
> need to apply congestion signals, because it is always empty" as there would be no CE marks for the traffic in the L queue to respond to.
>  
> I think the key part here is "if". :-) The assertion "prevents the L queue from seeing any need to apply congestion signals, because it is always empty" is from:
>   https://sce.dnsmgr.net/downloads/L4S-WGLC2-objection-details.pdf [sce.dnsmgr.net]
> That assertion is inconsistent with the functioning of the Dual-Q algorithm, as described in:
>   https://www.ietf.org/id/draft-ietf-tsvwg-aqm-dualq-coupled-21.html [ietf.org]
>  
> As Bob noted: "in the scenario shown, although the L queue is indeed always empty, it will see a high level of congestion signals (~10% in this case) via the coupling."
> Here's Bob's e-mail for more context/details:
>   https://mailarchive.ietf.org/arch/msg/tsvwg/joFr3sfOrxxkYhWdYrO2rLlCNUw/ [mailarchive.ietf.org]
>  
> thanks,
> neal
>  
>  
>  
> Please give that further consideration.
>  
> Thanks, --David (as an individual)
>  
> From: tsvwg <tsvwg-bounces@ietf.org> On Behalf Of De Schepper, Koen (Nokia - BE/Antwerp)
> Sent: Friday, February 25, 2022 4:29 AM
> To: tsvwg IETF list; Jonathan Morton
> Subject: Re: [tsvwg] Related to "Non-L4S traffic abusing the L-queue" discussion during the interim
>  
> [EXTERNAL EMAIL] 
> 
> Hi Jonathan,
>  
> Can you confirm that this test is done with “Cubic” traffic that is not responding at all to CE marks? So it is just like any other non-responding traffic (like UDP CBR). We don’t see any other way to explain your results. 
>  
> If so, we can/should remove this “issue” from the shepherd’s write-up, as such unresponsive flows will get the same throughput on any single-Q bottleneck with or without AQM (taildrop/PI2/PIE/CoDel/STEP/RED/…) with a latency that matches the AQM strategy.
>  
> Koen.
>  
>  
> From: tsvwg <tsvwg-bounces@ietf.org> On Behalf Of De Schepper, Koen (Nokia - BE/Antwerp)
> Sent: Thursday, February 17, 2022 7:01 PM
> To: tsvwg IETF list <tsvwg@ietf.org>; Jonathan Morton <chromatix99@gmail.com>
> Subject: [tsvwg] Related to "Non-L4S traffic abusing the L-queue" discussion during the interim
>  
> Hi Jonathan,
>  
> It seems that the following open issue identified by the chairs:
>  
> Non-L4S traffic abusing the L-queue
> • ‘DualQ gives a large throughput bonus to L queue traffic, ie. a “fast lane”’
> • Is this a matter specific for DualQ that can be left for experimentation?
>  
> is based on the following experiment you performed:
>  
> >             simple two-flow competition test on a standard dumbbell topology,
> 
> >             with the bottleneck running a DualQ qdisc into a 50Mbps shaper.
> 
> >             Both flows were configured to use CUBIC congestion control with
> 
> >             ECN negotiated, but one was additionally tweaked to set ECT(1)
> 
> >             instead of ECT(0) on all data segments, and to pace its output at
> 
> >             40Mbps. This latter measure prevents the L queue from seeing any
> 
> >             need to apply congestion signals, because it is always empty.  These
> 
> >             tweaks allowed that flow to use 80% of the link capacity, gaining a
> 
> >             fourfold advantage over its competitor,
> 
>  
> If there is capacity seeking traffic in the Classic queue, then it is even desired that the L4S queue does not add extra marks. The L4S marks should come only from the Classic coupling.
> Before diving into details, can you first explain why in your experiment the coupling from the Classic Q has no effect on your paced and ECT(1) labeled Cubic flow?
>  
> I would expect that this ECT(1) labeled Cubic flow would get even less throughput than the Classic Cubic flow, as the first gets the doubled coupled CE marking probability (eg 2*10% = 20%) for L4S flows instead of the squared CE marking probability (10%^2 = 1%) which ECT(0) traffic would get.
>  
> Thanks,
> Koen.