Re: [tcpPrague] [tsvwg] ecn-l4s-id: Proposed Changed to Normative Classic ECN detection Text

"De Schepper, Koen (Nokia - BE/Antwerp)" <koen.de_schepper@nokia-bell-labs.com> Sun, 01 November 2020 21:53 UTC

Return-Path: <koen.de_schepper@nokia-bell-labs.com>
X-Original-To: tcpprague@ietfa.amsl.com
Delivered-To: tcpprague@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id D7C2E3A0994; Sun, 1 Nov 2020 13:53:49 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.901
X-Spam-Level:
X-Spam-Status: No, score=-1.901 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=0.001, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=nokia.onmicrosoft.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id m8MsKXcMwnmD; Sun, 1 Nov 2020 13:53:47 -0800 (PST)
Received: from EUR05-VI1-obe.outbound.protection.outlook.com (mail-vi1eur05on2126.outbound.protection.outlook.com [40.107.21.126]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id C486B3A098A; Sun, 1 Nov 2020 13:53:46 -0800 (PST)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=W7tcnSGOmeSd0BegJta18fAhQl9uwPgFM88JCzylXvplAhiXsrOnwwzbt6KCMe5Gv7oh3H7N9XFQFaQXsWAK14e/2j4vshWEVB16q01Py8ODQr53tK5G7OukbxqlMdFfosx5mu+WdtBdfq7r1rvIepQyIvdGTSvVXkshwEu7CmIb/utkgTVzRPTTKOZrrQy7nx0AMrioNp/cHmuHSS8ByAf44SaHWL2qTlTj1Zd4L8IAIyuN6ejz1+1iYZkuH4MhRT1DjpfGDS/6fKJW7LHlb7P2r8xiZhzi2E2DYTOjpBqx3hPPqb7qewXsD6wQI8hxrUcHLMRtjR/3X02qse1Ezw==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=HrjYeGqKjuaurniTWFi3KSONoUWIbIzlf+Lzt9/MHJE=; b=bOYSVafyZXBLlgZNvfyI5D+d4kyZ6oJmRCWDpsOV0ZbH2aHZ/yHB7kkSM+zkYCMpI6inCGzdVfqjt2XR74ZNpIZlU1of25b6/8YJ/dgwWwYBAP1y6I10BFKB+6DvnmidGWBunfylh0DxUddFtznqOTZWnOloWZDPKU/HznhICUjBVqFaFYIefYgJ1pOH5U1SzOykKBX5L7Trk/3Hee+/nQxlh6/LWyui/om9pknXNOSJAAbpluUqzKOEshJhfF87WZxxOW+wxqT10RJOub+ljQ9j2T5wxrn2AUFeKN4j9ym2SzLMdUf7LXe+SHV4NZHJqc7x2rXKi0R14qR20AvIKw==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=nokia-bell-labs.com; dmarc=pass action=none header.from=nokia-bell-labs.com; dkim=pass header.d=nokia-bell-labs.com; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=nokia.onmicrosoft.com; s=selector1-nokia-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=HrjYeGqKjuaurniTWFi3KSONoUWIbIzlf+Lzt9/MHJE=; b=jTJG3rzcNA/FBZd7q7HGxoprgMOHExZoJANCJsLQX3Hvyz2hSTEZg/UJLYRueVNQPI8wbfoxxWwXh5yW4/otLQsIYVv3EGJHiYcrZ5ycZGCPxhcAiaI/U6ysOeqPHq8rsTdSx3Q7oXiv96gXV5aMugJ9EUEzvGSgr75lwPyF/p8=
Received: from AM0PR07MB6114.eurprd07.prod.outlook.com (2603:10a6:208:113::33) by AM8PR07MB7345.eurprd07.prod.outlook.com (2603:10a6:20b:249::13) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.3541.10; Sun, 1 Nov 2020 21:53:43 +0000
Received: from AM0PR07MB6114.eurprd07.prod.outlook.com ([fe80::a9f4:9199:8226:d6c6]) by AM0PR07MB6114.eurprd07.prod.outlook.com ([fe80::a9f4:9199:8226:d6c6%6]) with mapi id 15.20.3541.010; Sun, 1 Nov 2020 21:53:43 +0000
From: "De Schepper, Koen (Nokia - BE/Antwerp)" <koen.de_schepper@nokia-bell-labs.com>
To: Christian Huitema <huitema@huitema.net>, Jonathan Morton <chromatix99@gmail.com>
CC: iccrg IRTF list <iccrg@irtf.org>, Bob Briscoe <ietf@bobbriscoe.net>, tsvwg IETF list <tsvwg@ietf.org>, TCP Prague List <tcpPrague@ietf.org>
Thread-Topic: [tcpPrague] [tsvwg] ecn-l4s-id: Proposed Changed to Normative Classic ECN detection Text
Thread-Index: AQHWsIfwC/FrblPY50a/ND1AVr6ZVKmzwIVw
Date: Sun, 01 Nov 2020 21:53:43 +0000
Message-ID: <AM0PR07MB6114083DB6F3FBA98AA2281FB9130@AM0PR07MB6114.eurprd07.prod.outlook.com>
References: <1b71a610-75ea-e1d4-e3ce-f0ae6a4c12f7@bobbriscoe.net> <28247e5f-5df3-1f75-50e6-b4a7e80d5ab0@huitema.net> <61B6E9DC-8786-4CC0-9AC0-7F74E3529A73@gmail.com> <40751956-daa0-5693-d880-c45e8ca262cc@huitema.net>
In-Reply-To: <40751956-daa0-5693-d880-c45e8ca262cc@huitema.net>
Accept-Language: nl-BE, en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
authentication-results: huitema.net; dkim=none (message not signed) header.d=none;huitema.net; dmarc=none action=none header.from=nokia-bell-labs.com;
x-originating-ip: [81.82.56.234]
x-ms-publictraffictype: Email
x-ms-office365-filtering-ht: Tenant
x-ms-office365-filtering-correlation-id: 5e36002d-9c76-49dd-6a99-08d87eb09a04
x-ms-traffictypediagnostic: AM8PR07MB7345:
x-microsoft-antispam-prvs: <AM8PR07MB7345274E79D7265B5BE38C15B9130@AM8PR07MB7345.eurprd07.prod.outlook.com>
x-ms-oob-tlc-oobclassifiers: OLM:5236;
x-ms-exchange-senderadcheck: 1
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: dNYAxAN3nWdS1D6ltpXUPqza0D+ThLn5WAIbtODPRwILh/VAlKsPZjAX52Ig329uSL9QJihFXAD3q8mwHrsGB01KjXWhQcZ552o+GIYMLkEIbzglIrnLKWsEpGXYJxeCxZLovXSDz+ifCXa6vZkNL8fOkvzVkyEmoAybeYMPbd50eHEUbmXEYAd0YHc9VWba1qh48YXCJsOMu2N8nALuMq+uUQrZuaVg6w3qx/5pyNXQqJ8AUNtQmFDAMLnbVY3qgBs8UE8dFMCPSxwqfzZGtaz0tgZHIzTXFPPyorBRiDC1kuid75C9HMFlqNfV4+5b9W9YoOLL866TqQ/JZhbEYQ==
x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:AM0PR07MB6114.eurprd07.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(39860400002)(346002)(136003)(366004)(376002)(396003)(26005)(9686003)(110136005)(186003)(316002)(66574015)(83380400001)(4326008)(54906003)(6506007)(71200400001)(5660300002)(76116006)(478600001)(66476007)(86362001)(8936002)(9326002)(2906002)(7696005)(64756008)(33656002)(55016002)(52536014)(66946007)(53546011)(8676002)(66446008)(66556008); DIR:OUT; SFP:1102;
x-ms-exchange-antispam-messagedata: MWzaBL/29f6Ad00raRHg/r0UjeoyHmhMhj+hEOnz+XA715GFR4yM08LvU4bCRsyVoj1zgJvC7xwsZgTeOhRFbmJXgRpp7Kxi3mOgfQDjYZkIies1oRE/uIiY5sAhzH+5Zm+oVE3gkaATx65KG1YocVxJ67ZZ/Rpa2sUGOWyJmTglZ+vznGjb0BVLXnyFRRJ1rHpmrnX7k2n/3HqIcPWYjvMKaF2Fbe42g2tOIm/0BHr5kQN3ASHnrFkbp/UZdmc5mL8/+OD/YhDyripbGXodRZ4LXaeHT7ahn19eO5s0iQa+GGGZTIUB+lGIIESWlQeyDIYyY97+/cZdMSNGMzpJXFtt1waDw6v5IlrOuZPIfqcR0zLl9yM6jCDgJQiZdfrP5GTj5MPXX+c/aD7q198A6mT7ldPSrPBGGjbsWNddMF5QyyZVlVQcVsnGKpFpizbQUjFQMpNUU8KuagxoT1jPcoMj1r6UsE2hE83ABngoR3U+2uNvwQ1W1DpQyuDiIkzPZZbMPczPEIs0/a76VsjLcdLakIPx6p4T4yJhtFBuqnnhHY2wswx1NRZ15mF9eRlvhcHF7VtOvmeEZMwpPlCJ3oV+RTVxQNyqa1f381jJEMohFerkMGP5sgvBkcLHl0kNTzaSLDBinX8zG/EmYCHJGg==
x-ms-exchange-transport-forked: True
Content-Type: multipart/alternative; boundary="_000_AM0PR07MB6114083DB6F3FBA98AA2281FB9130AM0PR07MB6114eurp_"
MIME-Version: 1.0
X-OriginatorOrg: nokia-bell-labs.com
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-AuthSource: AM0PR07MB6114.eurprd07.prod.outlook.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 5e36002d-9c76-49dd-6a99-08d87eb09a04
X-MS-Exchange-CrossTenant-originalarrivaltime: 01 Nov 2020 21:53:43.5200 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 5d471751-9675-428d-917b-70f44f9630b0
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: S3WH6BNeOdccIyrfprWcEiSLm41ZwndRWE1UIB6ymJVWv9/OOWkaUt10ACyxXVLiO9ydrXg1jn/rAOUxo6oIiNUoUoX/gVqIj+XbZtn5x9JDQaLL/NyS9qlo9/j2ERMV
X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM8PR07MB7345
Archived-At: <https://mailarchive.ietf.org/arch/msg/tcpprague/hAetMBSZq8WfM2SZDba3yCLT0ms>
Subject: Re: [tcpPrague] [tsvwg] ecn-l4s-id: Proposed Changed to Normative Classic ECN detection Text
X-BeenThere: tcpprague@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "To coordinate implementation and standardisation of TCP Prague across platforms. TCP Prague will be an evolution of DCTCP designed to live alongside other TCP variants and derivatives." <tcpprague.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tcpprague>, <mailto:tcpprague-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tcpprague/>
List-Post: <mailto:tcpprague@ietf.org>
List-Help: <mailto:tcpprague-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tcpprague>, <mailto:tcpprague-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 01 Nov 2020 21:53:50 -0000

Hi Christian,


>>By the way, I do think that L4S makes too many hypotheses about the CC. I wish the AQM was based on intrinsic properties, such as "this flow is using more than its fair share of bandwidth"

This last quote is exactly what the L4S marking is intended to do: the smoothed marking probability over a longer time interval (about 10 RTTs) indicates the fair target rate. Assuming we agree on 15ms being the reference RTT, then the average rate can be r=2/(p*0.015)=133/p packets per second. So you can evaluate yourself if you use more or less than the fair share. How you respond to this knowledge is up to your CC.

>>or "this link is congested and the transport should back off"

L4S marks also give a very accurate queue size/delay indication (the queue size/delay that the network prefers). So keeping track of the RTT of marked packets, especially when they change from unmarked to marked and v/v is a good indicator of the target RTT, without the need to assume a universal delay target.

>> rather than reasoning as if implementations were still using TCP-RENO, which neither Linux or Windows or Chrome does.
We only assume TCP-friendliness, which is typically based on the Reno/Cubic reference. I agree, there shouldn’t be any dependency on the implementation of the congestion control itself.

I would assume that BBR can more tightly follow the available BW if it receives CE marks, and that the BW probing increase/frequency/duration in that case can be adapted to trigger the right marking probability matching the current BW. If no (or insufficient) marks are received, the normal BBR BW probing can be used. If too many marks are received, the BW needs to be reduced and normal BW probing can be suppressed/reduced…

Regards,
Koen.




From: Christian Huitema <huitema@huitema.net>
Sent: Sunday, November 1, 2020 8:48 PM
To: Jonathan Morton <chromatix99@gmail.com>
Cc: iccrg IRTF list <iccrg@irtf.org>; Bob Briscoe <ietf@bobbriscoe.net>; tsvwg IETF list <tsvwg@ietf.org>; De Schepper, Koen (Nokia - BE/Antwerp) <koen.de_schepper@nokia-bell-labs.com>; TCP Prague List <tcpPrague@ietf.org>
Subject: Re: [tcpPrague] [tsvwg] ecn-l4s-id: Proposed Changed to Normative Classic ECN detection Text



On 11/1/2020 9:00 AM, Jonathan Morton wrote:

On 1 Nov, 2020, at 3:07 am, Christian Huitema <huitema@huitema.net><mailto:huitema@huitema.net> wrote:



I am reading the L4S ECN-AQM proposal with an eye on responding to it in an implementation of QUIC, and I have a couple of questions regarding use of ECN marking with QUIC.



The document does not mention QUIC, yet QUIC is already used in a large fraction of Internet traffic. QUIC does specify support for ECN, and QUIC acknowledgements may carry counts of each category of ECN marks received from the peer -- three counters for ECT(0), ECT(1) and CE. In theory, QUIC implementations could take advantage of L4S -- in fact, at least one implementation supports DC-TCP like CC already. Is there interest in specifying L4S for QUIC?



My next question regards the interaction of the proposed L4S ECN-AQM with CC algorithms like BBR that attempt to discover the bottleneck packet rate for the connection, and use pacing to send packets at that rate. I observed that BBR is never mentioned in the draft, yet BBR is used in a sizeable part of Internet traffic. Do we have data on how a non-L4S aware implementation of BBR interacts with the proposed L4S AQM?



My last question regards potential use of ECT(1) marking. Most current implementations set ECT(0), but setting ECT(1) instead is trivial. This should elicit an L4S compatible response in L4S-AQM, and the BBR implementation might be modified to use the signals as part of the bottleneck bandwidth tracking. But there is a small issue there. With BBR, QUIC packets are supposedly paced at just under the bottleneck rate, except during "probe" periods in which they probe for 1 RTT at a slightly higher rate. The L4S AGM might degenerate in a form of ON-OFF control -- no feedback at all most of the time, then a bunch of CE marks if the probe rate exceeds the bottleneck bandwidth. As anyone experimented with that?

In my view, the biggest question you should be asking is how QUIC will distinguish between CE marks applied by an L4S AQM on the path, and those applied by an RFC-3168 AQM on the path.  The latter will treat ECT(1) marked packets the same as ECT(0), and expects the same multiplicative-decrease congestion response, but L4S expects a qualitatively different linear response.

Yes, that's a significant issue. As it is, nodes can only work with L4S on specific paths that are somehow guaranteed to not mix the "classic" and "L4S" behavior. For now, the only way I would implement it is by adding an "l4s" option, that would only be turned on by default when experimenting with L4S behavior. If we want any large scale deployment, we have to get an IETF consensus on the evolution of ECN -- something prescriptive, not just the "may experiment" of RFC 8311. Currently, implementations can only treat CE marks as an indication that they are probably sending faster than the network allows.



You should probably not do any substantial work on integrating L4S into QUIC until you have a good answer to that question.  Alternative approaches to high-fidelity congestion control exist which resolve that particular conundrum easily.  In particular, the ECN field feedback mechanism in QUIC can also support SCE.

Yes, QUIC implementations could indeed support SCE. But they could not support both SCE and L4S, and that's a big bummer.

I wonder whether there is a way to unify the L4S and SCE behaviors. Transport protocol developers are unlikely to widely develop either option unless the IETF comes to a consensus. This is a shame, because ECN based congestion control does not have the ambiguity of loss-based CC: ECN marks are not ambiguous, they are clearly signals from the network, the application does not need to build logic to differentiate between congestion induced losses and random losses due to, for example, radio event. Failing that ECN unification CC algorithms end up relying a lot on bandwidth and delay measurements, but that is sub-optimal because some radio links do have variable bandwidth and variable delays.

By the way, I do think that L4S makes too many hypotheses about the CC. I wish the AQM was based on intrinsic properties, such as "this flow is using more than its fair share of bandwidth" or "this link is congested and the transport should back off" rather than reasoning as if implementations were still using TCP-RENO, which neither Linux or Windows or Chrome does.

-- Christian Huitema