Re: [tsvwg] plan for L4S issue #29

"De Schepper, Koen (Nokia - BE/Antwerp)" <koen.de_schepper@nokia-bell-labs.com> Wed, 23 September 2020 15:46 UTC

Return-Path: <koen.de_schepper@nokia-bell-labs.com>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id EEFE23A11AA for <tsvwg@ietfa.amsl.com>; Wed, 23 Sep 2020 08:46:44 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -3.596
X-Spam-Level:
X-Spam-Status: No, score=-3.596 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_HIGH=-1.695, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=nokia.onmicrosoft.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id TMHyQrYBEYZ5 for <tsvwg@ietfa.amsl.com>; Wed, 23 Sep 2020 08:46:43 -0700 (PDT)
Received: from EUR03-DB5-obe.outbound.protection.outlook.com (mail-eopbgr40118.outbound.protection.outlook.com [40.107.4.118]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 02A213A11A6 for <tsvwg@ietf.org>; Wed, 23 Sep 2020 08:46:42 -0700 (PDT)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=ngOOU2DY3OM9dw8zKlPrpZtIfGHdMfSihUn4EIrZ5hHrT8AC1I6ydyGNroGsfipRScqSD19xqSXg7EGQkRe19SDcTmEBQG4FypLfm0+BMtBIjcNiiKPGTLv0tuvtqx7v8ahLKaqjv+Ru88ntJI8wgIk9KTbvIQRWPSwpZXI1o2i3KEUL7Zp+vE3biqAILbWC93nnlNRQnvfCNg1lMhM01wxUNfBEIWixuH07l4R11xVpFkA4PsZrNPoboGE/pXrBSvk2ynpg8FGr16tNYgocYmWa0G/rUrF3t/8Ff8GoCHldIA3qC+szqdWuq+bBxhennjNxRnXDeK8o9+IF2HoxDA==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=FITiI5lHJsfKRn3x/k5cEJNnDLh16bNp2T11apeeNp8=; b=MISw7rZgo6xe10JZZnwo/CW9wBVRoxQvHAdiVCXUcvfnKifLRCwVMk7jvrKc4ZjpsvsLudQp7N+Zny999jF93JS6IHbIRgnfOa5kS4wEnvHyjz5HW87P4asu5GgXDSl3WkNAHk247DPA4Hbip6d1IzdxFCnAVAdfg6vW7baEokfGL6Y++zMQJI1xHeaLve0gqmKtJBP595oWy0n84N+Thhtv3aKzSKj2uvM6jOwfLqy44KUE7stPWX829CpqJL88fZNJfrKxgsMI4SMUeBr/GLRrObkGJYiW9swGFIlIK0YMlj8z9fVbmoVWxu/DZTlAAepBrg3eYL6fixeETVlowQ==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=nokia-bell-labs.com; dmarc=pass action=none header.from=nokia-bell-labs.com; dkim=pass header.d=nokia-bell-labs.com; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=nokia.onmicrosoft.com; s=selector1-nokia-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=FITiI5lHJsfKRn3x/k5cEJNnDLh16bNp2T11apeeNp8=; b=eEMn4O/oavlebnanuF/areCqERH9MltHAmNlC3PtAEeZ0Cnwpz8n3mWp9n+66s99hF4XchAM5WojjgmTL7jPArwpAShqx5cQEnJOQhBmP+wAeLPsWyFOwsQ8sTuMrYguMfe1cZh+bDuXi+lPMyjPBhGpHlKkP9t2O1hWe41YGPk=
Received: from AM0PR07MB6114.eurprd07.prod.outlook.com (2603:10a6:208:113::33) by AM0PR07MB4212.eurprd07.prod.outlook.com (2603:10a6:208:ba::22) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.3412.11; Wed, 23 Sep 2020 15:46:39 +0000
Received: from AM0PR07MB6114.eurprd07.prod.outlook.com ([fe80::a06d:c78b:2c84:9f9]) by AM0PR07MB6114.eurprd07.prod.outlook.com ([fe80::a06d:c78b:2c84:9f9%5]) with mapi id 15.20.3412.020; Wed, 23 Sep 2020 15:46:39 +0000
From: "De Schepper, Koen (Nokia - BE/Antwerp)" <koen.de_schepper@nokia-bell-labs.com>
To: Pete Heist <pete@heistp.net>, Wesley Eddy <wes@mti-systems.com>
CC: "tsvwg@ietf.org" <tsvwg@ietf.org>
Thread-Topic: [tsvwg] plan for L4S issue #29
Thread-Index: AQHWZ1Eiyyf8Tpcrk06jp3qQxH0lfKkiCPgAgEfDadCADKx2gIAALuQg
Date: Wed, 23 Sep 2020 15:46:39 +0000
Message-ID: <AM0PR07MB61140549F3BCAA65BBE6AD24B9380@AM0PR07MB6114.eurprd07.prod.outlook.com>
References: <ca8ede0e-53a2-f4ff-751d-f1065cf5e795@mti-systems.com> <D0D3EDCE-3633-4E37-A167-3F1E09148ED9@heistp.net> <AM0PR07MB6114EDA6F2E8DCCB3D86D082B9200@AM0PR07MB6114.eurprd07.prod.outlook.com> <92c056567b3ad7af08777829314673ed66f5a96b.camel@heistp.net>
In-Reply-To: <92c056567b3ad7af08777829314673ed66f5a96b.camel@heistp.net>
Accept-Language: nl-BE, en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
authentication-results: heistp.net; dkim=none (message not signed) header.d=none;heistp.net; dmarc=none action=none header.from=nokia-bell-labs.com;
x-originating-ip: [2a02:1810:1e00:cb00:e1fe:6185:f17a:6215]
x-ms-publictraffictype: Email
x-ms-office365-filtering-ht: Tenant
x-ms-office365-filtering-correlation-id: 0608a57b-5514-42db-dfcc-08d85fd7dcc5
x-ms-traffictypediagnostic: AM0PR07MB4212:
x-microsoft-antispam-prvs: <AM0PR07MB4212F462616EC42D17411966B9380@AM0PR07MB4212.eurprd07.prod.outlook.com>
x-ms-oob-tlc-oobclassifiers: OLM:10000;
x-ms-exchange-senderadcheck: 1
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: ZvjEjrBxCvpqC+CcSC3vooOx5Q/b4G6yAZBAcoEjmktwuq2dT0IF0B606jwf5GuS7L69OwpAdmICKE36LD3l1tkxzinL9ODVEgs7ylOTfN97GPsuAMEXKp53/SPwnu5XzFSAneHpCXo2rVDN0nydMQuAa+h9tAlyrAUm1MQVK5Ui7D9Tcc0yNAABJDEKI/AWHEdzDJPWNtss0LB0jDJLpZBfFX8388xMw4DnubxBjYvMHCcXhymibY+tQBl5ltGT8NPnYKW8t9Y/mENu8FEEgvqyw6SOmb2nTzp1FDXLoB2hrCVjOxDdGSjU6oEOfCPQ1LvVbbFyBOrSo4XCGQ0Oo8WiDDjZ7ILWH0FeGODKNyF1wL9eEeOHHtcyDrnCCCKI6TRhGzGajbrSKKJa77uMGw==
x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:AM0PR07MB6114.eurprd07.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(39860400002)(136003)(346002)(376002)(396003)(366004)(316002)(86362001)(55016002)(66556008)(2906002)(71200400001)(64756008)(66446008)(83380400001)(9686003)(66476007)(478600001)(76116006)(966005)(110136005)(66946007)(186003)(33656002)(52536014)(5660300002)(7696005)(4326008)(8936002)(53546011)(8676002)(6506007)(66574015); DIR:OUT; SFP:1102;
x-ms-exchange-antispam-messagedata: PAvKGv+nT/5yYqE6qnHled+AiE2O8uhA81aifVazVbzXsl+hEmuefrcv/ntXfFSozIhbZySL1ZLN823w6VceeMx7dbGaFkgi+4N8jhApcj4IrfAu5J+0r6c33grO7W1W1/EJySyY9IXhFNK8Yt14JGy9l8h24byUTkRXnNWYrKqxoltmR6VUhDwO4dEAdQujg/NSIQzYGmm0ewDIMo81djABTFzXathkjPLpZ0CROE84P2I54O/uaX2K9G3Yvm/TTZrBkWrwEXbZhpCRDDa02Nr5UAOZBuSsBzjXQZ1b5nFOblWnsUG2XgZVjptNdG5GzYnq9jcgbUbSVfJMk8+Iv2qD1OwdUS+dM7vqG2fkZz9a4dbr7Tfd1r0O5O2KDcjiZrW9UoKQ2p5/x29HR+PUtuMPOOpUiC3xEcVZECVtZfZA46sgZS5K1p3o9EckcxTTZk2FOkuu1b32BFA5EIgf6p6WVK903UHIUWj0qxsKck5HrE7A5HdteU5W3WSoGf6aazW14QlOh6rYF6s0BnAkWbT9A4O+mBSSV2OuyB0i6IqlPPcKter9/TI7vzMdhDjGVMV4UrxuwITp5AWZJkDObzQwCatnosB9CdTnyDpfHu39Qr4aVgZzm7/V53hc4JVXMsH/bRSr3TQwilU7CDirTzZKof/2rBk6eJeho6ie92t8ZyhylDBWKiegwNpgBbDxVHIRZYK9f1WQJOI7EuMjzw==
x-ms-exchange-transport-forked: True
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
MIME-Version: 1.0
X-OriginatorOrg: nokia-bell-labs.com
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-AuthSource: AM0PR07MB6114.eurprd07.prod.outlook.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 0608a57b-5514-42db-dfcc-08d85fd7dcc5
X-MS-Exchange-CrossTenant-originalarrivaltime: 23 Sep 2020 15:46:39.8478 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 5d471751-9675-428d-917b-70f44f9630b0
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: UyOTzder/lD2R0bLbjs266jA/5BRhy9u/BzAf3YoYw83fb6cCDTHzKYK6LQcUj5OvZVnE3M0Pqt5KGn9WTPyO7eyg/ohpE+rEPFeptgCn4j5b18IF7SjdaPhc7Inu0g4
X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM0PR07MB4212
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/Qp9oLyrPMIcW8hKtehRfO2N_lvY>
Subject: Re: [tsvwg] plan for L4S issue #29
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg/>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 23 Sep 2020 15:46:45 -0000

Hi Pete,

I don't think the goal is to wait for a perfect RFC3168 detection solution and we already have an implementation that works well under a wide variety of conditions without additional configuration. This also means that it can still be improved, which I think can be done more appropriate when facing real world issues. The goal of the operational guidelines is to provide a larger set of tools for solving problems with classic ECN bottlenecks, exactly to avoid the need to rely on perfect end-host detection mechanism only.

Related to the "false negatives" and "false positives" naming, I agree that it is very confusing. As the goal is to detect Classic ECN network AQM behavior, maybe better and shorter names could be "false-detect" and "false-non-detect"?

Koen.

-----Original Message-----
From: Pete Heist <pete@heistp.net> 
Sent: Wednesday, September 23, 2020 2:19 PM
To: De Schepper, Koen (Nokia - BE/Antwerp) <koen.de_schepper@nokia-bell-labs.com>; Wesley Eddy <wes@mti-systems.com>
Cc: tsvwg@ietf.org
Subject: Re: [tsvwg] plan for L4S issue #29

Hi Koen,

I can definitely understand the need to turn bottleneck detection on and off for testing, or for additional knobs during development.

Overall, I suspect that there will be more questions about potential problems if bottleneck detection is not a MUST for implementations in the draft, or not baked into the final implementation in a way that works well under a wide variety of conditions without additional configuration.

On an easier topic, I wonder if we shouldn't change the "false negatives" and "false positives" terminology to something clearer, like "mis-identification of RFC 3168 bottlenecks as L4S", or "mis- identification of L4S bottlenecks as RFC 3168", respectively. I might have opened up a can of worms there in trying to save a few words. :)

Pete

On Tue, 2020-09-15 at 12:43 +0000, De Schepper, Koen (Nokia -
BE/Antwerp) wrote:
> Hi Wes, Pete,
> 
> I think to make progress on avoiding both false negatives and false 
> positives, a good view on the conditions that cause problems is 
> needed. So we better have the means to detect the real life impact of 
> Classic-ECN-FIFO deployments. This means we need to be able to switch 
> off Classic ECN detection (under controlled or even known conditions).
> 
> Another point is that it would be useful also to have all control 
> variables of the existing implementation configurable for everyone 
> willing to further experiment (without necessarily needing to change 
> code). As I understood, the right tuning of these can bring a lot of 
> further improvement opportunities. Also depending on a typical 
> deployment, these parameters could be tuned for that specific targeted 
> case.
> 
> So the resolution of this issue is exactly to facilitate further 
> improving the detection algorithm (preferably via tuning), and being 
> able to disable it when conditions are controlled or safe to avoid 
> these false negatives.
> 
> I think these are topics that can be covered by the Operational 
> Guidelines draft.
> 
> Regards,
> Koen.
> 
> -----Original Message-----
> From: tsvwg <tsvwg-bounces@ietf.org> On Behalf Of Pete Heist
> Sent: Friday, July 31, 2020 8:53 PM
> To: Wesley Eddy <wes@mti-systems.com>
> Cc: tsvwg@ietf.org
> Subject: Re: [tsvwg] plan for L4S issue #29
> 
> Hi Wesley,
> 
> One thing I noticed during testing was that the current implementation 
> of TCP Prague in Linux allows disabling bottleneck detection through 
> the prague_ecn_fallback kernel module parameter ( 
> https://github.com/L4STeam/linux/blob/0e7cf8acb318873c3f61084453f8da15
> b2e398be/net/ipv4/tcp_prague.c , line 158). I don’t know if that was 
> left in only for testing.
> 
> In section 6.3.3 of l4s-arch, there is discussion around classic 
> bottleneck detection. Since I don’t see an explicit MUST that it 
> remain enabled (although I do see the text “an L4S sender will have to 
> fall back to…”), it’s not completely clear to me if it’s actually 
> required to be implemented and permanently enabled in all 
> implementations. If it is, I suppose the implementation should reflect 
> that also.
> 
> While I feel it best that detection identifies both types of queues 
> accurately, if bottleneck detection were both an explicit MUST in the 
> text *and* not possible to disable in any implementation, I think that 
> would make the misidentification of L4S queues as classic ECN queues 
> less of a safety concern, since it would be impossible to turn off. It 
> would remain an issue for the architecture overall though.
> 
> Hope that helps...
> 
> Pete
> 
> > On Jul 31, 2020, at 5:41 PM, Wesley Eddy <wes@mti-systems.com>
> > wrote:
> > 
> > Hello, ticket #29 for the L4S documents is about classic bottleneck 
> > detection misidentifying L4S queues as classic ECN queues.
> > 
> > https://trac.ietf.org/trac/tsvwg/ticket/29
> > 
> > In contrast to other issues, it doesn't seem like this should block 
> > a WGLC on the L4S drafts.
> > 
> > 	• It is specific to classic bottleneck detection algorithm, which 
> > is planned to be worked on in the Prague ICCRG draft.
> > 	• The result is sometimes failing to achieve the best possible L4S 
> > behavior, but doesn't seem to be an Internet safety issue.  This 
> > resulting in people turning off classic bottleneck detection would 
> > be a different issue, and something maybe the operator guidelines 
> > would address.
> > 	• It seems like it can be worked on further in the course of L4S 
> > experimentation, without negative effects to others.
> > So, I believe we should track this work in the ICCRG, and close the 
> > ticket here.  Please let me know in the next week if I've 
> > misunderstood any aspect of this and it should remain open.
> > 
> >