Re: [tsvwg] New rev of draft-ietf-tsvwg-ecn-l4s-id-17

"De Schepper, Koen (Nokia - BE/Antwerp)" <koen.de_schepper@nokia-bell-labs.com> Thu, 27 May 2021 08:46 UTC

Return-Path: <koen.de_schepper@nokia-bell-labs.com>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 5A30F3A1412 for <tsvwg@ietfa.amsl.com>; Thu, 27 May 2021 01:46:57 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.598
X-Spam-Level:
X-Spam-Status: No, score=-2.598 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.698, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=nokia.onmicrosoft.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Mi9y8Jn3Ph3N for <tsvwg@ietfa.amsl.com>; Thu, 27 May 2021 01:46:51 -0700 (PDT)
Received: from EUR03-VE1-obe.outbound.protection.outlook.com (mail-eopbgr50109.outbound.protection.outlook.com [40.107.5.109]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 64E7D3A1417 for <tsvwg@ietf.org>; Thu, 27 May 2021 01:46:51 -0700 (PDT)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=Sj0FM2eTzmcPcZ/2Q5j22gRJ0aCKAO5uwzrDl9EcZuUXOVapepZ9p/E1KoNSByQR6ICofkXsBoeLrrlF5f2RSVHa8SZ6l8MdlulWKUzqCKhLinxseksacxX4WS0xPcC6XzHkA7AKiKnLiuzyAz010ZTki34mZ3ubHIl0iOvRnDzfEBGR3WLUjWPGhRY6e2nVLKutvn3ZAmfXYhcrvKZBgXZXSzNTbRMnRWx758m6+9s1Y1LtZxCzkaZ92/GUrawyhnPvsiXalO4b3YZvgfTOLkXUSiMct2iKCC1NdTkcy6DeYzukA0q/cNwyBghh4jGhg3z+BeJHyk1l03lrmbDqWw==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=aQxPecPdWS7EUSSPrRQhAFox8NrZvWKlRKz68J/u1Ng=; b=Y964UCQF0yxKZQq6v/e/fnI4MaadX3J+97SkO9MtBgivznugW09qXgbLfo44oqKHVCRYVA2zXV/zAeQPGXp+/X9HU+k/Z2zGlF68FLQjbu5L8jUERQJ3XAxrs3quMyClAR1u46a4Ix0uMJCmY47WhOMWBKVl9PFFoGcWK6a7vdLwJqNs934dsqOa47NM0HGfCZ0fOxIN1SsrdaWf8YZ7klY1umGLhDwJZAllcEulEzwI2BLvfizi38zJwmUwNkshV8hE5YrlH+j4aZ5DuZgvzVxUT90+st2IFdD72gjRaRBXXn3nVOovgu1JXTeb7R3IcONbYLZ4uGnGORqxu7UKeA==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=nokia-bell-labs.com; dmarc=pass action=none header.from=nokia-bell-labs.com; dkim=pass header.d=nokia-bell-labs.com; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=nokia.onmicrosoft.com; s=selector1-nokia-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=aQxPecPdWS7EUSSPrRQhAFox8NrZvWKlRKz68J/u1Ng=; b=udl9kABIjs4TsoJPCW3hXF57K0ebrhnXaAqtuKnrQ+kNB2EyBdoXxoQpXmXktqBtrrZXgkufppb16CLBHl77wfTLTnlPYu7UxRMjPCG6SAzmqfpQr0/Qtusfv+glvNUVgHH9kexjI6zOBpSoDu4xqXFPNvDzEl5PnGZWGRQ8Dlo=
Received: from AM9PR07MB7313.eurprd07.prod.outlook.com (2603:10a6:20b:2c6::19) by AM0PR07MB4450.eurprd07.prod.outlook.com (2603:10a6:208:7a::11) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4173.12; Thu, 27 May 2021 08:46:47 +0000
Received: from AM9PR07MB7313.eurprd07.prod.outlook.com ([fe80::6103:ea49:e70b:2823]) by AM9PR07MB7313.eurprd07.prod.outlook.com ([fe80::6103:ea49:e70b:2823%7]) with mapi id 15.20.4173.019; Thu, 27 May 2021 08:46:47 +0000
From: "De Schepper, Koen (Nokia - BE/Antwerp)" <koen.de_schepper@nokia-bell-labs.com>
To: "Black, David" <David.Black@dell.com>, Bob Briscoe <ietf@bobbriscoe.net>, Gorry Fairhurst <gorry@erg.abdn.ac.uk>, Wesley Eddy <wes@mti-systems.com>
CC: "tsvwg@ietf.org" <tsvwg@ietf.org>
Thread-Topic: [tsvwg] New rev of draft-ietf-tsvwg-ecn-l4s-id-17
Thread-Index: AQHXTiKvyj2WelD0+U2QfUvSFJfgeqruc7QAgAAJ44CAAyQfgIADCVmAgAFeXgCAACaDAIAA2K2w
Date: Thu, 27 May 2021 08:46:47 +0000
Message-ID: <AM9PR07MB73130BB0B4DA87986065C869B9239@AM9PR07MB7313.eurprd07.prod.outlook.com>
References: <162158815765.22731.15608328324211025925@ietfa.amsl.com> <f8ed1105-d1db-55ce-eb1f-00de8a83b0e8@bobbriscoe.net> <3F147A3D-BD68-4F0A-89FF-9A92284FF0A5@gmx.de> <MN2PR19MB404590F453EF73E681A7F17D83299@MN2PR19MB4045.namprd19.prod.outlook.com> <0b7edf59-5bce-3189-8745-324083c98ce4@bobbriscoe.net> <MN2PR19MB4045C6483D844FD602A543F083259@MN2PR19MB4045.namprd19.prod.outlook.com> <e5e965b1-31f9-3780-be6a-2003e2e8a0bc@bobbriscoe.net> <MN2PR19MB40455B5E0706806B3FCDE18C83249@MN2PR19MB4045.namprd19.prod.outlook.com>
In-Reply-To: <MN2PR19MB40455B5E0706806B3FCDE18C83249@MN2PR19MB4045.namprd19.prod.outlook.com>
Accept-Language: nl-BE, en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
msip_labels: MSIP_Label_17cb76b2-10b8-4fe1-93d4-2202842406cd_Enabled=true; MSIP_Label_17cb76b2-10b8-4fe1-93d4-2202842406cd_SetDate=2021-05-26T19:15:36.4193357Z; MSIP_Label_17cb76b2-10b8-4fe1-93d4-2202842406cd_Method=Privileged; MSIP_Label_17cb76b2-10b8-4fe1-93d4-2202842406cd_Name=External Public; MSIP_Label_17cb76b2-10b8-4fe1-93d4-2202842406cd_SiteId=945c199a-83a2-4e80-9f8c-5a91be5752dd; MSIP_Label_17cb76b2-10b8-4fe1-93d4-2202842406cd_ActionId=bd97b601-7099-435e-8832-beac06717f27; MSIP_Label_17cb76b2-10b8-4fe1-93d4-2202842406cd_Owner=david.black@emc.com; MSIP_Label_17cb76b2-10b8-4fe1-93d4-2202842406cd_Application=Microsoft Azure Information Protection; MSIP_Label_17cb76b2-10b8-4fe1-93d4-2202842406cd_ContentBits=0
authentication-results: dell.com; dkim=none (message not signed) header.d=none;dell.com; dmarc=none action=none header.from=nokia-bell-labs.com;
x-originating-ip: [2a02:1810:1e00:cb00:3517:1f17:9711:e5e8]
x-ms-publictraffictype: Email
x-ms-office365-filtering-correlation-id: ef066976-bc4f-4897-ae6b-08d920ebf6bc
x-ms-traffictypediagnostic: AM0PR07MB4450:
x-microsoft-antispam-prvs: <AM0PR07MB44507BCDD60C27178D36F09AB9239@AM0PR07MB4450.eurprd07.prod.outlook.com>
x-ms-oob-tlc-oobclassifiers: OLM:7691;
x-ms-exchange-senderadcheck: 1
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: aagu51x2DfA4WYaAHksdhmaSSAxzLN0L3t6B5VT8suo7L/5tOCBK8wGCyZMuPyqL5jfAEbv+FkhWga7bHF/rdlUa7wsQBoU1t+Pn6qOXYaXceSZZyShbg/Gv6khTIUiLFBMuWi4JZXo3UBGOXDZXy1O/cCswtjFGM1Xow09x+ROO4iduYPasfsynK9IflB5JHdb6MscpTtMy11yV/KHAjKtB8cl/rRwol/bTHqYeplLXptSRunq8tSvemA/PjWvJ7zfKhzXln2ko9bUtYujxxN+UkK4y+WOLBVXcWPo78BkyDyVq3bJXvmrJLAf1o53Vzg6G2shOQz0jk+KNyHtbwj+4X6TRqCodVc/aaTOPSc77ilfQG75mclD+D+z3XXUkQs1CQfxlkV0N/4xWbhwkD31r1h5dT1yDneywKgTbgy9rKhzpzQWvRF4XeUBtUURnAHp2n2HoGu7KO74mzLtEgWXylFCh6dRLOBw9WLORTVDrJpYsiXAfm7+AjnII/QmqdNztxPjL/itd9yGk8o2z3sofd9Bf2IPiMjbKWM0mdQTx4sFvzxL0JDLep/GOMSEoaWEYeSLWr+8P6WH8ybjWsZRflzNJ9IDY2QdgxMl4A2JV3ApDOxuKhi1REWmfjIuAwKr75G1Wz7QMR/ZDNuAuuNEF6E52GXJ6QAOoMRbMHH+2LGaYDufn3EYxGBVbe/CwJzMm0WxYZyix1UCCRRqw8sGi18+EGieKtUMNkd6aC9DrVlS5dXovmVcHpoFqC5GG
x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:AM9PR07MB7313.eurprd07.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(136003)(39860400002)(366004)(376002)(396003)(346002)(166002)(38100700002)(55016002)(122000001)(66574015)(478600001)(966005)(83380400001)(30864003)(9686003)(7696005)(186003)(64756008)(66476007)(316002)(66446008)(86362001)(66946007)(76116006)(71200400001)(53546011)(52536014)(6506007)(33656002)(5660300002)(110136005)(8676002)(2906002)(66556008)(8936002)(4326008); DIR:OUT; SFP:1102;
x-ms-exchange-antispam-messagedata: KTh6b9j8Cr45wtsJEzcV5us4IHi3U61ldUA0PmblX/F0gOoEM9kBHp10d5FfEsrZ7OKxVMSgcemR0BMMPJUd5wb3wTks1/E6LEdMtIKOQGSEbHsjWKud2QEPwZOcxujHjTPkXqWL0g23Azv22/sMINB/WxsMEdLqf3EwGLLx6Updk1WFvf/2ox3QI17YQ8z4kK8aqzUE/AsWvc3YsTKb97mPSJresabu1Gkk4YcEmjG0WlT91LCxXxc1ez0pG1UOKHCOAH5p/aPcjBd5LP7/Z9/JRLbJE8P+l2CA7K52PaaRnegd3Oz0j+0UcDOjSYZhZ8eneQiSvcUk97xHckEeOI5aKVt+gGQj8tL2Z4+BqFvDF0gsbe6taUlpJcvPpYyanKvdsP6buNvEfOwFxzHRA2gTuSa4PIVAmUJq0UckuY5+zLKCQ5tt0aDzOWMAgF9AunAus7a12ELSDhk6Rvj0zLxJsa1mY/tvO1s9HOuRSSzOKk4P9ENIn/67NSsnBNNGaFWtTtWxNcFaCNfRsshggShcBnZ0L7Lg8eEA7ptjcGVOWTwtIrtOx+ckF5736bJ7+mOtcaCLSY7+OuoZd3oK7ZVhwVlozTf5bg21SAFiejfkGEdvsUQe93ezttP4L8xHLm/8vuWAKVoNd8DUitHhwAof5XlQmb3eYdNAbcxtCUZpqLWrZTV/M5tAPvvz+AdgeXX74L8EtoD21bj/7eDSq6fH2SLI0Zpcc0uJ9ZhLOuuYysXVlwNAsN5cI8wPd+Xb9enzN33zluV/EUu8umB0DJsy+2zbfqPXkI7NnO31AYy2vi8Ln5dPVpaZvF02zP0gR1bQrHwRLrQXeJSigGOYC839KoBKFf733AbgXNOCZbYSJQGXqb58l3jITIki5TvbDvsFfO1NApmpRWxxPkKSB6PIJKbgq++mVNYJNB+OHTtWSXdVYrWyJezBT864HXrUwAyXkTjB0UlArNmUrIb9uO55zqbA0UYZPmz0lcXo95RJSiAPsEzp8MfzJcUQ0BxtO+9oEF/tsXz6A7hT4ioQU2Iux8P+CjE83fet2m5Fo/iM+QYFPdACe4npIzjxdnZyVXFMZqhYaSumkst97jtMTh7d0ZyTO3u6Ax3NbLeHFZ4POZLJm1B6SJaUCWtZukWDYxUuZmBCO02mSBme5o2PutVG2B+T2DUHQTL+zuwmLYBmK11Y+1cfRI7bb3ukNHi2pSuRJ/arEA+0r9LR4zscruN4uGVFjYOKqy8x2aLG8tsVyBhnUaX/LU1/9DILC9rBdYoQzN9qEcATkJV/SWQz2pTeUtxeE5F4NQcUBWrf6X2vQspaZ622CkwGpJciNewRCTaaWpLJpC4NW1idLbt3XnEtpL3wzS8yfIpPLPNxIZXUykuYa/qvae1VvlLczOpi
x-ms-exchange-transport-forked: True
Content-Type: multipart/alternative; boundary="_000_AM9PR07MB73130BB0B4DA87986065C869B9239AM9PR07MB7313eurp_"
MIME-Version: 1.0
X-OriginatorOrg: nokia-bell-labs.com
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-AuthSource: AM9PR07MB7313.eurprd07.prod.outlook.com
X-MS-Exchange-CrossTenant-Network-Message-Id: ef066976-bc4f-4897-ae6b-08d920ebf6bc
X-MS-Exchange-CrossTenant-originalarrivaltime: 27 May 2021 08:46:47.7892 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 5d471751-9675-428d-917b-70f44f9630b0
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: 2Xb3UeqW3OgDTKJ5jvguyjRFEVTgdGxXiosg1gvpw1/pVhKUUP8GnzCge6BJMCyauP2bRHWg08SYCyHeTjpKe7FvFu5epe63bSnEoxXlWQldZRQCzeksjJMHZC3cOYWe
X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM0PR07MB4450
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/lrBUYJBY9P7LY4iCQe3Aqvl_MfM>
Subject: Re: [tsvwg] New rev of draft-ietf-tsvwg-ecn-l4s-id-17
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg/>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 27 May 2021 08:46:58 -0000

I agree that larger (adaptive) replay window is the main solution, and removes all other needs. The adaptive window could easily grow based on the gap that an early packet is creating, allowing the next packets to fill the gap (optionally shrinking the window afterwards back to gradually smaller values).

As an alternative, another smarter reordering-safe mechanism would help as well. I think a symmetric window that also drops (or “keeps aside without moving the window”) sporadic early packets, will perform better… It would drop (or delay) only the expedited CE(s).

This all avoids the need for different SA’s and/or DCSPs…

Koen.

From: tsvwg <tsvwg-bounces@ietf.org> On Behalf Of Black, David
Sent: Wednesday, May 26, 2021 9:34 PM
To: Bob Briscoe <ietf@bobbriscoe.net>; Gorry Fairhurst <gorry@erg.abdn.ac.uk>; Wesley Eddy <wes@mti-systems.com>
Cc: tsvwg@ietf.org
Subject: Re: [tsvwg] New rev of draft-ietf-tsvwg-ecn-l4s-id-17

Inline … --David

From: Bob Briscoe <ietf@bobbriscoe.net<mailto:ietf@bobbriscoe.net>>
Sent: Wednesday, May 26, 2021 1:16 PM
To: Black, David; Gorry Fairhurst; Wesley Eddy
Cc: tsvwg@ietf.org<mailto:tsvwg@ietf.org>
Subject: Re: [tsvwg] New rev of draft-ietf-tsvwg-ecn-l4s-id-17


[EXTERNAL EMAIL]
David,
On 25/05/2021 21:22, Black, David wrote:
Two comments:

[1] This VPN discussion appears to be assuming a remote access VPN, so it does not align well with site-to-site VPNs:

> This particular path divergence occurs iff an ECT0 flow is CE-marked /prior/ to the VPN ingress:
> * In general where the VPN ingress is on the end-system (transport mode) that doesn't happen.
> * There is however a chance that CE could be introduced before a 'bump in the wire' VPN ingress (tunnel mode).

For site-to-site VPNs, the last line above is a significant understatement.  A scenario that helps to understand this is ROBO, specifically a sizeable remote office or branch office such as a large retail store, where the remote or branch VPN gateway is a potential bottleneck node for outbound traffic courtesy of fan-in from multiple end systems and the LAN having more bandwidth than the (purchased) VPN bandwidth.

[BB] Wouldn't that particular bottleneck apply marking at the egress of the VPN gateway after encap?
[David>] That depends on the details of how the VPN is integrated with the network, e.g., if queue is upstream of encap.

[2] I think the whole approach of parallel SAs is not a good idea – it would be better to recommend larger replay windows, perhaps much larger, and/or use DSCPs to direct traffic to parallel SAs (which would disambiguate CE across L4S and non-L4S traffic).

[BB] You haven't given any reasons.

[David>] Sure:

  *   See RFC 7657 for general discussion about why variable QoS treatment within a single reliable transport protocol session is not a good idea – that discussion is couched in Diffserv terms, but it's applicable to L4S ECN-based service differentiation.
  *   Asking VPNs to classify traffic based on ECN field values is also asking for the possibility of discard of traffic marked with some of those values, which is not a good place to go given the desire for the ECN field to remain end-to-end.
  *   Using DSCPs is a "reduce to previous case" approach that doesn't involve modification of any security specs and aligns with existing VPN implementations that are cognizant of DSCPs.
  *   Larger replay windows are a lower-effort approach that ought to apply to deployed VPNs whose replay window is configurable.  In practice, this is the place to start if L4S causes VPN problems.


Bob


Thanks, --David

From: Bob Briscoe <ietf@bobbriscoe.net><mailto:ietf@bobbriscoe.net>
Sent: Sunday, May 23, 2021 6:00 PM
To: Black, David; Gorry Fairhurst; Wesley Eddy
Cc: tsvwg@ietf.org<mailto:tsvwg@ietf.org>
Subject: Re: [tsvwg] New rev of draft-ietf-tsvwg-ecn-l4s-id-17


[EXTERNAL EMAIL]
David,
See inline, tagged [BB]...
On 21/05/2021 23:02, Black, David wrote:

section 6.2 with its, "use two SAs, one for ECT(1) and one for the rest" seems a bit limited



The situation is more severe than that, and it's problematic if there's any non-L4S TCP traffic that uses ECN involved.  The two SAs are actually for 2 ECN values each:



   To avoid this limitation, a VPN ingress participating in the L4S

   experiment SHOULD map packets onto two parallel SAs indexed by the

   lowest significant bit of the IP-ECN field.



That indexing puts Not-ECT [00b] and ECT(0) [10b] packets into one SA, and puts ECT(1) [01b] and CE [11b] packets into another.  Non-L4S TCP uses ECT(0) and CE, hence a flow that carries any congestion indications is split across those two SAs.  ECMP based on SPI (deployed technology) that puts those parallel SAs on different network paths is not assured to preserve packet order across those SAs, and packet reordering is not a good thing to do to a non-L4S TCP flow.

[BB] Yes, this would be a potential problem.

This particular path divergence occurs iff an ECT0 flow is CE-marked /prior/ to the VPN ingress:
* In general where the VPN ingress is on the end-system (transport mode) that doesn't happen.
* There is however a chance that CE could be introduced before a 'bump in the wire' VPN ingress (tunnel mode).

For a VPN participating in the L4S experiment with two SAs, if there was any CE before the ingress, we did think carefully about which way to classify it:
* If CE goes with ECT0, then if a subsequent L4S node gives ECT0 more queue delay than CE, the VPN anti-replay function could discard some ECT0 packets.
* If CE goes with ECT1, it avoids anti-replay discard completely. We had tried to think of possible problems with splitting Classic flows across SAs,  but we hadn't thought of your point where ECMP on the SPI makes CE and ECT0 from the same flow taking different paths.

I believe it is still best to classify CE with ECT1, based on the following reasoning. I think the scenario would occur with very low probability {Note 1}, and if it did, the CE might either arrive a little earlier or a little later than data packets around them.
* If earlier, the benefit of ECN would not be lost, but there would be an extremely small chance (p_s * p_r * p_N) of an occasional spurious re-xmt {Note 2}.
* If later, the chance that a delayed CE would be mistaken for a drop and trigger a spurious re-xmt and a congestion response would probably be higher, but still very small (p_s * p_r * p_l) {Note 3}. That would lose the benefit of ECN but, at least for Classic flows, a single CE is meant to trigger a large congestion response as if it was a loss anyway.

I don't want to belittle the point you've made, because this is certainly a new problem. However, I do think the probabilities need to be put in perspective.

I suspect the ECMP problem with any ECN (3168 or L4S) is likely to be a greater concern. You might recall that João Taveira from Fastly pointed this out during a tsvwg discussion on ECN back in 2017. This link should jump to 14:25 mins into a 2017 NANOG talk from Lorenzo Saino of Fastly about how they had to disable ECN negotiation when Apple clients started to request ECN in 2015 - because a number of different ECMP vendors still hash on the ToS byte for ECMP and load balancing, so the data would have been routed to a different server from the handshake:
    https://youtu.be/ciClZdwHelU?t=805 [youtu.be]<https://urldefense.com/v3/__https:/youtu.be/ciClZdwHelU?t=805__;!!LpKI!y0AavGemYqNsy2Urub76LQ2eIuIinYWTadbh2ddv0r80YQabjQCK8HywtmUFhVbc$>

Regards



Bob

{Note 1}: Reasoning for scenario occurring with low probability (we'll call it p_s):
Usually, an institution provides a tunnel-mode VPN between physically secured networks (e.g. between their intranet and an employee's home laptop). AQM deployment would be unusual in the interior of corporate and institutional intranets, because the bottleneck is more likely elsewhere (e.g. in the access link between the site and the Internet). However, there are bound to be some cases of tunnel-mode VPNs where the traffic arriving could have already been ECN-marked (for instance Jonathan Morton's example of gamers providing a VPN end-point on the public Internet to hide their own IP address).
Then, to experience this problem, there also has to be some load balancing or ECMP after the VPN ingress but before the egress. I'm sure that will occur sometime somewhere (let's say with probability p_r). The likelihood of the scenario occurring will then be p_s * p_r.

{Note 2}:
* N packets in a row within a Classic flow have to be CE-marked for early CE packets to result in a spurious re-xmt, let's say that happens with probability p_N. As already explained in Appx B of ecn-l4s-id, N was traditionally 3, but RACK is making it larger. And the main sources of ECN marking for Classic flows are FQ_CoDel and CAKE, both of which take great care not to mark multiple packets in a row. So p_N is going to be tiny.
And the probability of a spurious re-xmt with early reordering will be the vanishingly small product (p_s * p_r * p_N).

{Note 3}:
* Let's say p_l (for late) is the probability that the reordering between the two SAs is sufficient to make each CE packet arrive late enough  to be deemed a loss. p_l is unlikely to be as small as p_N, but the overall probability of this occurring is still reduced because the probability that CE marking precedes that VPN (p_s), and that there's routing keyed on flow IDs and SPIs (p_r). So the overall probability of a spurious rexmt due to late reordering is (p_s * p_r * p_l).



Bob





Thanks, --David



-----Original Message-----

From: Sebastian Moeller <moeller0@gmx.de><mailto:moeller0@gmx.de>

Sent: Friday, May 21, 2021 5:27 PM

To: Bob Briscoe

Cc: Gorry Fairhurst; Black, David; Wesley Eddy; tsvwg@ietf.org<mailto:tsvwg@ietf.org>

Subject: Re: [tsvwg] New rev of draft-ietf-tsvwg-ecn-l4s-id-17





[EXTERNAL EMAIL]



Bob, chairs,



section 6.2 with its, "use two SAs, one for ECT(1) and one for the rest" seems a bit limited since it ignores that VPNs might propagate both DSCPs and ECN bits between the layers, so IMHO a better approach might be to recommend to treat DSCP+ECN bits as one aggregate byte (let's cal it TOS ;) ) as the extra ECT(1)-SA seems to be required for all SAs that already exist to deal with multiple supported DSCPs. So in a sense the recommendation would be to double the number of SAs.

[BB] Yes, we ought to reword it to say that the VPN ingress should use two SAs indexed on the LSB of the ECN field, and, if it was also classifying on DSCPs, it could also consider classifying any low latency DSCP(s) with the L4S packets. To avoid the anti-replay problem, there would only need to be one SA configured per each degree of queuing delay, not one for every ECN x DSCP combination.







Also:

"and the current draft of DTLS 1.3 says "The receiver

     SHOULD pick a window large enough to handle any plausible reordering,

     which depends on the data rate."  However, in practice, the size of

     the VPN's anti-replay window is not always scaled appropriately."



L4S on a 10 ms path under load can introduce re-ordering in the range of 50 ms (roughly twice the difference between the L- and C-queue delay targets), re-ordering tolerance 5 times of the path RTT seems to be a bit on the high side to expect, no?

[BB] The above text that I quoted from the DTLS spec. is reasonable, both practically (see below) and in terms of taking responsibility for the problem. Beyond its window, the anti-replay function presumes a packet is guilty of a replay attack with no evidence, purely because it chooses not to hold that amount of evidence. Therefore it's proper that it holds a sufficient window of evidence for any plausible reordering.

BTW, the C-queue target has never been 25ms. I noticed JM said that incorrectly as well recently.
* A default C queue delay target of 15ms has always been recommended in aqm-dualq-coupled. That results in PI2 Qdelay of about 25ms at the 99%ile or 30ms at the 99.9%ile. We have been considering whether to change the default target to 10ms for some time, but not done so yet.
* Low Latency DOCSIS specifies a default C queue delay target of 10ms.

So a replay window allowing for 30ms of packets at the interface rate would be sufficient.
At 1Gb/s (say) using 1500B packets, that's a replay window of 2500 packets.

Quoting Pete Heist's info here https://github.com/heistp/l4s-tests/#dropped-packets-for-tunnels-with-replay-protection-enabled [github.com]<https://urldefense.com/v3/__https:/github.com/heistp/l4s-tests/*dropped-packets-for-tunnels-with-replay-protection-enabled__;Iw!!LpKI!y0AavGemYqNsy2Urub76LQ2eIuIinYWTadbh2ddv0r80YQabjQCK8Hywtt-9Grg1$> :
"Modern Linux kernels have a default maximum replay window size of 4096 (XFRMA_REPLAY_ESN_MAX in xfrm.h [elixir.bootlin.com]<https://urldefense.com/v3/__https:/elixir.bootlin.com/linux/latest/source/include/uapi/linux/xfrm.h__;!!LpKI!y0AavGemYqNsy2Urub76LQ2eIuIinYWTadbh2ddv0r80YQabjQCK8Hywtu2jvbJy$>). Wireguard uses a hardcoded value of 8192 with no option for runtime configuration, increased from 2048 in May 2020 by this commit [git.zx2c4.com]<https://urldefense.com/v3/__https:/git.zx2c4.com/wireguard-linux/commit/drivers/net/wireguard?id=c78a0b4a78839d572d8a80f6a62221c0d7843135__;!!LpKI!y0AavGemYqNsy2Urub76LQ2eIuIinYWTadbh2ddv0r80YQabjQCK8HywtrpY39m8$>."
Regards



Bob













Regards

  Sebastian





On May 21, 2021, at 11:21, Bob Briscoe <ietf@bobbriscoe.net><mailto:ietf@bobbriscoe.net> wrote:



Chairs, list,



We've posted a new rev of draft-ietf-tsvwg-ecn-l4s-id-17 attempting to address all the discussion since the last posting just before the interim. In particular:

* review comments on a careful read from Gorry and the chairs

* the VPN anti-replay problem

* added an out-of-band test for an RFC3168 ECN AQM in a shared queue.



There are a couple of outstanding discussions, which I'm sure will continue on the list, e.g. the role of RFC4774 and whether to remove any of Appx C. But it was considered better to get the queued up changes out, to re-base the discussions.



This is quite an extensive set of changes, so pls check and pass any comments to the list.



Thanks for everyone who is contributing, and particularly to the chairs for continuing to referee this all. We've added appropriate thanks in the Acks section.





Bob





On 21/05/2021 10:09, internet-drafts@ietf.org<mailto:internet-drafts@ietf.org> wrote:

A New Internet-Draft is available from the on-line Internet-Drafts directories.

This draft is a work item of the Transport Area Working Group WG of the IETF.



        Title           : Explicit Congestion Notification (ECN) Protocol for Very Low Queuing Delay (L4S)

        Authors         : Koen De Schepper

                          Bob Briscoe

 Filename        : draft-ietf-tsvwg-ecn-l4s-id-17.txt

 Pages           : 57

 Date            : 2021-05-21



Abstract:

   This specification defines the protocol to be used for a new network

   service called low latency, low loss and scalable throughput (L4S).

   L4S uses an Explicit Congestion Notification (ECN) scheme at the IP

   layer that is similar to the original (or 'Classic') ECN approach,

   except as specified within.  L4S uses 'scalable' congestion control,

   which induces much more frequent control signals from the network and

   it responds to them with much more fine-grained adjustments, so that

   very low (typically sub-millisecond on average) and consistently low

   queuing delay becomes possible for L4S traffic without compromising

   link utilization.  Thus even capacity-seeking (TCP-like) traffic can

   have high bandwidth and very low delay at the same time, even during

   periods of high traffic load.



   The L4S identifier defined in this document distinguishes L4S from

   'Classic' (e.g. TCP-Reno-friendly) traffic.  It gives an incremental

   migration path so that suitably modified network bottlenecks can

   distinguish and isolate existing traffic that still follows the

   Classic behaviour, to prevent it degrading the low queuing delay and

   low loss of L4S traffic.  This specification defines the rules that

   L4S transports and network elements need to follow with the intention

   that L4S flows neither harm each other's performance nor that of

   Classic traffic.  Examples of new active queue management (AQM)

   marking algorithms and examples of new transports (whether TCP-like

   or real-time) are specified separately.





The IETF datatracker status page for this draft is:

https://urldefense.com/v3/__https://datatracker.ietf.org/doc/draft-ietf-tsvwg-ecn-l4s-id/__;!!LpKI!xeGDaVTRL33QRdX3Tos2AURXirtYtZXHcEP8W5a6OO_m4gWSHU68p06V9qj70HfR$<https://urldefense.com/v3/__https:/datatracker.ietf.org/doc/draft-ietf-tsvwg-ecn-l4s-id/__;!!LpKI!xeGDaVTRL33QRdX3Tos2AURXirtYtZXHcEP8W5a6OO_m4gWSHU68p06V9qj70HfR$> [datatracker[.]ietf[.]org]



There is also an htmlized version available at:

https://urldefense.com/v3/__https://datatracker.ietf.org/doc/html/draft-ietf-tsvwg-ecn-l4s-id-17__;!!LpKI!xeGDaVTRL33QRdX3Tos2AURXirtYtZXHcEP8W5a6OO_m4gWSHU68p06V9vC50L2Y$<https://urldefense.com/v3/__https:/datatracker.ietf.org/doc/html/draft-ietf-tsvwg-ecn-l4s-id-17__;!!LpKI!xeGDaVTRL33QRdX3Tos2AURXirtYtZXHcEP8W5a6OO_m4gWSHU68p06V9vC50L2Y$> [datatracker[.]ietf[.]org]



A diff from the previous version is available at:

https://urldefense.com/v3/__https://www.ietf.org/rfcdiff?url2=draft-ietf-tsvwg-ecn-l4s-id-17__;!!LpKI!xeGDaVTRL33QRdX3Tos2AURXirtYtZXHcEP8W5a6OO_m4gWSHU68p06V9oZoDRib$<https://urldefense.com/v3/__https:/www.ietf.org/rfcdiff?url2=draft-ietf-tsvwg-ecn-l4s-id-17__;!!LpKI!xeGDaVTRL33QRdX3Tos2AURXirtYtZXHcEP8W5a6OO_m4gWSHU68p06V9oZoDRib$> [ietf[.]org]





Internet-Drafts are also available by anonymous FTP at:

https://urldefense.com/v3/__ftp://ftp.ietf.org/internet-drafts/__;!!LpKI!xeGDaVTRL33QRdX3Tos2AURXirtYtZXHcEP8W5a6OO_m4gWSHU68p06V9rKXcwvA$<https://urldefense.com/v3/__ftp:/ftp.ietf.org/internet-drafts/__;!!LpKI!xeGDaVTRL33QRdX3Tos2AURXirtYtZXHcEP8W5a6OO_m4gWSHU68p06V9rKXcwvA$> [ftp[.]ietf[.]org]







--

________________________________________________________________

Bob Briscoe                               https://urldefense.com/v3/__http://bobbriscoe.net/__;!!LpKI!xeGDaVTRL33QRdX3Tos2AURXirtYtZXHcEP8W5a6OO_m4gWSHU68p06V9gj_uPXC$<https://urldefense.com/v3/__http:/bobbriscoe.net/__;!!LpKI!xeGDaVTRL33QRdX3Tos2AURXirtYtZXHcEP8W5a6OO_m4gWSHU68p06V9gj_uPXC$> [bobbriscoe[.]net]







--

________________________________________________________________

Bob Briscoe                               http://bobbriscoe.net/ [bobbriscoe.net]<https://urldefense.com/v3/__http:/bobbriscoe.net/__;!!LpKI!y0AavGemYqNsy2Urub76LQ2eIuIinYWTadbh2ddv0r80YQabjQCK8HywtoRgTac-$>


--

________________________________________________________________

Bob Briscoe                               http://bobbriscoe.net/ [bobbriscoe.net]<https://urldefense.com/v3/__http:/bobbriscoe.net/__;!!LpKI!z5o0nAykh9F3vkOXqwkB6EByr7rXTsxWZsk8cnefqdJrD1IFy5L0W8qh71SgIKm1$>