Re: [tsvwg] ECN encapsulation draft - proposed resolution

Jonathan Morton <chromatix99@gmail.com> Fri, 11 June 2021 12:13 UTC

Return-Path: <chromatix99@gmail.com>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id F39803A357B for <tsvwg@ietfa.amsl.com>; Fri, 11 Jun 2021 05:13:11 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.847
X-Spam-Level:
X-Spam-Status: No, score=-1.847 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_BLOCKED=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id jCdNJJHO1yXv for <tsvwg@ietfa.amsl.com>; Fri, 11 Jun 2021 05:13:07 -0700 (PDT)
Received: from mail-lf1-x12f.google.com (mail-lf1-x12f.google.com [IPv6:2a00:1450:4864:20::12f]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 6D50E3A3579 for <tsvwg@ietf.org>; Fri, 11 Jun 2021 05:13:07 -0700 (PDT)
Received: by mail-lf1-x12f.google.com with SMTP id i10so8251880lfj.2 for <tsvwg@ietf.org>; Fri, 11 Jun 2021 05:13:07 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; bh=9flPSlzC9fhLBMCW8s8+5LmlEMLp3zYzepPbqhBR+qg=; b=kJa4UBmaJrrPUZajHJkoWYPTTF66+ETwz5MHUyZ+zrxEfoHPMVHJ46NVQcNrIvGGaF SydXIq/4MY7pie2lMjZji6KUFh1QJLHYsoJobPFCRqvVxrqjYmhMBzXyXd0pYAE+PB6j Y43/dpd4IndiMKe2fvFGE6soakGak3Gzs9CyPOprR0RKEzIBncnANw7+TtC1dNv++xgU auhqhZR5O1VjUoaxEvKZSzyQtUpcoiv+qnxDZfl29hm3G7eXaH6paOhEGm2G1FjXLbPl BsZYRwN0Fo9cpECA5cGdA77BsNfh3Zb9sy8ZzYvd0kF6ky3oKmbD/RKGqI3Uwc/EiyYn ruaw==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; bh=9flPSlzC9fhLBMCW8s8+5LmlEMLp3zYzepPbqhBR+qg=; b=pKxoP/nH5gXOT/mGlxHjLq0eYV4aHS1UE8j5QslNKV0LtHLk5aRn8d8n39ZwxslJAX c2CzNfPij85ioddgEio1zNLoi8hUYHUUnK9z4dhWvu/pdgrJGTVWdtK7BE/3oZ0ReFKx YzdnVVGg5HfeSJR57JU2qcbksE/W1YTIQZicczI6r9FoUGgjJxVpBbuCFnIoFDmnoNnF UuaG2ZzMNqtSNz3pIKq2lowBsDXmiBsR1tc80mVrz7ueHS2EgBWB8fAO7vFbc99RRxbo h8N4GCdH74DShaLZYQtpUhD9Sw7O9BOOsYautDBV0hXh3D7kYYgbSRy1BEvdce5irCFw tQNQ==
X-Gm-Message-State: AOAM530F/EWupNOtxbBQZ8cjFSUPUi5M3Lo7hm1LHrJeYCaHgMW08dIV UwSHqeACK9UcRQjHJSIBQH4=
X-Google-Smtp-Source: ABdhPJw+fLQcxJgDLX9Tz4Ulb1f28CeCdbRcXU+I9jOKbQQKkLN8iKqZYh1KufuFLgwre5eeV5DlwA==
X-Received: by 2002:ac2:596a:: with SMTP id h10mr2376328lfp.360.1623413583649; Fri, 11 Jun 2021 05:13:03 -0700 (PDT)
Received: from jonathartonsmbp.lan (178-55-194-130.bb.dnainternet.fi. [178.55.194.130]) by smtp.gmail.com with ESMTPSA id d12sm575234lfs.286.2021.06.11.05.13.02 (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Fri, 11 Jun 2021 05:13:03 -0700 (PDT)
Content-Type: text/plain; charset="us-ascii"
Mime-Version: 1.0 (Mac OS X Mail 11.5 \(3445.9.7\))
From: Jonathan Morton <chromatix99@gmail.com>
In-Reply-To: <AM9PR07MB7313406CE46E1A167A473443B9349@AM9PR07MB7313.eurprd07.prod.outlook.com>
Date: Fri, 11 Jun 2021 15:13:01 +0300
Cc: Bob Briscoe <ietf@bobbriscoe.net>, "tsvwg@ietf.org" <tsvwg@ietf.org>, Markku Kojo <kojo@cs.helsinki.fi>
Content-Transfer-Encoding: quoted-printable
Message-Id: <9B8D5034-E580-4613-8DC2-6845F20FA9EA@gmail.com>
References: <MN2PR19MB40454BC50161943BC33AAAD783289@MN2PR19MB4045.namprd19.prod.outlook.com> <43e89761-d168-1eca-20ce-86aa574bd17a@bobbriscoe.net> <de8d355d-08b6-34fb-a6cc-56755c9a11ee@bobbriscoe.net> <MN2PR19MB4045DB9D2C45066AEB0762DB83259@MN2PR19MB4045.namprd19.prod.outlook.com> <alpine.DEB.2.21.2106021717300.4214@hp8x-60.cs.helsinki.fi> <290e1624-fa1e-21d7-95fb-90e284c27dd8@bobbriscoe.net> <C7509065-526C-4712-B6CD-E919910E280E@gmail.com> <AM9PR07MB7313E7797F850B210EC3A799B9369@AM9PR07MB7313.eurprd07.prod.outlook.com> <3009F41B-1D79-4B2D-BC16-8F2049EA4976@gmail.com> <AM9PR07MB731372180EBF5C815A684F3CB9359@AM9PR07MB7313.eurprd07.prod.outlook.com> <51F33C7C-3323-4ADA-98BB-16A83F763FCE@gmail.com> <AM9PR07MB7313DC2C4F922A90E2E602BFB9349@AM9PR07MB7313.eurprd07.prod.outlook.com> <272C3C12-C788-4080-9B57-C3E2DD7BC5B4@gmail.com> <AM9PR07MB7313406CE46E1A167A473443B9349@AM9PR07MB7313.eurprd07.prod.outlook.com>
To: "De Schepper, Koen (Nokia - BE/Antwerp)" <koen.de_schepper@nokia-bell-labs.com>
X-Mailer: Apple Mail (2.3445.9.7)
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/n4de-ql24nrxj9rMbAylVvO_WG8>
Subject: Re: [tsvwg] ECN encapsulation draft - proposed resolution
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg/>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 11 Jun 2021 12:13:12 -0000

> On 11 Jun, 2021, at 1:39 pm, De Schepper, Koen (Nokia - BE/Antwerp) <koen.de_schepper@nokia-bell-labs.com> wrote:
> 
>>> Let me turn that around and ask you what would happen if 3% of the ATM cells were *lost*.
> 
> So you are still limit your thinking in terms of CE == DROP, which I thought was accepted not to be the most optimal and useful way forward... We can and MUST use ECN in a much better way if you want ECN to get really deployed AND used...

Funnily enough, nearly all current ECN deployment is using conventional TCP congestion control (RFC-5681, RFC-8312), a time-domain marking strategy (RFC-8289) and the RFC-3168 reassembly rules, and it seems to work pretty well.  Since that is what's currently standardised in published RFCs, don't knock it until you've actually tried it.

>>> Stop thinking in terms of "probability per packet", and start thinking in bytes or seconds instead.  Everyone else is.
>>> There are two currently known AQM designs which implement this approach:
> 
> I think you are missing the point here: It is not the AQM that determines the marks. It is the end-system congestion control.

I agree, that is the correct way around to analyse the system.  Which is what I did in a previous post, using Dimensional Analysis.  That is precisely what I've been trying to get you to read and understand.

What I found, in brief, was that DCTCP has a steady state that is dimensioned in marks per second, while Reno and CUBIC have steady states dimensioned in byte-seconds per mark.  None of them are directly sensitive to packet or byte marking probability - that is just an emergent property of empirical measurement on an inappropriate assumed basis.

> If that uses the packet probability it will DRIVE the AQM to whatever it needs to converge. It is a closed loop, and the end system determines the rate/probability.

You're right that the AQM is effectively driven to the state needed to control the actual transport.  Which is why the AQM in my PPPoA example would never be driven to a 3% cell-marking probability except under extreme circumstances.  Instead, it would select a probability that resulted in a reasonable number of marks per second, or if you prefer, a reasonably large number of bytes between marks.  Codel simply operates directly in the time domain, neatly bypassing a few awkward steps on the way.

> We only can hope the AQM is responsive enough to quickly set that rate or probability. This is where CoDel clearly fails, so not a good example to use. We need to learn from our mistakes and forget "rules from the past" that are not useful...

I'll accept the Codel is not perfect.  However, it's the first AQM that matches with the problem domain well enough to deploy successfully with universal defaults, which I think accounts for its popularity.  If you continue to claim that it "fails", you're going to have to back that up with empirical data.

 - Jonathan Morton