Re: [tcpPrague] [tsvwg] ecn-l4s-id: Proposed Changed to Normative Classic ECN detection Text

"De Schepper, Koen (Nokia - BE/Antwerp)" <koen.de_schepper@nokia-bell-labs.com> Mon, 02 November 2020 12:33 UTC

Return-Path: <koen.de_schepper@nokia-bell-labs.com>
X-Original-To: tcpprague@ietfa.amsl.com
Delivered-To: tcpprague@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 5BD5B3A0E9F; Mon, 2 Nov 2020 04:33:08 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.901
X-Spam-Level:
X-Spam-Status: No, score=-1.901 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=0.001, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=nokia.onmicrosoft.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id xIB-2pO8WeGp; Mon, 2 Nov 2020 04:33:05 -0800 (PST)
Received: from EUR05-AM6-obe.outbound.protection.outlook.com (mail-am6eur05on2126.outbound.protection.outlook.com [40.107.22.126]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id E4CA53A0EB7; Mon, 2 Nov 2020 04:33:04 -0800 (PST)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=FZi7fhfnrZC7088iDot1FkPpdV7hyfzXd3IFesBcYY4NcMqjcgk1SpTvPUQdiGI6WiuVs8yZQzSBeLlqdP3pYDQGlWIc+WOjwepdOme+gXMEk049br6mligp/Ku3JdAhTCRZfrSYA1qcAoMWWrWatmGyfrOvi/3lULk8zg7r6CawI0FkmIxsTwm0L+pN+X0AfyU5NTPGICXdiqJZgh49X6GQLB6b6UavUCcVyS2rOTP2gRfMy9GrtE3+3KA1W5cQSx3ijmw+CB/PIv5cTJkFAPpUeJuUl8KAa/GfzxpKGbJoQGC98tOheRsI5cNGCI3LkY74X5L6ZVeLZBk5zDDCOg==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=iYLZZHGYmhJBEXkmSEkZ0g6jr8AJevdcPBtPjYtzSjA=; b=VENqVWOyOlNteF1loOULTyO0gieVBhKPdgdTDLZaobjiLX1xvGemwFmnY9HRL6jyf7v0MMbEIel8vgFs/4ccRgUjW81ln/mlsudAsr50klaJWr0gUWi3wKsFohVwVVpEWKHqYcYKgMnudKuBGj2+bi5XFBDT8hzNkzukqLI1y12fzqFTHL7vtVHAk1sSiyP4UFCAYLZx5uZp+nNWwBOP7LsOEezoJTqNyftxnEpIvxDH+1n28Seg8EI6Dbn1TgCOnXTtAuAY1zsgmyt/h9LqpetwivsJ1d8WGtp1OjP60JuSTR5coEKmqnvjRcRb25IPGxuMt1JtMSPtEDXTbGF1Mw==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=nokia-bell-labs.com; dmarc=pass action=none header.from=nokia-bell-labs.com; dkim=pass header.d=nokia-bell-labs.com; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=nokia.onmicrosoft.com; s=selector1-nokia-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=iYLZZHGYmhJBEXkmSEkZ0g6jr8AJevdcPBtPjYtzSjA=; b=mxRmQ37PEy7nsSl6/C+2BXXzzNo1s1c9DRsZojTecdgIs+2jLDDlJOFakEO9rEPyQSfyaz7Sh6iVmFNBKLiFNMYEBa+/TTkT205KmeWHUESBxn8P040FOGYaa1VOrwVPhhQ1hSsCg69Zfdg6oUDp4R7WpeDWRlfVadFlr5T39v8=
Received: from AM0PR07MB6114.eurprd07.prod.outlook.com (2603:10a6:208:113::33) by AM0PR07MB3940.eurprd07.prod.outlook.com (2603:10a6:208:46::20) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.3541.10; Mon, 2 Nov 2020 12:33:01 +0000
Received: from AM0PR07MB6114.eurprd07.prod.outlook.com ([fe80::a9f4:9199:8226:d6c6]) by AM0PR07MB6114.eurprd07.prod.outlook.com ([fe80::a9f4:9199:8226:d6c6%6]) with mapi id 15.20.3541.010; Mon, 2 Nov 2020 12:33:01 +0000
From: "De Schepper, Koen (Nokia - BE/Antwerp)" <koen.de_schepper@nokia-bell-labs.com>
To: Christian Huitema <huitema@huitema.net>, Jonathan Morton <chromatix99@gmail.com>
CC: iccrg IRTF list <iccrg@irtf.org>, Bob Briscoe <ietf@bobbriscoe.net>, tsvwg IETF list <tsvwg@ietf.org>, TCP Prague List <tcpPrague@ietf.org>
Thread-Topic: [tcpPrague] [tsvwg] ecn-l4s-id: Proposed Changed to Normative Classic ECN detection Text
Thread-Index: AQHWsIfwC/FrblPY50a/ND1AVr6ZVKmzwIVwgAAzdwCAAAQwgIAAuLTQ
Date: Mon, 2 Nov 2020 12:33:01 +0000
Message-ID: <AM0PR07MB6114D78B68E07B2F712E12E4B9100@AM0PR07MB6114.eurprd07.prod.outlook.com>
References: <1b71a610-75ea-e1d4-e3ce-f0ae6a4c12f7@bobbriscoe.net> <28247e5f-5df3-1f75-50e6-b4a7e80d5ab0@huitema.net> <61B6E9DC-8786-4CC0-9AC0-7F74E3529A73@gmail.com> <40751956-daa0-5693-d880-c45e8ca262cc@huitema.net> <AM0PR07MB6114083DB6F3FBA98AA2281FB9130@AM0PR07MB6114.eurprd07.prod.outlook.com> <5ba51c98-9600-a55f-5d9f-bff012eb39e4@huitema.net> <fc6f56f1-add4-fdc0-4718-012ee10102bf@huitema.net>
In-Reply-To: <fc6f56f1-add4-fdc0-4718-012ee10102bf@huitema.net>
Accept-Language: nl-BE, en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
authentication-results: huitema.net; dkim=none (message not signed) header.d=none;huitema.net; dmarc=none action=none header.from=nokia-bell-labs.com;
x-originating-ip: [2a02:1810:1e00:cb00:106c:a742:663:cd5d]
x-ms-publictraffictype: Email
x-ms-office365-filtering-ht: Tenant
x-ms-office365-filtering-correlation-id: b9f847ca-e70a-4bf8-f867-08d87f2b7056
x-ms-traffictypediagnostic: AM0PR07MB3940:
x-microsoft-antispam-prvs: <AM0PR07MB3940E4CED7872FFBBCE69FE8B9100@AM0PR07MB3940.eurprd07.prod.outlook.com>
x-ms-oob-tlc-oobclassifiers: OLM:10000;
x-ms-exchange-senderadcheck: 1
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: 4uSY6tAcv+zjuApLt2VJGUV783/AsU+iCV4NJFgWjJd2A9SvK0qdB4CwMyk0PcMQK6UJRNIIP5mhqNM5Ivofd/O8vtUc03qu/NRDpQsJXbhUyk+1u2ZAp0cFqAQuhAeNZfi7wUGqi+x2i/EhLTzYHOixJS9XDBpAv2zMB3krjhU6j3R6vZJpHStIMKI1PPSYJiwIQq6iRUktV9N6L50QeadN9pviiuoGkHxx/BrZHcJt1YswbsK+0CwvGeWF94Mxl7p/CFpZUroofGdr0pEvgdZoPmiU9JF/kSXBcJgzc3/O/Dv0ZUGJBjTLPSM7k3DmmlIornM9IL7j9wm0Gxr3tiRc5k1YiUG99dAyNgj+PRUjJ5TcYLBc2hEjAE5ysHh9g6znBFy6NrLbm2XtN9OLWg==
x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:AM0PR07MB6114.eurprd07.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(366004)(136003)(346002)(376002)(39860400002)(396003)(66556008)(64756008)(66446008)(86362001)(52536014)(76116006)(4326008)(71200400001)(66946007)(8936002)(83380400001)(66476007)(186003)(7696005)(33656002)(110136005)(54906003)(316002)(8676002)(5660300002)(9686003)(166002)(2906002)(6506007)(966005)(53546011)(55016002)(9326002)(478600001); DIR:OUT; SFP:1102;
x-ms-exchange-antispam-messagedata: ATAjPNFHijQ/D1JtYdU2+ZQs2YaikgZNqqg4XNemGHwjmWf/3u3ctz9ce3L5bUoWjZc+HDr5D3qrtMrbs1mcLGRDfsFlgjGRqNKdFt7+mTu1KHRvVh4H0zVRALmBwGIvGOYy79KVmQUJqM7g3YtnUAinaOAJaTgKFU4vwg/ACoGSdwlFGwk1pRzoXRPQ7P00QsbzZM1ak7GPic71bCdtghZB5pv/+rGkZ6DU2/Ya+5Rg6YtLGrImIoyuog3yWI2FCT3QuC0JVQq0vRbhPbkPdxU5eCUutJ14NZlvVrg/qfRraF/x2tsL9lcURYKrVPm/cT7hQ6WiL+kdLQ0ABWaj/A25yGW1Cql/v2ht3aBYSsyrFXod8rBLzCC3g+TShjeqRjf6qnwDhw9h0/arDPMcfauR8B33DN4tNrXxoj5FQ+p1hxSWe77QhxSFh6ls3lejQmxJ+5N+jIWVnKjIeyYprSPKzpEorCp8B8ZbYtLS0nD9awaoBfI31WFKvN+MQ7jQwlPlnDcsuUILFwvKt+W390Hi6XEJejPQdpTZUb6Q++/RAyUKrn114jFwIJcZOYN3vzdRREr4u55vxuIG5fHu91GNQyXe9w6A27p4zyudvymmlNrbI8dPZW3zXLaB9AGQdPjpCw18rkqygwYS3Jpep7iucOGJmzHZhELznzMU2ixQlQW7xPQsetG3H2BfyJFAbjVc/kVmAfR0UtnHKodqwg==
x-ms-exchange-transport-forked: True
Content-Type: multipart/alternative; boundary="_000_AM0PR07MB6114D78B68E07B2F712E12E4B9100AM0PR07MB6114eurp_"
MIME-Version: 1.0
X-OriginatorOrg: nokia-bell-labs.com
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-AuthSource: AM0PR07MB6114.eurprd07.prod.outlook.com
X-MS-Exchange-CrossTenant-Network-Message-Id: b9f847ca-e70a-4bf8-f867-08d87f2b7056
X-MS-Exchange-CrossTenant-originalarrivaltime: 02 Nov 2020 12:33:01.6914 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 5d471751-9675-428d-917b-70f44f9630b0
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: OG7un+pkTVfblW0dAWLq72e2d5zYeNOUUptV2G8rih5b8FVuel8RTOO9EZpMoGDNJvnALRKq7QQ5ZxachhprxWXcPhgfkCIV14Lvor4YVvjdjtWQPP65QVzxMFIANNof
X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM0PR07MB3940
Archived-At: <https://mailarchive.ietf.org/arch/msg/tcpprague/Tav5vWvd1XopqXIfqDDELmx-3jE>
Subject: Re: [tcpPrague] [tsvwg] ecn-l4s-id: Proposed Changed to Normative Classic ECN detection Text
X-BeenThere: tcpprague@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "To coordinate implementation and standardisation of TCP Prague across platforms. TCP Prague will be an evolution of DCTCP designed to live alongside other TCP variants and derivatives." <tcpprague.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tcpprague>, <mailto:tcpprague-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tcpprague/>
List-Post: <mailto:tcpprague@ietf.org>
List-Help: <mailto:tcpprague-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tcpprague>, <mailto:tcpprague-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 02 Nov 2020 12:33:09 -0000

Hi Christian,

Indeed a very important discussion we need to have, and agree on.

>> If you give up the formula, you could end up with a significantly simpler event based congestion control, in which applications slow down if they see too many CE marks, accelerate if they only see a very low marking rate, and stay put if they see an infrequent marking rate.

This is exactly what I expect most CCs will do: work event based. The formula is only to tune the events so it converges to the formula (eventually). If there is a lot of dynamism, obviously shorter RTTs can respond faster and will benefit from that. They will have memory over a shorter time interval, and on average more throughput than a longer RTT flow. But if things are stable, it is good to converge to a known rate. I don’t see any better solution, do you? If we don’t foresee such a mechanism how do you expect bandwidth to be shared over shared infrastructure? Then the whole discussion of fairness can be dismissed, and CCs should not be evaluated for this neither.

>> The marking rate is actually 0.1%, about 2 packets per RTT

Indeed, this is still 2 marks per 15ms, much better than 1 drop/mark per 1000 RTTs (15 seconds) for your example when using Reno (Cubic will also be seconds). Only if the RTT is much smaller than 15ms, it will get less marks per RTT, but in our Linux Prague we have an option to gradually take the RTT dependence into account, depending on how long the flow is running application unlimited (or by extension how long things are stable).

Also such a formula is just a reference to aim at. It should not be a continuous hard requirement. There are reasons when a “flow” can deviate from it (Less than best effort can take less, a tunnel of multiple flows will take as many times more as there are flows inside, and a real-time application might temporary take more when needed).

I’m in favor of specifying explicitly such a formula as a target for L4S CC’s to share BW in steady state situations (long flows and stable throughput). Up to now I don’t think we have such an explicit requirement in the drafts. Anyone else that would support this? At least CC control developers then have a target to aim for and a clear evaluation criterium.

Regards,
Koen.


From: Christian Huitema <huitema@huitema.net>
Sent: Monday, November 2, 2020 1:11 AM
To: De Schepper, Koen (Nokia - BE/Antwerp) <koen.de_schepper@nokia-bell-labs.com>om>; Jonathan Morton <chromatix99@gmail.com>
Cc: iccrg IRTF list <iccrg@irtf.org>rg>; Bob Briscoe <ietf@bobbriscoe.net>et>; tsvwg IETF list <tsvwg@ietf.org>rg>; TCP Prague List <tcpPrague@ietf.org>
Subject: Re: [tcpPrague] [tsvwg] ecn-l4s-id: Proposed Changed to Normative Classic ECN detection Text



On 11/1/2020 3:55 PM, Christian Huitema wrote:


On 11/1/2020 1:53 PM, De Schepper, Koen (Nokia - BE/Antwerp) wrote:

>>By the way, I do think that L4S makes too many hypotheses about the CC. I wish the AQM was based on intrinsic properties, such as "this flow is using more than its fair share of bandwidth"

This last quote is exactly what the L4S marking is intended to do: the smoothed marking probability over a longer time interval (about 10 RTTs) indicates the fair target rate. Assuming we agree on 15ms being the reference RTT, then the average rate can be r=2/(p*0.015)=133/p packets per second. So you can evaluate yourself if you use more or less than the fair share. How you respond to this knowledge is up to your CC.

I think that part of the complexity in L4S AQM comes from trying to achieve fair sharing of resource without actually implementing fair queuing. You end up expecting that implementations apply a formula and derive their packet sending rate from a smoothed average of the packet marking rate. It may work well in controlled environments, but I am afraid that very few implementations are ever going to do that in practice.

Congestion Control algorithms are much more likely to treat packet losses, markings, or delay increases as signals that they are sending too fast, while considering their absence or lower frequency as signal that they might be able to send faster. Implementations also are likely to take into account the recent history, for example monitoring the min RTT (LEDBAT, BBR), or monitoring the recent available bandwidth (BBR). They also typically filter signals, e.g., consider only one packet loss or ECN mark per RTT (New Reno, Cubic, etc.). This is expected, because we do see packet losses arriving in batches, and can reasonably expect ECN marks to have the same behavior. But implementations are inherently selfish, do not really trust the signals from the network, and are very unlikely to end up applying the formula that you suggest.

In any case, you have a scaling issue. Let's consider a 1.5Gbps link, with 15 ms delay and 1500 bytes packets. The nominal sending rate is 125,000 packets per second. The marking rate with your formula shall be p = 2/(r*0.015), which is about 0.0008%. Over the last 10 RTT, the connection will on average see 0.14 marks -- that is, no mark over the last 10 RTT 86% of the time. This falls well short of the requirement to provide frequent feedback!

Sorry, bug in my spreadsheet. The marking rate is actually 0.1%, about 2 packets per RTT. Still not frequent enough for my taste, but much better than 0.0008%. Nevertheless, the inverse relation between marking rate and data rate is not great.

If you give up the formula, you could end up with a significantly simpler event based congestion control, in which applications slow down if they see too many CE marks, accelerate if they only see a very low marking rate, and stay put if they see an infrequent marking rate. I can see applications doing that, resulting in a fairly stable network and low delays. Of course, you will have to do some form of fair share enforcement to catch violators, but I am sure you can come up with something.

-- Christian Huitema







_______________________________________________

tcpPrague mailing list

tcpPrague@ietf.org<mailto:tcpPrague@ietf.org>

https://www.ietf.org/mailman/listinfo/tcpprague