Re: [tsvwg] Why L4S need a separate queue?

Sebastian Moeller <moeller0@gmx.de> Thu, 18 April 2024 08:28 UTC

Return-Path: <moeller0@gmx.de>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id D7571C14F6AD; Thu, 18 Apr 2024 01:28:53 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.547
X-Spam-Level:
X-Spam-Status: No, score=-2.547 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmx.de
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id od5AyMCOjxGz; Thu, 18 Apr 2024 01:28:49 -0700 (PDT)
Received: from mout.gmx.net (mout.gmx.net [212.227.15.15]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 29A70C14F6B3; Thu, 18 Apr 2024 01:28:48 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmx.de; s=s31663417; t=1713428927; x=1714033727; i=moeller0@gmx.de; bh=0m6w6AnK8s5gWa8XsC3MD9Z7RA9xZB0HvMqtqvbWRPs=; h=X-UI-Sender-Class:Content-Type:Mime-Version:Subject:From: In-Reply-To:Date:Cc:Content-Transfer-Encoding:Message-Id: References:To:cc:content-transfer-encoding:content-type:date:from: message-id:mime-version:reply-to:subject:to; b=KB2Bon1847q/aXJmMaDC5833XsjDUCss9MpMbnPWjn/mmVaf6GlplyYM/Aa+0LdD c3jh4H+4DB6cPwX0kr5Gwl/mfY5Kvet77uLZVwwW7UMWi9y9k3Ahq0GI6T1ujtpa3 z3rcDbF5yZN5h9xsYVsroYzsxGlAJBbXzkYPRMbOqFlboX1e5jNXrvJm5nf4Kkdue pkGwKAwNMw5G2kZQMvNMGMsOfn4zcQ/gbZy1UJoFETHi1mXh8/ed77DZVybTf35XM mlOEUUcQxm9agE08EBvd7dLRzKOMusnFXbcfHFRWVLGKrVZ/KbgT6Mda9hJYZP9bY VCC2hpyYNS6uweg0vA==
X-UI-Sender-Class: 724b4f7f-cbec-4199-ad4e-598c01a50d3a
Received: from smtpclient.apple ([134.76.241.253]) by mail.gmx.net (mrgmx004 [212.227.17.190]) with ESMTPSA (Nemesis) id 1MVNAr-1s84Fc0fmZ-00SQi9; Thu, 18 Apr 2024 10:28:47 +0200
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3774.500.171.1.1\))
From: Sebastian Moeller <moeller0@gmx.de>
In-Reply-To: <8a83c60e2dd841679f4a0b1848830b6f@huawei.com>
Date: Thu, 18 Apr 2024 10:28:36 +0200
Cc: "tsvwg@ietf.org" <tsvwg@ietf.org>
Content-Transfer-Encoding: quoted-printable
Message-Id: <C8959351-9C9A-457E-9E6F-CE3ABA195DAE@gmx.de>
References: <6284927dbd704a6abdec7fca7f6bade2@huawei.com> <8a83c60e2dd841679f4a0b1848830b6f@huawei.com>
To: Vasilenko Eduard <vasilenko.eduard=40huawei.com@dmarc.ietf.org>
X-Mailer: Apple Mail (2.3774.500.171.1.1)
X-Provags-ID: V03:K1:A3O0d5XSg8DGm8AUVznW+V/NG7lWfyIk7zuUaPJqJTpVspV5mO1 iVg1DrK0OlWjpcnurlvbS6wjYCavlEqfxr69T/J4GTGoaYy70rzftI+djUZZpJYT7UFnAL6 nJBN3KbmX4bY0as1kbjfexDzU9n0it19WYgRlQycWhEtEetQTC4sOjdqwC61vhZ/LhTSblj TvNfrr/H8/3mPS20fW8oA==
UI-OutboundReport: notjunk:1;M01:P0:ejr7qOgWHSI=;X5ZlyGWRg+AT1OWu2XiRC4Zt3fM GMRkPGWUHtNKF6yxtHVEhgE7cND6VV8xYM7Awn1v1oB9MhQuEvcrNQ4/Jli9P3fjUbT51j8L3 Hv3wFiN1LvgwCwZk0KFkVHfN7nDvoaqBoyoiufqFcL7Tv7wBb0hCZi05/0SCdbpg4ZKyI2RjX w5PeF429+XbyaNZLYz0mD+LLU39E7V9Om7AjO8z232m6qMVHLPbM87SKUJ3D5SH/cjZbz06bP unxbvLKd90c2bFExLUi1qIxghdbRdy1BPU2p+61L7wbWMOVETaVx5Yd8n02k9bWCJYHpmYMVp sMpikm6ismCAMX1W97V9pRBTBQnA0wneQpan8Lzx2KBN5qskOtWvLqJBrfxgZWEvkH0JYXPHN hUwHLFpA+1jE15+AoWvmz4Dr4PKa0FhXZMup8QFy5FR20lijd5lFYp5lbNttH6l/pXPuO4jMX BFgC2qIui0ciO2Xk9mGNaCU9dNUmPrWkODMl0a17U3LZ7CASLQIy/I045RO88PXqgAhFsrS2R PhFbwNUbtqC+UphmTYr7FBSEIyma+5rdv7xBRdbe9wB8XC192J8V/u3gUpnQ0JqrCJp0LCBBT onahFCdPAEsbacEFU/jKLwMcmuZ8lwQYAcD7/YmLDvZGk9FveXQILTkwMRv5zWF2Slc9y+fy1 kmeH7YwFv/S8eHw1r3KPBAa6lb2N93vgiBCD3X/8NlixnaZ9dyvt8TGtMoQaJdayGcCQcReZW JwK8IprkouVHcN4L6U6EYPbAUkoANf72IP0glMkNztek/eTO6Ws8vsHSJ9H4M/hs8pFbPU6Tr Vb78F2wY9UH1k3VWHhFUXOb0GfI+QVhH9iOEVwBMen8uk=
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/YTAVkKSdV6TpltHRXXszYs0_CKE>
Subject: Re: [tsvwg] Why L4S need a separate queue?
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg/>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 18 Apr 2024 08:28:53 -0000

Hi Ed,


> On 18. Apr 2024, at 09:58, Vasilenko Eduard <vasilenko.eduard=40huawei.com@dmarc.ietf.org> wrote:
> 
> If somebody in the future would develop yet another CC algorithm (nobody could predict future requirements),
> But dependency between algorithms (formula) is hard-coded in the router’s hardware,
> Then it is an infrastructure replacement again.
> It was wrong to put a formula for CC cooperation/fairness into hardware.

[SM] I actually agree that both redefining what response is expected for a single CE mark and the imprecise coupling between two queues are obvious flaws of L4S, that could and should have been avoided. However IMHO the solution really is to actually signal the piece of information end points need to adapt to the situation at the bottleneck, and that IMHO would be a multibit signal about the bottleneck state sent in every individual packet. Flows can then reflect this information and adjust their rate according to the veridical dynamics of the bottleneck load.  That also seems to be the way the data center is going (see e.g. https://www.usenix.org/conference/nsdi23/presentation/wang-weitao) abandoning the approach of emulating a true per-packet signal with a rate code of the same state bit essentially toggling a single bit. 

>  It may have a small positive effect on the transition period to isolate new traffic from legacy,

[SM] IMHO isolation really is the key, but just separating traffic based on the expected differential CE response as L4S does is really just the bare minimum, and is mostly owed to the fact that these two responses are inherently incompatible and that normal internet traffic expects/requires deeper queues.

> But BBR has surpassed half of the traffic already and resolved this problem differently.

[SM] Mmmh, but has it? BBRv3, as far as I know actually evaluates and responds to L4S style CE marks.

>  Especially in the situation when the formula was analytically derived from the wrong assumptions.
> It is a time of errata for RFC 9332 where RENO and CUBIC dependencies are claimed on the square root of marking probability.

[SM] Maybe it is time you present a proof of the claim that the square root 'law' is not 'good enough'? CoDel (https://datatracker.ietf.org/doc/html/rfc8289) is built ion that assumption and in practice works pretty well. To me this implies that the theory is 'good enough'.

>  By the way, RFC 9332 references PI2 publication which claims different formulas for RENO and CUBIC. CUBIC is cubical as anybody could guess – very different from RENO.

[SM] Indeed, but depending on the regime cubic is in it is close tro Reno, and IIRC was designed to peacefully coexist with Reno.

> CUBIC has almost replaced RENO on the market, but RFC 9330 and RFC 9332 propose to use a formula from RENO. It is not logical at all.

[SM] I seem to recall that at the time Reno was the only TCP variant that had a published standard RFC, so honouring this as precedent seems totally expected for the IETF, no?

> Even if nobody spotted that analytical formulas are wrong, why the formula from RENO was chosen for DualQ, not for CUBIC?

[SM] See above, as Reno has RFC statius while cubic has not, and cubic tries to accommodate Reno.

> It is an additional errata for the RFC 9332.
>  Opensource has won IETF again and unfortunately, I suspect why.

[SM] Excuse me? That comes quite unexpected, what do you mean with opensource here, and what is your exact criticism, it is not that RFCs based on proprietary commercial work are always perfect either...

> This silence is a good sign of what is going on in TSVWG.
> Nobody wants to discuss the problem in essence. Everything has already been decided by politics.

[SM] As much as I agree with the view that politics has a stronger influence on IETF decisions than the self-description of the IETF's processes would imply, I also consider that to be pretty much expected when a larger number of folks need to agree on things... But typically well-made arguments will elicit a response in this mailing list by actual experts (and not just the peanut gallery visitors like me) but that can take a bit, so maybe have a bit more patience?

Regards
	Sebastian

> Eduard
> From: tsvwg <tsvwg-bounces@ietf.org> On Behalf Of Vasilenko Eduard
> Sent: Tuesday, April 16, 2024 17:06
> To: tsvwg@ietf.org
> Subject: [tsvwg] Why L4S need a separate queue?
>  Hi Experts,
> For sure, some balancing is needed between the new CCA against the old CCA to achieve the fairness. It may have a complex formula – no problem with that.
> But why this formula has been put on routers?
> BBR (without ECN feedback) has finally developed a good enough formula to play nicely with CUBIC.
> I have not seen a test for BBR with ECN against CUBIC with ECN or against CUBIC without ECN. If anybody saw it – please share.
> No doubt that such a formula could be developed for BBR with ECN (if it is not a part of BBR already).
> Any CCA MUST have fairness with CUBIC in one queue – the world has a huge number of already installed hardware (with fixed schedulers).
>  Then the only thing that would be needed from routers is to support ECN instead (or in addition) of drop.
> And ECN(1) would not be needed.
>  I do not see a value in the requirement for an additional scheduler type on routers.
> It would not affect the performance. Because any new CCA MUST have fairness for one queue anyway for adoption strategy.
> It just additionally complicated L4S adoption (with rip&replace approach).
>  PS: everything above is my personal opinion.
>  Best Regards
> Eduard Vasilenko
> Senior Architect
> Network Algorithm Laboratory
> Tel: +7(985) 910-1105