Re: [tsvwg] a new method of congestion control

"Shihang(Vincent)" <shihang9@huawei.com> Mon, 05 June 2023 07:46 UTC

Return-Path: <shihang9@huawei.com>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 99D78C151545 for <tsvwg@ietfa.amsl.com>; Mon, 5 Jun 2023 00:46:48 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.194
X-Spam-Level:
X-Spam-Status: No, score=-4.194 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_MED=-2.3, RCVD_IN_MSPIKE_H5=0.001, RCVD_IN_MSPIKE_WL=0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Os5K-ApAg_op for <tsvwg@ietfa.amsl.com>; Mon, 5 Jun 2023 00:46:44 -0700 (PDT)
Received: from frasgout.his.huawei.com (frasgout.his.huawei.com [185.176.79.56]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 0B202C14CE38 for <tsvwg@ietf.org>; Mon, 5 Jun 2023 00:46:44 -0700 (PDT)
Received: from lhrpeml500002.china.huawei.com (unknown [172.18.147.200]) by frasgout.his.huawei.com (SkyGuard) with ESMTP id 4QZQcG6cNtz67PTQ for <tsvwg@ietf.org>; Mon, 5 Jun 2023 15:44:46 +0800 (CST)
Received: from kwepemi500020.china.huawei.com (7.221.188.8) by lhrpeml500002.china.huawei.com (7.191.160.78) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2507.23; Mon, 5 Jun 2023 08:46:41 +0100
Received: from kwepemi500020.china.huawei.com (7.221.188.8) by kwepemi500020.china.huawei.com (7.221.188.8) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2507.23; Mon, 5 Jun 2023 15:46:39 +0800
Received: from kwepemi500020.china.huawei.com ([7.221.188.8]) by kwepemi500020.china.huawei.com ([7.221.188.8]) with mapi id 15.01.2507.023; Mon, 5 Jun 2023 15:46:39 +0800
From: "Shihang(Vincent)" <shihang9@huawei.com>
To: Mitchell Erblich <erblichs@earthlink.net>, "touch@strayalpha.com" <touch@strayalpha.com>
CC: tsvwg IETF list <tsvwg@ietf.org>
Thread-Topic: [tsvwg] a new method of congestion control
Thread-Index: AQHZl0hYdQFvt3BVa0G1oAWpf734Cq96/EWAgAAG+wCAABBuAIAAwNNQ
Date: Mon, 05 Jun 2023 07:46:39 +0000
Message-ID: <5b4d9427390649abb2b977e8280a2117@huawei.com>
References: <CAA93jw6TJEciW8QhgbSe=0ZTk6njhpxMTQ3ETxzy73hhcP0yAw@mail.gmail.com> <C21F06B7-79DE-4726-9DD7-D91BF9DB9AC5@earthlink.net> <16922F7E-EED2-4298-98EC-AA7784A0D4AD@strayalpha.com> <0EB4FCC9-99F3-4E29-BB12-22B2DAC9AB95@earthlink.net>
In-Reply-To: <0EB4FCC9-99F3-4E29-BB12-22B2DAC9AB95@earthlink.net>
Accept-Language: zh-CN, en-US
Content-Language: zh-CN
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-originating-ip: [10.112.41.128]
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
MIME-Version: 1.0
X-CFilter-Loop: Reflected
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/dWhOPSOP8e0gByGTIYXJL-iHocI>
Subject: Re: [tsvwg] a new method of congestion control
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg/>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 05 Jun 2023 07:46:48 -0000

From https://nvidianews.nvidia.com/news/nvidia-announces-spectrum-high-performance-data-center-networking-infrastructure-platform :
With 51.2Tbps aggregate ASIC bandwidth supporting 128 ports of 400GbE, combined with adaptive routing and enhanced congestion control mechanisms, Spectrum-4 optimizes RDMA over Converged Ethernet fabrics and dramatically accelerates data centers.

AI workloads run mostly on RoCE, not TCP.

Best,
Hang

-----Original Message-----
From: tsvwg <tsvwg-bounces@ietf.org> On Behalf Of Mitchell Erblich
Sent: Monday, June 5, 2023 12:14 PM
To: touch@strayalpha.com
Cc: tsvwg IETF list <tsvwg@ietf.org>
Subject: Re: [tsvwg] a new method of congestion control

Joe,

	One would need to duplicate Appropriate Byte Counting (ABC) by Allman / Pacing in Ethernet, but why would you want to when you already had it within a experimental CA in TCP?

	If you flood packets and a intermediate system (router) did a tail-drop or a random-early drop, you still drop packets.

	If you have out-of-orderness that jumps because… then you drop packets

	If you fragment and drop a fraction of the packet you drop…
	
	If you re-route because of the change of the routing table and you re-order, you drop,,,,

	Yes,we have selective asks…. But not in Ethernet or IPvX.

	Ethernet supports two sizes of IP (IPv4 and IPv6) … do you do something special for a fraction of the packets in your network?

	You need to have 1 hop networks … Again ,,, weird…

	Can’t guarantee no drop of packets… Yes, you can mitigate if and if  and if… but is’t it just worth to use Selective Acks and other existing RFCs 

	Why create a wheel when it already exists???


Group,,, Sorry, about the verbose response….

Mitchell Erblich
erblichs@earthlink.net



> On Jun 4, 2023, at 8:15 PM, touch@strayalpha.com wrote:
> 
> Ethernet has a LOT of extensions:
> 
> IEEE 802.1
> en.wikipedia.org
> <wikipedia.png>
> 
> Could this either be some subset of newer extensions or revival/vendor-specific variants of past attempts?
> 
> Joe
> 
> —
> Dr. Joe Touch, temporal epistemologist www.strayalpha.com
> 
>> On Jun 4, 2023, at 7:50 PM, Mitchell Erblich <erblichs@earthlink.net> wrote:
>> 
>> Weird, In my Opinion (IMO),
>> 
>> 	How do you separate out AI versus non-AI flows, so you can tailer one type network flow to AI aware applications?
>> 
>> 	Is AI consuming even 20% of network bandwidth at this time?
>> 
>> 	Isn’t Apple’s ?Siri? an AI application approaching 10 years of age, which I find far from perfect.
>> 
>> 	It is like having a different Ethernet protocol for UDP versus TCP.
>> 
>> 	The article must be missing critical information or the writer doesn’t understand TCP and/or CA.
>> 
>> Mitchell Erblich
>> erblichs@earthlink.net
>> 
>> 
>> 
>>> On Jun 4, 2023, at 5:53 PM, Dave Taht <dave.taht@gmail.com> wrote:
>>> 
>>> announced by nvidia here:
>>> https://www.zdnet.com/article/nvidia-unveils-new-kind-of-ethernet-fo
>>> r-ai-grace-hopper-superchip-in-full-production/
>>> 
>>> I have no idea what an AI workload looks like, familiarity with a 
>>> ton of DC l2 protocols, and there are hints in this post about telemetry.
>>> 
>>> Anyone have clue here?
>>> 
>>> --
>>> Podcast: 
>>> https://www.linkedin.com/feed/update/urn:li:activity:705879391022711
>>> 1937/
>>> Dave Täht CSO, LibreQos
>>> 
>> 
>