Re: [ippm] [tsvwg] [iccrg] New Internet Draft: Congestion Signaling (CSIG)

Sebastian Moeller <moeller0@gmx.de> Wed, 13 September 2023 06:14 UTC

Return-Path: <moeller0@gmx.de>
X-Original-To: ippm@ietfa.amsl.com
Delivered-To: ippm@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id CFCECC15108D; Tue, 12 Sep 2023 23:14:29 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.553
X-Spam-Level:
X-Spam-Status: No, score=-2.553 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmx.de
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 0xhk9nYrhrDo; Tue, 12 Sep 2023 23:14:25 -0700 (PDT)
Received: from mout.gmx.net (mout.gmx.net [212.227.17.22]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 94379C151085; Tue, 12 Sep 2023 23:14:24 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.de; s=s31663417; t=1694585660; x=1695190460; i=moeller0@gmx.de; bh=JU0cFu3j+nb818/ozFZve2/n8eYa3aZXVht7vfxfwC0=; h=X-UI-Sender-Class:Date:From:To:CC:Subject:In-Reply-To:References; b=qriacOGGZqCbcMt/iIOKb80dz+tUzU2jUOl6qMlzyQI4zW+xhgmMPeGo8DiTrpNAb/fyBjx JC51G4nODOgxYsorcaXFXUq5et4bo538ycSkOYqzJcK/AiC0rjnYXj5fVzVLk0Bp1krf2ZouL mvjSUYuA+IFl26hgqBYX4uIcXxpiYcaoieoymsQRL6DE6RTJ9FRTVnEcXHjbGEd845dNparkG IgaaLApvOgbwxqejh8tGoehbOja/2ErnOe4R9Rdsu1ozjTJYlxBEGqefidwrOkq4Q618kunWb PYhHFppSHmhmPo141JYDEoyXPaSh6h/KkR/edmo7m44GluGfgBxw==
X-UI-Sender-Class: 724b4f7f-cbec-4199-ad4e-598c01a50d3a
Received: from [127.0.0.1] ([134.76.241.253]) by mail.gmx.net (mrgmx104 [212.227.17.168]) with ESMTPSA (Nemesis) id 1MNsw4-1r4Qey24QC-00OKhQ; Wed, 13 Sep 2023 08:14:20 +0200
Date: Wed, 13 Sep 2023 08:14:15 +0200
From: Sebastian Moeller <moeller0@gmx.de>
To: Neal Cardwell <ncardwell=40google.com@dmarc.ietf.org>, Vidhi Goel <vidhi_goel=40apple.com@dmarc.ietf.org>
CC: "Shihang(Vincent)" <shihang9=40huawei.com@dmarc.ietf.org>, "Huangyihong (Rachel)" <rachel.huang=40huawei.com@dmarc.ietf.org>, Tom Herbert <tom=40herbertland.com@dmarc.ietf.org>, Abhiram Ravi <abhiramr=40google.com@dmarc.ietf.org>, IETF IPPM WG <ippm@ietf.org>, tsvwg <tsvwg@ietf.org>, "ccwg@ietf.org" <ccwg@ietf.org>, "iccrg@irtf.org" <iccrg@irtf.org>, Nandita Dukkipati <nanditad@google.com>, Naoshad Mehta <naoshad@google.com>, Jai Kumar <jai.kumar@broadcom.com>
User-Agent: K-9 Mail for Android
In-Reply-To: <CADVnQynjcK-eZFBj_RNnnNRm0MgA4rpvL-e9W5idHBe=VXjzHA@mail.gmail.com>
References: <92a6a6b54105447db6998d15961b1f8e@huawei.com> <2cc3f954aa2447dcb475f2a630841859@huawei.com> <2F15B386-EFF2-4637-8A3D-AF3CDD61114D@apple.com> <CADVnQynjcK-eZFBj_RNnnNRm0MgA4rpvL-e9W5idHBe=VXjzHA@mail.gmail.com>
Message-ID: <FD4CE122-A7CD-46C3-86C4-8960E979FBF0@gmx.de>
MIME-Version: 1.0
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: quoted-printable
X-Provags-ID: V03:K1:LUPq2My4IcFfFTaWm9EFB+3RdjV7kXCAWjEX9NrE/v+o60uRG0k /D0XTkY75aSHDWg50ppKJ/fnc3XzO0efBeMyNLqnjPRuChZMLO4XmQlKAwE/ROdJzO6ZZKj X+HrEexSN0Af9Wl5Pie8ukX4NU1hDWBtddmFll/h38JGGvRmV2YZIF/+lQVvY4sNFTK8QD1 m6aR+GDKYt8aH4EIVRczw==
UI-OutboundReport: notjunk:1;M01:P0:IVC37dXoxXU=;4pQY5r/hB6+SLs+7yBbquqNmHnw r8zN0hVkFwnz6sybosJLOS/nmEaAPGt2eS5ANVYcELcs6XOquBmT8nR/7w5I3YpMvhlkcK2X3 ohto+NuApy2rt633w2pqIiopuc3124FcXTAlkBDdIJy+krTkc40kGyTR9ZlbhuaEwR2iv2iLX jl/lKrecnZLZU4Y2zC7mn3eTMKqAVOuSSb89UNlYXFyxcSBtUe8YdMdmKquZW8fdim//o4lhw 6Mx3LebVIR9YhF6r3evRHORpTbH56TRj7U+LubIB+W88C1sKxDYQiWtlun1ogANKQcsuJ8lFw vSe6fTO/MoNokIAibrqJHBEPtKKsDXhK2NNMRQyHHAcqQRNtri9UDsdjkgc7lMhkKITDXdxPV L4AGqD9XbvjGlZ9EmcIClMzH4etsos6Qfg6dlNjWgf/dUNGitE5EHmwo2PCh/NarrWPmkbubp 1aLrsHd8AkopgGilExlWzpmuvQJtJ1D4+EhFNpbC1U2EzskdAlYuxE191F4Vy5SlJOdS+mhYQ NzT53uFMAPsHJ3kCY5zRfSkldCnAoxN5lg7rmCsAgCPsoL9bQ6jr/9p+5tCaOMS6KqAh5qKbT 4TawpMPEq+dnVyvPz/GcB7yLdmt2RCRxv9R4LhMXWqhL62qoPH6Y7flypWcMOkZKq69GB2sKy 9EwjkGrjn43xb5pI9S90W9vIQ2OrSh1CaCzZHi/wtNp+D64flIxx6uE+6PGpScgrldFGq449M DcqNJpr7bp3N9PYx6y8QF5sooX9DA8nLBjXVW6Q0V26SXymofRqC/3HVfQKprK9TT58LLsgMT dfeRLblrf+LDtr6GD+DcPMxrI6yZPAaZ855bePSYuX7Nh0Q//k0dMY8iEZz4varAc/jAX5Om+ UxsCJns83WBpUvD3O0YgPKZx2j0iuTm5mdgzUEC3KbY1hrDCjb5OBOtHACvvsZczUaxWINRWR 38o/H1xPKDrWzo5zH2MyVJWR7iM=
Archived-At: <https://mailarchive.ietf.org/arch/msg/ippm/bKIoeJ0dHpE5-Dm8uWtv3gQrmBI>
X-Mailman-Approved-At: Wed, 13 Sep 2023 00:08:55 -0700
Subject: Re: [ippm] [tsvwg] [iccrg] New Internet Draft: Congestion Signaling (CSIG)
X-BeenThere: ippm@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: IETF IP Performance Metrics Working Group <ippm.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ippm>, <mailto:ippm-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ippm/>
List-Post: <mailto:ippm@ietf.org>
List-Help: <mailto:ippm-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ippm>, <mailto:ippm-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 13 Sep 2023 06:14:29 -0000

Hi Neal,

On 13 September 2023 03:50:38 CEST, Neal Cardwell <ncardwell=40google.com@dmarc.ietf.org> wrote:
>On Tue, Sep 12, 2023 at 7:59 PM Vidhi Goel <vidhi_goel=
>40apple.com@dmarc.ietf.org> wrote:
>
>> Not sure why we are coming up with so many new techniques when ECN just
>> works fine.
>> ECN is a 2 bit field (not 1 bit) and seems to be sufficient to indicate
>> extent of congestion by marking it per packet. Adding more complexity to
>> any layer whether it is L2 or L3 doesn’t work well in deployments. Our goal
>> should be to simplify things and only add new headers if absolutely
>> necessary.
>>
>
>ECN is great. But it's only Explicit Congestion Notification. It only
>notifies you when there's congestion. When the congestion goes away, the
>signal goes away, and flows don't know whether the bottleneck link is 0.1%
>utilized or 99.9% utilized.
>
>So with only ECN and a queue-free bottleneck, flows must either (a) be
>willing to increase cwnd quickly and run the risk of overshooting and
>causing massive delay/loss (as with CUBIC and its ceiling of 1.5x per round
>trip cwnd growth), or (b) increase very slowly to avoid significant
>overshoot (as with Reno and its 1 MSS per round trip cwnd growth).
>
>With a signal like CSIG, which provides bandwidth utilization information,
>then a CC can increase its sending rate as a function of the available
>bandwidth, to quickly increase the sending rate only in cases where the
>bottleneck has low utilization. This allows more quickly utilizing
>underutilized links with lower risk of queuing and loss damage from
>overshoot.

[SM] 'lower risk' is well put, but this needs empirical validation. What happens if bottlenecks hand out immediate capacity/rate estimates and all greedy flows start increasing their cwin really fast?
IMHO these is something to be said for low gain increases during congestion control phases... (as is for restricting slow start to doubling cwin each uncongested RTT). There clearly is a difference in what can be considered optimal rate increase rate depending on whether one looks from the perspective of an individual flow/application or looking at the network in total.

Regards
        Sebastian


>
>best regards,
>neal
>
>
>
>
>> Vidhi
>>
>> On Sep 12, 2023, at 3:12 AM, Shihang(Vincent) <shihang9=
>> 40huawei.com@dmarc.ietf.org> wrote:
>>
>> Hi,
>> I agree L2 may not be the best choice to carry the congestion signaling
>> end-to-end and more bits are needed. We have submitted a draft to carry the
>> multi-bits congestion signaling in L3. We call it Advanced ECN. See
>> https://datatracker.ietf.org/doc/draft-shi-ccwg-advanced-ecn/.
>>
>> Thanks,
>> Hang
>>
>> *From:* CCWG <ccwg-bounces@ietf.org> *On Behalf Of *Huangyihong (Rachel)
>> *Sent:* Tuesday, September 12, 2023 5:41 PM
>> *To:* Tom Herbert <tom=40herbertland.com@dmarc.ietf.org>; Abhiram Ravi
>> <abhiramr=40google.com@dmarc.ietf.org>
>> *Cc:* IETF IPPM WG <ippm@ietf.org>; tsvwg <tsvwg@ietf.org>; ccwg@ietf.org;
>> iccrg@irtf.org; Nandita Dukkipati <nanditad@google.com>; Naoshad Mehta <
>> naoshad@google.com>; Jai Kumar <jai.kumar@broadcom.com>
>> *Subject:* Re: [CCWG] [iccrg] [tsvwg] New Internet Draft: Congestion
>> Signaling (CSIG)
>>
>> Hi,
>>
>> I also have the same feeling. Implementing in L2 may be difficult to be
>> used in e2e transport. Of course it can work well in limited domain, like
>> DC or HPC clusters. However, I also look for some solutions that could be
>> able to go through internet. We have submitted a draft to describe the
>> transport challenges. See
>> https://datatracker.ietf.org/doc/html/draft-huang-tsvwg-transport-challenges
>> .
>>
>> I share the same opinion that the congestion signal is useful and current
>> 1-bit ECN solution is not fully sufficient. But I also feel like the more
>> straight way is to extend L3, or l4, like update IOAM, to carry the
>> information. For L2 solution, it should be developed together with IEEE
>> 802.1.
>>
>> BR,
>> Rachel
>>
>> *发件人:* iccrg <iccrg-bounces@irtf.org> *代表 *Tom Herbert
>> *发送时间:* 2023年9月10日 0:10
>> *收件人:* Abhiram Ravi <abhiramr=40google.com@dmarc.ietf.org>
>> *抄送:* IETF IPPM WG <ippm@ietf.org>; tsvwg <tsvwg@ietf.org>; ccwg@ietf.org;
>>  iccrg@irtf.org; Nandita Dukkipati <nanditad@google.com>; Naoshad Mehta <
>> naoshad@google.com>; Jai Kumar <jai.kumar@broadcom.com>
>> *主题:* Re: [iccrg] [tsvwg] New Internet Draft: Congestion Signaling (CSIG)
>>
>> Hi, thanks for draft!
>>
>> The first thing that stands out to me is the carrier of the new packet
>> headers. In the forward path it would be in L2 and in reflection it would
>> be L4. As the draft describes, this would entail having to support the
>> protocol in multiple L2 and multiple L4 protocols-- that's going to be a
>> pretty big lift! Also, L2 is not really an end-to-end protocol (would
>> legacy switches in the path also forward the header)l?).
>>
>> The signaling being described in the draft is network layer information,
>> and hence IMO should be conveyed in network layer headers. That's is L3
>> which conveniently is the average of L2+L4 :-)
>>
>> IMO, the proper carrier of the signal data is Hop-by-Hop Options. This is
>> end-to-end and allows modification of data in-flight. The typical concern
>> with Hop-by-Hop Options is high drop rates on the Internet, however in this
>> case the protocol is explicitly confined to a limited domain so I don't see
>> that as a blocking issue for this use case.
>>
>> The information being carried seems very similar to that of IOAM (IOAM
>> uses Hop-by-Hop Options and supports reflection). I suppose the differences
>> are that this protocol is meant to be consumed by the transport Layer and
>> the data is a condensed summary of path characteristics. IOAM seems pretty
>> extensible, so maybe it could be adapted to carry the signals of this draft?
>>
>> A related proposal might be FAST draft-herbert-fast. Where the CSIG is
>> network to host signaling, FAST is host to network signaling for the
>> purposes of requesting network services. These might be complementary and
>> options for both may be in the same packet. FAST also uses reflection, so
>> we might be able to leverage some common implementation at a destination.
>>
>> Tom
>>
>> On Fri, Sep 8, 2023, 7:43 PM Abhiram Ravi <abhiramr=
>> 40google.com@dmarc.ietf.org> wrote:
>>
>> Hi IPPM folks,
>>
>> I am pleased to announce the publication of a new internet draft,
>> Congestion Signaling (CSIG):
>> https://datatracker.ietf.org/doc/draft-ravi-ippm-csig/
>>
>> CSIG is a new end-to-end packet header mechanism for in-band signaling
>> that is simple, efficient, deployable, and grounded in concrete use cases
>> of congestion control, traffic management, and network debuggability. We
>> believe that CSIG is an important new protocol that builds on top of
>> existing in-band network telemetry protocols.
>>
>> We encourage you to read the CSIG draft and provide your feedback and
>> comments. We have also cc'd the TSVWG, CCWG, and ICCRG mailing lists, as we
>> believe that this work may be of interest to their members as well.
>>
>> Thank you for your time and consideration.
>>
>> Sincerely,
>> Abhiram Ravi
>> On behalf of the CSIG authors
>>
>>
>>

-- 
Sent from my Android device with K-9 Mail. Please excuse my brevity.