Re: New Version Notification for draft-miao-rtgwg-hpccplus-00.txt

"Miao, Rui" <miao.rui@alibaba-inc.com> Tue, 22 March 2022 21:22 UTC

Return-Path: <miao.rui@alibaba-inc.com>
X-Original-To: rtgwg@ietfa.amsl.com
Delivered-To: rtgwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 4CF7B3A0A84 for <rtgwg@ietfa.amsl.com>; Tue, 22 Mar 2022 14:22:08 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -7.106
X-Spam-Level:
X-Spam-Status: No, score=-7.106 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FROM_EXCESS_BASE64=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_HI=-5, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, UNPARSEABLE_RELAY=0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=alibaba-inc.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id o4LEgmnESL7c for <rtgwg@ietfa.amsl.com>; Tue, 22 Mar 2022 14:22:03 -0700 (PDT)
Received: from out0-155.mail.aliyun.com (out0-155.mail.aliyun.com [140.205.0.155]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id CD3C43A0A80 for <rtgwg@ietf.org>; Tue, 22 Mar 2022 14:22:02 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=alibaba-inc.com; s=default; t=1647984120; h=Date:From:To:Message-ID:Subject:MIME-Version:Content-Type; bh=wHmBbP0rMiCr7Hre/yuC1S2bQI99j2AQtmxGHFjPsLU=; b=YIMITqKa+olRCW/gJloCZD93t4kfBF17bgV74Isss7JvqCwUDAepPxWododlJVH0bFVTHSzyUVtYBBnr//NU4yUC/zYGZHPI+minX8dZIKgoRoE4bV1zTnCaGMMzpS/sil8EFJz4njKsYWJ+6dTGtTsTQH8uWJZQgHdEPDc8YqY=
X-Alimail-AntiSpam: AC=PASS; BC=-1|-1; BR=01201311R201e4; CH=green; DM=||false|; DS=||; FP=0|-1|-1|-1|0|-1|-1|-1; HT=ay29a033018047203; MF=miao.rui@alibaba-inc.com; NM=1; PH=DW; RN=3; SR=0; TI=W4_0.1.10_v5ForWebDing_21281EA8_1647984041951_o7001c104k;
Received: from WS-web (miao.rui@alibaba-inc.com[W4_0.1.10_v5ForWebDing_21281EA8_1647984041951_o7001c104k]) by ay29a011140100061.et135 at Wed, 23 Mar 2022 05:21:59 +0800
Date: Wed, 23 Mar 2022 05:21:59 +0800
From: "Miao, Rui" <miao.rui@alibaba-inc.com>
To: Greg Mirsky <gregimirsky@gmail.com>
Cc: "Acee Lindem (acee)" <acee@cisco.com>, rtgwg <rtgwg@ietf.org>
Reply-To: "Miao, Rui" <miao.rui@alibaba-inc.com>
Message-ID: <13faa1fc-6cb8-4372-9898-eac73fafdb81.miao.rui@alibaba-inc.com>
Subject: Re: New Version Notification for draft-miao-rtgwg-hpccplus-00.txt
X-Mailer: [Alimail-Mailagent revision 1][W4_0.1.10][v5ForWebDing][Chrome]
MIME-Version: 1.0
References: <164669251153.9143.12929435321386079793@ietfa.amsl.com> <ebab0566-782b-411c-9baa-08ca82032f60.miao.rui@alibaba-inc.com> <0FF1A8D3-1ECB-4540-A1E7-25F71DB9BA80@cisco.com> <bcd4cbc6-c269-4f5f-b54d-0809c9bbfbe5.miao.rui@alibaba-inc.com> <CA+RyBmV4YkKHaPBMwSoGB5W=os=e75ZMQSVp-yAaC-WDyTfCqA@mail.gmail.com> <d26ce944-1fc9-4201-971a-914aab12314e.miao.rui@alibaba-inc.com> <CA+RyBmWeixmTui3PK9U2NVk5x1-xmmSC0RGENWhmR4ZeDGZmfA@mail.gmail.com>, <CA+RyBmUDx1f8ynXj-2qtgUBSEVZxBs9SAhX+-AarbMJ5ofXkvA@mail.gmail.com>
x-aliyun-mail-creator: W4_0.1.10_v5ForWebDing_M3LTW96aWxsYS81LjAgKE1hY2ludG9zaDsgSW50ZWwgTWFjIE9TIFggMTBfMTVfNykgQXBwbGVXZWJLaXQvNTM3LjM2IChLSFRNTCwgbGlrZSBHZWNrbykgQ2hyb21lLzk4LjAuNDc1OC4xMDkgU2FmYXJpLzUzNy4zNg==vN
In-Reply-To: <CA+RyBmUDx1f8ynXj-2qtgUBSEVZxBs9SAhX+-AarbMJ5ofXkvA@mail.gmail.com>
Content-Type: multipart/alternative; boundary="----=ALIBOUNDARY_85676_7f4b35487700_623a3df7_7d93fb"
Archived-At: <https://mailarchive.ietf.org/arch/msg/rtgwg/VtSkCEzdnSKGU52guVbbjLBhJZ0>
X-BeenThere: rtgwg@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Routing Area Working Group <rtgwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/rtgwg>, <mailto:rtgwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/rtgwg/>
List-Post: <mailto:rtgwg@ietf.org>
List-Help: <mailto:rtgwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/rtgwg>, <mailto:rtgwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 22 Mar 2022 21:22:10 -0000

Thanks Greg and Acee for the comments. 

We will iterate your concerns and address them in our updates. 

Thanks,
Rui


------------------------------------------------------------------
From:Greg Mirsky <gregimirsky@gmail.com>
Sent At:2022 Mar. 21 (Mon.) 08:16
To:Rui <miao.rui@alibaba-inc.com>
Cc:Acee Lindem (acee) <acee@cisco.com>; rtgwg <rtgwg@ietf.org>
Subject:Re: New Version Notification for draft-miao-rtgwg-hpccplus-00.txt

Dear Authors,
thank you for the presentation at RTGWG meeting. I have several further comments for your consideration:

I agree with Dean's comment that waiting for the congestion to occur and acting on the notification with unbounded latency will likely cause further degradation of the network condition because of the reactive mechanism. Polling, proposed by Dean, is one way to use telemetry data in a proactive manner. I would refer to the work on Precision Availability Metrics in Multi-SLO services we've presented earlier at the IPPM WG meeting. Monitoring the network and reporting crossed SLO thresholds may be the path to proactively detecting degradation in the network and avoiding congestion altogether.
Overall, it appears that there are two drafts in one - one describing HPPCC++ itself, and another on what telemetry data are required for HPPCC++. It seems like separating these might help the discussion and finding proper places to continue the work.
The draft and presentation emphasize the use of "in-band" OAM methods. Is the intention to collect telemetry information on each data packet? It could be helpful to generalize document and define not how information is obtained (hybrid, active, passive methods) but which information is required to enable HPPCC++.
Regards,
Greg
On Tue, Mar 15, 2022 at 7:25 PM Greg Mirsky <gregimirsky@gmail.com> wrote:

Hi Rui,
thank you for the expedient responses to my questions. I think that there are some aspects of how to obtain the information used to detect the congestion that I'd like to clarify with your help:

From its description, HPCC++ depends on the high-precision information collected along the path, including timestamps. All methods listed in the Section 6.1 collect information into the trigger packet. But such a collection method negatively affects the accuracy of a timestamp because the time value cannot be taken in the course of physical transmission of the packet. What methods and techniques could be used to improve the accuracy of time values recorded?
As I understand it, HPCC++ relies on the collection of telemetry data on all nodes traversed by the packet. If that is true, then that would require that all nodes have their clocks synchronized. And to provide the high precision time values, the accuracy of clock synchronization in the domain must be at least 10 times higher then the required accuracy of time measurements. Hence my next question, What should be the accuracy of clock synchronization for HPCC++?
If I understand your answer #3 correctly, you consider IOAM as an active OAM method. According to draft-ietf-ippm-data and RFC 7799, IOAM is an example of hybrid type 1 measurement method. And since a useful active OAM protocol is always in-band with the monitored flow, I think that the list of protocols in Section 6.1 can be significantly expanded to include other known measurement protocols. What are your thoughts?
Regards,
Greg
On Tue, Mar 15, 2022 at 6:40 PM Miao, Rui <miao.rui@alibaba-inc.com> wrote:
Thanks Greg for the comments.

1. ``inband'' here means the host queries the network telemetry information along the path where its traffic traverses through in real time.
2.  Although the "out-of-band telemetry" may provide feedback-based control via management plane, the target scenario achieves congestion control for low latency (~10 us round trip time) where inband fits better.
3. I would think so. In fact, IOAM (draft-ietf-ippm-ioam-data-17) is part of our implementation examples. 


-Rui




------------------------------------------------------------------
From:Greg Mirsky <gregimirsky@gmail.com>
Sent At:2022 Mar. 15 (Tue.) 16:23
To:Rui <miao.rui@alibaba-inc.com>
Cc:Acee Lindem (acee) <acee@cisco.com>; rtgwg <rtgwg@ietf.org>
Subject:Re: New Version Notification for draft-miao-rtgwg-hpccplus-00.txt

Hi Rui,
it seems that the proposed congestion control mechanism is dependent on "inband telemetry". If my understanding is correct, I have several questions:

I see that OAM methods listed in Section 6.1 can be classified, based on RFC 7799, as hybrid measurement methods. It is not clear why they are referred to as "inband". Could you please clarify the interpretation of "inband telemetry" in the draft.
Furthemore, do you see the case of using "out-of-band telemetry" as the basis for the proposed congestion control mechanism?
And lastly, in your opinion, can active OAM methods be used to provide the information necessary for the HPCC++?
Regards,
Greg
On Tue, Mar 15, 2022 at 2:18 PM Miao, Rui <miao.rui=40alibaba-inc.com@dmarc.ietf.org> wrote:
Hi Acee, 

Thanks for the comment. 

HPCC++ proposes an approach to leverage the in-band telemetry for traffic admission and path selection, which we think is more relevant to routing decisions. Besides, HPCC++ is orthogonal to any transports and thus we have several example implementations in the draft. 

Thanks,
Rui

------------------------------------------------------------------
From:Acee Lindem (acee) <acee@cisco.com>
Sent At:2022 Mar. 15 (Tue.) 12:43
To:Rui <miao.rui@alibaba-inc.com>; rtgwg <rtgwg@ietf.org>
Subject:Re: New Version Notification for draft-miao-rtgwg-hpccplus-00.txt

Why is this draft in the Routing WG? This work is more applicable to the Transport or Internet Area. 
Acee
From: rtgwg <rtgwg-bounces@ietf.org> on behalf of "Miao, Rui" <miao.rui=40alibaba-inc.com@dmarc.ietf.org>
Reply-To: "Miao, Rui" <miao.rui@alibaba-inc.com>
Date: Tuesday, March 15, 2022 at 2:42 PM
To: Routing WG <rtgwg@ietf.org>
Subject: Fw: New Version Notification for draft-miao-rtgwg-hpccplus-00.txt
Hello All,

We have posted an updated version of HPCC++ at https://datatracker.ietf.org/doc/draft-miao-rtgwg-hpccplus/ .

There are two major changes since the last presentation. First, since HPCC++ is orthogonal to different in-band telemetry formats and transport protocols, we introduce a few reference implementations over those protocols for people to quickly pick up. 
Second, we advocate a better routing scheme for path selection and traffic admission, building on top of HPCC++'s precise link load information.

Please provide any comments you may have on the draft or its extensions.

Thanks,
Rui




------------------------------------------------------------------
From:internet-drafts <internet-drafts@ietf.org>
Sent At:2022 Mar. 7 (Mon.) 14:35
To:Harry <hongqiang.liu@alibaba-inc.com>; Barak Gafni <gbarak@mellanox.com>; Changhoon Kim <chang.kim@intel.com>; Jeff Tantsura <jefftantsura@microsoft.com>; Jeongkeun Lee <jk.lee@intel.com>; Rong Pan <rong.pan@intel.com>; Rui <miao.rui@alibaba-inc.com>; Yuval Shpigelman <yuvals@nvidia.com>
Subject:New Version Notification for draft-miao-rtgwg-hpccplus-00.txt


 A new version of I-D, draft-miao-rtgwg-hpccplus-00.txt
 has been successfully submitted by Rui Miao and posted to the
 IETF repository.

 Name:  draft-miao-rtgwg-hpccplus
 Revision: 00
 Title:  HPCC++: Enhanced High Precision Congestion Control
 Document date: 2022-03-07
 Group:  Individual Submission
 Pages:  20
 URL:             https://www.ietf.org/archive/id/draft-miao-rtgwg-hpccplus-00.txt
 Status:          https://datatracker.ietf.org/doc/draft-miao-rtgwg-hpccplus/
 Htmlized:        https://datatracker.ietf.org/doc/html/draft-miao-rtgwg-hpccplus


 Abstract:
   Congestion control (CC) is the key to achieving ultra-low latency,
   high bandwidth and network stability in high-speed networks.
   However, the existing high-speed CC schemes have inherent limitations
   for reaching these goals.

   In this document, we describe HPCC++ (High Precision Congestion
   Control), a new high-speed CC mechanism which achieves the three
   goals simultaneously.  HPCC++ leverages inband telemetry to obtain
   precise link load information and controls traffic precisely.  By
   addressing challenges such as delayed signaling during congestion and
   overreaction to the congestion signaling using inband and granular
   telemetry, HPCC++ can quickly converge to utilize all the available
   bandwidth while avoiding congestion, and can maintain near-zero in-
   network queues for ultra-low latency.  HPCC++ is also fair and easy
   to deploy in hardware, implementable with commodity NICs and
   switches.




 The IETF Secretariat
 _______________________________________________
 rtgwg mailing list
rtgwg@ietf.org
https://www.ietf.org/mailman/listinfo/rtgwg