Re: [tsvwg] 回复:Takeaways for HPCC++

"Scharf, Michael" <Michael.Scharf@hs-esslingen.de> Mon, 02 November 2020 12:53 UTC

Return-Path: <Michael.Scharf@hs-esslingen.de>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id B70D73A0F0C; Mon, 2 Nov 2020 04:53:07 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.996
X-Spam-Level:
X-Spam-Status: No, score=-1.996 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.001, HTTPS_HTTP_MISMATCH=0.1, SPF_HELO_NONE=0.001, SPF_NONE=0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=hs-esslingen.de
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id T13zHbmaeriN; Mon, 2 Nov 2020 04:53:05 -0800 (PST)
Received: from mail.hs-esslingen.de (mail.hs-esslingen.de [134.108.32.78]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 831B13A0F06; Mon, 2 Nov 2020 04:53:03 -0800 (PST)
Received: from localhost (localhost.localdomain [127.0.0.1]) by mail.hs-esslingen.de (Postfix) with ESMTP id 227EE25A15; Mon, 2 Nov 2020 13:53:02 +0100 (CET)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=hs-esslingen.de; s=mail; t=1604321582; bh=qG9ndRMdptbBmaPkmBgfiJDCwuVnp0PP6nGKRAASlMQ=; h=From:To:CC:Subject:Date:References:In-Reply-To:From; b=KgmgchpxY68rpEhKTwbhmHaTpJaH6OS1KPz3ha2E390vZRZq2+8BToYTbA9Jtr236 qjMiGfgtd40X+PruZZoDqjMaA3ksllxiFHWqOTyFq1poT8jIC/q5pBF0dtYfU4GcCJ ESThf77GQ7cc+KvRZHJhaDKpYOFA8B7vbjZz1l7s=
X-Virus-Scanned: by amavisd-new-2.7.1 (20120429) (Debian) at hs-esslingen.de
Received: from mail.hs-esslingen.de ([127.0.0.1]) by localhost (hs-esslingen.de [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 84H4zibjrLfX; Mon, 2 Nov 2020 13:53:00 +0100 (CET)
Received: from rznt8202.rznt.rzdir.fht-esslingen.de (rznt8202.hs-esslingen.de [134.108.48.165]) (using TLSv1.2 with cipher AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.hs-esslingen.de (Postfix) with ESMTPS; Mon, 2 Nov 2020 13:53:00 +0100 (CET)
Received: from rznt8202.rznt.rzdir.fht-esslingen.de (134.108.48.165) by rznt8202.rznt.rzdir.fht-esslingen.de (134.108.48.165) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.1979.3; Mon, 2 Nov 2020 13:53:00 +0100
Received: from rznt8202.rznt.rzdir.fht-esslingen.de ([fe80::aca4:171a:3ee1:57e0]) by rznt8202.rznt.rzdir.fht-esslingen.de ([fe80::aca4:171a:3ee1:57e0%3]) with mapi id 15.01.1979.006; Mon, 2 Nov 2020 13:53:00 +0100
From: "Scharf, Michael" <Michael.Scharf@hs-esslingen.de>
To: Richard Li <richard.li@futurewei.com>, "Rui, Miao" <miao.rui@alibaba-inc.com>, tsvwg <tsvwg@ietf.org>
CC: Barak Gafni <gbarak@mellanox.com>, tsvwg-chairs <tsvwg-chairs@ietf.org>
Thread-Topic: [tsvwg] 回复:Takeaways for HPCC++
Thread-Index: AQHWiBkX5UP5xQKJq0Cr7TYhOkAYu6mtkk2AgAavHQCAANA/sA==
Date: Mon, 02 Nov 2020 12:53:00 +0000
Message-ID: <3ccd7022d25f4d059e2024db995ec7b2@hs-esslingen.de>
References: <725865b7-b8a5-4099-8019-e8c43c84f9b4.miao.rui@alibaba-inc.com> <4331364e-306f-41d0-b867-7dbc9aaa6bcc.miao.rui@alibaba-inc.com> <3f0684890ec9401080c26ccc9fa78ea6@hs-esslingen.de> <BYAPR13MB227942191D509AB9636B112487100@BYAPR13MB2279.namprd13.prod.outlook.com>
In-Reply-To: <BYAPR13MB227942191D509AB9636B112487100@BYAPR13MB2279.namprd13.prod.outlook.com>
Accept-Language: de-DE, en-US
Content-Language: de-DE
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-originating-ip: [134.108.48.168]
Content-Type: multipart/alternative; boundary="_000_3ccd7022d25f4d059e2024db995ec7b2hsesslingende_"
MIME-Version: 1.0
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/OU4llyISybaEN9FVK8Xhgag8ebE>
Subject: Re: [tsvwg] 回复:Takeaways for HPCC++
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg/>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 02 Nov 2020 12:53:08 -0000

RFC 6077 Section 3.1 summarizes challenges for all congestion control schemes that rely on network support known at the time of publicatiob.

In addition to the general issues, RFC 6077 Section 3.1 also includes some specific literature pointers to known issues for XCP. As far as I recall, some of the issues in XCP were fairly fundamental. The corresponding papers are referenced in the RFC.

Before publication of RFC 6077, I have myself worked with XCP simulations and XCP hardware implementations. In my PhD thesis I have compared, amonst others, XCP with a simpler solution developed in the IETF (Quick-Start). My personal findings can be found in Sections 4, 5 and 6 of my Phd thesis:

http://content.ikr.uni-stuttgart.de/Content/Publications/Archive/Sf_Diss_40112.pdf

Specifically, I ran into issues when a router supporting XCP cannot correctly determine the capacity of a link (see section 6.5.3.3 in the thesis).

RFC 6077 explains why it is not trivial to actually know what link capacity in reality (i.e., outside simulations). Examples for such cases include virtualization, tunneling, brownfield deployments with legacy devices, etc.

I don’t understand HPCC++ well enough to know how this problem is solved in actual deployments. If you want to repeat the experiment I did in my PhD thesis for XCP (cf. Section 6.5.3.3), you would have to run e.g. experiments in which the actual link capacity is either by factor 10 larger or by factor 10 smaller than the capacity assumed in the algorithm (I guess it is the variable B_j in the draft). If a congestion control algorithm shall be robust, IMHO it would have to be able to deal with such situations. According to my own results, XCP did not work well in such cases of capacity over- or under-estimation.

BTW, the I-D „draft-irtf-panrg-what-not-to-do“ also discusses related lessons learnt.

Michael


From: Richard Li <richard.li@futurewei.com>
Sent: Monday, November 2, 2020 1:44 AM
To: Scharf, Michael <Michael.Scharf@hs-esslingen.de>; Rui, Miao <miao.rui@alibaba-inc.com>; tsvwg <tsvwg@ietf.org>
Cc: Barak Gafni <gbarak@mellanox.com>; tsvwg-chairs <tsvwg-chairs@ietf.org>
Subject: RE: [tsvwg] 回复:Takeaways for HPCC++

> it is has specifically taken into account lessons learnt from protocols such as XCP

Would you mind sharing what lessons exactly you learnt from XCP?

Thanks,

Richard




From: tsvwg <tsvwg-bounces@ietf.org<mailto:tsvwg-bounces@ietf.org>> On Behalf Of Scharf, Michael
Sent: Wednesday, October 28, 2020 10:48 AM
To: Rui, Miao <miao.rui@alibaba-inc.com<mailto:miao.rui@alibaba-inc.com>>; tsvwg <tsvwg@ietf.org<mailto:tsvwg@ietf.org>>
Cc: Barak Gafni <gbarak@mellanox.com<mailto:gbarak@mellanox.com>>; tsvwg-chairs <tsvwg-chairs@ietf.org<mailto:tsvwg-chairs@ietf.org>>
Subject: Re: [tsvwg] 回复:Takeaways for HPCC++

Hi all,
An editorial comment: ICCRG has published RFC 6077 about ten years ago. Obviously, the document may be a bit dated, but it is has specifically taken into account lessons learnt from protocols such as XCP or Quick-Start. Not all content of RFC 6077 may apply to current data centers. Nonetheless, some of the content in RFC 6077 is quite generic, and it could be useful to better explain in a description of HPC++ how this architecture addresses the challenges.
Best regards
Michael (as co-author of RFC 6077)

From: tsvwg <tsvwg-bounces@ietf.org<mailto:tsvwg-bounces@ietf.org>> On Behalf Of Rui, Miao
Sent: Friday, September 11, 2020 10:53 AM
To: Miao, Rui(缪睿) <miao.rui@alibaba-inc.com<mailto:miao.rui@alibaba-inc.com>>; tsvwg <tsvwg@ietf.org<mailto:tsvwg@ietf.org>>
Cc: Barak Gafni <gbarak@mellanox.com<mailto:gbarak@mellanox.com>>; tsvwg-chairs <tsvwg-chairs@ietf.org<mailto:tsvwg-chairs@ietf.org>>
Subject: [tsvwg] 回复:Takeaways for HPCC++

Dear WG members,

At IETF 108, we presented HPCC++, a new architecture for congestion control designed for high-speed networks. We have modified the draft according to the feedback from the meeting. The newer version is available at https://tools.ietf.org/html/draft-pan-tsvwg-hpccplus-02<https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Ftools.ietf.org%2Fhtml%2Fdraft-pan-tsvwg-hpccplus-02&data=04%7C01%7Crichard.li%40futurewei.com%7C5f95b2910d2b41a59fe208d87b699b9a%7C0fee8ff2a3b240189c753a1d5591fedc%7C1%7C1%7C637395040817739023%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C2000&sdata=Q%2FDwV%2Bm73NlD3m%2BmYoOIA0OGUUlgtUYCUFz2uwAC3Qk%3D&reserved=0>

Thanks, and please feel free to let us know if you have more comments.

Thanks,
Rui

------------------------------------------------------------------
发件人:Miao, Rui(缪睿) <miao.rui@alibaba-inc.com<mailto:miao.rui@alibaba-inc.com>>
发送时间:2020年8月2日(星期日) 11:21
收件人:tsvwg <tsvwg@ietf.org<mailto:tsvwg@ietf.org>>
抄 送:tsvwg-chairs <tsvwg-chairs@ietf.org<mailto:tsvwg-chairs@ietf.org>>; Wesley Eddy <wes@mti-systems.com<mailto:wes@mti-systems.com>>; "Pan, Rong" <rong.pan@intel.com<mailto:rong.pan@intel.com>>; Barak Gafni <gbarak@mellanox.com<mailto:gbarak@mellanox.com>>
主 题:Takeaways for HPCC++


Hello TSVWG members,

It was our pleasure to present HPCC++ in the IETF-108 meeting this week. We appreciate your valuable feedback. Our current draft is located at https://tools.ietf.org/html/draft-pan-tsvwg-hpccplus-01<https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Ftools.ietf.org%2Fhtml%2Fdraft-pan-tsvwg-hpccplus-01&data=04%7C01%7Crichard.li%40futurewei.com%7C5f95b2910d2b41a59fe208d87b699b9a%7C0fee8ff2a3b240189c753a1d5591fedc%7C1%7C1%7C637395040817748978%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C2000&sdata=mrAKAuPvwwdWCBghjQDoMYqNftztOQtCmGJivkqwgpE%3D&reserved=0>.
Here, we have summarized a list of takeaways to further improve the draft.

1. HPCC++ is a generic congestion control scheme that works for high-speed networks. It requires to have window limit and ACK self-clocking, which is out of the definition of RoCEv2 but should instead work well for TCP/iWRAP. In addition, HPCC++ conforms to the paradigm of existing TCP congestion control, such as switch marking, receiver feedback, and sender enforcement. The major changes are using inband telemetry as the congestion feedback and it is incremental. In the next revision, we describe clearly how it works for TCP/UDP and related protocols.

2. HPCC++ works for both data centers and the Internet. We will add descriptions and results for the Internet-scale size and latency in the next revision.

Thanks again for the comments. Please feel free to let us know if you have more thoughts.

All the best,
Rui, Rong, Barak