Re: [tcpm] [EXTERNAL] apparent ambiguities in draft spec for Hystart++ window start/end?

Yi Huang <huanyi@microsoft.com> Thu, 06 January 2022 02:08 UTC

Return-Path: <huanyi@microsoft.com>
X-Original-To: tcpm@ietfa.amsl.com
Delivered-To: tcpm@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 5956A3A13DA; Wed, 5 Jan 2022 18:08:16 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.575
X-Spam-Level:
X-Spam-Status: No, score=-2.575 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.576, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.001, HTTPS_HTTP_MISMATCH=0.1, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=microsoft.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id mcTr1DszD_yy; Wed, 5 Jan 2022 18:08:11 -0800 (PST)
Received: from na01-obe.outbound.protection.outlook.com (mail-cusazon11020016.outbound.protection.outlook.com [52.101.61.16]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id B76ED3A13D8; Wed, 5 Jan 2022 18:08:08 -0800 (PST)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=g1RZaso6KhV1ybvMYsqgkrdsyO/4YiQX0LBYSCbDvaa4sgpCxx00ywE+i0fjC6h/4wFgbm9ruvXmCL+O5YvYYyskBUe+YZrHSmL5uTSnSPWGkkya9fhm3tsZRG/nnULRK67x62xlBpUKHnCMKq8EcZzBYdTwwRnZKKh/pw+EwMMgNvIMx05Zje1dHUvIX8kSxzu5vvDMyG2X/5mExBH1BchD+L+68CFVKZl9E+qqWHRmnSCbbHO5wmibKaPVh8blBqLAobHnpuilGRuZ6lCvVC/UQgU1I79YXBA6K+pQ/HQn8251VXs43qDo9kAVFl9y1Bu11Oy60PPRGVwH6bk66Q==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=VRUSYEX9EH4db/FEmBZv20BRG5aY+avichz/dKmbP9A=; b=edAt4Zl0Xkik92JgWZ3FgBxqkTgbwQQoPVe/+GvBtyzvZ187ql4J5cMDUWKjb1Dih7hPF+A2FOqE/d+lyPtA9EQ//ffXcmdjj9iPKwe712rAl4iL8DpadNLSuCihBKfFazC7ULfbyfqO7vtEv48qNHmV6gdk+RBRSl5fZgkNs0Tq6XfNxRVEMhXQf0nEAPz5t0m1x8G245nCfDQhJ78UCT/py3vQb826bZ5QOTWgLjGYDKV8T8BpWuT+aNCXC7U4eCnBllMkkLPS/zs+Ft6QfQqGFHKSIOP9EdOiLsdQ8XO0selhrcpf/SXtai6W1x5T37fuubeGzbJuWpMtfvDdpw==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=microsoft.com; dmarc=pass action=none header.from=microsoft.com; dkim=pass header.d=microsoft.com; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=selector2; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=VRUSYEX9EH4db/FEmBZv20BRG5aY+avichz/dKmbP9A=; b=GmGosLSIi6oHzjYy9enyxx0Q1Q5P4rMccFhJCdKo7SnXlNnc7JmIwmfT8hGqjDHiHc3iGbmAg/+0ZXo8IWk06NYXUHliS/qtaFNyIVS6FwrWQ0CBLI/dsEi6CAbzRZG3+6hbKiA1GusK0KepiFiWX3F5yQSMWs43dRkLqIPE5ZE=
Received: from CO1PR00MB1324.namprd00.prod.outlook.com (2603:10b6:303:15f::23) by MW2PR00MB0348.namprd00.prod.outlook.com (2603:10b6:302:9::19) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4908.0; Thu, 6 Jan 2022 02:07:58 +0000
Received: from CO1PR00MB1324.namprd00.prod.outlook.com ([fe80::b926:ee67:e738:72d5]) by CO1PR00MB1324.namprd00.prod.outlook.com ([fe80::b926:ee67:e738:72d5%3]) with mapi id 15.20.4908.000; Thu, 6 Jan 2022 02:07:58 +0000
From: Yi Huang <huanyi@microsoft.com>
To: Neal Cardwell <ncardwell@google.com>, Praveen Balasubramanian <pravb@microsoft.com>, "draft-ietf-tcpm-hystartplusplus@ietf.org" <draft-ietf-tcpm-hystartplusplus@ietf.org>
CC: tcpm <tcpm@ietf.org>
Thread-Topic: [EXTERNAL] apparent ambiguities in draft spec for Hystart++ window start/end?
Thread-Index: AQHX5XhWw7SPZNpn3UCY/3Z5js3+M6xVeNgs
Date: Thu, 06 Jan 2022 02:07:58 +0000
Message-ID: <CO1PR00MB13244BCA19541DB6AC1BC872C34C9@CO1PR00MB1324.namprd00.prod.outlook.com>
References: <CADVnQym=Uq_wK-YTvEwSw7AkdzGA4pvuY8ibaT6z2Hyq2sZX=A@mail.gmail.com>
In-Reply-To: <CADVnQym=Uq_wK-YTvEwSw7AkdzGA4pvuY8ibaT6z2Hyq2sZX=A@mail.gmail.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
msip_labels: MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_Enabled=True; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_SiteId=72f988bf-86f1-41af-91ab-2d7cd011db47; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_SetDate=2022-01-06T02:07:56.790Z; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_Name=General; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_ContentBits=0; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_Method=Standard;
authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=microsoft.com;
x-ms-publictraffictype: Email
x-ms-office365-filtering-correlation-id: 0017637a-8da6-441d-142c-08d9d0b95c14
x-ms-traffictypediagnostic: MW2PR00MB0348:EE_
x-ms-exchange-atpmessageproperties: SA|SL
x-microsoft-antispam-prvs: <MW2PR00MB03488F09F88248E8AD0ADAF3C34C9@MW2PR00MB0348.namprd00.prod.outlook.com>
x-ms-oob-tlc-oobclassifiers: OLM:10000;
x-ms-exchange-senderadcheck: 1
x-ms-exchange-antispam-relay: 0
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: dEkFlLGxiWx3JQq23UBNtqnaZHRtIOPB/1VtuTrqq6qm4GpNGkouNyyZdigrs13CT3rnHubxKphEA85NRkBSBnDIg0+SFPCfq7dW6exL0WNEVdjIlLfRg7N6H5KOJUvSW2/p9qiEsh9YkMWhdXwnCPJbtnNB964Ea5aUn214oaK0oNimfnCtFBDUJ2yiYn3biyJeR3CeWUbDE0B7SDGwxOglMvur3PdJN08kxmDHZxymGNldJV508hc+fmqn886abHUS6EKo/RXNg1iFfD/+WJBA9qK+mNx2lq/nTc5GpUb5+s+yrpUzfXe0YBNSSvyiXMmA0WxhvovNek6dS5tj9+eqUMbwPzcdRN44b8EIP65TsOFZOK0FdTOOAc/yCsrDCR5MMNzhuXjJMMGcHr/vYvr15WgZmozUgCamY96BVpRgIwlYKh9RfpY7eaqxj73cAtvrKTlL1D7VxH7jRGn23U1TPwzt+8VNMkjP43MUKcqaZITN8fjLUfSKCvEVkXY6fW68M9ARnR4NQ3I6K5YBa1uOxiFCcvsvvpbtBfp7273j56KFszZ6lcDAVP3BVU2odcAAVjyVVTJFTpU3RNMePP8cyUxsuFgy29yq5Ib2xudZUCxiwgLNijsZVj9p0UJ1a9EEwKmlUAI/VEh5DLpsWm3IOVS0Qo2Ziu7DNXq8rlxB1fI+cJxTrz86KKh14irzO0mBervLKyWpt504nAZkYiIiEs6XTRtUURd8OThpIFoodJn5UbeH/isWwpomPWtQA3Ms7mt76/J2M44NsVrC8DiLkZb/GBBbpQHd9ooqF0PzImBia6+suv7ZOXIruNk8CJCuHDoAce5Go+GDtRB7fybcmlAedI8pRaAl2GS1Yth5Mm+G88hUyTl+dSwBnFpO
x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:CO1PR00MB1324.namprd00.prod.outlook.com; PTR:; CAT:NONE; SFS:(13230001)(4636009)(366004)(508600001)(4326008)(19627405001)(66476007)(66446008)(82950400001)(83380400001)(2906002)(8936002)(8676002)(66946007)(66556008)(82960400001)(186003)(55016003)(64756008)(52536014)(5660300002)(966005)(122000001)(71200400001)(86362001)(38100700002)(76116006)(316002)(7696005)(9686003)(53546011)(38070700005)(110136005)(8990500004)(166002)(6506007)(10290500003)(33656002)(20210929001); DIR:OUT; SFP:1102;
x-ms-exchange-antispam-messagedata-chunkcount: 1
x-ms-exchange-antispam-messagedata-0: yaol6w2e+j+0aA5xsgwMkms69PrGozRcrotPSDfnEeB6tvz7x+WE/qabghe6qUWtG0Vt8Ds/B9ZkpKmX+m+wverEwMoMTCsDcuhWp87nMz3A5cZdoCzwosOb6oM1h1FxNDEySR2E1HLPDT6jqfdoW5HTQ8q5BML7Oxzkr88xTvvS7SP/UEVoe1NoI49vF//ftEGpmwg34VQ5LlXn+R0EYuH9HXlrgYJ0r4pFN+MyUYR+4NIpyD3M2FpEe3C+HqWgfrMkBWUeVL9Yj/hM+rWGd4TxXKaHiREOjh75pBy6u6vE0PrT951EPyRR1Jf9c2sVhuYW8gTsIp84NoZsKu4zh41e8xuRb3ykN/kyF4RmRk+6JJ/05xQuYcY3IQ7G6PnUeRet2qP8eg6PpBCIKgYluX+ho861n4g0xlaBL4ItsMTeZBBCpOea04xGwctBvRHGbZbCRpAJEZZh6j/XHW8kjHIO1a8Txd4wFcgaBzoQSLZBJUDqi4Qoq4nqSraGGQrFfltqpEVqR0iTbKmN+Sx3LZ7iW9HgdvKWSdqghI9xd0YB163Fex4RlNp7J/xzXHAKCIf/WRKmmRIlrowYfB+s6O0GUg1bQFwhcJzlEHHmIPzThrl5i2vxAXbBRudat8h3LB7wpioJE5yVzUNokf56I8ul54FjsDATQMe/Z/BeqWQ76YeooUSh/Apfqkipl/jF2IJwC0rRtXPW2Z1SPq9cK+YKRbPYz9GaGFY0qRq7joTcwClSrLlCsYFb4S+3xVtZeKdSQFyNp5I7slxQdfc4h4e4xVWBRj5weP23klxvBL2zLhlUh00t2XL/tcXKNPce6ibB3uhNCuVurmI0jTTNB4Bj5ljOuBgNIyWRInFJPvw8ZYgHD6Pl/Hx0sNoadY3mhFNMZB5Ve4UBpZkxToy54QdLbcjh3qKjpFWwwz3LmclhXhkttq/vYE/O6PNHp0dw9JD6KNXOyus3eCyRnIU1/GLNuoBMSLAvIijsi0UeXKFOz/YOYuQ6Gw0Ql7jWQXn8hiQDAZRjswl55bLx5Q4WkmR5+PUanOzqva+tAR4ST0CRwDS4U5uqqpDrRFEflR/e3xYLz+3b5VsSCCbWaMdpLS7AecM4xRA4Ct0u08ZgR8GdiNdOHyUobgJNi8Q5HJzRxuXR0vGpvhg/QEvoaQEdWFD2x+e2d7Hz6D+N9Zlhl2y5+n+ufJSm05MxeExhSKMLanTLtYn8Z6QtQAMzKM3C7TOAiBfJoPCrR9L0l2TLLpZ5t3Wbk+3w9MLJ/EcHgPMMj1ezX8dKlJ8Gn5qmcass1ON9bgwCUOsyoUOLG9Rs2nlTjgOktBLrWMrPN+Iy1mvsILOwjF07qmbOsOHOSx11DMaH8HuHkPpuEgHNbPpJJYP6KHRtnTP6gbPMs5awkcZq6Ken27H1kGnfiJrl6fLZnPE0RABO1SDfhkOpbsIqmh1//BV8FiZ+Wsw7PMdUiPBeL8pVCXyr52dtLUJolfXQvNHURAovsoBoCGewpZ6YlG/m/jx2F2gJ90VNmqoVC4EufoRUv5R9iLjmJQddU7+MBi02afWpTWUfISRmu4E0eE7bQgfYhkZyKlovMIHv5M/fqWYAqeReWcRDN/Sawg5PRQFIadLs8xYRT6kD0pD68CSgijmYWyxwZr6Jp6CPodll7OwELKh7zdgsCoMQsvbiOA==
Content-Type: multipart/alternative; boundary="_000_CO1PR00MB13244BCA19541DB6AC1BC872C34C9CO1PR00MB1324namp_"
MIME-Version: 1.0
X-OriginatorOrg: microsoft.com
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-AuthSource: CO1PR00MB1324.namprd00.prod.outlook.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 0017637a-8da6-441d-142c-08d9d0b95c14
X-MS-Exchange-CrossTenant-originalarrivaltime: 06 Jan 2022 02:07:58.0533 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 72f988bf-86f1-41af-91ab-2d7cd011db47
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: TUDNKKXlbbZ9LcQBHFqP36aegxASdFTGT3x/+Zjk/vi6lzJ8nRdAtF49SqXDggr8jOE6itmsdSZuiDZATPKCjg==
X-MS-Exchange-Transport-CrossTenantHeadersStamped: MW2PR00MB0348
Archived-At: <https://mailarchive.ietf.org/arch/msg/tcpm/fu4xMhWFdr6Ha7PvmOnKTOisrYo>
Subject: Re: [tcpm] [EXTERNAL] apparent ambiguities in draft spec for Hystart++ window start/end?
X-BeenThere: tcpm@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: TCP Maintenance and Minor Extensions Working Group <tcpm.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tcpm>, <mailto:tcpm-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tcpm/>
List-Post: <mailto:tcpm@ietf.org>
List-Help: <mailto:tcpm-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tcpm>, <mailto:tcpm-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 06 Jan 2022 02:08:17 -0000

Hi Neal,

Thanks for the review and sorry for the delay!

1. Our Hystart++ implementation actually does (b) and the draft intends (b) too.
>> Normally when I read that a sequence number X "is ACKed" I think of the (b) case, i.e. SND.UNA >= X.
I am somewhat confused by this because SND.UNA means the first byte of data that has not been ACKed. If it equals X, then X has not been ACKed yet.

We tried (a) in the past and we had issues with it. With ACK compression, it's common (at least for Windows) to see only 1 or a few ACKs for a whole window of packets especially in the first couple of RTT rounds. In this case, a round will very likely last for 2 RTT rounds (when it's waiting for the only ACK for the next window worth of data.)

We are not happy with (b) either. We set windowEnd to SndNxt when a round ends but SndNxt was updated by the previous ACK that triggered new data, which means the ACK that covers windowEnd (AckNo >= windowEnd) might come in anytime leading to marking rounds inaccurately.

We have been experimenting with a new method to mark rounds- set windowEnd to SndUna + min(CWnd, advertised window from the peer) when a round ends. We have been using it in LEDBAT++ already and rounds are aligned with RTTs more consistently. How does this sound?

2. We have not done any targeted testing of app-limited scenarios unfortunately. Our implementation does assume that the end of a previous round is the start of next round. I understand the situation you described where the next round might actually start a few days later and we will be comparing current round's minRTT with a stale last round's minRTT. One mitigation for this issue would be to clear last round's minRTT when idle is detected. E.g. if the time diff between two consecutive rounds is greater than a threshold, we skip the current round.

3. I agree it's not true in app-limited scenarios and we should remove it.

Thanks,

Yi
________________________________
From: Neal Cardwell <ncardwell@google.com>
Sent: Monday, November 29, 2021 3:23 PM
To: Praveen Balasubramanian <pravb@microsoft.com>; draft-ietf-tcpm-hystartplusplus@ietf.org <draft-ietf-tcpm-hystartplusplus@ietf.org>
Cc: tcpm <tcpm@ietf.org>
Subject: [EXTERNAL] apparent ambiguities in draft spec for Hystart++ window start/end?

Hi,

Thanks again for the nice Hystart++ draft!

Below are three quick comments about what AFAICT are some ambiguities about the spec for detecting and handling window starts/ends, plus a suggestion for wording that section:

(1) Currently, the draft at:
  https://datatracker.ietf.org/doc/html/draft-ietf-tcpm-hystartplusplus#section-4.2<https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fdatatracker.ietf.org%2Fdoc%2Fhtml%2Fdraft-ietf-tcpm-hystartplusplus%23section-4.2&data=04%7C01%7Chuanyi%40microsoft.com%7Cc36d089c6c1747a652fe08d9b38f5212%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637738251037372864%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000&sdata=p3NISukUXfgsSbNFq6CTmjwhX8jdbXCr98MYgSQrWz0%3D&reserved=0>
says:

      When windowEnd is ACKed, the current round ends and windowEnd is
      set to SND.NXT

IMHO this is a bit ambiguous as to whether it means:
(a)  if (SND.UNA > windowEnd)
or
(b)  if (SND.UNA >= windowEnd)

Normally when I read that a sequence number X "is ACKed" I think of the (b) case, i.e. SND.UNA >= X. But for application-limited flights of data, I think that (b) interpretation would lead to the Hystart++ logic for the start/end of rounds potentially being executed twice for each flight of data: once for the first segment in the flight, and once for the last segment in the flight (because while processing ACKs for an application-limited flight the value of SND.NXT may not change).

So I imagine the Hystart++ algorithm actually intends the (a) semantics. The (a) approach is what Linux TCP CUBIC Hystart does, and the AFAICT  (a) semantics give the correct once-per-round behavior even for application-limited flights. I think it would be good to express that aspect of the algorithm more precisely to indicate the approach that is intended.

(2) A second, related, issue: for per-round behavior section 4.2 of the algorithm specifies how to detect the end of a round ("When windowEnd is ACKed, the current round ends"), but only specifies what action is to be taken at the start of a round ("At the start of each round during standard slow start ([RFC5681]) and CSS:").

I suppose the text is implicitly considering that the start of one round is the same as the end of the previous round. For bulk data that assumption might be acceptable; it might be fine to think of the start of one round as the same as the end of the previous round. But for application-limited flights of data that assumption does not hold, and the start of a round and end of a round are very different events with very different timing and characteristics. So for the common case of application-limited flights of data currently the draft does not specify how to detect the "start of each round" event that triggers the per-round actions; it only defines how to detect the "end" of a round.

(3)  A third, related, issue: the text in section 4.2 says:


   A round is chosen to be approximately the Round-Trip Time (RTT).

For application-limited flights this text is a poor or confusing fit. For application-limited flights,  the duration of time between two successive rounds, aka the length of a round (IMHO), can be arbitrarily long; it can be much longer than the RTT and perhaps hours or even days. I would suggest just dropping that sentence, to avoid confusion.

---

(a suggestion):

To clear up these ambiguities I would suggest tweaking the existing text from:

(before):
"""

   A round is chosen to be approximately the Round-Trip Time (RTT).  We
   recommend that rounds be measured using sequence numbers.  Round can
   be approximated using sequence numbers as follows:

      Define windowEnd as a sequence number initialize to SND.UNA

      When windowEnd is ACKed, the current round ends and windowEnd is
      set to SND.NXT

   At the start of each round during standard slow start ([RFC5681<https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fdatatracker.ietf.org%2Fdoc%2Fhtml%2Frfc5681&data=04%7C01%7Chuanyi%40microsoft.com%7Cc36d089c6c1747a652fe08d9b38f5212%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637738251037372864%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000&sdata=d8QjJJDqAlTEi4xYVNByJm3Dk7tYjY%2BA2wv7w%2BanKck%3D&reserved=0>]) and
   CSS:

"""

...to be something more like the following:

(after):
"""

   Hystart++ measures rounds using sequence numbers, as follows:

      Define windowEnd as a sequence number initialized to SND.UNA.

      Upon receiving an ACK, if (SND.UNA > windowEnd) then
      a new round has started and the sender sets windowEnd to
      SND.NXT.

   At the start of each new round during standard slow start ([RFC5681])
   and CSS:
"""

best regards,
neal