[tsvwg] Re: WGLC for Convergence of Congestion Control from Retained State (draft-ietf-tsvwg-careful-resume)

Gorry Fairhurst <gorry@erg.abdn.ac.uk> Mon, 10 February 2025 20:04 UTC

Return-Path: <gorry@erg.abdn.ac.uk>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 588DDC1F58AB for <tsvwg@ietfa.amsl.com>; Mon, 10 Feb 2025 12:04:34 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.003
X-Spam-Level:
X-Spam-Status: No, score=-2.003 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_BLOCKED=0.001, RCVD_IN_VALIDITY_RPBL_BLOCKED=0.001, RCVD_IN_VALIDITY_SAFE_BLOCKED=0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=erg.abdn.ac.uk
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 2R4-k7ix2LA4 for <tsvwg@ietfa.amsl.com>; Mon, 10 Feb 2025 12:04:30 -0800 (PST)
Received: from pegasus.erg.abdn.ac.uk (pegasus.erg.abdn.ac.uk [137.50.19.135]) by ietfa.amsl.com (Postfix) with ESMTP id AF162C09E1BA for <tsvwg@ietf.org>; Mon, 10 Feb 2025 12:04:23 -0800 (PST)
Received: from [192.168.1.191] (fgrpf.plus.com [212.159.18.54]) by pegasus.erg.abdn.ac.uk (Postfix) with ESMTPSA id 5B2151B002A4; Mon, 10 Feb 2025 20:04:17 +0000 (GMT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=erg.abdn.ac.uk; s=default; t=1739217859; bh=FRc2MSpAfMNPxGTie3BICai0+pNybbY3Quw2dcFqOus=; h=Date:Subject:To:References:From:Cc:In-Reply-To:From; b=K77S4S6T1P2H28XfUD2bUtgVO5gyqImiob6vWyJI8stxWWW/myYn2OCkw8/8uLJyI 48bOxh9x7x+vGjX+aMYpn+2mondiYBptahwrtN1K6kIJlJWOzYQrkcP6+Iyt0P3U0N aN1gts7Xno783eW0z4y1UFGT33epwqA/HEIQGKWlO+Nl0AjgPBLzx0QFlGl4q9ZtD+ TVP8XKjo84lqBpuFZuad4kmKuGT0N0i/iuF1OahPg22bZvHSGkn5OR011QNiQcpqjV 5SHIVAflOOk2kR3QjUZzTuSfqocs9CakR1/nX8W1LbroQI2TZOx03pMx7r7/LZ89vh 4EyzffuMdl7RQ==
Content-Type: multipart/alternative; boundary="------------KqBHPOb5YCy0DPE6z1S0p30g"
Message-ID: <3f679501-8442-416d-8c5e-a20395061720@erg.abdn.ac.uk>
Date: Mon, 10 Feb 2025 20:04:16 +0000
MIME-Version: 1.0
User-Agent: Mozilla Thunderbird
Content-Language: en-GB
To: Mirja Kuehlewind <mirja.kuehlewind=40ericsson.com@dmarc.ietf.org>
References: <CAOYVs2psDQEAwB0v+ThLpdBGn917_+4z2w8wBROfb+rSRVtwcA@mail.gmail.com> <CAEh=tce1yPBtbXjWN__VMSW0t69uXRcirqDZsx7RD7ezAddwoQ@mail.gmail.com> <E3CF8CE5-917A-4AE7-BE2A-78DDFBDCE45D@ericsson.com>
From: Gorry Fairhurst <gorry@erg.abdn.ac.uk>
Organization: UNIVERSITY OF ABERDEEN
In-Reply-To: <E3CF8CE5-917A-4AE7-BE2A-78DDFBDCE45D@ericsson.com>
Message-ID-Hash: I2F2SX27PULDFQNGFPGOUADASSZKMWDE
X-Message-ID-Hash: I2F2SX27PULDFQNGFPGOUADASSZKMWDE
X-MailFrom: gorry@erg.abdn.ac.uk
X-Mailman-Rule-Misses: dmarc-mitigation; no-senders; approved; emergency; loop; banned-address; member-moderation; header-match-tsvwg.ietf.org-0; nonmember-moderation; administrivia; implicit-dest; max-recipients; max-size; news-moderation; no-subject; digests; suspicious-header
CC: "tsvwg@ietf.org" <tsvwg@ietf.org>
X-Mailman-Version: 3.3.9rc6
Precedence: list
Subject: [tsvwg] Re: WGLC for Convergence of Congestion Control from Retained State (draft-ietf-tsvwg-careful-resume)
List-Id: Transport Area Working Group <tsvwg.ietf.org>
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/VKMZtKEshuvY5BQkRO5l8iVgCWs>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Owner: <mailto:tsvwg-owner@ietf.org>
List-Post: <mailto:tsvwg@ietf.org>
List-Subscribe: <mailto:tsvwg-join@ietf.org>
List-Unsubscribe: <mailto:tsvwg-leave@ietf.org>

On 10/02/2025 14:13, Mirja Kuehlewind wrote:
>
> Hi all,
>
> sorry for my slightly late review. I think this document is very clear 
> and ready for publication. I only have one high-level comment that 
> could maybe need some additional clarification and a few minor and 
> fully editorial comments below.
>
Thanks Mirja for reading  this - it really helps to have people offering 
comments !!!

I've just posted rev -13, to try to address the WGLC comments so far.

> I think it is not fully clear what is meant exactly by “normal rules 
> for the current congestion controller“ e.g. here:
>
>    *  Validating Phase (Updating CWND): The CWND is updated using the
>       normal rules for the current congestion controller, this typically
>       allows CWND to be increased for each received acknowledgement that
>       indicates a packet has been successfully sent across the path.
>
> Is that meant to be Slow Start behavior or Congestion Avoidance. More 
> generally I think the document could be more explicit about slow 
> start, e.g. if you detect congestion during validation and exit into 
> normal mode, this means going into congestion avoidance because 
> congestion would have ended slow start. However, would be good to 
> spell this out clearly.
>
Yes well this is harder than it should be. So first, I did add more text 
on whether this is SS or CA.

Safe Retreat is actually odd. The problem is that the loss that triggers 
congestion could have been an arbitrary overload of the path - it's not 
just a x2 overshoot as in SS. That requires a large reduction, based on 
the partial knowledge the sender has at the time when congetsion is 
detected.

Long answer: Even when all the loss is recovered, the sender still does 
not know whether it used capacity which would have fairly been someone 
elses, hence we reset ssthresh on exit to the detected bottleneck rate, 
but still allow ss to grow the cwnd towards that bottleneck rate, in 
case there are other flows also sharing that bottleneck.

> Related to this, this doc says in the intro:
>
> “The present document specifies a CC mechanism, called Careful Resume,…”
>
> I think this is more confusing than helpful. It really doesn’t place 
> any congestion control. It rather can be used together with all kinds 
> of congestion controls.
>
I see it as a mechanism, not an algorithm (with many mechanisms and 
rules) - but usage of language varies, so I tried to adapt the text a 
little in rev -13.
>
> Some more editorial comments:
>
> I find some of the information in sec 4 a bit random or even 
> redundant. Especially section 4.5.  on Implementation Notes for using 
> BBR seems suddenly very detailed and I would have more likely expected 
> that in the appendix. Also on BBR in general: of course it is good to 
> mention in various places, because it’s quite different than 
> loss-based congestion control, however, maybe it would be better to 
> generalize this and talk about rate-based congestion control more in 
> general and name BBR only as an example.**
>
I'm really grateful to all those who helped accumulate this guidance for 
BBR, but I see what you say and I have gathered much more of the 
BBR-specific text in one place as an appendix as you suggest. I 
personally like it better this way, but I'm OK if people see better ways 
to present this.
>
> One minor editorial comment in Sec 1.5 (and also 4.2 actually):
>
> „e.g., using the recommended maximum IW for the first
>
>       RTT of transmitting data [RFC9000].“
>
> I think this should be RFC9002, and potentially also RFC6928.
>
Indeed it should be section 7.2 of RFC9002; and also RFC 6928, I'll 
check next PR (which likely will be soon).
>
> Also there are 2-3 places in the document where it talks about QUIC 
> specifically. However, I wonder if it really should only talk about 
> QUIC in these places or also say something about TCP as well.
>
OK. I think I have fixed in the next PR.
>
> And one nit in section 4.2 (if not already corrected):
>
> A sender also verifies that the initial data was acknowledged.
>    (i.e., loss could otherwise could be indicative of persistent
>    congestion).
>
> -> Remove “.” before the bracket; and two times “could” in the brackets.
>
Thanks - now fixed in the PR.
>
> Mirja
>
I'll try to work out how to handle the comments we haver received on 
qlog and then make another rev.

Gorry

> *From: *Zaheduzzaman Sarker <zahed.sarker.ietf@gmail.com>
> *Date: *Wednesday, 29. January 2025 at 18:45
> *To: *Marten Seemann <martenseemann@gmail.com>
> *Cc: *tsvwg <tsvwg@ietf.org>
> *Subject: *[tsvwg] Re: WGLC for Convergence of Congestion Control from 
> Retained State (draft-ietf-tsvwg-careful-resume)
>
> Hi all,
>
> Marten will be on vacation when this last call will end and Gorry is 
> an author of this document, he need to recuse from making consensus 
> call here. With that I am extending this last call for one more week.
>
> There were some promises to review this document and I haven’t seen 
> much feedback/ review yet, so please use this extra week to send your 
> comments/ issues/ support.
>
> // Zahed
>
> On Sun, 19 Jan 2025 at 15:25, Marten Seemann <martenseemann@gmail.com> 
> wrote:
>
>     This email starts a 2 week WG Last Call call to determine if the
>     following TSVWG ID is ready to publish:
>
>     https://datatracker.ietf.org/doc/draft-ietf-tsvwg-careful-resume/
>
>     This document targets Proposed Standard.
>
>     The WGLC will conclude on February 2, 2025.
>
>     Please do read the draft, and send any comments/concerns to the
>     TSVWG mailing list, including notes on whether these are ready to
>     publish (or send an email directly to the chairs).
>
>     Best wishes,
>     Gorry and Marten
>     (tsvwg co-chairs)
>