Re: [tcpm] FW: New Version Notification for draft-balasubramanian-tcpm-hystartplusplus-00.txt

Neal Cardwell <ncardwell@google.com> Tue, 09 July 2019 22:54 UTC

Return-Path: <ncardwell@google.com>
X-Original-To: tcpm@ietfa.amsl.com
Delivered-To: tcpm@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id ECFA412009C for <tcpm@ietfa.amsl.com>; Tue, 9 Jul 2019 15:54:29 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -16.104
X-Spam-Level:
X-Spam-Status: No, score=-16.104 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_MED=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, ENV_AND_HDR_SPF_MATCH=-0.5, HTML_MESSAGE=0.001, HTTPS_HTTP_MISMATCH=0.1, PDS_NO_HELO_DNS=1.295, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, USER_IN_DEF_DKIM_WL=-7.5, USER_IN_DEF_SPF_WL=-7.5] autolearn=no autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=google.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id SdGQUiLHf8YX for <tcpm@ietfa.amsl.com>; Tue, 9 Jul 2019 15:54:26 -0700 (PDT)
Received: from mail-ot1-x334.google.com (mail-ot1-x334.google.com [IPv6:2607:f8b0:4864:20::334]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 9606012006D for <tcpm@ietf.org>; Tue, 9 Jul 2019 15:54:26 -0700 (PDT)
Received: by mail-ot1-x334.google.com with SMTP id q10so167432otk.10 for <tcpm@ietf.org>; Tue, 09 Jul 2019 15:54:26 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=A604BZYGFMjZYZZvEOVDC6Wq7Q8OT7l92p67dURBx20=; b=BmnPbMtvYjvSQi9MI3kE3TqzCUu9bQd0zbHj7SgN/x84s1I+26Fqen35pC6+YVGcUg FIoydLgqnoxw/FhCFahE0uQeUX24R2Y6oAEfFLauezs27Gr/fLkCbreX5L6wZtzsaVgP rmlxxRgIgdIIjH1Wjt/e76wVyCsd9U1ZH5/weuvf4Wu88APw/AFOCxciuUFq/QjwPOCv F7z+Ib5IXBSgpaC9hjoL53vzEOI9Oaigadq/ZVSPzS8uobCW7m99Kwp2ReKVDgqcAMQO Wo5TaGREJBY7G4D8zhEuy1qjTpVXuK3EbrksKAKNQ+Sv2mT0df7mZsauiK0vPmjB4ewV o5Cw==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=A604BZYGFMjZYZZvEOVDC6Wq7Q8OT7l92p67dURBx20=; b=qAEBcG5cCEESR0XHCHEoQ83OSwog+91Ez80WguHYR29bj0vsSrK+Kc4jBVprlzQov5 pHrEe1CgErWGKwocpgVD0TzJlKNF4zlhfYnV1irxgId7mbJKHKchvnQrZoLy87YCczm8 XhKug8lTWkx5VIOmoAMjkZHgp33LYbKVTpoSwHgCM417zBEL97mEIGcqMKVcsSab9Dr7 fo11z267Td16nVsJWhEvQHnMO9LAHfJKjZqqA3tie2AMCgw3KllO14943Ck8OrTtGX48 24/pcVM2+32Z5oAVcx52rkdsKurDNxmiuoUP/jAOOrB9Ic5lXQcIcLMHMsVfrpXe9UWh qrFA==
X-Gm-Message-State: APjAAAUmL6KB6QS+qd4hLz3u7b+CDqrCfaVez2cKyeJTHN84NmrNFAMd A4jHuqA+M3oa6RT5tghoy1UuG+hr3Y6+HCts9h/yOA==
X-Google-Smtp-Source: APXvYqzGZJeT5/4SRUdA8jjv1+r+YKpEYcjxZZsPXmSs0AvR1xu5f5/cnu09CQtSFYQb5uLYRlsTRS3sV3qfiqgLxEc=
X-Received: by 2002:a9d:6742:: with SMTP id w2mr21613123otm.371.1562712865403; Tue, 09 Jul 2019 15:54:25 -0700 (PDT)
MIME-Version: 1.0
References: <156262678491.777.1949510542578853249.idtracker@ietfa.amsl.com> <MW2PR2101MB10495E1951F40C4CE6526413B6F60@MW2PR2101MB1049.namprd21.prod.outlook.com> <CADVnQykFdbxfzzukb9nXtQX2gYFhEi7w+WBFozD4ivNg510EYA@mail.gmail.com> <MW2PR2101MB1049A6A61D6D29619D3A71D0B6F10@MW2PR2101MB1049.namprd21.prod.outlook.com>
In-Reply-To: <MW2PR2101MB1049A6A61D6D29619D3A71D0B6F10@MW2PR2101MB1049.namprd21.prod.outlook.com>
From: Neal Cardwell <ncardwell@google.com>
Date: Tue, 9 Jul 2019 18:54:07 -0400
Message-ID: <CADVnQymauhrLBNn0PrhF3UZHf6sDkiLhzGQ059p8oU3ABQcTNQ@mail.gmail.com>
To: Praveen Balasubramanian <pravb=40microsoft.com@dmarc.ietf.org>
Cc: Praveen Balasubramanian <pravb@microsoft.com>, "tcpm@ietf.org" <tcpm@ietf.org>, Matt Olson <maolson@microsoft.com>, Yi Huang <huanyi@microsoft.com>
Content-Type: multipart/alternative; boundary="0000000000003ae129058d477107"
Archived-At: <https://mailarchive.ietf.org/arch/msg/tcpm/6bxaVARRkz8VKeZYbKcqklRM0B8>
Subject: Re: [tcpm] FW: New Version Notification for draft-balasubramanian-tcpm-hystartplusplus-00.txt
X-BeenThere: tcpm@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: TCP Maintenance and Minor Extensions Working Group <tcpm.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tcpm>, <mailto:tcpm-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tcpm/>
List-Post: <mailto:tcpm@ietf.org>
List-Help: <mailto:tcpm-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tcpm>, <mailto:tcpm-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 09 Jul 2019 22:54:30 -0000

On Tue, Jul 9, 2019 at 6:18 PM Praveen Balasubramanian <pravb=
40microsoft.com@dmarc.ietf.org>; wrote:

> Re (1). Great catch Neal. That's a bug in the draft and not in our
> implementation. Eta is computed based on lastRoundMinRTT and not
> currentRoundMinRTT.. We will fix this in next revision.
>

Got it. Thanks!


> What was the reason for Linux to switch to lifetimeMinRTT? On Wifi links
> RTT does fluctuate a lot.
>

I'm not sure what the motivation was. The "Taming the Elephants" paper and
dissertation both use the lastRoundMinRTT, but the code that was submitted
to Linux (ae27e98a51526595837 on Oct 29 2008) uses the  lifetimeMinRTT,
without a comment about the rationale. I can imagine both approaches having
pros and cons.


> Re (2). I agree that the goal is to find a function that's faster than CA
> and slower than slow start. LSS works well in practice. For long lived
> flows on high BDP links incorporating the CUBIC CA early can help and
> that's a good idea. But given that this is an informational RFC describing
> the current implementation, we'd have to experiment and deploy any changes
> before updating the text. We will look into it.
>

Makes sense.

thanks,
neal


-----Original Message-----
> From: tcpm <tcpm-bounces@ietf.org>; On Behalf Of Neal Cardwell
> Sent: Tuesday, July 9, 2019 8:07 AM
> To: Praveen Balasubramanian <pravb=40microsoft.com@dmarc.ietf.org>;
> Cc: tcpm@ietf.org; Matt Olson <maolson@microsoft.com>;
> Subject: Re: [tcpm] FW: New Version Notification for
> draft-balasubramanian-tcpm-hystartplusplus-00.txt
>
> On Mon, Jul 8, 2019 at 7:11 PM Praveen Balasubramanian <pravb=
> 40microsoft..com@dmarc.ietf.org>; wrote:
> >
> > We submitted a draft for modified slow start using HyStart and Limited
> Slow Start. Feedback is welcome.
>
> Thanks for posting this draft! (
>
> https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Ftools.ietf.org%2Fhtml%2Fdraft-balasubramanian-tcpm-hystartplusplus-00&amp;data=02%7C01%7Cpravb%40microsoft.com%7Ccf008cd4a3ae41e87e7d08d7047f2c33%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C636982816578522537&amp;sdata=GttDyDYhw3sG%2BxDGAFSefMYeRyNfjeUih6XVvbJa7MQ%3D&amp;reserved=0
> )
>
> This is a very nice, clear document, IMHO.
>
> I was curious about a couple of items:
>
> (1) In the draft the condition for exiting slow start is:
>
>             Eta = clamp(MIN_ETA, currentRoundMinRTT / 8, MAX_ETA)
>             if (currentRoundMinRTT >= (lastRoundMinRTT + Eta))
>
> The use of currentRoundMinRTT/8 to compute Eta is counterintuitive to me,
> and is different from both the Hystart paper and the Linux CUBIC Hystart
> implementation from the CUBIC authors.
>
> The Hystart paper used (modified pseudocode):
>
>   if (currentRoundMinRTT> lastRoundMinRTT + clamp(lastRoundMinRTT / 16))
>
> The Linux implementation from the Hystart author Sangtae Ha in 2008 used
> the approach (pseudocode):
>
>   if (currentRoundMinRTT> lifetimeMinRTT + clamp(lifetimeMinRTT / 16))
>
> Eric Dumazet changed the Linux approach in 2014 to use a different
> constant scaling factor (also pseudocode):
>
>   if (currentRoundMinRTT> lifetimeMinRTT + clamp(lifetimeMinRTT / 8))
>
> I am curious about the rationale for:
>
> (a) Using an RTT threshold for exiting slow-start that is based on
> currentRoundMinRTT/8, rather than being purely a function of the RTT values
> from previous rounds (as in the Hystart paper) or the lifetime of the
> connection (as in Linux)?
>
> (b) Using a function of lastRoundMinRTT and currentRoundMinRTT for the
> exit threshold, rather than a function of the min RTT over the lifetime of
> the connection ("lifetimeMinRTT" in my pseudocode, above)?
> Is the rationale here partly to deal with fluctuations in RTT due to
> cellular/wifi/DOCSIS paths with noisy RTTs?
>
> Any background on the motivation would be interesting to hear, or perhaps
> to include in future drafts.
>
> (2) The Limited Slow-Start approach described in the draft seems to boil
> down to increasing cwnd by roughly ssthresh/4 each round trip.
> It does seem like that would be considerably faster than Reno or CUBIC
> would normally grow in congestion avoidance, for the first few seconds
> after exiting slow start. But it does seem to be essentially just a large
> additive increase. And so it would seem that if CUBIC were instead in its
> native CUBIC congestion avoidance instead of limited slow-start, then CUBIC
> would eventually (e.g. after a few seconds) tend to reach the rapid-growth
> convex portion of the cubic curve, and eventually grow much faster
> (eventually reaching exponential growth of 1.5x per round trip). I wonder
> if it would be advantageous for CUBIC connections exiting HyStart++ to use
> a  strategy of growing the cwnd using the maximum of the schedule
> calculated using limited slow-start and the schedule calculated using the
> normal CUBIC congestion avoidance logic. That seems like it might have the
> potential to mitigate performance issues for CUBIC connections that
> spuriously exit slow-start too soon?
>
> best regards,
> neal
>
> _______________________________________________
> tcpm mailing list
> tcpm@ietf.org
>
> https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.ietf.org%2Fmailman%2Flistinfo%2Ftcpm&amp;data=02%7C01%7Cpravb%40microsoft.com%7Ccf008cd4a3ae41e87e7d08d7047f2c33%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C636982816578522537&amp;sdata=0Sjqt2FWz1rKTqply%2FMJIzGeleyolWWNObL58KdCxm4%3D&amp;reserved=0
>