From nobody Tue Jul 19 23:39:33 2022
Return-Path: <nsd.ietf@gmail.com>
X-Original-To: tcpm@ietfa.amsl.com
Delivered-To: tcpm@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1])
 by ietfa.amsl.com (Postfix) with ESMTP id 7D4AEC131947
 for <tcpm@ietfa.amsl.com>; Tue, 19 Jul 2022 23:39:32 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.107
X-Spam-Level: 
X-Spam-Status: No, score=-2.107 tagged_above=-999 required=5
 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1,
 DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001,
 HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001,
 SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001]
 autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key)
 header.d=gmail.com
Received: from mail.ietf.org ([50.223.129.194])
 by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024)
 with ESMTP id X1e9OrR4bgbs for <tcpm@ietfa.amsl.com>;
 Tue, 19 Jul 2022 23:39:30 -0700 (PDT)
Received: from mail-wr1-x42f.google.com (mail-wr1-x42f.google.com
 [IPv6:2a00:1450:4864:20::42f])
 (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits)
 key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256)
 (No client certificate requested)
 by ietfa.amsl.com (Postfix) with ESMTPS id 86351C13C518
 for <tcpm@ietf.org>; Tue, 19 Jul 2022 23:39:30 -0700 (PDT)
Received: by mail-wr1-x42f.google.com with SMTP id h8so1972208wrw.1
 for <tcpm@ietf.org>; Tue, 19 Jul 2022 23:39:30 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; 
 h=mime-version:references:in-reply-to:from:date:message-id:subject:to
 :cc; bh=VQdTor+pLcAb1R4J8yz9Jn0rYobqvpbuhkIB27JYJDM=;
 b=Szvrx3Q7LMJfgwQ1DEsKuT5aXqTIWWo8n4Zu48wILpv1P3Nr/dy4i+7xywOjJvaC3M
 7a2x/XNNp5vouI0q87WEvWo1IbK9UkDQpeRv00AUmkqHXjWijeqtp9WJkiEYnH8/T9FT
 ilDck9O9t/MNOccYVvDTpEvOgDnz8uOFGo34gfuTpJpmFtmqAk329oANcuA/HQCYi4Cr
 avRdyCfwtjFO9p3Qz4L3vysEppLaQLDJUyh/85uFAkcCEIRKzE0dQvMQKZAaHmfQGpK+
 udkgZ27LADu4sGf8mZqFIM+dwlC8Ajt4JOCiC9ViScHJlXmOVgla7IoxZKoOjkjexEzC
 ykWg==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=1e100.net; s=20210112;
 h=x-gm-message-state:mime-version:references:in-reply-to:from:date
 :message-id:subject:to:cc;
 bh=VQdTor+pLcAb1R4J8yz9Jn0rYobqvpbuhkIB27JYJDM=;
 b=g6BPsdg2idemfz02GI7ecfu0wnJx1P4dszP1lZ9MvL0InP2fgU6J0ckMPMHfC1/Dqd
 daPG8FhrTUc/RrExa8CVirYOSm9moWtRmyc7j9dTZuLZV3lSv3kGRI6MrtcgMVDjW60v
 CLpHO/MqQcVVa0YA4tN/3kOsfQrqYFWVUFGRm2kKaoAKaci/GqnhD8wMp6yXqhRZrXOZ
 Q7eygVMR0F6qAijVcpOaKJ0HEHeZ2cOOuadReRrgmE5P5OKp+7kKAmpDI+QvKy8wkFD0
 dTi+pQ+YgoVHplMuSO7swUxY/xiVr+KdNSw0F6ZnjsxGO+w7pmRioD8aIjPSAg/BBR7H
 /jbg==
X-Gm-Message-State: AJIora8c7gUFHl8XdtikpSr5yNZj32eb1C2sB7EdoGUyKEcpL/tiXV83
 uF9deTInOgiFJzZiCkVSBH/8F0FQHgYmh3gT61M=
X-Google-Smtp-Source: AGRyM1sWVf9usL3pDpmXzOngxzEw9kiJRK5pLzW/yBnztkfWTgBgLh+eH6CTO45HUb2PoG02TMRMlql4C41TGlIUGKY=
X-Received: by 2002:a05:6000:1549:b0:21d:bdce:1d01 with SMTP id
 9-20020a056000154900b0021dbdce1d01mr29666816wry.373.1658299168872; Tue, 19
 Jul 2022 23:39:28 -0700 (PDT)
MIME-Version: 1.0
References: <alpine.DEB.2.21.2206141500480.7292@hp8x-60.cs.helsinki.fi>
 <alpine.DEB.2.21.2207112144430.7292@hp8x-60.cs.helsinki.fi>
 <7CF26B3A-D6C3-48F6-AA82-424231DD95D4@apple.com>
 <CADVnQykd9z=vgkQ-FkQ8-sj_E0BrQnpwhsj8AoF9QgQiQNQEhg@mail.gmail.com>
 <CAAK044TTg1p8ebJ9yd7uEES+KQskVFYw=wHimj9qrSJXDTASUA@mail.gmail.com>
 <alpine.DEB.2.21.2207200131430.7292@hp8x-60.cs.helsinki.fi>
In-Reply-To: <alpine.DEB.2.21.2207200131430.7292@hp8x-60.cs.helsinki.fi>
From: Yoshifumi Nishida <nsd.ietf@gmail.com>
Date: Tue, 19 Jul 2022 23:39:17 -0700
Message-ID: <CAAK044QZxWR6EMi6x+KFWrzkx885BnoQAAbPLqf-EqRHOc_htw@mail.gmail.com>
To: Markku Kojo <kojo@cs.helsinki.fi>
Cc: Neal Cardwell <ncardwell@google.com>, Vidhi Goel <vidhi_goel@apple.com>, 
 "tcpm@ietf.org Extensions" <tcpm@ietf.org>
Content-Type: multipart/alternative; boundary="000000000000e45c8f05e436dc11"
Archived-At: <https://mailarchive.ietf.org/arch/msg/tcpm/rCcKR6_JCpV5fiT4wlkMX6S3Vk8>
Subject: Re: [tcpm] CUBIC rfc8312bis / WGLC Issue 2
X-BeenThere: tcpm@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: TCP Maintenance and Minor Extensions Working Group <tcpm.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tcpm>,
 <mailto:tcpm-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tcpm/>
List-Post: <mailto:tcpm@ietf.org>
List-Help: <mailto:tcpm-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tcpm>,
 <mailto:tcpm-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 20 Jul 2022 06:39:32 -0000

--000000000000e45c8f05e436dc11
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

Hi Markku, folks,

In my understanding, compared to Reno, Cubic makes a strong assumption that
the last cwnd which caused packet loss is more or less close to the
available bandwidth.
If this assumption is correct, it can utilize bandwidth efficiently.
However, if the assumption deviates from the actual value, it may create
more packet losses than Reno. As a result, Cubic may suffer poor
performance in this case.
I don't believe it leads to congestion collapse although b=3D0.7 may result
in slower convergence than Reno. But I think it's a part of design choice.
I agree 40% overshooting case is an unfortunate case, but I am not sure we
should adjust the entire design to this kind of case. At least for now.

Thanks,
--
Yoshi

On Tue, Jul 19, 2022 at 5:16 PM Markku Kojo <kojo@cs.helsinki.fi> wrote:

> Hi Yoshi, all,
>
> On Tue, 19 Jul 2022, Yoshifumi Nishida wrote:
>
> > Hi folks,
> > I think I understand this issue, but I'm personally not sure how bad
> this is.
> > Because this looks a rather pathological case to me, also I don't think
> this can cause congestion collapse as
> > this is still multicative decrease.
>
> It is multiplicative decerase from a FlightSize (cwnd) that is double the
> available network capacity. If the decrease factor is 0.5, we end up
> sending at exactly the full rate the network allows, i.e., there is no
> unused capacity. Decrease factor 0.7 means that the flow is
> effectively unresponsive for 40% of the packets it injects after the cwnd
> decrease.
>
> If you consider a congested scenario where new flows start up
> continuously (e.g., a large number of Web users sharing a heavily
> congested bottleneck router), it resembles a situation where the flows do
> not appropriately react to congestion but keep on sending up to 40% of
> undelivered packets. Congestion collapse does not necessarily mean full
> (100%) collapse but several different degrees of congestion collapse is
> possible (pls. see the description of undelivered packets in RFC 2914).
> In this case we may see up to 40% congestion collapse because these
> undelivered packets eat up useful capacity from other users.
>
> The question to answer is: what sense does it make and what is the
> justification for a flow to inject that many packets into the network
> unnecessarily knowing that they will get dropped (or cause drops for
> other flows) with a tail-drop bottleneck router?
>
> Apologies for the strong words but for me this would be insane design.
>
> > It seems to me that this is a kind of shooting in the foot, a suboptima=
l
> case. However,  there are some
> > advantages in the current logic.
>
> Could you possibly elaborate?
>
> > I'm not very sure if we should sacrifice better results to address some
> rare cases. I think we will need more
> > analysis of the pros and cons for this.
>
> I don't think this could be considered a rare (or corner) case as this
> occurs potentially every time a flow starts and every time a flow
> encounters RTO with sudden congestion (the latter is very bad because
> in front of heavy congestion it is extremely important that every flow
> reacts appropriately).
>
> Thanks,
>
> /Markku
>
> > Thanks,
> > --
> > Yoshi
> >
> > On Wed, Jul 13, 2022 at 7:17 AM Neal Cardwell <ncardwell@google.com>
> wrote:
> >       Hi Markku and TCPMers,
> >
> > My understanding of Markku's concern here is that in slow start the cwn=
d
> can continue to grow in
> > response to ACKs after the lost packet was sent, so that the cwnd is
> often twice the level of in-flight
> > data at which the loss happened, by the time the loss is detected. So
> the cwnd ends up at 2 * 0.7 =3D 1.4x
> > the level at which losses happened, which causes an unnecessary
> follow-on round with losses, in order to
> > again cut the cwnd, this time to 1.4 * 0.7 =3D 0.98x of the level that
> causes losses, which is likely to
> > finally fit in the network path.
> >
> > However, there are two technical issues with this concern, as expressed
> in the proposed draft text in
> > this thread:
> >
> > (1) The analysis for slow-start is not correct for the very common case
> where the flow is
> > application-limited in slow-start, in which case the cwnd would not gro=
w
> at all between the packet loss
> > and the time the loss is detected. So the text is needlessly strict in
> this case.
> >
> > (2) For CUBIC the problematic dynamic (of cwnd growth between loss and
> loss detection exceeding the
> > multiplicative decrease) can also occur outside of slow-start, in
> congestion avoidance. The CUBIC cwnd
> > growth in congestion avoidance can be up to 1.5x per round trip. So
> after a packet loss the cwnd could
> > grow by 1.5x before loss detection and then be cut in response to loss
> by 0.7, causing the ultimate cwnd
> > to be 1.5 * 0.7 =3D 1.05x the volume of in-flight data at the time of t=
he
> packet loss. This would likely
> > cause an unnecessary follow-on round of packet loss due to failing to
> cut cwnd below the level that
> > caused loss. So the problem is actually wider than slow-start.
> >
> > AFAICT a complete/general fix for this issue is best solved by recordin=
g
> the volume of inflight data at
> > the point of each packet transmission, and then using that metric as th=
e
> baseline for the multiplicative
> > decrease when packet loss is detected, rather than using the current
> cwnd as the baseline. This is the
> > approach that BBRv2 uses. Perhaps there are other, simpler approaches a=
s
> well.
> >
> > I also agree with Vidhi's concern, that a change to the multiplicative
> decrease changes the algorithm
> > substantially. To ensure that the draft/RFC is not recommending
> something that has unforeseen
> > significant negative consequences, we shouldn't make such a significant
> change to the text until we get
> > experience w/ the new variation.
> >
> > best regards,
> > neal
> >
> >
> > On Tue, Jul 12, 2022 at 6:08 PM Vidhi Goel <vidhi_goel=3D
> 40apple.com@dmarc.ietf.org> wrote:
> >       Hi Markku,
> >
> >       I emailed about this to other co-authors and we think that this
> change is completely
> >       untested for Cubic and we think that this could be considered of =
a
> future version of Cubic,
> >       not the current rfc8312bis.
> >       To change Beta from 0.7 to 0.5 during slow-start, we would at
> least need some experience
> >       either from lab testing or deployment since all current
> deployments of Cubic for both TCP
> >       and QUIC use 0.7 as Beta during slow start. Since a lot of
> implementations currently use
> >       hystart(++) along with Cubic, we don=E2=80=99t see any high risk =
of
> overaggressive sending rate and
> >       that is what the current rfc8312bis suggests as well. In fact,
> changing Beta from 0.7 to 0.5
> >       can still be aggressive without using hystart.
> >
> >       Thanks,
> >       Vidhi
> >
> >       > On Jul 11, 2022, at 5:55 PM, Markku Kojo <kojo=3D
> 40cs.helsinki.fi@dmarc.ietf.org> wrote:
> >       >
> >       > Hi all,
> >       >
> >       > below please find proposed text to solve the Issue 2 a). I will
> propose text to solve 2 b)
> >       once we have come to conclusion with 2 a). For description and
> arguments for issues 2 a) and
> >       2 b), please see the original issue descriptions below.
> >       >
> >       > Sec 4.6. Multiplicative Decrease
> >       >
> >       > Old:
> >       >   The parameter Beta__cubic_ SHOULD be set to 0.7, which is
> different
> >       >   from the multiplicative decrease factor used in [RFC5681] (an=
d
> >       >   [RFC6675]) during fast recovery.
> >       >
> >       >
> >       > New:
> >       >   If the sender is not in slow start when the congestion event =
is
> >       >   detected, the parameter Beta__cubic_ SHOULD be set to 0.7,
> which
> >       >   is different from the multiplicative decrease factor used in
> >       >   [RFC5681] (and [RFC6675].
> >       >   This change is justified in the Reno-friendly region during
> >       >   congestion avoidance because a CUBIC sender compensates the
> higher
> >       >   multiplicative decrease factor than that of Reno by applying
> >       >   a lower additive increase factor during congestion avoidance.
> >       >
> >       >   However, if the sender is in slow start when the congestion
> event is
> >       >   detected, the parameter Beta__cubic_ MUST be set to 0.5
> [Jacob88].
> >       >   This results in the sender continuing to transmit data at the
> maximum
> >       >   rate that the slow start determined to be available for the
> flow.
> >       >   Using Beta__cubic_ with a value larger than 0.5 when the
> congestion
> >       >   event is detected in slow start would result in an
> overagressive send
> >       >   rate where the sender injects excess packets into the network
> and
> >       >   each such packet is guaranteed to be dropped or force a packe=
t
> from
> >       >   a competing flow to be dropped at a tail-drop bottleneck
> router.
> >       >   Furthermore, injecting such undelivered packets creates a
> danger of
> >       >   congestion collapse (of some degree) "by delivering packets
> through
> >       >   the network that are dropped before reaching their ultimate
> >       >   destination." [RFC 2914]
> >       >
> >       >
> >       >   [Jacob88] V. Jacobson, Congestion avoidance and control,
> SIGCOMM '88.
> >       >
> >       > Thanks,
> >       >
> >       > /Markku
> >       >
> >       > On Tue, 14 Jun 2022, Markku Kojo wrote:
> >       >
> >       >> Hi all,
> >       >>
> >       >> this thread starts the discussion on the issue 2: CUBIC is
> specified to use incorrect
> >       multiplicative-decrease factor for a congestion event that occurs
> when operating in slow
> >       start. And, applying HyStart++ does not remove the problem, it
> only mitigates it in some
> >       percentage of cases.
> >       >>
> >       >> I think it is useful to discuss this in two phases: 2 a) and 2
> b) below.
> >       >> For anyone commenting/arguing on the part 2 b), it is importan=
t
> to first
> >       >> acknowledge whether (s)he thinks the original design and logic
> by Van Jacobson is
> >       correct. If not, one should explain why Van's design logic is
> incorrect.
> >       >>
> >       >> Issue 2 a)
> >       >> ----------
> >       >>
> >       >> To begin with, let's but aside a potential use of HyStart++
> (also assume tail drop router
> >       unless otherwise mentioned).
> >       >>
> >       >> The use of an MD factor larger than 0.5 is against the theory
> and original design by Van
> >       Jacobson as explained in the congavoid paper [Jacob88]. Any MD
> factor value larger then 0.5
> >       will result sending extra packets during Fast Recovery following
> the congestion event
> >       (drop). All extra packets will become dropped at a tail-drop
> bottleneck (if a lonely flow).
> >       >>
> >       >> Note that at the time when the drop becomes signalled at the
> TCP sender, the size of the
> >       cwnd is double the available network capacity that slow start
> determined for the flow. That
> >       is, using MD=3D0.5 is already as aggressive as possible, leaving =
no
> slack. Therefore, if
> >       MD=3D0.7 is used, the TCP sender enters fast recovery with cwnd t=
hat
> is 40% larger that the
> >       determined network capacity and all excess packets are guaranteed
> to become dropped, or even
> >       worse, the excess packets are likely to force packets for any
> competing flows to become
> >       unfairly be dropped.
> >       >>
> >       >> Moreover, if NewReno loss recovery is in use, a CUBIC sender
> will
> >       >> operate overagressively for a very long time. For example, if
> the
> >       >> available network capacity for the flow is 100 packets, cwnd
> will have
> >       >> value 200 when the congestion is signalled and the CUBIC sende=
r
> enters
> >       >> fast recovery with cwnd=3D140 and injects 40 excess packets fo=
r
> each of
> >       >> the subsequent 100 RTTs it stays in fast recovery, forcing 400=
0
> packets to become
> >       inevitably and totally unnecessarily dropped.
> >       >>
> >       >> Even worse, this behaviour of sending 'undelivered packets' is
> against
> >       >> the congestion control principles as it creates a danger of
> congestion
> >       >> collapse (of some degree) "by delivering packets through the
> network
> >       >> that are dropped before reaching their ultimate destination."
> [RFC 2914]
> >       >>
> >       >> Such undelivered packets unnecessarily eat capacity from other
> flows
> >       >> sharing the path before the bottleneck.
> >       >>
> >       >> RFC 2914 emphasises:
> >       >>
> >       >> "This is probably the largest unresolved danger with respect t=
o
> >       >> congestion collapse in the Internet today."
> >       >>
> >       >> It is very easy to envision a realistic network setup where
> this creates a degree of
> >       congestion collapse where a notable portion of useful network
> capacity is wasted due to the
> >       undelivered packets.
> >       >>
> >       >>
> >       >> [Jacob88] V. Jacobson, Congestion avoidance and control,
> SIGCOMM '88.
> >       >>
> >       >>
> >       >> Issue 2 b)
> >       >> ----------
> >       >>
> >       >> The CUBIC draft suggests that HyStart++ should be used
> *everywhere* instead of the
> >       traditional Slow Start (see section 4.10).
> >       >>
> >       >> Although the draft does not say it, seemingly the authors
> suggest using HyStart++ instead
> >       of traditional Slow Start in order to avoid the problem of
> over-aggressive behaviour
> >       discussed above. This, however, has several issues.
> >       >>
> >       >> First. it is directly in conflict with HyStart++ specification
> which says that HyStart++
> >       should be used only for the initial Slow Start. However, the
> overaggressive behaviour after
> >       slow start is also a potential problem with slow start during an
> RTO recovery; in case of
> >       sudden congestion that reduces available capacity for a flow down
> to a fraction of the
> >       currently available capacity, it is very likely that an RTO
> occurs. In such a case the RTO
> >       recovery in slow start inevitably overshoots and it is crucial fo=
r
> all flows not to be
> >       overaggressive.
> >       >>
> >       >> Second, the experimental results for initial slow start in
> HyStart++ draft suggest that
> >       while HyStart++ achieves good results HyStart++ is unable to exit
> slow start early and avoid
> >       overshoot in a significant percentage of cases.
> >       >>
> >       >> Given the above issues, the CUBIC draft must require that MD o=
f
> 0.5 is used when the
> >       congestion event occurs while the sender is (still) in slow start=
.
> The use of MD=3D0.5 is an
> >       obvious stumble in the original CUBIC and the original CUBIC
> authors have already
> >       acknowledged this. It seems also obvious that instead of
> correcting the actual problem (use
> >       of MD other than 0.5), HyStart and HyStart++ have been proposed t=
o
> address the design
> >       mistake. While HyStart++ is a useful method also when used with
> MD=3D0.5, when used alone it
> >       only mitigates the impact of the actual problem rather than solve=
s
> the problem.
> >       >>
> >       >> What should be done for the cases where HyStart++ exits slow
> start but
> >       >> is not able to avoid (some level of) overshoot and dropped
> packets is IMO an open issue.
> >       Resolving it requires additional experiments and it should be
> resolved separately when we
> >       have more data. For now when we do not have enough data and
> understanding of the behaviour
> >       we should IMO follow the general IETF guideline "be conservative
> in what you send" and
> >       specify that MD =3D 0.5 should be used for a congestion event tha=
t
> occurs for a packet sent in
> >       slow start.
> >       >>
> >       >> Thanks,
> >       >>
> >       >> /Markku
> >       >>
> >       >
> >       > _______________________________________________
> >       > tcpm mailing list
> >       > tcpm@ietf.org
> >       > https://www.ietf.org/mailman/listinfo/tcpm
> >
> >       _______________________________________________
> >       tcpm mailing list
> >       tcpm@ietf.org
> >       https://www.ietf.org/mailman/listinfo/tcpm
> >
> >
> >

--000000000000e45c8f05e436dc11
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div>Hi=C2=A0Markku, folks,</div><div><br></div><div>In my=
 understanding, compared to Reno, Cubic makes a strong assumption that the =
last cwnd which caused packet loss is more or less close=C2=A0to the availa=
ble=C2=A0bandwidth.<br></div><div>If this assumption is=C2=A0correct, it ca=
n utilize bandwidth efficiently.=C2=A0</div><div>However, if the assumption=
 deviates from the actual value, it may create more packet losses than Reno=
. As a result, Cubic may suffer poor performance in this case.</div><div>I =
don&#39;t believe it leads to congestion collapse although b=3D0.7 may resu=
lt in slower=C2=A0convergence than Reno. But I think it&#39;s a part of des=
ign choice.=C2=A0</div><div>I agree 40% overshooting case is=C2=A0an unfort=
unate case, but I am not sure we should adjust the entire design to this ki=
nd of case. At least for now.=C2=A0=C2=A0</div><div><br></div><div>Thanks,<=
/div><div>--</div><div>Yoshi</div><div><br></div><div class=3D"gmail_quote"=
><div dir=3D"ltr" class=3D"gmail_attr">On Tue, Jul 19, 2022 at 5:16 PM Mark=
ku Kojo &lt;<a href=3D"mailto:kojo@cs.helsinki.fi">kojo@cs.helsinki.fi</a>&=
gt; wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0=
px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">Hi Yo=
shi, all,<br>
<br>
On Tue, 19 Jul 2022, Yoshifumi Nishida wrote:<br>
<br>
&gt; Hi folks,=C2=A0<br>
&gt; I think I understand this issue, but I&#39;m personally not sure how b=
ad this is.<br>
&gt; Because this looks a rather pathological=C2=A0case to me,=C2=A0also I =
don&#39;t think this can cause congestion collapse=C2=A0as<br>
&gt; this is still multicative decrease.<br>
<br>
It is multiplicative decerase from a FlightSize (cwnd) that is double the <=
br>
available network capacity. If the decrease factor is 0.5, we end up <br>
sending at exactly the full rate the network allows, i.e., there is no <br>
unused capacity. Decrease factor 0.7 means that the flow is <br>
effectively unresponsive for 40% of the packets it injects after the cwnd <=
br>
decrease.<br>
<br>
If you consider a congested scenario where new flows start up <br>
continuously (e.g., a large number of Web users sharing a heavily <br>
congested bottleneck router), it resembles a situation where the flows do <=
br>
not appropriately react to congestion but keep on sending up to 40% of <br>
undelivered packets. Congestion collapse does not necessarily mean full <br=
>
(100%) collapse but several different degrees of congestion collapse is <br=
>
possible (pls. see the description of undelivered packets in RFC 2914).<br>
In this case we may see up to 40% congestion collapse because these <br>
undelivered packets eat up useful capacity from other users.<br>
<br>
The question to answer is: what sense does it make and what is the <br>
justification for a flow to inject that many packets into the network <br>
unnecessarily knowing that they will get dropped (or cause drops for <br>
other flows) with a tail-drop bottleneck router?<br>
<br>
Apologies for the strong words but for me this would be insane design.<br>
<br>
&gt; It seems to me that this is a kind of shooting in the foot, a suboptim=
al case. However,=C2=A0 there are some<br>
&gt; advantages in the current logic.=C2=A0<br>
<br>
Could you possibly elaborate?<br>
<br>
&gt; I&#39;m not very sure if we should sacrifice better results to address=
 some rare cases. I think we will need more<br>
&gt; analysis of the pros and cons for this.<br>
<br>
I don&#39;t think this could be considered a rare (or corner) case as this =
<br>
occurs potentially every time a flow starts and every time a flow <br>
encounters RTO with sudden congestion (the latter is very bad because <br>
in front of heavy congestion it is extremely important that every flow <br>
reacts appropriately).<br>
<br>
Thanks,<br>
<br>
/Markku<br>
<br>
&gt; Thanks,<br>
&gt; --<br>
&gt; Yoshi<br>
&gt; <br>
&gt; On Wed, Jul 13, 2022 at 7:17 AM Neal Cardwell &lt;<a href=3D"mailto:nc=
ardwell@google.com" target=3D"_blank">ncardwell@google.com</a>&gt; wrote:<b=
r>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0Hi Markku and TCPMers,<br>
&gt; <br>
&gt; My understanding of Markku&#39;s concern here is that in slow start th=
e cwnd can continue to grow in<br>
&gt; response to ACKs after the lost packet was sent, so that the cwnd is o=
ften twice the level of in-flight<br>
&gt; data at which the loss happened, by the time the loss is detected. So =
the cwnd ends up at 2 * 0.7 =3D 1.4x<br>
&gt; the level at which losses happened, which causes an unnecessary follow=
-on round with losses, in order to<br>
&gt; again cut the cwnd, this time to=C2=A01.4 * 0.7 =3D 0.98x of the level=
 that causes losses, which is likely to<br>
&gt; finally fit in the network path.<br>
&gt; <br>
&gt; However, there are two technical issues with this concern, as expresse=
d in the proposed draft text in<br>
&gt; this thread:<br>
&gt; <br>
&gt; (1) The analysis for slow-start is not correct for the very common cas=
e where the flow is<br>
&gt; application-limited in slow-start, in which case the cwnd would not gr=
ow at all between the packet loss<br>
&gt; and the time the loss is detected. So the text is needlessly strict in=
 this case.<br>
&gt; <br>
&gt; (2) For CUBIC the problematic dynamic (of cwnd growth between loss and=
 loss detection exceeding the<br>
&gt; multiplicative decrease) can also occur outside of slow-start, in cong=
estion avoidance. The CUBIC cwnd<br>
&gt; growth in congestion avoidance can be up to 1.5x per round trip. So af=
ter a packet loss the cwnd could<br>
&gt; grow by 1.5x before loss detection and then be cut in response to loss=
 by 0.7, causing the ultimate cwnd<br>
&gt; to be 1.5 * 0.7 =3D 1.05x the volume of in-flight data at the time of =
the packet loss. This would likely<br>
&gt; cause an unnecessary follow-on round of packet loss due to failing to =
cut cwnd below the level that<br>
&gt; caused loss. So the problem is actually wider than slow-start.<br>
&gt; <br>
&gt; AFAICT a complete/general fix for this issue is best solved by recordi=
ng the volume of inflight data at<br>
&gt; the point of each packet transmission, and then using that metric as t=
he baseline for the multiplicative<br>
&gt; decrease when packet loss is detected, rather than using the current c=
wnd as the baseline. This is the<br>
&gt; approach that BBRv2 uses. Perhaps there are other, simpler approaches =
as well.<br>
&gt; <br>
&gt; I also agree with Vidhi&#39;s concern, that a change to the multiplica=
tive decrease changes the algorithm<br>
&gt; substantially. To ensure that the=C2=A0draft/RFC is not recommending s=
omething that has unforeseen<br>
&gt; significant negative consequences, we shouldn&#39;t make such a signif=
icant change to the text until we get<br>
&gt; experience w/ the new variation.<br>
&gt; <br>
&gt; best regards,<br>
&gt; neal<br>
&gt; <br>
&gt; <br>
&gt; On Tue, Jul 12, 2022 at 6:08 PM Vidhi Goel &lt;vidhi_goel=3D<a href=3D=
"mailto:40apple.com@dmarc.ietf.org" target=3D"_blank">40apple.com@dmarc.iet=
f.org</a>&gt; wrote:<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0Hi Markku,<br>
&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0I emailed about this to other co-authors and=
 we think that this change is completely<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0untested for Cubic and we think that this co=
uld be considered of a future version of Cubic,<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0not the current rfc8312bis.<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0To change Beta from 0.7 to 0.5 during slow-s=
tart, we would at least need some experience<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0either from lab testing or deployment since =
all current deployments of Cubic for both TCP<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0and QUIC use 0.7 as Beta during slow start. =
Since a lot of implementations currently use<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0hystart(++) along with Cubic, we don=E2=80=
=99t see any high risk of overaggressive sending rate and<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0that is what the current rfc8312bis suggests=
 as well. In fact, changing Beta from 0.7 to 0.5<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0can still be aggressive without using hystar=
t.<br>
&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0Thanks,<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0Vidhi<br>
&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt; On Jul 11, 2022, at 5:55 PM, Markku Koj=
o &lt;kojo=3D<a href=3D"mailto:40cs.helsinki.fi@dmarc.ietf.org" target=3D"_=
blank">40cs.helsinki.fi@dmarc.ietf.org</a>&gt; wrote:<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt; Hi all,<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt; below please find proposed text to solv=
e the Issue 2 a). I will propose text to solve 2 b)<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0once we have come to conclusion with 2 a). F=
or description and arguments for issues 2 a) and<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A02 b), please see the original issue descript=
ions below.<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt; Sec 4.6. Multiplicative Decrease<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt; Old:<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0The parameter Beta__cubic_ =
SHOULD be set to 0.7, which is different<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0from the multiplicative dec=
rease factor used in [RFC5681] (and<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0[RFC6675]) during fast reco=
very.<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt; New:<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0If the sender is not in slo=
w start when the congestion event is<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0detected, the parameter Bet=
a__cubic_ SHOULD be set to 0.7, which<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0is different from the multi=
plicative decrease factor used in<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0[RFC5681] (and [RFC6675].<b=
r>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0This change is justified in=
 the Reno-friendly region during<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0congestion avoidance becaus=
e a CUBIC sender compensates the higher<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0multiplicative decrease fac=
tor than that of Reno by applying<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0a lower additive increase f=
actor during congestion avoidance.<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0However, if the sender is i=
n slow start when the congestion event is<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0detected, the parameter Bet=
a__cubic_ MUST be set to 0.5 [Jacob88].<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0This results in the sender =
continuing to transmit data at the maximum<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0rate that the slow start de=
termined to be available for the flow.<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0Using Beta__cubic_ with a v=
alue larger than 0.5 when the congestion<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0event is detected in slow s=
tart would result in an overagressive send<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0rate where the sender injec=
ts excess packets into the network and<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0each such packet is guarant=
eed to be dropped or force a packet from<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0a competing flow to be drop=
ped at a tail-drop bottleneck router.<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0Furthermore, injecting such=
 undelivered packets creates a danger of<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0congestion collapse (of som=
e degree) &quot;by delivering packets through<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0the network that are droppe=
d before reaching their ultimate<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0destination.&quot; [RFC 291=
4]<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0[Jacob88] V. Jacobson, Cong=
estion avoidance and control, SIGCOMM &#39;88.<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt; Thanks,<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt; /Markku<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt; On Tue, 14 Jun 2022, Markku Kojo wrote:=
<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; Hi all,<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; this thread starts the discussion o=
n the issue 2: CUBIC is specified to use incorrect<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0multiplicative-decrease factor for a congest=
ion event that occurs when operating in slow<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0start. And, applying HyStart++ does not remo=
ve the problem, it only mitigates it in some<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0percentage of cases.<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; I think it is useful to discuss thi=
s in two phases: 2 a) and 2 b) below.<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; For anyone commenting/arguing on th=
e part 2 b), it is important to first<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; acknowledge whether (s)he thinks th=
e original design and logic by Van Jacobson is<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0correct. If not, one should explain why Van&=
#39;s design logic is incorrect.<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; Issue 2 a)<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; ----------<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; To begin with, let&#39;s but aside =
a potential use of HyStart++ (also assume tail drop router<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0unless otherwise mentioned).<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; The use of an MD factor larger than=
 0.5 is against the theory and original design by Van<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0Jacobson as explained in the congavoid paper=
 [Jacob88]. Any MD factor value larger then 0.5<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0will result sending extra packets during Fas=
t Recovery following the congestion event<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0(drop). All extra packets will become droppe=
d at a tail-drop bottleneck (if a lonely flow).<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; Note that at the time when the drop=
 becomes signalled at the TCP sender, the size of the<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0cwnd is double the available network capacit=
y that slow start determined for the flow. That<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0is, using MD=3D0.5 is already as aggressive =
as possible, leaving no slack. Therefore, if<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0MD=3D0.7 is used, the TCP sender enters fast=
 recovery with cwnd that is 40% larger that the<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0determined network capacity and all excess p=
ackets are guaranteed to become dropped, or even<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0worse, the excess packets are likely to forc=
e packets for any competing flows to become<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0unfairly be dropped.<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; Moreover, if NewReno loss recovery =
is in use, a CUBIC sender will<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; operate overagressively for a very =
long time. For example, if the<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; available network capacity for the =
flow is 100 packets, cwnd will have<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; value 200 when the congestion is si=
gnalled and the CUBIC sender enters<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; fast recovery with cwnd=3D140 and i=
njects 40 excess packets for each of<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; the subsequent 100 RTTs it stays in=
 fast recovery, forcing 4000 packets to become<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0inevitably and totally unnecessarily dropped=
.<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; Even worse, this behaviour of sendi=
ng &#39;undelivered packets&#39; is against<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; the congestion control principles a=
s it creates a danger of congestion<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; collapse (of some degree) &quot;by =
delivering packets through the network<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; that are dropped before reaching th=
eir ultimate destination.&quot; [RFC 2914]<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; Such undelivered packets unnecessar=
ily eat capacity from other flows<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; sharing the path before the bottlen=
eck.<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; RFC 2914 emphasises:<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; &quot;This is probably the largest =
unresolved danger with respect to<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; congestion collapse in the Internet=
 today.&quot;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; It is very easy to envision a reali=
stic network setup where this creates a degree of<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0congestion collapse where a notable portion =
of useful network capacity is wasted due to the<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0undelivered packets.<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; [Jacob88] V. Jacobson, Congestion a=
voidance and control, SIGCOMM &#39;88.<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; Issue 2 b)<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; ----------<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; The CUBIC draft suggests that HySta=
rt++ should be used *everywhere* instead of the<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0traditional Slow Start (see section 4.10).<b=
r>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; Although the draft does not say it,=
 seemingly the authors suggest using HyStart++ instead<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0of traditional Slow Start in order to avoid =
the problem of over-aggressive behaviour<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0discussed above. This, however, has several =
issues.<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; First. it is directly in conflict w=
ith HyStart++ specification which says that HyStart++<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0should be used only for the initial Slow Sta=
rt. However, the overaggressive behaviour after<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0slow start is also a potential problem with =
slow start during an RTO recovery; in case of<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0sudden congestion that reduces available cap=
acity for a flow down to a fraction of the<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0currently available capacity, it is very lik=
ely that an RTO occurs. In such a case the RTO<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0recovery in slow start inevitably overshoots=
 and it is crucial for all flows not to be<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0overaggressive.<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; Second, the experimental results fo=
r initial slow start in HyStart++ draft suggest that<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0while HyStart++ achieves good results HyStar=
t++ is unable to exit slow start early and avoid<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0overshoot in a significant percentage of cas=
es.<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; Given the above issues, the CUBIC d=
raft must require that MD of 0.5 is used when the<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0congestion event occurs while the sender is =
(still) in slow start. The use of MD=3D0.5 is an<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0obvious stumble in the original CUBIC and th=
e original CUBIC authors have already<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0acknowledged this. It seems also obvious tha=
t instead of correcting the actual problem (use<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0of MD other than 0.5), HyStart and HyStart++=
 have been proposed to address the design<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0mistake. While HyStart++ is a useful method =
also when used with MD=3D0.5, when used alone it<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0only mitigates the impact of the actual prob=
lem rather than solves the problem.<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; What should be done for the cases w=
here HyStart++ exits slow start but<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; is not able to avoid (some level of=
) overshoot and dropped packets is IMO an open issue.<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0Resolving it requires additional experiments=
 and it should be resolved separately when we<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0have more data. For now when we do not have =
enough data and understanding of the behaviour<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0we should IMO follow the general IETF guidel=
ine &quot;be conservative in what you send&quot; and<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0specify that MD =3D 0.5 should be used for a=
 congestion event that occurs for a packet sent in<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0slow start.<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; Thanks,<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt; /Markku<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt; _______________________________________=
________<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt; tcpm mailing list<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt; <a href=3D"mailto:tcpm@ietf.org" target=
=3D"_blank">tcpm@ietf.org</a><br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0&gt; <a href=3D"https://www.ietf.org/mailman=
/listinfo/tcpm" rel=3D"noreferrer" target=3D"_blank">https://www.ietf.org/m=
ailman/listinfo/tcpm</a><br>
&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0____________________________________________=
___<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0tcpm mailing list<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0<a href=3D"mailto:tcpm@ietf.org" target=3D"_=
blank">tcpm@ietf.org</a><br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0<a href=3D"https://www.ietf.org/mailman/list=
info/tcpm" rel=3D"noreferrer" target=3D"_blank">https://www.ietf.org/mailma=
n/listinfo/tcpm</a><br>
&gt; <br>
&gt; <br>
&gt;</blockquote></div></div>

--000000000000e45c8f05e436dc11--