Re: Questions about recovery draft 29

Spencer Dawkins at IETF <spencerdawkins.ietf@gmail.com> Mon, 20 July 2020 15:23 UTC

Return-Path: <spencerdawkins.ietf@gmail.com>
X-Original-To: quic@ietfa.amsl.com
Delivered-To: quic@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 340FD3A0C4C for <quic@ietfa.amsl.com>; Mon, 20 Jul 2020 08:23:17 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.097
X-Spam-Level:
X-Spam-Status: No, score=-2.097 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id K4eeVMIBu3nB for <quic@ietfa.amsl.com>; Mon, 20 Jul 2020 08:23:15 -0700 (PDT)
Received: from mail-lf1-x136.google.com (mail-lf1-x136.google.com [IPv6:2a00:1450:4864:20::136]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 871113A0C4A for <quic@ietf.org>; Mon, 20 Jul 2020 08:23:14 -0700 (PDT)
Received: by mail-lf1-x136.google.com with SMTP id y18so9916498lfh.11 for <quic@ietf.org>; Mon, 20 Jul 2020 08:23:14 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=j8iFR2Bsap7/HgLv1BwRgonw2lmDKBjgCyf/Sv4Utco=; b=RrlLuBzqsVi7afUyWcWj4GivPVcOzDLDbUQBctV3lmiCD2YOdOfpS94BFuir+DuP5F 4jcuAPHUTBWDE6IsK1ZQGaI4J3z7x8wNM8wVkYRKbmXnCSai/cf4fychwcZAbo9Ds/tz EyRfZ1uhDzSj69L5ra5hk18RomJ6KBeF9+6SG4drZB3ZJ2PzcZZA2owm58nyWeDvjBYD 8AtsHyn6m1rouLy7jZH0AXkVCrQrHGOeEq1bWqJT+J1ApYE4qoBcc9LtlHot31Vz7Kyg CgnRHORtMqu343V31vCK/PfkB7YNuNtOSQid0zf41NarhU9QLOVLrW9hs9Xzoo1UUd0w VnnQ==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=j8iFR2Bsap7/HgLv1BwRgonw2lmDKBjgCyf/Sv4Utco=; b=UJN+/kV9ze6KZ0rVOjE1wRzmE7BhTMmtT3g9dcwiCiXt/uWsnY7H9qORHulGkfs5GW 1n6jDQAMbzO8Q2mqmb6hPcLvA1OtywtrhkZgO+rv61Ga6GIUGmspBS3khi5NmelE4Gr+ NB8FTMSejYHIgtKB19bChz049qHBdUUTgzuz65iJoI7zJL6LirOutj+6dXuXNS6Z8n0d qr66UUltblSbSlGz6pVfpRDGICjx1CVdojlVnrt/kQRGagCbT07Pvw5zBxODh4Tdujrd 7xwSH3Xcyzmt1kh35Op6QdsFSAUO7CaJ3rMKYLNTAi9qCK7po61QAvRq/rBX8dLr8zRw esFQ==
X-Gm-Message-State: AOAM5332t4Uvj6Y7Qi1RmWx6IMRbYNBob0I1xcMGiPUoieghdJcyY1H/ XqwZ0IvCyrjNQ+ijvcOw/7jbHx+DDwJ9eZyo0yQ=
X-Google-Smtp-Source: ABdhPJxFMS3QzAPFL4kqyQ+Emp3xmg8Aps0usUpBVY7KXRcFuBlztOCl83GVHKkR/GlTH5uZTR3u0HbyTOBBOxL1o1M=
X-Received: by 2002:ac2:4422:: with SMTP id w2mr10999543lfl.152.1595258592581; Mon, 20 Jul 2020 08:23:12 -0700 (PDT)
MIME-Version: 1.0
References: <CABC-CqayJAeipEhsTf4h8rkUVQvYLd=1wF4UBMkaYSNMfvgnXw@mail.gmail.com> <CADdTf+jxKyPw_zfJ6FWCz230rXqg83d7rQJhKBteBvoLCut1RQ@mail.gmail.com> <CAKKJt-dgEmOtvXR8hVT9qf+YgSe3m9LMUuE6=csU28j+j1V_ZQ@mail.gmail.com> <CACpbDce=G=ou1F5dQx3r+TbzACrJRkQkKO7OviTHvd2QxFKTyg@mail.gmail.com>
In-Reply-To: <CACpbDce=G=ou1F5dQx3r+TbzACrJRkQkKO7OviTHvd2QxFKTyg@mail.gmail.com>
From: Spencer Dawkins at IETF <spencerdawkins.ietf@gmail.com>
Date: Mon, 20 Jul 2020 10:22:46 -0500
Message-ID: <CAKKJt-fD3UdAgi6QL2M7n28hM8vVS3hMFqNQXdKhmWMHcm9Raw@mail.gmail.com>
Subject: Re: Questions about recovery draft 29
To: Jana Iyengar <jri.ietf@gmail.com>
Cc: Matt Joras <matt.joras@gmail.com>, IETF QUIC WG <quic@ietf.org>, Jonas Reynders <jonas.reynders@student.uhasselt.be>
Content-Type: multipart/alternative; boundary="000000000000bc5c6d05aae11538"
Archived-At: <https://mailarchive.ietf.org/arch/msg/quic/9cRhMhIh0lp7LiWx-XylR1Bj3E8>
X-BeenThere: quic@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Main mailing list of the IETF QUIC working group <quic.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/quic>, <mailto:quic-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/quic/>
List-Post: <mailto:quic@ietf.org>
List-Help: <mailto:quic-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/quic>, <mailto:quic-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 20 Jul 2020 15:23:17 -0000

Hi, Jana,

On Fri, Jul 17, 2020 at 10:15 PM Jana Iyengar <jri.ietf@gmail.com> wrote:

> Spencer,
>
> It's a good idea, but I think it's a bit early to think of a roadmap. The
> core protocol components necessary for QUIC are exactly the three documents
> we have right now, nothing more. Once we have more extensions, other
> changes in more drafts, it might make sense to think of a roadmap.
>

I think we're talking about a race condition between whether we know enough
about QUIC to start a roadmap now that would be useful when the tsunami of
extension proposals arrives (as envisioned in discussions about the NEXT
recharter), or whether that's still mysterious enough that we have no
choice except to wait for extensions (and other draft changes), before we
can produce something useful.

You may very well be right, that we don't have much to say yet, so waiting
is best.

But let's remember that we had this conversation in another year to 18
months, OK? :-)

Best,

Spencer


>
> - jana
>
> On Mon, Jul 13, 2020 at 7:40 AM Spencer Dawkins at IETF <
> spencerdawkins.ietf@gmail.com> wrote:
>
>> Just to chime in here, so top-posting.
>>
>> Matt is correct (that specifications != implementations), but I note that
>> the transport area has produced, and revised,
>> https://datatracker.ietf.org/doc/rfc7414/ (A Roadmap for Transmission
>> Control Protocol (TCP) Specification Documents", that go through what Matt
>> characterized as the 'myriad RFCs that loosely define "IETF TCP"'
>>
>> The TCP community used Informational RFCs to document what's a good idea
>> and what's a bad idea at the per-specification level ("really important to
>> implement this, that is a really bad idea so don't implement it", because
>> RFCs were the tools we have.
>>
>> It might be worth thinking about the form a "QUIC Roadmap" might take,
>> given that we now have wikis, gitbub, and other tools that could allow the
>> QUIC roadmap and implementations to be more aware of each other, especially
>> if this was more granular than "this document".
>>
>> We talked about this briefly at IETF 106 in TSVAREA. From the minutes at
>> https://datatracker.ietf.org/meeting/106/materials/minutes-106-tsvarea-02
>>  ...
>>
>>    - Ted: Lots of creativity right now. people need to understand what
>>    they are doing *to* the protocol, not just *with* it. We should write "the
>>    hitchhiker's guide to QUIC". Intended for all audiences.
>>    - Mirja: That's what applicability and mgmt docs are for.
>>    - Ted: That's not what I'm looking for.
>>    - Spencer: Imagine if TCP roadmap had started before there were 150
>>    RFCs!
>>
>> Did people have thoughts about that?
>>
>> Best,
>>
>> Spencer
>>
>>
>> On Thu, Jul 9, 2020 at 2:10 PM Matt Joras <matt.joras@gmail.com> wrote:
>>
>>> I won't offer specific answers to all your questions as Ian or Jana
>>> likely have better ones, however I'd like to make a general point
>>> about this line of questioning: Linux TCP != IETF TCP, or, said
>>> another way, Implementations and Specifications are not the same
>>> thing. Recall that QUIC is a series of specifications being developed
>>> by the IETF, similar to the myriad RFCs that loosely define "IETF
>>> TCP". Linux's TCP stack is one such interpretation/implementation of
>>> TCP, and while it can inform the decisions of IETF workings, it should
>>> not be an undue determiner of the development of said specifications.
>>> Similarly we have an even wider and more diverse collection of QUIC
>>> interpretations/implementations in various states of maturity and
>>> deployment. Many diverge, sometimes heavily, from the specifications
>>> especially insofar as recovery is concerned. Chrome and us (mvfst) use
>>> BBR as the default congestion controller, others use Cubic or HSTCP or
>>> who knows what else. Chrome, AFAIK has a notion of adaptive reordering
>>> similar to Linux TCP and modifications to the draft-suggested time
>>> thresholds and PTO strategy.
>>>
>>> Whether some of these changes should be codified in the QUIC
>>> specifications is a good question. My hope is that these best
>>> practices will gain deployment experience and drive consensus at the
>>> IETF in future iterations of QUIC recovery and encounter fewer
>>> roadblocks than innovations based on TCP have historically
>>> experienced. As it is now though, QUIC's charter, by design, largely
>>> limits us to the adaptation of recovery and congestion control work
>>> that has already gained consensus via other IETF RFCs. That being
>>> said, what we have in the recovery specs may not be the "state of the
>>> art" exactly, but it is still quite good and I would argue has other
>>> significant advantages over what is commonly deployed in TCP, even
>>> Linux TCP.
>>>
>>> Matt Joras
>>>
>>> On Thu, Jul 9, 2020 at 9:22 AM Jonas Reynders
>>> <jonas.reynders@student.uhasselt.be> wrote:
>>> >
>>> > I have been looking at the recovery draft 29 of QUIC and comparing it
>>> to TCP, specifically the linux implementation in kernel v5.4. I have a
>>> couple of questions about the design choices that differ from TCP in linux.
>>> >
>>> > The first question is in regards to ack-based loss detection where
>>> QUIC uses both a packet-threshold and time-threshold method. Looking at the
>>> linux implementation (kernel v5.4), TCP uses only a time-threshold method
>>> (RACK) as default. From my understanding both methods were used in linux
>>> some time ago, around the time it was clarified to use both methods for
>>> QUIC as well. However since then TCP changed to RACK only as the default,
>>> from what I found out it was due to tests at google detecting a 10%
>>> reduction in recovery latency (reference:
>>> https://patchwork.ozlabs.org/project/netdev/cover/20180516234017.172775-1-ycheng@google.com/).
>>> My question is why QUIC still recommends using both methods as default? I
>>> can understand that in a datacenter context, where RTT values are really
>>> low, RACK can struggle due to timers not being fine grained.
>>> >
>>> > My second question is in regards to the time threshold
>>> calculation(6.1..2 in recovery draft). QUIC uses a constant value (9/8) for
>>> the time threshold, whilst in linux an adaptive threshold is used in the
>>> form of a reordering window variable. In the recovery draft it is mentioned
>>> that adaptive thresholds can be used, however it’s not mandatory. Is there
>>> a reason why QUIC doesn’t use the linux method for an adaptive threshold?
>>> The draft also mentions that reordering can be more common in QUIC and that
>>> adaptive thresholds have the advantage of being more aggressive when there
>>> is no reordering. The same question for the packet threshold, QUIC uses a
>>> constant threshold of 3 packets whilst in linux there is an adaptive
>>> threshold.
>>> >
>>> > My last question is that of merging the TLP and RTO into PTO. If I
>>> understand correctly, this was possible due to probe packet being
>>> ack-eliciting. And, that the time-threshold method would detect loss in a
>>> similar manner to RTO, with the RTO also causing a congestion collapse if
>>> no acks were received for consecutive RTOs. I assume that keeping RTO would
>>> unnecessarily complicate things, is that correct?
>>>
>>>