Re: [EXTERNAL] Re: WGLC review of draft-ietf-quic-recovery-29

Gorry Fairhurst <gorry@erg.abdn.ac.uk> Fri, 10 July 2020 08:21 UTC

Return-Path: <gorry@erg.abdn.ac.uk>
X-Original-To: quic@ietfa.amsl.com
Delivered-To: quic@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 609C63A0EBA for <quic@ietfa.amsl.com>; Fri, 10 Jul 2020 01:21:08 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.896
X-Spam-Level:
X-Spam-Status: No, score=-1.896 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HTML_MESSAGE=0.001, SPF_HELO_NONE=0.001, SPF_NONE=0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id v6alN8-fdlgE for <quic@ietfa.amsl.com>; Fri, 10 Jul 2020 01:21:05 -0700 (PDT)
Received: from pegasus.erg.abdn.ac.uk (pegasus.erg.abdn.ac.uk [137.50.19.135]) by ietfa.amsl.com (Postfix) with ESMTP id 008F13A0EC1 for <quic@ietf.org>; Fri, 10 Jul 2020 01:21:04 -0700 (PDT)
Received: from GF-MacBook-Pro.lan (fgrpf.plus.com [212.159.18.54]) by pegasus.erg.abdn.ac.uk (Postfix) with ESMTPSA id C7D021B000A3; Fri, 10 Jul 2020 09:21:01 +0100 (BST)
Subject: Re: [EXTERNAL] Re: WGLC review of draft-ietf-quic-recovery-29
To: Praveen Balasubramanian <pravb@microsoft.com>, IETF QUIC WG <quic@ietf.org>
References: <MN2PR00MB073663726DB5AFE6885D0A6BB6670@MN2PR00MB0736.namprd00.prod.outlook.com> <53187d65-f7b9-b99a-f68b-b267303ab399@erg.abdn.ac.uk> <CH2PR00MB0726D18611EC030BF548CA0DB6640@CH2PR00MB0726.namprd00.prod.outlook.com>
From: Gorry Fairhurst <gorry@erg.abdn.ac.uk>
Message-ID: <40818629-e1dc-f0f1-6173-984a8166c7c2@erg.abdn.ac.uk>
Date: Fri, 10 Jul 2020 09:21:00 +0100
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.13; rv:68.0) Gecko/20100101 Thunderbird/68.10.0
MIME-Version: 1.0
In-Reply-To: <CH2PR00MB0726D18611EC030BF548CA0DB6640@CH2PR00MB0726.namprd00.prod.outlook.com>
Content-Type: multipart/alternative; boundary="------------7F5A35BA621E9AFAB1111B71"
Content-Language: en-GB
Archived-At: <https://mailarchive.ietf.org/arch/msg/quic/vRCD2HcWw1gUz7cWbISsV3Q1c4c>
X-BeenThere: quic@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Main mailing list of the IETF QUIC working group <quic.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/quic>, <mailto:quic-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/quic/>
List-Post: <mailto:quic@ietf.org>
List-Help: <mailto:quic-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/quic>, <mailto:quic-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 10 Jul 2020 08:21:08 -0000

Thanks Praveen,

I'd like to talk more, see below:

On 09/07/2020 17:11, Praveen Balasubramanian wrote:
>
> Within data center mainly. To hit 40 Gbps line rate you effectively 
> have to batch as much as you can to minimize the CPU processing costs. 
> The burst size today is capped to 64k in most cases for TCP. Now that 
> we have segmentation offload for UDP the same should apply. Even for 
> Internet facing traffic, the bursts primarily impact the first hop 
> within the datacenter because subsequently you get statistical 
> multiplexing from ToR switch handling traffic forwarding for many 
> flows and ports.
>
I get this for this use-case and comments on high-speed NICs. I agree 
offload is important.

> Ok understood about there being no conflict. But I still contend that 
> the MUST regarding burst size is way too restrictive. We should allow 
> for experimentation here.
>
I am not sure. The IW spec for TCP was based on evidence back around 
2010, and the IETF agreed in RFC6928 that a burst size of 10 was 
acceptable as an EXP spec for TCP.

I think we can safely refer to that - even perhaps take the 
leap-of-faith that EXP for TCP is equivalent to PS for QUIC, given 
subsequent experience.  I think the IW  **is** the burst size that the 
IETF should specify (because it's dictated by buffering that is expected 
on the path).

I see opportunities here for experimentation and measurement 
opportunities, and new specs can be proposed. Or the Specs can be recast 
with more detail. I also know that practical experience is that many 
people have configured (probably tested, I hope) much larger IW's - and 
I'd hope they also make a reasonable effort to monitor for possible 
interactions with other Internet applications and services (as per 
section 12 of RFC6928).

> Or present data that this is universally bad on all networks.
>
I think you touch on something that the IETF hasn't been great at 
talking about:

I could myself indeed likely produce data this shows this is "bad" in 
some networks that I value, and also show side-effects of collateral 
damage to other flows and adverse effects of line-rate bursts into 
equipment with small buffers. I don't know how useful that would be at all.

The IETF has typically defined a spec for endpoint. The way a modern 
endpoint is implemented can be quite different in a DC from a single 
device connected to the Internet. In a DC, I understand it is common  to 
distribute transport functions to other parts of the network. e.g.: DDOS 
protection could be done on a server; or by a box in front of it - 
similarly what matters to the network CC is the  bursts of traffic that 
leaves a DC within a flow to a local receiver, not what leaves a 
particular server within the DC network - I think in future we may do 
well to consider these differences?

> Current production use of TCP shows there is no major problems with 
> bursting higher. In fact for low latency same rack scenarios this will 
> boost receive performance as well due to better opportunity for LRO etc.
>
It's also possible that QUIC LRO (or whatever) includes some notion of 
per-flow burst-mitigation. Is that impossible?

However, I personally would not hold-up QUIC, to arrive at a new consensus.

Gorry

> *From:* Gorry Fairhurst <gorry@erg.abdn.ac.uk>
> *Sent:* Thursday, July 9, 2020 1:39 AM
> *To:* Praveen Balasubramanian <pravb@microsoft.com>; IETF QUIC WG 
> <quic@ietf.org>
> *Subject:* [EXTERNAL] Re: WGLC review of draft-ietf-quic-recovery-29
>
> Hi Praveen,
>
> On 08/07/2020 22:29, Praveen Balasubramanian wrote:
>
>     Section 7.9
>
>     "Implementations MUST either use pacing or another method to limit
>     such bursts to the initial congestion window; see Section 7.2."
>
>     This seems to preclude use of segmentation offload of sizes
>     greater than IW. In datacenters we routinely send bursts that are
>     higher without causing loss.
>
> Is that within data centres? or from data centres to Internet 
> destinations? ... if the latter, how can the sender be sure this is 
> not impacting other traffic?
>
>     The MUST here seems unnecessary. It also conflicts with the
>     RECOMMENDED in an earlier sentence.
>
> I actually don't see this as conflicting. The recommendation in this 
> section is to pace. The requirement is that sender limits burst, using 
> some method, to the size of IW.
>
> Gorry
>