Re: [tsvwg] David Black (individual) on safety of L4S for the Internet

Jonathan Morton <chromatix99@gmail.com> Sat, 09 May 2020 09:42 UTC

Return-Path: <chromatix99@gmail.com>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 825E23A0949 for <tsvwg@ietfa.amsl.com>; Sat, 9 May 2020 02:42:12 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.849
X-Spam-Level:
X-Spam-Status: No, score=-1.849 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, FREEMAIL_FROM=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id SPUcAeHr2hF7 for <tsvwg@ietfa.amsl.com>; Sat, 9 May 2020 02:42:10 -0700 (PDT)
Received: from mail-lj1-x22f.google.com (mail-lj1-x22f.google.com [IPv6:2a00:1450:4864:20::22f]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 332BC3A0945 for <tsvwg@ietf.org>; Sat, 9 May 2020 02:42:10 -0700 (PDT)
Received: by mail-lj1-x22f.google.com with SMTP id y4so4235388ljn.7 for <tsvwg@ietf.org>; Sat, 09 May 2020 02:42:10 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; bh=N1ZypOfm+wfVez4yVZW205la5LN8bVq5MrLbCWR2ml8=; b=WXxviNia28s8RZlmizb4nD/miD8KStmvTBYaUTR93ZoIOZtBAz/eJym8V2FX7hHgvA oST22JQLwY/z6rx0AQ0hFJ1YFjDWkJpkNQlQMMK8YeUpbOw6PnC2Jcp+p3NfK4mVJX8a Wqv/R53H79EIeuIv8q4NRarT963z2nrzx8NZaujzp9jSvi89r8bEKAGEzlRFHCE6JVuC yDT5sRY2DlEmNI4BU8OEpcyPCKU5cAlZyesJI2gUs1tvesQeG8ZXef96lQYCJPW9WTUf BtTRaS/0zSvYcq49fJgY4mr73FwZdw3KngE4ziLEJYKChCMrVkRpO0U3n6pEF3iIrQXQ Rv2w==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; bh=N1ZypOfm+wfVez4yVZW205la5LN8bVq5MrLbCWR2ml8=; b=fqhCA1kiPo62bUShFEE0YmuS1Qm1jOY5Qoa92Q84QLar07JH7Ko56JfR8EUL7IYyPv qUNHNZgV6SSVN2NrnD1jU6eX5E1RXtgJBPAKMxnm5FO0wzr45HVmBI1JSP4J0h6yDEDN EHXtTPEB9i6auZliwSay078sSPdsafnL7C2VCVVQ5uFmNmzdoi44BA0b24tB7cse4SAi GzVEb+IJ6girvSXKvkz7QT1kvitesakmoCaVIGl7ENl8NTfe9CWI2jdbjzmI5V29G1jA U55TaDoAFKJdZs2a24h4CjCkeAE5iNGmGU1LHpt2JEp/VGULYThbc+bAEufmzHvFxUmI iIkA==
X-Gm-Message-State: AOAM531aNjM6rOCXfXdeuc7lsGwRLDe56dOLshaId5xN2ZsJbN0l0w1o A/dmmJR93jhgkc5F99RTglc=
X-Google-Smtp-Source: ABdhPJxKnmP1P3LVdUampCmvj+9ZU85bJVbZj/277SNZ0tgnNLatCtJVPeW7mJSKwgGpHbP0L/4Z0A==
X-Received: by 2002:a2e:988f:: with SMTP id b15mr4531698ljj.232.1589017328354; Sat, 09 May 2020 02:42:08 -0700 (PDT)
Received: from jonathartonsmbp.lan (83-245-235-192-nat-p.elisa-mobile.fi. [83.245.235.192]) by smtp.gmail.com with ESMTPSA id b2sm3476225lfi.14.2020.05.09.02.42.07 (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Sat, 09 May 2020 02:42:07 -0700 (PDT)
Content-Type: text/plain; charset="us-ascii"
Mime-Version: 1.0 (Mac OS X Mail 11.5 \(3445.9.1\))
From: Jonathan Morton <chromatix99@gmail.com>
In-Reply-To: <CADVnQymz-p_OF_QLOWHt2hPngLzUE=GkCm0epDb4SccbEDzRmA@mail.gmail.com>
Date: Sat, 09 May 2020 12:42:06 +0300
Cc: "Black, David" <David.Black@dell.com>, "tsvwg@ietf.org" <tsvwg@ietf.org>
Content-Transfer-Encoding: quoted-printable
Message-Id: <3C64B584-1A0E-4C8D-9325-09724289877C@gmail.com>
References: <MN2PR19MB4045DBC270D70DECE5F2B4AC83A20@MN2PR19MB4045.namprd19.prod.outlook.com> <CADVnQymXFe7o4M_GgwqxDP0x5UsAKE+1oQcavyDTF04gP-S34Q@mail.gmail.com> <6E67B51E-6F0A-4BC9-9A77-478F6C5B1222@gmail.com> <CADVnQymz-p_OF_QLOWHt2hPngLzUE=GkCm0epDb4SccbEDzRmA@mail.gmail.com>
To: Neal Cardwell <ncardwell@google.com>
X-Mailer: Apple Mail (2.3445.9.1)
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/CEJQBubXtVUOHJuDhNb0mPxWTJk>
Subject: Re: [tsvwg] David Black (individual) on safety of L4S for the Internet
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg/>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sat, 09 May 2020 09:42:13 -0000

> On 9 May, 2020, at 5:02 am, Neal Cardwell <ncardwell@google.com> wrote:
> 
>> The problem with DCTCP is not its response to loss, though there was a serious
>> bug that apparently went unnoticed for a long time. Rather, it has to do with
>> the response to CE marks applied by an AQM. We can analyse this qualitatively
>> and show a clear difference which will always result in severe unfairness. This
>> is also easy to confirm by experiment.
> 
> I see. Thanks for clarifying. I'm aware of the issues with DCTCP
> operating in an RFC3168 AQM. The sentence introducing this thread
> ("DCTCP-like traffic outcompetes the TCP-like traffic") did not
> specify an AQM (RFC3168 or RFC8257), so absent the mention of an AQM I
> was imagining this was referring to the default case of a non-ECN
> single-queue drop-tail FIFO.
> 
> However, I suppose I should have been able to deduce from the
> structure of the argument that there was implicitly an RFC3168 AQM in
> this scenario, so my apologies for not deducing that.

It's a bit more subtle than that: the unfairness exists regardless of the type of AQM, so long as it cannot distinguish a DCTCP flow from a conventional flow (for which FQ is sufficient).  The unfairness stems from a different interpretation of the common congestion signal, and is ultimately caused by DCTCP, not by the AQM.

And that is the case for the "shallow marking" switches you mentioned earlier.  Even though they were designed with DCTCP in mind, they are not designed to provide fairness between DCTCP (or a transport that resembles it, as TCP Prague does) and conventional TCP (or a transport that resembles it, such as TCP-SCE or QUIC).  So while a conventional TCP passing through such a switch my merely experience poor utilisation due to the shallow buffer, a conventional TCP crossing DCTCP traffic in that same switch may find itself completely starved.

It's crucial that everyone in this debate fully understands this point, because DCTCP-like behaviour underpins every aspect of L4S.

A consequence of that is that L4S expects the network to be upgraded, in full, to either understand its classification signal or implement FQ.  The *whole Internet*, from datacentre to CDN to core to backhaul to last-mile to third-world farmer just trying to check his e-mail.  That is not going to happen within any reasonable timeframe.

There are alternatives which recognise the existence of AQMs already deployed, and explicitly distinguish the high-fidelity and conventional congestion signals so that they aren't confused with each other upon receipt.  The SCE team has proposed one, and I now see that Jake Holland has proposed another.  I think both of these alternatives should be given serious consideration.

 - Jonathan Morton