Re: [tsvwg] David Black (individual) on safety of L4S for the Internet

"Scheffenegger, Richard" <rs.ietf@gmx.at> Tue, 12 May 2020 15:01 UTC

Return-Path: <rs.ietf@gmx.at>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 1730B3A0B30 for <tsvwg@ietfa.amsl.com>; Tue, 12 May 2020 08:01:56 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.899
X-Spam-Level:
X-Spam-Status: No, score=-1.899 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=gmx.net
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id tC3fJXrLSw2I for <tsvwg@ietfa.amsl.com>; Tue, 12 May 2020 08:01:53 -0700 (PDT)
Received: from mout.gmx.net (mout.gmx.net [212.227.17.22]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 453C73A0BAC for <tsvwg@ietf.org>; Tue, 12 May 2020 08:01:53 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.net; s=badeba3b8450; t=1589295711; bh=ewzisY0jmV8mTfGsVkrfP4XlxRsQL0jB9mB2ENaJwP8=; h=X-UI-Sender-Class:Subject:To:References:From:Date:In-Reply-To; b=dIT4qRckaH7viMBEOg9Ku42WQzlhxetDFuUUu7Yje2a/rhFvsUKXcJhLO4DVkeAKJ N0uzz7yHgDhbNlWex615A90fXF9tQ1A1eLTP1J9UPMe6BKuEQW9ruZGawl9iQWwQyc DPy95/bapKFit2LwL9YAwGoaDi+dN3IhgeMHVHT0=
X-UI-Sender-Class: 01bb95c1-4bf8-414a-932a-4f6e2808ef9c
Received: from [192.168.233.104] ([185.236.167.136]) by mail.gmx.com (mrgmx104 [212.227.17.168]) with ESMTPSA (Nemesis) id 1MYNJq-1jcO1E3n3C-00VNTj for <tsvwg@ietf.org>; Tue, 12 May 2020 17:01:50 +0200
To: tsvwg@ietf.org
References: <MN2PR19MB4045DBC270D70DECE5F2B4AC83A20@MN2PR19MB4045.namprd19.prod.outlook.com> <CADVnQymXFe7o4M_GgwqxDP0x5UsAKE+1oQcavyDTF04gP-S34Q@mail.gmail.com> <6E67B51E-6F0A-4BC9-9A77-478F6C5B1222@gmail.com>
From: "Scheffenegger, Richard" <rs.ietf@gmx.at>
Message-ID: <711afebb-456b-abfd-e910-f1d31789bae1@gmx.at>
Date: Tue, 12 May 2020 17:01:50 +0200
User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:68.0) Gecko/20100101 Thunderbird/68.7.0
MIME-Version: 1.0
In-Reply-To: <6E67B51E-6F0A-4BC9-9A77-478F6C5B1222@gmail.com>
Content-Type: text/plain; charset="utf-8"; format="flowed"
Content-Transfer-Encoding: quoted-printable
X-Provags-ID: V03:K1:5QDBlBlW/9IiFo7Zw7vyFwnc0mV7qdm7rIcs0KUPHyxAM5V5TuP KEvK57GPt1s2sGdzHEN5IIaELF3idczFEpPelC5kbz/lKwOtbZiq1I2WJ8BmeJVIUFBVCS5 qhnvJEjbPNRKg8+8A+5Ah7VDzsYtXbWfYdioFWLAE1XLqoo/fwsi4djJgQXNWK6CkACVIcx IygHuvUHPKfwFGhoeR9iw==
X-UI-Out-Filterresults: notjunk:1;V03:K0:WzJolCKufGw=:NG4bMXNV3EcmkMtwNVV5iG XHgnTMABcLg8hizu+z9tlkCoM9LeGi25U40Kf8VOpVIkc0iL2CzRCPgOn5IBk7QvyBRUlGvS9 xiQEHSOX1Obz2Jn4CmZ5WYME9tnXfFv/+McyYibGVMzoLKTUvr/8wCwgPWoHZz+AbOSJK/kbD LQgJ/2YBbZd+/lsxuSSD+drBBo57K3w8Mkxu9q3SiLgVdwy8C9pMAzesM3UgYeVUwT2Cr1Q7I BN/YLzSHW+RbFmRzgP79MyEj9nqHU7hT9XUDhMRP4hlDVgzvj265Beaf+z5qqyvYWLSGJQK07 s9tuusgZg9tbh2HyfqukvNAjxjFrPJpAfwLy0St2rRATf2UBBzshCIAlDpPQBa3BZnTPj3RWe GgWHbUZTUXqDl34vC4O4NAp3krIan80pfTs/kwGFR3PzrUWQAna3Xh6//fu47afmNtUCkrku3 cDZoaKIkPRXXEoNGdmD7RWXnr+OJfq554AQhkdjbBPKZoz/F+vLU9SHVOMqY/pP2yJxCZvXvK SN7P4vawPqDnxUvRmDnS7pZp2k6yiL+0wuWPGW9Fd1UoWj0MkuBijxCc3PISJzthC6VHZXk13 gL5NzMXmlauB3RJTMpJyGqQhaTrhct5cT+bh8EJLfE7bi+R/LvsROxjO/YmdsyR0J630QSvUg vZfyWHe1KqVNhf7HLbgkkKQdz5gzFnMYXb6sgmMqL1lWWahaT+TU96HoYW59RoXJeqPLmBEsG BsgS15iFllOnXRZnL/j+WQ5UrJl/oEzucWvQbN8+53EkLsvkh7DFRPlucm1k9x/GNGEQDKyms ExEs+lh6CORRoDXDg0m6A6wAFQzyZjLZj07LGn8lsWlNEyKz/yKAkD7bd7dnCwPnmQhJE866e O+cnVdzVIApI/GGxe2S1Tq9m+pFDCCl+9Lv40vGwKjucYMRuq3Ht4mUBIqe5mNeHa2/9t0Dtf buHwyqQ9fVbuoYETYtOznMY4hTk/onloJ71NtMC0iiIe7q++CQmYXrb2XTNGXPAeKwskZj9zC mnHDQt3Hn70eUVScxZAHSynCdzKbhXtjX1Z+q8mLT6TmbyUuC7IuOS7sutJ9pAmF7YznsEdlz YOk69crg1InkKCe2p+STKw5FQMKsZ9hLnIo0wmp+vJnIdtSR7Rse3YFAjNZTMiYOtibFHsz+5 M9eDXWDY5gYx6mM4Q8WNlqNQDuXkYs1U5db2YT9u3SWNN9W55+FnO8xg6FHvJM7jJUUYdr4oY bMdooQjbX5jFkRGsm
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/eLIFW4SE61wNP5YN-QQEeYJETYE>
Subject: Re: [tsvwg] David Black (individual) on safety of L4S for the Internet
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg/>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 12 May 2020 15:01:56 -0000

Hi Jonathan,

A small nitpick:

 > In particular, to obtain the 50% reduction to a single CE mark that
NewReno implements, an entire congestion window's worth of segments must
be marked CE by the AQM.

The DCTCP sender has to believe that one window's worth of segments was
received with CE set; This is incidentally the backwards compatibility
mode of DCTCP against RFC3168 receivers, which will latch ECE until an
(acceptable) CWR has been received. (And based on the readily available
DCTCP / RFC3168 implementations, this is something that tends to happen
frequently).

You are, of course, correct in stating that shallow-threshold CE marking
is detremental to 1/sqrt(p) CC. But I can also support Neal's claim from
independent findings, that hardware is being deployed which only
supports 1/p marking in heterogeneous environments today (well, since 2
years ago).

Having an classifier which is more persistent across the internet than
DiffServ is IMHO key, if a consistent low-latency service is something
we strive for.

(And yes, L4S as it currently stands, may have issues to varying degrees
in certain scenarios. Nevertheless, I remain convinced, that those
problems can be resolved in a feasible way.

Richard



Am 09.05.2020 um 00:37 schrieb Jonathan Morton:
>> On 9 May, 2020, at 1:04 am, Neal Cardwell <ncardwell=40google.com@dmarc.ietf.org> wrote:
>>
>>> Current practice is not to mix DCTCP-like traffic (1/p-class congestion
>>> control) with TCP-like traffic (1/sqrt(p)-class congestion control, e.g.,
>>> NewReno) in the same queue because the DCTCP-like traffic outcompetes the
>>> TCP-like traffic to the point of starvation of the latter.
>>
>> For my education, does someone have a pointer to the experiments
>> underlying that description of DCTCP outcompeting Reno/CUBIC? I'm not
>> aware of that being a fundamental property of DCTCP. The DCTCP
>> algorithm specifies a Reno-style response to loss -
>> https://tools.ietf.org/html/rfc8257#section-3.5 - and has Reno-style
>> increase functions as well. So the algorithm should be Reno friendly,
>> AFAIK.
>
> The problem with DCTCP is not its response to loss, though there was a serious bug that apparently went unnoticed for a long time.  Rather, it has to do with the response to CE marks applied by an AQM.  We can analyse this qualitatively and show a clear difference which will always result in severe unfairness.  This is also easy to confirm by experiment.
>
> Assume that a single queue exists at the bottleneck, with a single ECN AQM which does not discriminate between flows and classes of traffic.  This queue and AQM are shared by diverse traffic flows, some of which adhere to RFC3168 and RFC-8511, some of which are Not-ECT, and some of which are DCTCP.  The Not-ECT packets will, in accordance with RFC-3168, be dropped if they would have been marked CE.
>
> The steady state for both Not-ECT and RFC-3168/8511 transports (which include NewReno and CUBIC) is multiple RTTs between CE marks.  Each round-trip containing a CE mark (or a packet loss) causes a Multiplicative Decrease to between 50% and 85% of the previous congestion window, as defined by RFC-8511 and earlier well-known specifications.  It then takes several RTTs to grow the congestion window to its previous value, where a further congestion signal can be expected.
>
> The steady state for DCTCP is two CE marks per round-trip.  Each CE mark causes half a segment to be subtracted from the congestion window, on average.  This is qualitatively different from the response specified by RFC-8511.  In particular, to obtain the 50% reduction to a single CE mark that NewReno implements, an entire congestion window's worth of segments must be marked CE by the AQM.
>
> Two results should be obvious from the above.  First, an AQM response that causes RFC-8511 compliant transports to back off substantially may cause very little response from DCTCP.  Second, an AQM response that produces the steady state in DCTCP will also apply CE marks (or packet loss) to almost every round-trip of other competing flows, which will cause conventional TCPs' congestion windows to collapse to very small values.
>
> That is why DCTCP is specified for deployment only in controlled environments where its interactions with conventional TCP can be strictly managed.
>
>   - Jonathan Morton
>