Re: [tsvwg] Update to Position Statement on ECT(1)

"Holland, Jake" <jholland@akamai.com> Thu, 28 May 2020 04:47 UTC

Return-Path: <jholland@akamai.com>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 71C363A07E5 for <tsvwg@ietfa.amsl.com>; Wed, 27 May 2020 21:47:17 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.1
X-Spam-Level:
X-Spam-Status: No, score=-2.1 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=akamai.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id woxNDEyKyRWn for <tsvwg@ietfa.amsl.com>; Wed, 27 May 2020 21:47:15 -0700 (PDT)
Received: from mx0a-00190b01.pphosted.com (mx0a-00190b01.pphosted.com [IPv6:2620:100:9001:583::1]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id D2B123A07DC for <tsvwg@ietf.org>; Wed, 27 May 2020 21:47:15 -0700 (PDT)
Received: from pps.filterd (m0122332.ppops.net [127.0.0.1]) by mx0a-00190b01.pphosted.com (8.16.0.42/8.16.0.42) with SMTP id 04S4gYk1030969; Thu, 28 May 2020 05:46:31 +0100
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=akamai.com; h=from : to : cc : subject : date : message-id : references : in-reply-to : content-type : content-id : content-transfer-encoding : mime-version; s=jan2016.eng; bh=uWqdSJbnfq0vX892v1sMqsY8/NUz4ATqBqy0AId0ED8=; b=cg3+yq1OuynwFIRTHZEpNxuc/lMxGLJaLZQh6hZpDcc9jN20A/6+LTlRZcC1SXECMTg4 eLu6AbVktw7JpPUaRkbUJrz7jsIoNNpQ4KlYSczyCmPWAWpXTwqwei2vMk+miaiAqYfw Yda6OrPGXfbfY2slgGuYzIWHEF5WUdraycZVJ/KIHDzxDOQuMWibcguijx3Ly5Rl+nvS hyDxAykekOFdwUD09M7k/1EJ7nTceyHRdYtzIW5ZbVVChf5QGgUfuqzeStlSVk+E+Irw EHwQbUd4D4ZM/ZsEHPeIMVnK0oxXaYB1wSDbk5jVCeb7W/d6SzVE2h3q9ozlb5bXOQRV kQ==
Received: from prod-mail-ppoint6 (prod-mail-ppoint6.akamai.com [184.51.33.61] (may be forged)) by mx0a-00190b01.pphosted.com with ESMTP id 3188n0ca5j-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Thu, 28 May 2020 05:46:31 +0100
Received: from pps.filterd (prod-mail-ppoint6.akamai.com [127.0.0.1]) by prod-mail-ppoint6.akamai.com (8.16.0.27/8.16.0.27) with SMTP id 04S42XL9030803; Thu, 28 May 2020 00:46:29 -0400
Received: from email.msg.corp.akamai.com ([172.27.123.57]) by prod-mail-ppoint6.akamai.com with ESMTP id 316y5uu70g-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-SHA384 bits=256 verify=NOT); Thu, 28 May 2020 00:46:29 -0400
Received: from usma1ex-dag1mb6.msg.corp.akamai.com (172.27.123.65) by usma1ex-dag1mb6.msg.corp.akamai.com (172.27.123.65) with Microsoft SMTP Server (TLS) id 15.0.1497.2; Thu, 28 May 2020 00:46:28 -0400
Received: from usma1ex-dag1mb6.msg.corp.akamai.com ([172.27.123.65]) by usma1ex-dag1mb6.msg.corp.akamai.com ([172.27.123.65]) with mapi id 15.00.1497.006; Thu, 28 May 2020 00:46:28 -0400
From: "Holland, Jake" <jholland@akamai.com>
To: Bob Briscoe <in@bobbriscoe.net>, Martin Duke <martin.h.duke@gmail.com>
CC: TSVWG <tsvwg@ietf.org>
Thread-Topic: [tsvwg] Update to Position Statement on ECT(1)
Thread-Index: AQHWJXDCMj0HjdV7wk+M4pcr+ovB2aihqRUAgAAqnYCADOcbAP//2jKAgAFm54D//7rTAIAAwvOA//+WIYAAMzsDAAFl4pkA
Date: Thu, 28 May 2020 04:46:28 +0000
Message-ID: <8484B9B9-A209-401A-9759-703CEF0EFB65@akamai.com>
References: <BE44EAE9-5CFB-4F5D-85B8-05AFA516C151@akamai.com> <CACL_3VEbUHB-Omwp1-g5Tq3G3J-kKj9N3jPZLcfruicw3X=AsA@mail.gmail.com> <2CBBD8CD-2088-4E41-B113-EED665853D3C@akamai.com> <CAM4esxSFCBcxXjz5JJJg1z6+wwfN3mTrtJ8bKiBsj2TeOmmFSw@mail.gmail.com> <1D8D2AF8-F805-4BAC-8126-355A8337D830@akamai.com> <CAM4esxSMELAi0BMBRynYTx44iY6f-yLEWng4QQ2Pxt9J-haxFg@mail.gmail.com> <DE770902-CA1E-405C-A944-F12114AF2C3B@akamai.com> <CAM4esxQTyDNfNiAFhiHL9Zb3OPr9jivkrD2u8DtvhsMw_2Yv-g@mail.gmail.com> <789B55A8-A560-4C93-94FA-C62498B33411@akamai.com> <f60e36b7-0c04-be4b-c907-be1fd8ec66a3@bobbriscoe.net>
In-Reply-To: <f60e36b7-0c04-be4b-c907-be1fd8ec66a3@bobbriscoe.net>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
user-agent: Microsoft-MacOutlook/16.37.20051002
x-ms-exchange-messagesentrepresentingtype: 1
x-ms-exchange-transport-fromentityheader: Hosted
x-originating-ip: [172.19.88.116]
Content-Type: text/plain; charset="utf-8"
Content-ID: <C8F78FDBB981E7458AB8E2359FC4FB3E@akamai.com>
Content-Transfer-Encoding: base64
MIME-Version: 1.0
X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:6.0.216, 18.0.687 definitions=2020-05-28_01:2020-05-28, 2020-05-27 signatures=0
X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 suspectscore=0 malwarescore=0 phishscore=0 bulkscore=0 spamscore=0 mlxscore=0 mlxlogscore=999 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-2004280000 definitions=main-2005280020
X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:6.0.216, 18.0.687 definitions=2020-05-28_02:2020-05-28, 2020-05-27 signatures=0
X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 adultscore=0 lowpriorityscore=0 mlxlogscore=999 suspectscore=0 spamscore=0 mlxscore=0 impostorscore=0 malwarescore=0 phishscore=0 priorityscore=1501 cotscore=-2147483648 clxscore=1011 bulkscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2004280000 definitions=main-2005280025
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/1Ds7uFAMqksgyRZREctvbBNQXXc>
Subject: Re: [tsvwg] Update to Position Statement on ECT(1)
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg/>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 28 May 2020 04:47:18 -0000

Hi Bob,

Again, I'm trimming down to the summary in hopes of getting
better clarity, but for reference the full message I'm replying
to is here:
https://mailarchive.ietf.org/arch/msg/tsvwg/h6DPXNbPR9kXXzsZTuD9x7_Bs88/

I just wanted to respond to a few points.

> From: Bob Briscoe <in@bobbriscoe.net>
> Date: Wednesday, May 20, 2020 at 11:59 AM
...
> In summary,
> * There is only one unlikely condition (no.1) for spurious 
> retransmissions as a result of ECT1->0.
> * Whereas ECT0->CE only causes spurious retransmissions if all 5 
> unlikely or extremely unlikely conditions occur together.

I roughly agree with the analysis you gave; I do think you'd get
more spurious retransmissions in the case of multiple loaded
dualqs in series using the 2-signal scheme that Martin and I were
discussing than using the scheme in l4s-id.  (And thanks for
the references and the deeper dive, and the engagement with this
discussion.)

But the thing to compare is the overall consequences between the
2 schemes, of which reordering is one part.  (Not the direct
comparison of the reordering differences in a vacuum.)

The 2-signal scheme under discussion provides one major benefit,
namely that it would not redefine the meaning of a ECT(1)->CE
transition, and thus would not need the flag day I argued the
current L4S spec would need in my other recent message here:
https://mailarchive.ietf.org/arch/msg/tsvwg/THuFmguARLFQHWXG_VOyt0Wj5R8/

> Having engaged with this argument, I do want to emphasize that the 
> tunnel argument causes ECT1->0 not to work at all over 60-70% of the 
> Internet for many years.

I also agree the tunnel problem is a bigger downside of the proposed
2-signal scheme than the reordering issues.

You convinced me tunnels are a significant and serious issue, which
is in large part the driver on this discussion, and why I framed it
as possibly coming down to a choice between the 1/p signal and the
1/sqrt(p) signal, if we can't have both safely out of a tunnel.

However, organizing a flag day in order to enable an experiment that's
incompatible with a standard as widely implemented as RFC 3168 is a
heavy lift, and that proposal might not achieve the kind of consensus
that would be necessary to move that forward.

If no solution can be found that makes L4S's use of CE compatible with
RFC 3168 queues sharing with classic traffic, and no flag day can be
organized to responsibly deprecate the existing meaning of the
ECT(1)->CE transition, then the question becomes how to do something
worthwhile.  Even if that means starting with the 30-40% of the
internet that *can* be addressed in the near term without breaking
the one signal (CE) that can currently be passed through a tunnel
egress (with an eventual path forward over time if it proves useful).

This is the context where I thought our examination of the reordering
consequences made sense, in terms of thinking through the tradeoffs for
a proposal that does not redefine CE.

I'm not sure whether you're saying it's not worth doing anything at all
if the L4S 1/p redefinition of CE can't be deployed, but if you are I'd
tend to disagree.  A 1/p congestion signal seems worthwhile to try to
make happen and to use in the places where it can be used, even if those
are limited at first.  I just don't see it as clear-cut that the CE
semantic redefinition is going to be able to move forward, given the
poor interaction with existing implementations and the fundamentally
fragile heuristics in the proposed classic queue detection solution
(not to mention the poor behavior under testing that the SCE team and
your own collaborators have observed).

As I think Gorry put it, we're faced with a shortage of bits here, so
we might have to take a choice that's not as globally optimal as it
could be if we could change deployed systems whenever we wanted.  A
flag day is one path forward if there's a sufficiently broad consensus,
but I'm not convinced it'll be doable.

And I'd tend to be against it today, since I think we should keep
expanding on the deployment of the existing queues.  But I'm not
opposed to a standards track spec that says to treat ECT(1) as NECT
instead of as ECT(0), I think that's a fine idea that could open up
the possibility of using ECT(1) the way you want within a decade or so,
as support rolls out for avoiding the ambiguous signaling.  So it's
another reasonable path forward that might make L4S-style signaling
possible to set up for success down the road.

But again, it doesn't solve the near term problem of companies that
would like to get something rolling out sooner, but have so far been
working with a signal that's an incompatible use of an existing signal.

So to me it still seems worth considering a 2-signal approach first,
since it works right away for 30-40% of paths, doesn't break regular
AQM and CE (which, I must emphasize again, is already widely available
and finally in use at endpoints, and does most of the latency job
compared with drop-tail, which is the proper point of comparison for
the internet paths that are suffering from queuing delay today), and
could move forward with a longer term incremental improvement from
there.

Short of a magical bitless packet classifier, I'm not sure what other
options can help achieve anything that improves the congestion
signaling fidelity at internet scope in the near term.

Best regards,
Jake