Re: [tsvwg] dispute (Re: Results of consensus call on ECT(1) usage), Regarding testing

Sebastian Moeller <moeller0@gmx.de> Sat, 23 May 2020 09:29 UTC

Return-Path: <moeller0@gmx.de>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id AAAD93A0AAD; Sat, 23 May 2020 02:29:08 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.649
X-Spam-Level:
X-Spam-Status: No, score=-1.649 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, FREEMAIL_FROM=0.001, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=no autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=gmx.net
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id hjQIGG50GRms; Sat, 23 May 2020 02:29:06 -0700 (PDT)
Received: from mout.gmx.net (mout.gmx.net [212.227.15.18]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 40EA43A0AAC; Sat, 23 May 2020 02:29:06 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.net; s=badeba3b8450; t=1590226100; bh=eYxWm898RHTRGwMYHezY3Oc93dD9G+70B3H6/vD+d+I=; h=X-UI-Sender-Class:Subject:From:In-Reply-To:Date:Cc:References:To; b=aeyctnLjWqzzCfx6SO/f53JMmW0CXfTQUbchZ4b3iruBc+MY/4+ZUpFXWfZygvDbP wwrAF+Ls4NwViCYweUBWrBP1dNGdZknH3jk9yYFrZT7Oldo966xHPO/1y75OEmsmEE JK0ILy94zGe9gV30tZApJkXp4vZDoJRHTg6OzZPk=
X-UI-Sender-Class: 01bb95c1-4bf8-414a-932a-4f6e2808ef9c
Received: from hms-beagle2.lan ([95.112.110.247]) by mail.gmx.com (mrgmx005 [212.227.17.190]) with ESMTPSA (Nemesis) id 1MRTN9-1jQXxu3ieD-00NPzh; Sat, 23 May 2020 11:28:19 +0200
Content-Type: text/plain; charset="us-ascii"
Mime-Version: 1.0 (Mac OS X Mail 12.4 \(3445.104.14\))
From: Sebastian Moeller <moeller0@gmx.de>
In-Reply-To: <026e270b-0a3a-85c9-2c45-7560b50050eb@bobbriscoe.net>
Date: Sat, 23 May 2020 11:28:16 +0200
Cc: Paul Vixie <paul@redbarn.org>, "tsvwg@ietf.org" <tsvwg@ietf.org>, "tsvwg-chairs@ietf.org" <tsvwg-chairs@ietf.org>
Content-Transfer-Encoding: quoted-printable
Message-Id: <71FDB139-DE8A-49F8-AD94-33418202FEF3@gmx.de>
References: <d182f539-e0a2-e924-9556-db6577f47357@mti-systems.com> <3228077.bNJ4EoEDyu@linux-9daj> <026e270b-0a3a-85c9-2c45-7560b50050eb@bobbriscoe.net>
To: Bob Briscoe <in@bobbriscoe.net>
X-Mailer: Apple Mail (2.3445.104.14)
X-Provags-ID: V03:K1:yCO+M+G8jAFQS7Ra8k5SZFESBQkIJKV9vV8NtlZZA8qdcHiCUd5 8ddqGVXeMHuInZVruBxJYJWE2wXiW5ziQqpvEfnz/J+fQruo0wE8KcxqnPb3aG+WMt6BdCO v/2EMVOhc2u+4mOP8D4SZ/WHEZCS2aFtGWvnOXGjFSe+nObTiuJcW/tsFlLBaX0hVpWwQZ2 cOWDa8cCBZQE984XvE2FQ==
X-UI-Out-Filterresults: notjunk:1;V03:K0:7zEy2vOY/24=:JNMTnSENDGShEKAla+cI9J oZYqCNNR+uSHcVXrrzDfVBuWlIlXQjlGjQ0AgbUorMqAtq4pf+Y/lfYPekJ6f2MbRgggt1YOh BB8qqYcUwVsJnuj460B8LF1f9KWjGxBqnM/5ohobBB4tQBECzsBaETm88Fd0Iyp3sssap9PKg cdIrAJaX4cKkAAm8w7XkrYkmGBJ5Z4vw+NDEoOj51JAtuYeqk6tkWysmOQ3/Z5TT6BkD1ksju AzMlcNRxoyGWXJTGmLsnI89vELKZtbVU7JaXKslyl+yvBdVE4afvbRtXzrySvioAq7EMi86QX erqiExHecMLnhWSOVcEOWKrM/FYwFh8074wPkopHl3nxD/twANq9sKAUPPZlLJWRJcUeafdGz 955KNHmV7PQkriBQZzPy6DgyTC4FpKXBM6VKaqceZ48ZnWjFte60IT6hjZiLp8AvUhnrHaymD 8tVxhnGKee9MfZd2f+DXld5BKFHcotUDZhs/CEdlrTDJqUgBLRXnWJoJ/smfVB47WYOim5+YE 3A3osUmGldl2jTOY7u6bboCbrcm+lspAZanfn8+hY6aQ4ZAllDSx3EAdvz3ppDflbTbgOVmqt cBcZuyRTqlzfX6Gn4KbYDVfG2iuMD27sGCi0XMFUNPe/t0az22qjSmN7VARibiKBG7scdoBB2 htqQUqTvvWWH0fzZ4B/TnS8XM3pwn1I2CcuTPACeEnwevCpw4ynlc7O5c/D3847LtC8xUiWxe 6HxzKxUXMEBTadlwYsi20olkRB3ngODeV7u8eRmKAdb0Tto4fOS1Mqp8ORejU4ISJoDdWoP78 VuKzAaTHSybro2ruUKHC0F4tvZDeAqCWL+j1c9O1Sofa1zZOHJGlGFTThNcC+5fCYk8eTgn59 3QBktvAPfMlYcvOVLD5Elchm98I0R+oNpCza8kTMjgms4nFzw9MHPCKD2wqqeVauWSgL7AZ61 pFNNVJDBOejZBISuRQF8n3cQqF5XDTPs1m7kk56QjReu0GRMZkQ/vZFPMcptVf5pVaGz/HbYa ljVNrAqpZNNLloXMVTgqvB7ljO1T9iKtHNuIWgdEK0jlv/YAnDl4ypIi314ShJAYNRsPYJlJK M/N3etpIgO1HPaCDgHvGyZ6Y3y4djJLVvsP5VmY8JyBCRUx7KHesl5/HWxOOx22nIMx2y+ToS oHULT/DNMUOeJqSLdVoknmhKaNgamNKvKRb54TgVHJRY+n8R5F22kQTxPvacEkD59Qc8+ACUG A4xshGIje0KZxTGOf
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/tC0IWg8f_-nwxYJB1j_7MoFgQ9o>
Subject: Re: [tsvwg] dispute (Re: Results of consensus call on ECT(1) usage), Regarding testing
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg/>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sat, 23 May 2020 09:29:09 -0000

Bob, chairs, list,

Top post, because this is not a on-to-ne response to Bob points, but to the sentiment of his argument about testing, as I understood it; since this was triggered by Bob' response I opt to keep it in thread instead of opening a new thread.


I want to take the opportunity to register my objections to Bob's implicit claim below, that L4S has seen sufficient  and diverse enough testing, and that legacy testing  results with DCTCP fro the L4S's effort early days is simply transferable to TCP Prague. 
	Also I want to note that quantity of testing is a imprecise (bad) proxy for quality of testing. In a Popperian sense, I got the impression, that L4S went out of its way to document where white swans can be found instead of spending time and effort trying to find non-white swans, as a result the hypothesis all swans are white (or precisely the proposed L4S design works robustly and reliably) is still insufficiently tested.

1) Quantity: 
	I agree that the L4S team has presented an impressive array of testing results, so I will concede the sheer number. 

2) Diversity:
	Here the effort is considerably less impressive, it took the SCE's team effort to present data for the natural cases of asymmetric links and bi-directional saturation. For a technique primarily intended to the ISP/end-user edge these two are not extreme corner-cases but rather the bread and butter any AQM/QoS solution needs to deal with acceptably, period.
	I find it immensely troubling that Bob, in one earlier e-mail conceded that L4S would probably not work well with bi-directional saturation, and never ever bothered to make that a testing priority. This is the looking for more white swans approach I allude to above, but seemingly by conscious choice and not by simply overlooking the critical cases.
	I am also puzzled that the WG here seems determined to push L4S/ETC(1) through even without sufficient evidence that all the pie-in-the-sky promises actually are kept. All tire-kicking so far implies that L4S over-promises and under-delivers once one looks beyond the "normal" conditions which admittedly have been tested in considerable quantity, but essentially testing the same condition that L4S deals with well multiple times does not increase the relevant depth of testing, IMHO.

3) Legacy testing:
	The amount of major bugs, that the SCE team found in TCP Prague when doing comparison testing reveals that TCP Prague development does not seem to use regular regression testing. Based on that fact and the fact the TCP Prague by now has diverged from DCTCP in a noticeable fashion. I question whether the old DCTCP testing results have more that historic value by now, as it seems far from clear that TCP Prague will behave similarly, either because of evolutionary divergence or because of newly introduced but yet unfixed bugs. If the L4S team wants to have this testing taken seriously I encourage them to repeat these tests with a current TCP Prague version. Otherwise these tests carry little vsalue for predicting how current L4S implementation will behave.



Best Regards
	Sebastian

P.S.: I realize that engineering is the "art of the possible" and that often perfect is a worse goal than good enough. But my objections against the current L4S implementation & design are that they simply are not :good enough" (yet?), and that I consider it to be an odd situation that the WG seemingly steers down a road of spending the last ECN codepoint on a sheer promise alone. Then again I predicted that outcome immediately after the call for consensus was made to the lists, as the chairs might remember.



> On May 22, 2020, at 22:47, Bob Briscoe <in@bobbriscoe.net> wrote:
> 
> Paul, Chairs
> 
> On 22/05/2020 04:03, Paul Vixie wrote:
>> On Friday, 22 May 2020 02:32:45 UTC Wesley Eddy wrote:
>>> ...
>>> 
>>> Result of consensus call: TSVWG has interest in working on a low-latency
>>> service using ECT(1) as an input (e.g. L4S) in preference to working on
>>> enabling high-fidelity congestion control using ECT(1) as an network
>>> output signal (e.g. SCE). The chairs see strong support for this
>>> direction, including significant interest from network operators and
>>> vendors.
>> the relevant section of RFC 2026 is as follows:
>> 
>>> 6.5.1 Working Group Disputes
>>> 
>>>    An individual (whether a participant in the relevant Working Group or
>>>    not) may disagree with a Working Group recommendation based on his or
>>>    her belief that either (a) his or her own views have not been
>>>    adequately considered by the Working Group, or (b) the Working Group
>>>    has made an incorrect technical choice which places the quality
>>>    and/or integrity of the Working Group's product(s) in significant
>>>    jeopardy.  The first issue is a difficulty with Working Group
>>>    process;  the latter is an assertion of technical error.  These two
>>>    types of disagreement are quite different, but both are handled by
>>>    the same process of review.
>> on a personal note, my own expressed position was, that if something had to be
>> done it should be an input not an output. so, the chairs' declared result is
>> compatible with what i said. therefore i'm not claiming that my own views were
>> not adequately considered (a, above). rather, i'd concerned about (b) above.
>> 
>> this choice was not made based on technical merit, by the chair's own
>> description (quoted above). both the rough consensus and the running code
>> looked to me more like choice 3 ("more testing"). what the operators and
>> vendors who can afford to attend these meetings are interested in, may differ
>> from the community's long term interests.
>> 
>>>    A person who disagrees with a Working Group recommendation shall
>>>    always first discuss the matter with the Working Group's chair(s),
>>>    who may involve other members of the Working Group (or the Working
>>>    Group as a whole) in the discussion.
>> we are "here." i hope the chairs can explain why choice 3, more testing, was
>> not the rough consensus they think they witnessed,
> 
> [BB] Can I suggest that the chairs ask those asking for more testing to write a review of all the tests that have been done so far (by proponents and opposition), and identify the specific further tests that they believe are necessary. Then the chairs or the WG or whoever is judging this dispute can decide whether the tests they are asking for are critical prerequisites for assignment of an experimental codepoint in the IP header.
> 
> Reasoning: There have been a number of cases where people have complained in general terms about testing, and subsequently it became apparent that they had not read the test reports - some of which had even been verified independently and published by third parties.
> 
>> and also, how the time
>> value of a choice of 1 or 2 "now" is higher than the merit value of getting to
>> an evidence-based consensus "later".
> 
> [BB] I notice that you (Paul) said you'd been following this since the first interim. You may not be aware that this work was brought to the IRTF in March 2015, then brought to the IETF in July 2015 along with thousands of test results. A very large BoF was held in Jul 2016 that led to L4S drafting work being started in tsvwg and tcpm. The debate over the pros and cons of different codepoints was completed around Nov 2017, when ECT(1) was settled on and that debate was written up in the ecn-l4s-id draft. In March 2019, L4S was approaching WGLC, when the SCE proposal for a different codepoint arrangement was first published, and proponents of both approaches have published large volumes of test results in the 14 months since then.
> 
> So those taking part in the recent consensus call had a wealth of test data on which to base their decision, from both proponents and opponents of the outcome that the chairs have just announced.
> 
> 
> In the past, I have been in a similar position, when I have discovered that a major decision was imminent that would close off areas that were important to me. The closest example was when draft-ietf-tsvwg-ecn [RFC3168-to-be] was approaching WGLC. I took it upon myself to thoroughly read the draft and all its references, to catch up on all the mail archives, and to write a draft articulating our concerns (with Jon Crowcroft at that time). As I put our concerns, I could feel the time pressure of everyone involved breathing down my neck. By putting in the effort to put our concerns concisely and rapidly, the WG got them resolved quickly - Jon & I had to concede some aspects in the interest of keeping progress moving. As a result, RFC3168 went through WGLC soon after and was published only 7 months after we first intervened.
> 
> So, yes, there is a huge degree of time pressure now. The closing of this consensus call allows us to get moving in anger. You will have seen the number of companies that plan to deploy. No-one has bottomless project funding. I (and hopefully others) will try to point you at the resources that are relevant for your concerns. But I hope you will understand that, at the closing stages of the debate, we can't be expected to retype all the discussion each time a new person joins.
> 
> Cheers
> 
> 
> 
> Bob
> 
> 
> 
>>>    If the disagreement cannot be resolved in this way, any of the
>>>    parties involved may bring it to the attention of the Area
>>>    Director(s) for the area in which the Working Group is chartered.
>>>    The Area Director(s) shall attempt to resolve the dispute.
>>>        If the disagreement cannot be resolved by the Area Director(s) any of
>>>    the parties involved may then appeal to the IESG as a whole.  The
>>>    IESG shall then review the situation and attempt to resolve it in a
>>>    manner of its own choosing.
>>>        If the disagreement is not resolved to the satisfaction of the
>>>    parties at the IESG level, any of the parties involved may appeal the
>>>    decision to the IAB.  The IAB shall then review the situation and
>>>    attempt to resolve it in a manner of its own choosing.
>> i think a challenge was inevitable no matter what outcome was designated as
>> the consensus, and that this probably is a matter to be settled by IETF as a
>> body rather than by this working group in isolation.
>> 
> 
> -- 
> ________________________________________________________________
> Bob Briscoe                               http://bobbriscoe.net/
>