Re: [tsvwg] Gorry Fairhurst Individual thoughts on choosing whether/how to advance ECN work.

Sebastian Moeller <moeller0@gmx.de> Sun, 24 May 2020 08:17 UTC

Return-Path: <moeller0@gmx.de>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 956FF3A00DC for <tsvwg@ietfa.amsl.com>; Sun, 24 May 2020 01:17:08 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.649
X-Spam-Level:
X-Spam-Status: No, score=-1.649 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, FREEMAIL_FROM=0.001, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=no autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=gmx.net
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 2ZLlwNCGPxdo for <tsvwg@ietfa.amsl.com>; Sun, 24 May 2020 01:17:06 -0700 (PDT)
Received: from mout.gmx.net (mout.gmx.net [212.227.15.18]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id CE7243A00E1 for <tsvwg@ietf.org>; Sun, 24 May 2020 01:17:05 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.net; s=badeba3b8450; t=1590308222; bh=ulKQALlZDbC5ZTr+uL48iDRVYPwWSx1DSZZ0wSdCd1w=; h=X-UI-Sender-Class:Subject:From:In-Reply-To:Date:Cc:References:To; b=hPDGdkfGWrpr/9CHpVpvotDd5KLJvWejSQe/QTwc8rXSKrmyQ7EYcUBuW47VPZiri uaS3FjMTXG7WQGj4bkp3/UbheHQwZKL3vxh9pUK0uHg0+LDT+LGkaBUFWbJZVITJIE 8ohWhNqApusQ3JsrfJSjm3IWE5INJIlVTzlsuSv8=
X-UI-Sender-Class: 01bb95c1-4bf8-414a-932a-4f6e2808ef9c
Received: from hms-beagle2.lan ([95.112.110.247]) by mail.gmx.com (mrgmx005 [212.227.17.190]) with ESMTPSA (Nemesis) id 1MXGvM-1jWlCT0bza-00YeP1; Sun, 24 May 2020 10:10:55 +0200
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (Mac OS X Mail 12.4 \(3445.104.14\))
From: Sebastian Moeller <moeller0@gmx.de>
In-Reply-To: <af3e6272-c04d-9e81-763c-e690ed521749@bobbriscoe.net>
Date: Sun, 24 May 2020 10:10:52 +0200
Cc: tsvwg@ietf.org
Content-Transfer-Encoding: quoted-printable
Message-Id: <5E7ABD06-792B-479A-B4AE-B38B6E9DA669@gmx.de>
References: <dbc71da6-70f1-7369-1d2d-f08fb3b08b69@erg.abdn.ac.uk> <21483444.sDhFMENYeD@linux-9daj> <a85600da-e69f-9190-7ca1-d23a7e7246f9@bobbriscoe.net> <3267993.nvHYsSR2bi@linux-9daj> <dd8e3896-2951-537f-e3d1-9954c93348dd@bobbriscoe.net> <67CBAA42-BD37-4A19-B650-68F511FC244A@gmx.de> <af3e6272-c04d-9e81-763c-e690ed521749@bobbriscoe.net>
To: Bob Briscoe <ietf@bobbriscoe.net>
X-Mailer: Apple Mail (2.3445.104.14)
X-Provags-ID: V03:K1:3xlni7ks88Hl4DBIjRHnJqPP/pHhtqAxI3soTK/PCQsXeZ8BaKm /Q34ZrC1G+mUbXVUuGMYD8pT0vjCqjT66EmvIxLd8wXNr06vZf6OdjBh8XwZQxyOrIUvDZw 7W9RJogvvZko6RmIofIl7XkpMU5MkioKZdoeEJn8dyYK7LuC7PomTngETFfAeV3FXpyaO41 akRpwAArMtt3zpvs5Hobg==
X-UI-Out-Filterresults: notjunk:1;V03:K0:9Ek9dtNzfLg=:DSFN6CLeq1D4E1+5dckXxN xeCBzRLfp+qijt8P5Q9SPAz+4hWnY/JEQVjh9dJu8sbJtFLejqRoohW4d41Row0QyX9fuDMsJ hHt5wSCS8t3hTFxZ0K7F1B4wbh9UQU+WEvDlpkw8NybfnfgPhjCvdo/8Izrt7Xa0qhZHESS9Y ST+AhTHWXJVpf0FH+NYTr9XFuxiVlk2zBHeq6YKqTEHXyu3A+3dnt5FKzmmjAZf4KJ2vtJ+AP okfzIiaLcFMtbzNOQEX6C2rbdbDRR13Zwxda9A39KupXf46QTlMob+dTa6zyKXDa+NRrrTop/ zqOLoBKY+dBupQ7BME75S3MsmHCzMKFGLXcV9FghKHf23pSQRzLu4ZhYDovthH/WwLntNcynL P6nl2uFzade7tRIK43kc5MoTlfo7BIvlmmDYRDPWaInzUC3ZxmhgoyTWZ0rxLslXM5TBZvrx+ YUEFLgBbsufSVkeYG9lwYRcmtpX3OfwMbcYII3IXAFjrgs4Qd6bZ16N87fcNtrJm1T80L0rtl LuRNv47yGWxZkfg3lMjmou9vMjvgbgxInPrSHg84h9J4oP2RKXkZe28MgkynyEtuS8LnvLTvl 1dbOK19Zui1+TTBKy3SeCcWT2qxqPaFfcjqwoC0uyJGDk6hng9sMthutVfEn+XqAL9X6UGk3+ ZDms8Oql8ISdqlG+2s9Z+35NAMb0Lm77U6omE0wpdUQYMQduHrzQ4kwaOr+Dx8xfN0Lr4ILUZ KrU3ocOA5FNEw/4oguV0CcqZSCaua4+5crCRDn/J41lunX/Qq2sAG3te6ZtAiwAF94nTfRQro Kgqts2Etxbc9IxUITfZegHBI5j8eH2zzPkkcnTMPDNvG8120CTIEAcWSpTJyxEEa1DAYpqoLY d0Iwm0CggAXTQjcQac9AUaoaVuon9esQbFFgX/Xu+2lYh6Qagte7+L8JHS9GJcI3xfaFiAV/C 2ZRVP0CbQDpEkUrG/54lSbOJR9sygi1oYHGRK9w3SZ1q2uP2ODrjnkR7R/E7yvNGqUM5jbp+B XGzxHtEjCmZlS/0lcQ08to4syjL1SGOxb+H+gENfwu5BCmzEzEUSd9shZa9TNHyDMW4gMk23f Q/lnpH1LSTYl9Gp+IVUrsgqhKCp14zvDsV6fC9OcKAXYYqK22Lqyfsmen2u24ZuPlPuyDjAgd 2N921Fz72DmbycmQfTRv5YPsajVugqSxoWUOKUaaIEZo82iuDqJiG5rOpAQtZGCFT9iinCcgm yOdWIdP7veG8NXSJ9
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/XGrpV9ofPUNmAzzHd9mDK2tTtBU>
Subject: Re: [tsvwg] Gorry Fairhurst Individual thoughts on choosing whether/how to advance ECN work.
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg/>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 24 May 2020 08:17:09 -0000

Hi Bob,

Tl, dr: L4S offers some quantitative improvements over the status quo, but nothing qualitative novel. 



> On May 24, 2020, at 01:53, Bob Briscoe <ietf@bobbriscoe.net> wrote:
> 
> Sebastian,
> 
> On 23/05/2020 16:33, Sebastian Moeller wrote:
>> Bob,
>> 
>> 
>> 
>>> On May 23, 2020, at 17:07, Bob Briscoe <ietf@bobbriscoe.net>
>>>  wrote:
>>> [...]
>>> 
>>> Another explanation is that L4S enables new interactive applications that are currently infeasible over the WAN.
>>> 
>> 	[SM] I have asked this before, but I am happy if I get a response this time: Which novel applications are possible now that the queuing delay has been reduced from ~10ms (fq_codel/cake, measured under real life traffic conditions on life internet access links) to ~1ms (L4S. dual queue coupled AQM under laboratory conditions)? This is the state of the art you compare against, not multiple 100s milliseconds of latency-under-load.
> 
> As explained in the email you are responding to, tail latency is important for real-time.

	[SM] This is not wrong, but the fact of the matter is, that the three services I already listed, google stadia, nvidia gforce now, shadow cloud gaming, all are real-time sensitive applications that are DEPLOYED und OPERATE in today's internet. So even the status quo is good enough for real-time coordination between actions and resulting control flow changes to be convincing enough for people to spend money on. That is a bg part of your potential market, that clearly is OKAY today indicating that improving tail latency is not an urgent issue for internet-scale real-time applications.



> The 10ms you report for FQ_CoDel is roughly its 90th percentile queue latency, which is an irrelevant metric for real-time (if the stream was being rendered after the 90%ile delay, 10% of the packets would arrive too late to be rendered). 

	[SM] This is not wrong, assuming your total latency is below one frame period and you are rendering at >= 100 Hz (for the 10ms limit). Soft real-time applications like games already deal with such delay variations though (to over-come both information transfer and CPU/GPU-cycle limitations), so improving your PDV slightly is not going to bring qualitative changes, no matter how hard we might wish.


> 
> In http://bobbriscoe.net/presents/2004ietf/202005L4S-brief.pdf (slide 4) I've used the 99.9th percentile delay for the rendering deadline, which reduces the fraction of packets that cannot be used to 0.1%.

	[SM] Your assumptions for that plot are a bit simplistic, but let's work with those. A) I claim that L4S is in reality not going to work reliably in the green circles (which you made from extrapolating a lab measurement, not from real-world measurements). B) CDN/data centers tend to be closer to the red radii not the green ones, so the internet already "solved" that issue for you, and e.g. 5G proponents advertize moving data center nodes even into the base stations, so I see no tendency currently to re-centralize data centers, do you?


> In the experiment reported in slide 3 you'll see the P99.9 latencies are 32ms for FQ_CoDel and 4.5ms for L4S.
> On slide 4 this is translated into reach in fibre, taking the generally accepted 50ms max delay for 'responsive feel' and subtracting application and OS delays.

	[SM] The point is that 500km radius is already plenty sufficient, unless you are trying to sell L4S exclusively to operators of big data centers, so that they can re-centralize their sites.

> 
> 
> See the link I put in the email you are responding to for the list, repeated here:
>     http://bobbriscoe.net/presents/2004ietf/202005L4S-brief.pdf
> To be clear, I'm not advocating these. I'm merely justifying my assertion about applications that are not currently possible. Personally, I'm not excited by gadgetry and social distancing. 

	[SM] Again, which of these a) make sense over the internet and b) do not work already today? As far as I can see the intersection of that set is rather empty.
Let's face it, L4S offers nothing qualitative novel, it is (assuming it works) an quantitative improvement, and hence might make a few things work a bit better, but the claim for novel applications really is wishful thinking.

> 
> Of course, we can't predict what other new applications will be developed once this amount of tail latency is removed over the wide area.

	[SM] Well, I can offer a prediction: no novel applications will grow out of this.

> Also, please bear in mind that I cannot generally pass on information about product plans.

	[SM] Fair enough, but a product plan, does not make a successful new application, nor does it demonstrate that the application absolutely requires L4S like network feedback.

> The original L4S teams from Alcatel-Lucent and Simula demonstrated a number of these applications at the Multimedia System 2016 conference: “Ultra-Low Delay for All: Live Experience, Live Analysis“, Proc. ACM Multimedia Systems; Demo Session (May 2016).
> Particularly cloud-rendered panoramic camera control and cloud-rendered VR goggles, were run at the conference over DCTCP and L4S with emulated wide-area round trip delay. 

	[SM] People use shadow cloud gaming with oculus rift headsets to do cloud-rendered VR already today... L4S (if it works) might make shadow's job a bit easier, but that is simply nothing novel. As noted before shadow is not alone in that respect.


> 
> Over today's state of the art AQMs, the delay is too inconsistent to rely on doing any of the following real-time interactive applications over wide area distances, or rendered remotely (e.g. in the cloud), rather than just confined to a LAN or metropolitan area:

	[SM] A claim that either hinges on your definition of "wide area distances" or is simply already refuted by reality. As expected, as all you are offering is a quantitative improvement in one metric, queueing latency, while the proposed applications are limited by total latency and total latency variation.


> * cloud-rendered interactive control (like this: https://riteproject.eu/dctth/#1511dispatchwg )

	[SM] Shadow, stadia, gforce now, do this right now without any AQM on the consumer side/return channel, so clearly something is wrong with your analysis?

> * remote presence,

	[SM] Video conferencing already works today; interestingly this tends to work better with a competent home network set-up with airtime fairness on the wifi side and a properly debloated access link (the wifi tends to be the bigger issue for many users though, L4S is not going to help there, is it)?

> * interactive real-time remote control or supervisory monitoring of cameras, drones, machinery (oil rigs,

	[SM] This is asinine, but I will bite, these off-shore oil-rigs most likely are connected via satellite internet, and geostationary satellites at that, the variable queueing delay is just going to be drop in the latency bucket (~270ms one-way propagation delay); and good luck doing anything interactive over such a delay.


> power plants, machine shops, cranes, ), medical procedures, using (high res.) video to monitor what is being controlled. 

	[SM] Whoever thinks about routing important medical procedures information required to make that procedure successful over the open internet, needs to be separated from his/her medical license. Anything that is not important for the process, like a comented upon video feed can easily tolerate delays.


> * interactive light-field experiences

	[SM] Just like "cloud-rendered panoramic camera control" that seems to be something playing to L4S perceived strength, but that has currently zero market share behind it. Sort of like an on-line version of the laser-disc.


> 
> 
>> 	Note, how for this question, I accept your claim of 1ms queueing delay even though testing indicates that this does not hold under all traffic patterns one is prone to encounter on a real internet access link (e.g. congestion on the reverse path). But really which application can not tolerate this additional 9ms of delay, yet is still a viable idea to deploy over the open internet? I note that real time control applications/reaction-time bound game services like google stadia and nvidia gforce now, are already deployed in today's internet, there really is nothing novel left as far as I can see
> 
> [BB] The novelty isn't the ability to demonstrate a "concept application" in a marketing video;

	[SM] But that is all that you offer ATM; I asked explicitly to get some real world killer apps for L4S and all I got was generalities.

> it's large numbers of real people actually being able to use it over the wide area for extended periods, without getting stressed, sick, or exhausted. And with just their regular Internet, rather than having to set up special network arrangements, or a special reserved session.

	[SM] As before, because you keep ignoring the elephant in the room, like Google Stadia, NVidia GeForce Now, and shadow cloud gaming. This ship has sailed without L4S, all you can offer is incremental improvements.

> 
> In the case of industrial or professional applications, quite often it's the ability for expertise to be deployed to a far-flung or inaccessible site instantly that would be the novel aspect, without experts or supervisors having to already sit on-call close by the site.

	[SM] Bob, either the behavior can be conserved in a video, which can be easily and reliably transported over the best-effort internet to the remote expert to scrutinize, or we are back to the critically stupid and dangerous idea of running important real-time control loops with real life consequences over the best-effort internet (games are fine, the failure mode, you loose a round, is benign, as compared you blowing out the well-head of an oil-rig in the gulf of mexico, I assume you agree).


As before, my assessment of L4S boils down to "too little, too late". Too little improvement over the status quo (at too high side-effects/costs) almost a decade later than originally envisioned.


Best Regards
	Sebastian

> 
> 
> 
> Bob
> 
>> 	Thanks in advance for the list of relevant new use-cases.
> 
> 
> 
> 
> 
>> 
>> Best Regards
>> 	Sebastian
>> 
> 
> -- 
> ________________________________________________________________
> Bob Briscoe                               
> http://bobbriscoe.net/