Re: [PANRG] On the new text on ECN: draft-irtf-panrg-what-not-to-do-16

Bob Briscoe <ietf@bobbriscoe.net> Thu, 11 March 2021 17:13 UTC

Return-Path: <ietf@bobbriscoe.net>
X-Original-To: panrg@ietfa.amsl.com
Delivered-To: panrg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 358A53A147A for <panrg@ietfa.amsl.com>; Thu, 11 Mar 2021 09:13:04 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.433
X-Spam-Level:
X-Spam-Status: No, score=-1.433 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, NICE_REPLY_A=-0.001, RCVD_IN_DNSWL_BLOCKED=0.001, SPF_HELO_NONE=0.001, SPF_SOFTFAIL=0.665, URIBL_BLOCKED=0.001] autolearn=no autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=bobbriscoe.net
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id kZ3-um4kOEmp for <panrg@ietfa.amsl.com>; Thu, 11 Mar 2021 09:13:02 -0800 (PST)
Received: from mail-ssdrsserver2.hosting.co.uk (mail-ssdrsserver2.hosting.co.uk [185.185.84.51]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 011073A1495 for <panrg@irtf.org>; Thu, 11 Mar 2021 09:12:59 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=bobbriscoe.net; s=default; h=Content-Transfer-Encoding:Content-Type: In-Reply-To:MIME-Version:Date:Message-ID:From:References:To:Subject:Sender: Reply-To:Cc:Content-ID:Content-Description:Resent-Date:Resent-From: Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help: List-Unsubscribe:List-Subscribe:List-Post:List-Owner:List-Archive; bh=K53tUG9PdGW9wh7VU4hwxrvc2vZ4n/jZ948DGFpfsq0=; b=grPCGsw/3kG/LjmVR+2EpfHYEX W5rrd5Jit0HOq+2/z1bW16R0YnToA6HDh+drpM7DToq+b9soregwWqVBi5gu/9y41z/XmGkjUHr7E 1Whf8F2zRypIYHhPRYCQqvapstRg3kwpvtHwrFXmmL9pjqQR+7U3xea26aW//HOWcjL/mD6BwBTFc VaG0Y7hVXGDNJPI0I26MczmhOBcPd1MalkLTwrlsyu40RW/fVXqcvGzZPWHe+N6xWi5bybM8iZJRI vYl0FdCSUMj4pzchS1LCLWcYIy1zXuxRq2DTykQMJ3f6+ZnDYj4GEFCyawc8kFJa6pJYg+AF2/1hb torhmTbw==;
Received: from 67.153.238.178.in-addr.arpa ([178.238.153.67]:49260 helo=[192.168.1.11]) by ssdrsserver2.hosting.co.uk with esmtpsa (TLS1.2) tls TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 (Exim 4.93) (envelope-from <ietf@bobbriscoe.net>) id 1lKOrm-0003Rf-10 for panrg@irtf.org; Thu, 11 Mar 2021 17:12:54 +0000
To: panrg@irtf.org
References: <d072512f-ed66-bb8b-7338-58ed720210a2@erg.abdn.ac.uk> <45a9076d-d573-b976-8465-3e731081169a@erg.abdn.ac.uk>
From: Bob Briscoe <ietf@bobbriscoe.net>
Message-ID: <455c5465-7a2a-211e-a712-1c6b412c73b4@bobbriscoe.net>
Date: Thu, 11 Mar 2021 17:12:52 +0000
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.7.1
MIME-Version: 1.0
In-Reply-To: <45a9076d-d573-b976-8465-3e731081169a@erg.abdn.ac.uk>
Content-Type: text/plain; charset="utf-8"; format="flowed"
Content-Transfer-Encoding: 8bit
Content-Language: en-GB
X-AntiAbuse: This header was added to track abuse, please include it with any abuse report
X-AntiAbuse: Primary Hostname - ssdrsserver2.hosting.co.uk
X-AntiAbuse: Original Domain - irtf.org
X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12]
X-AntiAbuse: Sender Address Domain - bobbriscoe.net
X-Get-Message-Sender-Via: ssdrsserver2.hosting.co.uk: authenticated_id: in@bobbriscoe.net
X-Authenticated-Sender: ssdrsserver2.hosting.co.uk: in@bobbriscoe.net
X-Source:
X-Source-Args:
X-Source-Dir:
Archived-At: <https://mailarchive.ietf.org/arch/msg/panrg/t2vy3On1kLabqNpDYTWNUBxDUEM>
Subject: Re: [PANRG] On the new text on ECN: draft-irtf-panrg-what-not-to-do-16
X-BeenThere: panrg@irtf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "Path Aware Networking \(Proposed\) Research Group discussion list" <panrg.irtf.org>
List-Unsubscribe: <https://www.irtf.org/mailman/options/panrg>, <mailto:panrg-request@irtf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/panrg/>
List-Post: <mailto:panrg@irtf.org>
List-Help: <mailto:panrg-request@irtf.org?subject=help>
List-Subscribe: <https://www.irtf.org/mailman/listinfo/panrg>, <mailto:panrg-request@irtf.org?subject=subscribe>
X-List-Received-Date: Thu, 11 Mar 2021 17:13:04 -0000

Gorry, and panrg,

I'd agree with Gorry, about one chance per generation.

I'd also say that there are two main prongs to the ECN story. Not just 
the "one chance" point.
The other is "Outperforming End-to-end Protocol Mechanisms"

When I was asked to write the business case for deploying ECN in BT, my 
colleagues in the tech strategy team pushed me on this point of 
outperforming e2e mechanisms. Jamal Salim's performance evaluation 
[RFC2884] showed ECN gave no performance benefit, except for short 
flows. I had to admit that e2e FEC for short flows would be more likely 
to win out. And at the time, I'd just found Damon Wischik paper about 
how little the extra percentage of traffic volume would be, if every 
sender just duplicated the first few packets of each flow [Wischik08].

So the business case flat-lined. End systems had long since worked out 
tonnes of loss-hiding tricks. There was far too great a risk that, by 
the time the 3-part deployment had got anywhere (sender, receiver and 
network), the problem would have been solved e2e, and the high-risk 
high-cost investment would all have been wasted.

This was actually the start of my journey to realize that the "ECN = 
drop" rule was the problem, 'cos it disallowed the real benefit of ECN - 
to cut queuing delay, by providing a finer-grained signal than loss. I 
did some calculations to work out that the noise in the delay signal was 
too great to get queue delay down as low as would be achievable with ECN 
(which could use virtual queues once ISPs realized that bandwidth had 
become plentiful enough). That was actually when I started working on 
chirping (bringing in Mirja as a research fellow) and came to the 
conclusion that we could only get a better delay signal out of the noise 
by creating more noise with the chirps. That's when I realized the 
chirps should only be used at start-up, and ECN was the only way to keep 
queueing extremely low under load. Today you can see the limits of using 
e2e delay to reduce queuing in BBR, although of course BBR is unlikely 
to be the last word in e2e delay reduction.

I know it's unlikely that every ISP went through such a rigorous 
exercise, but still none deployed it - probably intuitively reaching the 
same conclusion. The main reason being the deployment pain wasn't worth 
the small performance gain. Which is a TL;DR summary of all the above.

The issue with Linksys home routers crashing wasn't a biggie in that 
assessment of ECN - there were few enough of that model around by then 
that it could be worked round with pre-deployment validation testing 
from the OS. This supports Gorry's "one chance per generation" point.

Reference
[Wischik08] Wischik, D. "Short Messages" Philosophical Transactions of 
the Royal Society A, 2008, 366, 1941-1953


Bob

On 11/03/2021 16:17, Gorry Fairhurst wrote:
> Resend: On 11/03/2021 15:06, Gorry Fairhurst wrote:
>> HI,
>>
>> Two observations on ECN in what may have been learned?
>>
>> * I think the "one chance" is per one /Generation/ of equipment for 
>> hardware, possibly less time for software updates - but hard to 
>> eliminate deployed critical bugs completely.
>>
>> * "Measure widely before, but important also gently test during 
>> initial deployment", to avoid the pain when there are issues and 
>> therefore to provide a chance to refine the design if there is a 
>> problem, and reduce the pain of trying.
>>
>> I like the additional thought from Eric (in the RG meeting) on 
>> mitigating user pain by using fallback techniques, so they do not 
>> share your pain when something happens to go wong.
>>
>> ————
>>
>> Is this a typo?: /Cannot be recovered at TCP layer/
>> - it seems like a partial sentence, does need a /This.../
>>
>> Gorry
>>
>> _______________________________________________
>> Panrg mailing list
>> Panrg@irtf.org
>> https://www.irtf.org/mailman/listinfo/panrg
>
>
> _______________________________________________
> Panrg mailing list
> Panrg@irtf.org
> https://www.irtf.org/mailman/listinfo/panrg

-- 
________________________________________________________________
Bob Briscoe                               http://bobbriscoe.net/