Re: [re-ECN] re-echo of drop

Bob Briscoe <rbriscoe@jungle.bt.co.uk> Mon, 02 November 2009 17:01 UTC

Return-Path: <rbriscoe@jungle.bt.co.uk>
X-Original-To: re-ecn@core3.amsl.com
Delivered-To: re-ecn@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id D7D4528C198 for <re-ecn@core3.amsl.com>; Mon, 2 Nov 2009 09:01:36 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -0.897
X-Spam-Level:
X-Spam-Status: No, score=-0.897 tagged_above=-999 required=5 tests=[AWL=-0.569, BAYES_05=-1.11, DNS_FROM_RFC_BOGUSMX=1.482, MIME_8BIT_HEADER=0.3, RCVD_IN_DNSWL_LOW=-1]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id fpfZUeHg0OM9 for <re-ecn@core3.amsl.com>; Mon, 2 Nov 2009 09:01:30 -0800 (PST)
Received: from smtp2.smtp.bt.com (smtp2.smtp.bt.com [217.32.164.150]) by core3.amsl.com (Postfix) with ESMTP id BCE1728C195 for <re-ecn@ietf.org>; Mon, 2 Nov 2009 09:01:29 -0800 (PST)
Received: from i2kc06-ukbr.domain1.systemhost.net ([193.113.197.70]) by smtp2.smtp.bt.com with Microsoft SMTPSVC(6.0.3790.3959); Mon, 2 Nov 2009 17:01:48 +0000
Received: from cbibipnt05.iuser.iroot.adidom.com ([147.149.196.177]) by i2kc06-ukbr.domain1.systemhost.net with Microsoft SMTPSVC(6.0.3790.3959); Mon, 2 Nov 2009 17:01:48 +0000
Received: From bagheera.jungle.bt.co.uk ([132.146.168.158]) by cbibipnt05.iuser.iroot.adidom.com (WebShield SMTP v4.5 MR1a P0803.399); id 1257181307544; Mon, 2 Nov 2009 17:01:47 +0000
Received: from MUT.jungle.bt.co.uk ([10.73.5.180]) by bagheera.jungle.bt.co.uk (8.13.5/8.12.8) with ESMTP id nA2H1iTW026695; Mon, 2 Nov 2009 17:01:44 GMT
Message-Id: <200911021701.nA2H1iTW026695@bagheera.jungle.bt.co.uk>
X-Mailer: QUALCOMM Windows Eudora Version 7.1.0.9
Date: Mon, 02 Nov 2009 17:01:44 +0000
To: =?iso-8859-1?Q?Jo=E3o?= Taveira =?iso-8859-1?Q?Ara=FAjo?= <j.araujo@ee.ucl.ac.uk>
From: Bob Briscoe <rbriscoe@jungle.bt.co.uk>
In-Reply-To: <4AE9B9A4.3000107@ee.ucl.ac.uk>
References: <4AE26E9B.8060205@thinkingcat.com> <200910242327.n9ONRbZt023456@bagheera.jungle.bt.co.uk> <4AE4CBDB.4030806@thinkingcat.com> <200910261228.n9QCSCp0030099@bagheera.jungle.bt.co.uk> <20091026133640.GA62345@verdi> <200910262116.n9QLGTBE010898@bagheera.jungle.bt.co.uk> <4AE6E99B.6050907@thinkingcat.com> <fc0ff13d0910271822n7e0ec0ceq575b9121678539e6@mail.gmail.com> <4AE82B4C.5000100@thinkingcat.com> <200910291508.n9TF8ua3018965@bagheera.jungle.bt.co.uk> <4AE9B9A4.3000107@ee.ucl.ac.uk>
Mime-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"; format=flowed
Content-Transfer-Encoding: quoted-printable
X-Scanned-By: MIMEDefang 2.56 on 132.146.168.158
X-OriginalArrivalTime: 02 Nov 2009 17:01:48.0168 (UTC) FILETIME=[2A290880:01CA5BDE]
Cc: Matt Mathis <matt.mathis@gmail.com>, re-ecn@ietf.org
Subject: Re: [re-ECN] re-echo of drop
X-BeenThere: re-ecn@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: re-inserted explicit congestion notification <re-ecn.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/re-ecn>, <mailto:re-ecn-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/re-ecn>
List-Post: <mailto:re-ecn@ietf.org>
List-Help: <mailto:re-ecn-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/re-ecn>, <mailto:re-ecn-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 02 Nov 2009 17:01:37 -0000

Joao,

At 15:49 29/10/2009, João Taveira Araújo wrote:
>Bob,
>
>I talked to you offline about this the other 
>day, but here goes my main concern..
>
>Bob Briscoe wrote:
>>For those who don't want the technical details, here's a summary
>>1/ Since draft-00 (Oct 2005), the re-ECN spec 
>>has always said re-echo whether congestion is drop or ECN-marking.
>
>Which suggests there is good reason to separate 
>both. The endpoint behaviour doesn't change, but 
>the network's understanding of what is happening 
>does, and so you'll get an overestimation of 
>upstream congestion. With a loss echo, you can 
>give a neutral value (as with cautious packets) 
>but still rate limit at the ingress. However...
>
>>2/ Re-ECN currently doesn't re-echo on drop 
>>unless the receiver is at least ECN-capable. 
>>But there's an easy way to make that happen, 
>>but with possible devil in details.
>
>You don't want to mix FNE's with l-echos. 
>Besides being a weak abstraction, the FNE flag 
>is critical in creating state, which you use for 
>policing flows and sampling interdomain. Another 
>problem with loss echos without re-ECN is it 
>makes it impossible for networks to verify the 
>percentage of e2e loss over bulk traffic, since 
>they don't know what traffic marks loss or not.

[BB] I am aware that everything you say under 1/& 2/ will be a problem.
This is why I think we need to work out whether 
there is a better codepoint arrangement. But it's 
not easy - for instance, I'm not convinced that 
the assignment for re-echo on drop that you have 
chosen is a good one. I.e. using the 'currently 
unused' codepoint left free in the re-ECN spec (ECT(1) with the RE flag set).

That involves sending an ECN-capable codepoint to 
a receiver that doesn't understand ECN. If this 
passes through a congested ECN-capable queue 
(different from the queue that did the drop in 
the last round) these packets will get ECN marked 
as congestion experienced (CE). The receiver is 
not ECN-capable so it will not understand and the 
congestion notifications will all get lost.


>>3/ Yes, black hole detection is critical (to 
>>re-ECN on drop as much as to ECN).
>
>
>I'm using the currently unused codepoint to 
>signal loss echos. Correlating the amount of 
>loss, congestion and RECT traffic to a 
>destination might be a better way of detecting 
>ECN black holes, especially as ISPs have an 
>incentive to prod the deployment of something 
>like this along. It might not be the easiest 
>solution, but I think it's cleaner: this way you 
>can tell if it's an "evil bit" black hole 
>(dropped FNEs), or just ECN averse (FNE sets up 
>state but packet which follows through is lost. Multiple loss echos follow?).

[BB] ECN black-hole generally means, when a 
client sends SYN with ECN flags in the TCP header, either:
- home router crashes, or
- home router drops the SYN, possibly returning a TCP RESET, possibly not
(See RFC3360)

This means there won't be any packets to measure 
- nothing gets past the home router, because of 
the TCP header, not the IP header. Am I missing something?

[BTW, there are many other ways for crap 
implementors to screw up everything... Recently, 
folks at UCL have also seen a router (probably at 
UCL's border) clearing ECN-capable codepoints in 
the IP header of UDP datagrams, back to Not-ECT (00). Dangerous, but true.]


Bob


>Joao.

________________________________________________________________
Bob Briscoe,                                BT Innovate & Design