Re: [re-ECN] TCP's "Dynamic Range"

Bob Briscoe <rbriscoe@jungle.bt.co.uk> Mon, 26 October 2009 21:16 UTC

Return-Path: <rbriscoe@jungle.bt.co.uk>
X-Original-To: re-ecn@core3.amsl.com
Delivered-To: re-ecn@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 7E73028C17C for <re-ecn@core3.amsl.com>; Mon, 26 Oct 2009 14:16:27 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.827
X-Spam-Level:
X-Spam-Status: No, score=-1.827 tagged_above=-999 required=5 tests=[AWL=0.290, BAYES_00=-2.599, DNS_FROM_RFC_BOGUSMX=1.482, RCVD_IN_DNSWL_LOW=-1]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id z7ICxGgiyP8q for <re-ecn@core3.amsl.com>; Mon, 26 Oct 2009 14:16:21 -0700 (PDT)
Received: from smtp1.smtp.bt.com (smtp1.smtp.bt.com [217.32.164.137]) by core3.amsl.com (Postfix) with ESMTP id 271DE28C347 for <re-ecn@ietf.org>; Mon, 26 Oct 2009 14:16:20 -0700 (PDT)
Received: from i2kc08-ukbr.domain1.systemhost.net ([193.113.197.71]) by smtp1.smtp.bt.com with Microsoft SMTPSVC(6.0.3790.3959); Mon, 26 Oct 2009 21:16:34 +0000
Received: from cbibipnt08.iuser.iroot.adidom.com ([147.149.100.81]) by i2kc08-ukbr.domain1.systemhost.net with Microsoft SMTPSVC(6.0.3790.3959); Mon, 26 Oct 2009 21:16:33 +0000
Received: From bagheera.jungle.bt.co.uk ([132.146.168.158]) by cbibipnt08.iuser.iroot.adidom.com (WebShield SMTP v4.5 MR1a P0803.399); id 1256591792872; Mon, 26 Oct 2009 21:16:32 +0000
Received: from MUT.jungle.bt.co.uk ([10.73.146.30]) by bagheera.jungle.bt.co.uk (8.13.5/8.12.8) with ESMTP id n9QLGTBE010898; Mon, 26 Oct 2009 21:16:29 GMT
Message-Id: <200910262116.n9QLGTBE010898@bagheera.jungle.bt.co.uk>
X-Mailer: QUALCOMM Windows Eudora Version 7.1.0.9
Date: Mon, 26 Oct 2009 21:16:29 +0000
To: John Leslie <john@jlc.net>
From: Bob Briscoe <rbriscoe@jungle.bt.co.uk>
In-Reply-To: <20091026133640.GA62345@verdi>
References: <4AE26E9B.8060205@thinkingcat.com> <200910242327.n9ONRbZt023456@bagheera.jungle.bt.co.uk> <4AE4CBDB.4030806@thinkingcat.com> <200910261228.n9QCSCp0030099@bagheera.jungle.bt.co.uk> <20091026133640.GA62345@verdi>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"; format=flowed
X-Scanned-By: MIMEDefang 2.56 on 132.146.168.158
X-OriginalArrivalTime: 26 Oct 2009 21:16:33.0847 (UTC) FILETIME=[983E4C70:01CA5681]
Cc: re-ecn@ietf.org
Subject: Re: [re-ECN] TCP's "Dynamic Range"
X-BeenThere: re-ecn@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: re-inserted explicit congestion notification <re-ecn.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/re-ecn>, <mailto:re-ecn-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/re-ecn>
List-Post: <mailto:re-ecn@ietf.org>
List-Help: <mailto:re-ecn-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/re-ecn>, <mailto:re-ecn-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 26 Oct 2009 21:16:27 -0000

John,

I really like your list of principles. Where in the BoF do you 
suggest we put it?

And I agree with everything you've said in this email, with just a 
couple of inline comments...

At 13:36 26/10/2009, John Leslie wrote:
>Bob Briscoe <rbriscoe@jungle.bt.co.uk> wrote:
> > At 22:06 25/10/2009, Leslie Daigle wrote:
> >>Bob Briscoe wrote:
> >>>
> >>> I've been thinking... We should add an item to the many purposes list:
> >>>
> >>> - evolution path beyond TCP (running out of dynamic range)
> >>
> >> Which is pretty cool, but on the bof agenda might lead to some
> >> ratholing on whether we're just bashing TCP, no?
> >
> > [BB] Not at all. Everyone in the transport area knows TCP has run out
> > of dynamic range. It's a well-known problem.
>
>    I've been thinking somewhat along this line, though I wasn't using
>the term "dynamic range", but rather considering signal-to-noise in
>our feedback function.
>
>    Beyond any question, TCP has been successful at curing "congestion
>collapse" -- which is, after all, what it aimed to do.
>
>    But packet loss, as a feedback signal, is frankly terribly unsuited
>to fine-tuning how to share bandwidth at the bottlenecks. It's not,
>IMHO, "bashing" TCP to point out this difference.
>
>    Network operators, at best, can only estimate packet loss, not
>measure it meaningfully, and even if they could measure it with 100%
>accuracy, they'd have no hope of relating packet loss to the backoff
>it signals.
>
>    That's because the multiplicative decrease it signals is based
>on end-to-end flows at a higher layer. We're looking at five or more
>orders of magnitude difference in the amount of backoff signaled by
>a packet lost. For a network operator, the signal is quite lost in
>the noise!
>
>    As we review the history of TCP, we notice that attempts to move
>away from packet loss have failed to dependably avoid congestion
>collapse: thus implementations tend to depend entirely on packet
>loss to signal backoff; and other congestion-notification tends to
>be ignored or even suppressed.
>
>    (BTW, that's an issue we need to be prepared to discuss: how can
>re-ecn operate when ECN marks are suppressed? Even though most of
>the suppression history concerns ICMP, there will be folks who think
>ECN will suffer similar suppression.)

As this sentence is in the passive, I assume you mean suppression by 
the transport or some other link than the congested one (not 
suppression by the congested link itself).

That's why we brought re-ECN to the IETF - because we had solved that 
problem. The draft-briscoe-tsvwg-re-ecn-tcp-motivation-01.txt 
explains the mechanisms that can be built over re-ECN to detect & 
prevent suppression.

I've actually got a PhD thesis of proofs on this now, with pseudocode 
of the mechanisms, simulations etc etc, but I haven't got round to 
asking my company for permission to publish (mainly because they are 
more focused on sacking people than authorising publications at the 
mo). I promise it will be published soon.


>    We should recognize that multiplicative-decrease on packet loss
>is a proven winner for avoiding congestion collapse, and concentrate
>on what's a better signal for management of bottleneck bandwidth.
>I propose a few principles:
>
>1) The signal needs to be visible to the network manager managing
>    the bottleneck;
>
>2) The signal should be distinct enough to be the basis for cost
>    allocation for upgrading the bottleneck;
>
>3) The signal should be visible to end-systems, giving a decent
>    measure of how much to backoff their sending rate;
>
>4) The signal should enable "backpressure" to allow network managers
>    to avoid forwarding too many packets towards the bottleneck;
>
>5) The signal should not involve packet loss.
>
>    I believe that a properly-designed signalling system can work
>for at least eight or nine orders of magnitude of sender bandwidth.
>To be complete, a proposal should probably get into how many bits
>per signal, but I'm personally convinced that re-ecn can work
>beyond five orders of magnitude.

Have you been following Matt Mathis's work on Relentless TCP? And 
generally on TCP algos with window proportional to 1/p, rather than 
1/sqrt(p) like current TCP. The idea is these maintain the same 
number of loss or ECN signals per window however fast you go.

Is there some reason for choosing 8 or 9 orders of magnitude? I would 
have thought 1/p would scale indefinitely, but you may be thinking of 
other factors I've missed.

Scaling was also one of the main motivations for Kelly's primal algo. 
And it was one of my motivations for introducing re-ECN so we could 
shift from the TCP-friendly (1/sqrt(p)) track painlessly onto a 
scalable 1/p track without worrying about flow fairness.


Bob


>    So, avoiding the bits-per-signal question, are the five principles
>above sufficient? Are they necessary? Can we come up with a good
>presentation of such principles for the BoF?
>
>--
>John Leslie <john@jlc.net>

________________________________________________________________
Bob Briscoe,                                BT Innovate & Design