Re: [tcpPrague] Experimental dual-queue ECN

"Black, David" <david.black@emc.com> Tue, 28 June 2016 20:53 UTC

Return-Path: <david.black@emc.com>
X-Original-To: tcpprague@ietfa.amsl.com
Delivered-To: tcpprague@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id B18ED12D894 for <tcpprague@ietfa.amsl.com>; Tue, 28 Jun 2016 13:53:22 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -5.728
X-Spam-Level:
X-Spam-Status: No, score=-5.728 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_MED=-2.3, RCVD_IN_MSPIKE_H2=-0.001, RP_MATCHES_RCVD=-1.426, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=emc.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id h7p9_uktcZJx for <tcpprague@ietfa.amsl.com>; Tue, 28 Jun 2016 13:53:20 -0700 (PDT)
Received: from mailuogwdur.emc.com (mailuogwdur.emc.com [128.221.224.79]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id CFA7412D8AF for <tcpPrague@ietf.org>; Tue, 28 Jun 2016 13:52:52 -0700 (PDT)
Received: from maildlpprd51.lss.emc.com (maildlpprd51.lss.emc.com [10.106.48.155]) by mailuogwprd51.lss.emc.com (Sentrion-MTA-4.3.1/Sentrion-MTA-4.3.0) with ESMTP id u5SKq0Ar022762 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NO); Tue, 28 Jun 2016 16:52:00 -0400
X-DKIM: OpenDKIM Filter v2.4.3 mailuogwprd51.lss.emc.com u5SKq0Ar022762
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed/relaxed; d=emc.com; s=jan2013; t=1467147121; bh=JjR4aydAw8hT14IVk9pdUVXyqiA=; h=From:To:CC:Subject:Date:Message-ID:References:In-Reply-To: Content-Type:Content-Transfer-Encoding:MIME-Version; b=LkNUIbC5Qp6YX20UFlyYJW8ZVDySN9Hse8KHyiWc8KvNYmTcju9qPx3NCMfr/dPY/ t+y7YLAQ0bTu0mG9uMyfd0ZcYai6kPaEler/B/Gd1ybBfMlyCNFCZMHiz8RP13iJjJ o1rhVBv0E5VyEvoE1CshGwnOZsYPlgdMeG3cYK9M=
X-DKIM: OpenDKIM Filter v2.4.3 mailuogwprd51.lss.emc.com u5SKq0Ar022762
Received: from mailusrhubprd02.lss.emc.com (mailusrhubprd02.lss.emc.com [10.253.24.20]) by maildlpprd51.lss.emc.com (RSA Interceptor); Tue, 28 Jun 2016 16:51:02 -0400
Received: from MXHUB317.corp.emc.com (MXHUB317.corp.emc.com [10.146.3.95]) by mailusrhubprd02.lss.emc.com (Sentrion-MTA-4.3.1/Sentrion-MTA-4.3.0) with ESMTP id u5SKpqKl013429 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-SHA384 bits=256 verify=FAIL); Tue, 28 Jun 2016 16:51:53 -0400
Received: from MX307CL04.corp.emc.com ([fe80::849f:5da2:11b:4385]) by MXHUB317.corp.emc.com ([10.146.3.95]) with mapi id 14.03.0266.001; Tue, 28 Jun 2016 16:51:52 -0400
From: "Black, David" <david.black@emc.com>
To: Michael Welzl <michawe@ifi.uio.no>, "gorry@erg.abdn.ac.uk" <gorry@erg.abdn.ac.uk>
Thread-Topic: [tcpPrague] Experimental dual-queue ECN
Thread-Index: AQHRzjoOU9hADIZ3ekqkVV5CUdcAP5/5JeOAgAAtgYCAASkvgIAAA5UAgACMioCABFQDQA==
Date: Tue, 28 Jun 2016 20:51:51 +0000
Message-ID: <CE03DB3D7B45C245BCA0D243277949362F5B2801@MX307CL04.corp.emc.com>
References: <574F2A2D.9070407@bobbriscoe.net> <574F4F29.9040409@bobbriscoe.net> <20160601215312.GA25116@verdi> <0898e249-03dd-aff9-7179-03cc8642efea@erg.abdn.ac.uk> <5762567D.8010609@bobbriscoe.net> <3f8fa637-17b5-853b-b835-db486a2a69f6@erg.abdn.ac.uk> <CAKKJt-cjncm7zsfj3=7pqB-uSNTxMPfjPY=qpSNnDncVmy+enA@mail.gmail.com> <20160624170118.GA52708@verdi> <576D70CB.8060108@erg.abdn.ac.uk> <8D9E4035-23E9-4BAD-B689-BF82C54BC98F@ifi.uio.no> <20160625140803.GB52708@verdi> <576E9344.2010909@erg.abdn.ac.uk> <ED8A00A3-7FE4-4972-955B-67058B880CFD@ifi.uio.no>
In-Reply-To: <ED8A00A3-7FE4-4972-955B-67058B880CFD@ifi.uio.no>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-originating-ip: [10.238.45.60]
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
X-Sentrion-Hostname: mailusrhubprd02.lss.emc.com
X-RSA-Classifications: public
Archived-At: <https://mailarchive.ietf.org/arch/msg/tcpprague/kzgwdf6oRygQEyGdgXHXX4s6g3U>
Cc: TCP Prague List <tcpPrague@ietf.org>, "Black, David" <david.black@emc.com>, Bob Briscoe <ietf@bobbriscoe.net>, John Leslie <john@jlc.net>, Spencer Dawkins <spencerdawkins.ietf@gmail.com>
Subject: Re: [tcpPrague] Experimental dual-queue ECN
X-BeenThere: tcpprague@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: "To coordinate implementation and standardisation of TCP Prague across platforms. TCP Prague will be an evolution of DCTCP designed to live alongside other TCP variants and derivatives." <tcpprague.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tcpprague>, <mailto:tcpprague-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tcpprague/>
List-Post: <mailto:tcpprague@ietf.org>
List-Help: <mailto:tcpprague-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tcpprague>, <mailto:tcpprague-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 28 Jun 2016 20:53:23 -0000

+1 on a simple draft to revise RFC 3168 solely to clear the "space" needed
for this sort of experimentation without getting into any details of what the
experiments will or will not do (NB: that sort of speculation on what's coming
is part of what got the RTP circuit breaker draft into its current situation, so
let's not repeat that ...).

Thanks, --David


> -----Original Message-----
> From: tcpPrague [mailto:tcpprague-bounces@ietf.org] On Behalf Of Michael
> Welzl
> Sent: Saturday, June 25, 2016 6:44 PM
> To: gorry@erg.abdn.ac.uk
> Cc: Bob Briscoe; John Leslie; Spencer Dawkins; TCP Prague List
> Subject: Re: [tcpPrague] Experimental dual-queue ECN
> 
> how nice to see, we all agree!
> 
> Sent from my iPhone
> 
> > On 25. jun. 2016, at 16.20, Gorry Fairhurst <gorry@erg.abdn.ac.uk> wrote:
> >
> > Email is a wodnerful media... we seem to have a lot more agreement than I'd
> hoped.
> >
> > (... and yes: If we have more than one simple PS update to 3168, and everyone
> agrees, I think these could form part of the same document).
> >
> > Gorry
> >
> >> On 25/06/2016 15:08, John Leslie wrote:
> >>    I seem to have flunked Robert Heinlein's class in writing orders!
> >>
> >>    Gorry succeeded in misunderstanding my point 1: "that reaction to ECN-CE
> >> should be the 'same as' drop". :^)
> >>
> >>    It was "obvious" to me that RFC 3168 now requires that, and that we
> >> need to change so that is no longer required.
> >>
> >>    Fortunately, it seems the three of us agree it needs to change. :^)
> >>
> >> Michael Welzl<michawe@ifi.uio.no>  wrote:
> >>> From: Michael Welzl<michawe@ifi.uio.no>
> >>> ...
> >>>> On 24. jun. 2016, at 19.41, Gorry Fairhurst<gorry@erg.abdn.ac.uk>  wrote:
> >>>> ...
> >>>> On 24/06/2016 18:01, John Leslie wrote:
> >>>>> Spencer Dawkins at IETF<spencerdawkins.ietf@gmail.com>   wrote:
> >>>>>> ...
> >>>>>> Are you thinking that wouldn't work here?
> >>>>> IMHO, such a path to  update RFC 3168 is impractical.
> >>>>> ...
> >>>>> IMHO, we need to break off a limited part of it for Experimental
> >>>>> protocols:
> >>>>>
> >>>>> 1. that reaction to ECN-CE should be the "same as" drop; and
> >>>> I disagree of course - the CE-marking proposal has already been
> >>>> discussed at the IETF - and I suggest no ECN is likely to be foundi
> >>>> using RED - and ECN-marked RED is now anyway now deprecated.
> >>>> Many modern AQM methods CE-mark on a shallow queue - and I think
> >>>> we need to update the PS to reflect this.
> >>    Fundamentally, I agree: but my concern was the minimum change needed
> >> to 3168.
> >>
> >>> +1 to Gorry  (unsurprisingly).
> >>>
> >>> The ECN ???Experiment??? has hardly happened, so on what basis de
> >>> we say that a reaction that is the ???same as??? drop is safe?
> >>    IMHO, that is what 3168 meant to say. I've never agreed with it.
> >>
> >>> If we just take operational experience, Cubic has first used a backoff
> >>> (multiplication) factor of 0.8, then 0.7, deployed in Linux and widely
> >>> used in the Internet.
> >>> This isn???t even limited to an ECN signal, which is likely to be
> >>> produced by an AQM mechanism, and hence much more likely to indicate
> >>> a shallow queue than loss.
> >>    I agree.
> >>
> >>> So: I think we can now assert that the Internet won???t melt down if
> >>> we'd back off using a different multiplication factor than 0.5.
> >>    I agree that would be good background material for a PS RFC relaxing
> >> that rule of 3168.
> >>
> >>    (But I don't think we need to limit the difference to applying a
> >> differentt factor: that is a really obvious way, and it has effectively
> >> been tested in actual use; but it is not the _only_ way.
> >>
> >>> Using such a larger factor *only* in response to ECN is even more
> >>> conservative, so even safer.
> >>    Exactly!
> >>
> >>> Adding to this, what exactly is the logic that makes ???react to
> >>> marking the same way as you would react to drop??? particularly safe?
> >>    The argument, as best I recall, was that otherwise one flow would
> >> starve the other. But there are other ways to treat that problem!
> >>
> >>> I can only assume that this assumes the same behavior in the network
> >>> for ECN-marking and dropping, and so, if we keep as much as possible
> >>> similar, this would be a safe way to go.
> >>    I don't see any reason to "keep as much as possible similay" if
> >> it's for an Experimental RFC. We should aim for _significant_ improvement.
> >>
> >>> Reality is different. Please see, for example Figure 13 in this pdf:
> >>> https://www.duo.uio.no/bitstream/handle/10852/37381/khademi-
> AQM_Kids_TR434.pdf
> >>    I shall get around to reading this today.
> >>
> >>> ...
> >>> In conclusion, I struggle to see the big reason why an ???exactly like
> >>> loss??? backoff is standard behavior and experimenting with other values
> >>> should be prohibited.
> >>    We three agree it shouldn't.
> >>
> >>    But...
> >>
> >>    Implementors should be able to read RFC3168 and its metadate and
> >> understand that their ten-year-old code may no longer be compliant.
> >>
> >>    It's about a warning-label!
> >>
> >>> A new particular value may constitute an experiment, but the
> >>> ???equal to loss??? limitation simply isn???t a good thing and should
> >>> be removed - this removal isn???t an experiment, it's a bugfix.
> >>    Exactly!
> >>
> >>> ...
> >>>> I also don't see a specific experiment that is needed here  - what
> >>>> would be needed to test for safety in deployment? I *think*
> >>>> particular update can be taken directly to PS.
> >>> +1
> >>
> >>    (I believe we could treat two issues in the same RFC, in order to
> >> make implementors' jab a little easier. YMMV...)
> >>
> >>> ...
> >> --
> >> John Leslie<john@jlc.net>
> >
> 
> _______________________________________________
> tcpPrague mailing list
> tcpPrague@ietf.org
> https://www.ietf.org/mailman/listinfo/tcpprague