[tsvwg] Fwd: RE: [REVISED I-D]: Document writeup for draft-ietf-tsvwg-byte-pkt-congest
Bob Briscoe <bob.briscoe@bt.com> Thu, 11 October 2012 17:09 UTC
Return-Path: <bob.briscoe@bt.com>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 4FB3E21F866C for <tsvwg@ietfa.amsl.com>; Thu, 11 Oct 2012 10:09:40 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.105
X-Spam-Level:
X-Spam-Status: No, score=-2.105 tagged_above=-999 required=5 tests=[AWL=-0.346, BAYES_00=-2.599, J_CHICKENPOX_72=0.6, RCVD_IN_DNSWL_LOW=-1, SARE_LWSHORTT=1.24]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 1rdhcPIbJnWn for <tsvwg@ietfa.amsl.com>; Thu, 11 Oct 2012 10:09:37 -0700 (PDT)
Received: from hubrelay-rd.bt.com (hubrelay-rd.bt.com [62.239.224.99]) by ietfa.amsl.com (Postfix) with ESMTP id 86B1621F8687 for <tsvwg@ietf.org>; Thu, 11 Oct 2012 10:09:36 -0700 (PDT)
Received: from EVMHR02-UKBR.domain1.systemhost.net (193.113.108.41) by EVMHR67-UKRD.bt.com (10.187.101.22) with Microsoft SMTP Server (TLS) id 8.3.279.1; Thu, 11 Oct 2012 18:09:35 +0100
Received: from dyw02134app01.domain1.systemhost.net (193.113.249.13) by EVMHR02-UKBR.domain1.systemhost.net (193.113.108.41) with Microsoft SMTP Server (TLS) id 8.3.279.1; Thu, 11 Oct 2012 18:09:34 +0100
Received: from cbibipnt05.iuser.iroot.adidom.com (147.149.196.177) by dyw02134app01.domain1.systemhost.net (10.35.25.214) with Microsoft SMTP Server id 14.2.309.2; Thu, 11 Oct 2012 18:09:22 +0100
Received: From bagheera.jungle.bt.co.uk ([132.146.168.158]) by cbibipnt05.iuser.iroot.adidom.com (WebShield SMTP v4.5 MR1a P0803.399); id 1349975301665; Thu, 11 Oct 2012 18:08:21 +0100
Received: from MUT.jungle.bt.co.uk ([10.73.16.16]) by bagheera.jungle.bt.co.uk (8.13.5/8.12.8) with ESMTP id q9BH8InG013834; Thu, 11 Oct 2012 18:08:18 +0100
Message-ID: <201210111708.q9BH8InG013834@bagheera.jungle.bt.co.uk>
X-Mailer: QUALCOMM Windows Eudora Version 7.1.0.9
Date: Thu, 11 Oct 2012 18:08:15 +0100
To: "Tschofenig, Hannes (NSN - FI/Espoo)" <hannes.tschofenig@nsn.com>
From: Bob Briscoe <bob.briscoe@bt.com>
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"; format="flowed"
X-Scanned-By: MIMEDefang 2.56 on 132.146.168.158
Cc: gorry@erg.abdn.ac.uk, philip.eardley@bt.com, tsvwg IETF list <tsvwg@ietf.org>, tsvwg-chairs@ietf.org
Subject: [tsvwg] Fwd: RE: [REVISED I-D]: Document writeup for draft-ietf-tsvwg-byte-pkt-congest
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/tsvwg>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 11 Oct 2012 17:09:40 -0000
Hannes, Brainfart - sorry. for 'CoDel's 100ms target', read 5ms. Still crazy for hi-speed. Bob >Date: Thu, 11 Oct 2012 17:06:57 +0100 >To: "Tschofenig, Hannes (NSN - FI/Espoo)" <hannes.tschofenig@nsn.com> >From: Bob Briscoe <bob.briscoe@bt.com> >Subject: RE: [tsvwg] [REVISED I-D]: Document writeup for >draft-ietf-tsvwg-byte-pkt-congest >Cc: ext Manner Jukka <jukka.manner@aalto.fi>, ><gorry@erg.abdn.ac.uk>, <philip.eardley@bt.com>, ><tsvwg-chairs@ietf.org>, tsvwg IETF list <tsvwg@ietf.org> > >Hannes, > >This doc is very general. The advice on AQM not biasing for small >packets applies equally to CoDel as to RED (it has already been >applied to PCN, which is another AQM). The doc gives specific advice >for RED as an exemplar, but it's not just about RED (it's worrying >that you think it is - which text gave this impression?). Everywhere >that RED is mentioned, the wording should carefully say it's an example. > >Similarly, it gives guidelines on end-to-end transports, and uses >TCP as an concrete example. > >[Changing the subject to "Who said CoDel is a panacea anyway?" CoDel >isn't really applicable on hi-speed links, certainly not if one >wants the main benefit of being insensitive to the chosen hard-coded >parameters. Even in the CoDel paper's simulations, CoDel gave higher >queuing delay than RED in all the cases shown except the lowest link >speed experiments. CoDel's 100ms target would be a crazy delay >target in a high speed network. To deploy CoDel in hi-speed >networks, you have to tune the parameters away from the hard-coded >ones, which loses the only benefit it has. >] > > >Bob > >At 13:57 11/10/2012, Tschofenig, Hannes (NSN - FI/Espoo) wrote: >>Hi Jukka, >> >>With QoS there is no such thing as "short term" (as we already had to >>learn painfully). >> >>That makes me wonder what is more useful: publish a document that >>describe how to configure RED correctly or to go for an AQM technique >>that (according to the authors) works almost zero configuration. >> >>My take-away from the Vancouver congestion control workshop was that it >>is better to ignore RED. (Maybe my impression was wrong -- someone else >>should confirm.) >> >>Ciao >>Hannes >> >> >> > -----Original Message----- >> > From: ext Manner Jukka [mailto:jukka.manner@aalto.fi] >> > Sent: Thursday, October 11, 2012 3:50 PM >> > To: Tschofenig, Hannes (NSN - FI/Espoo) >> > Cc: Bob Briscoe; <gorry@erg.abdn.ac.uk>; <philip.eardley@bt.com>; >> > <tsvwg-chairs@ietf.org>; tsvwg IETF list >> > Subject: Re: [tsvwg] [REVISED I-D]: Document writeup for draft-ietf- >> > tsvwg-byte-pkt-congest >> > >> > Hi, this document talks about the current and short term goals and >> > operations, while AFAIK CoDel is a more futuristic concept, >>interesting >> > concept, sure, but not deployed at large. >> > >> > cheers, >> > Jukka >> > >> > On Oct 11, 2012, at 3:44 PM, Tschofenig, Hannes (NSN - FI/Espoo) >>wrote: >> > >> > > Hi Jukka, Hi Bob, >> > > >> > > having listened to the presentations about CoDel I am wondering >> > whether >> > > this document may have taken over by events and is obsolete by now. >> > > What's your view on that? >> > > >> > > Ciao >> > > Hannes >> > > >> > > >> > >> -----Original Message----- >> > >> From: tsvwg-bounces@ietf.org [mailto:tsvwg-bounces@ietf.org] On >> > Behalf >> > >> Of ext Manner Jukka >> > >> Sent: Thursday, October 11, 2012 3:34 PM >> > >> To: Bob Briscoe >> > >> Cc: <gorry@erg.abdn.ac.uk>; <philip.eardley@bt.com>; <tsvwg- >> > >> chairs@ietf.org>; tsvwg IETF list >> > >> Subject: Re: [tsvwg] [REVISED I-D]: Document writeup for >>draft-ietf- >> > >> tsvwg-byte-pkt-congest >> > >> >> > >> Hi, >> > >> >> > >> Yeah, please at the point about cheap devices, still, I wonder what >> > > the >> > >> cost of AQM is on those devices. Maybe 1 cnt in mass production? >> > Thus, >> > >> I still wonder if it is worth arguing something about the future >> > that >> > >> strongly. >> > >> >> > >> Sorry, but the same goes for AQM in general, and where it is, or >> > will >> > >> not be deployed. I understand technical reasons why not something >>is >> > >> feasible, but business or monetary reasons can change very quickly. >> > ;) >> > >> >> > >> Jukka >> > >> >> > >> On Oct 11, 2012, at 3:08 PM, Bob Briscoe wrote: >> > >> >> > >>> Jukka, >> > >>> >> > >>> What I had in mind is all low level buffers in cheap pure L2 >> > > devices, >> > >> in cheap NATs etc. I was going to list some of these, but instead >> > > chose >> > >> to say "no-one expects AQM to be universally deployed". This >>clearly >> > >> missed the mark. >> > >>> >> > >>> Should we include some of these concrete examples of where AQM >> > won't >> > >> be deployed instead? >> > >>> >> > >>> [I've cc'd the list, given we're starting to wordsmith >> > significantly >> > >> new text added since WG last call.] >> > >>> >> > >>> >> > >>> Bob >> > >>> >> > >>> At 20:04 10/10/2012, Manner Jukka wrote: >> > >>>> Hi, looked good to me. Just one comment, I wouldn't put that much >> > >> emphasis on what is a probably state of things, e.g. >> > >>>> >> > >>>> "no-one expects AQM to ever be .." > in the future it might, we >> > >> don't know, and since the paragraph says that it doesn't matter >> > > anyway, >> > >> I would formulate this paragraph a bit towards that we have partial >> > >> deployment now and even with a full deployment, ... >> > >>>> >> > >>>> and similarly with the unrealistic deployment of AQM in the >> > >> subsequent paragraph. >> > >>>> >> > >>>> cheers, >> > >>>> Jukka >> > >>>> >> > >>>> On Oct 10, 2012, at 6:25 PM, Bob Briscoe wrote: >> > >>>> >> > >>>>> Phil, >> > >>>>> >> > >>>>> Sorry for delay, my round-robin scheduler is taking so long to >> > > get >> > >> round all the tasks that I keep finding dead robins (that reminds >>me >> > > of >> > >> a bike ride my grandmother took. On her return, she said there were >> > a >> > >> worryingly large number of robins run over on the roads. It turned >> > out >> > >> that she had been going round the same small triangle of roads >> > > multiple >> > >> times). >> > >>>>> >> > >>>>> I've done a complete re-write of 3.4. I hope you will agree that >> > >> it really is a motivating argument now and it can stay where it is >>- >> > > it >> > >> is not just a preamble. >> > >>>>> >> > >>>>> I've gone back to the very first version of 3.4 I wrote and I >>now >> > >> realise that I had written the core of the argument so badly that >> > even >> > >> I forgot what it was meant to say when we started editing it. In >>the >> > >> draft-08 version, the original meaning is lost and in your re- >> > wording >> > >> it is entirely lost. The link with the draft-08 version is nearly >> > >> unrecognisable, but it's the argument I intended here. >> > >>>>> >> > >>>>> >> > >> >> > > >> > >>======================================================================= >> > >> =============== >> > >>>>> 3.4. Permanent Partial Deployment of AQM >> > >>>>> >> > >>>>> In overview, the argument in this section runs as follows: >> > >>>>> * Because the network will not and cannot always drop packets in >> > >> proportion to their size, it shouldn't be given the task of making >> > > drop >> > >> signals depend on packet size at all. >> > >>>>> * Transports on the other hand don't always want to make their >> > >> rate response proportional to the size of dropped packets, but if >> > they >> > >> want to, they always can. >> > >>>>> >> > >>>>> The argument is similar to the end-to-end argument that says >> > >> "Don't do X in the network if end-systems can do X by themselves, >> > and >> > >> they want to be able to choose whether to do X anyway." Actually >>the >> > >> following argument is stronger; in addition it says "Don't give the >> > >> network task X that could be done by the end-systems, if X won't >> > ever >> > >> be deployed on all network nodes, and end-systems won't be able to >> > > tell >> > >> whether their network is doing X, or whether they need to do X >> > >> themselves." In this case, the X in question is "making the >>response >> > > to >> > >> congestion depend on packet size". >> > >>>>> >> > >>>>> We will now re-run this argument taking each step in more depth. >> > >> The argument applies solely to drop, not ECN marking. >> > >>>>> >> > >>>>> A queue drops packets for either of two reasons: a) to signal to >> > >> host congestion controls that they should reduce the load and b) >> > >> because there is no buffer left to store the packets. Active queue >> > >> management tries to use drops as a signal for hosts to slow down >> > (case >> > >> a) so that drop due to buffer exhaustion (case b) should not be >> > >> necessary. >> > >>>>> >> > >>>>> No-one expects AQM to ever be universally deployed in every >>queue >> > >> in the Internet; and, even if AQM were universal, it has to be able >> > to >> > >> cope with buffer exhaustion (by switching to a behaviour like tail- >> > >> drop), in order to cope with unresponsive or excessive transports. >> > >> Therefore networks will often be dropping packets as a last resort >> > >> (case b) rather than under AQM control (case a). >> > >>>>> >> > >>>>> When buffers are exhausted (case b), they don't naturally drop >> > >> packets in proportion to their size. The network can only reduce >>the >> > >> probability of dropping smaller packets if it has enough space to >> > > store >> > >> them somewhere while it waits for a larger packet that it can drop. >> > If >> > >> the buffer is exhausted, it does not have this choice. Admittedly >> > > tail- >> > >> drop does naturally drop somewhat fewer small packets, but exactly >> > how >> > >> few depends more on the mix of sizes than the size of the packet in >> > >> question. Nonetheless, in general, if we wanted networks to do >>size- >> > >> dependent drop, we would need universal deployment of (packet-size >> > >> dependent) AQM code, which is unrealistic. >> > >>>>> >> > >>>>> A host transport cannot know whether any particular drop was a >> > >> deliberate signal from an AQM or a sign of a queue shedding packets >> > > due >> > >> to buffer exhaustion. Therefore, because the network cannot >> > > universally >> > >> do size-dependent drop, it should not do it all. >> > >>>>> >> > >>>>> Whereas universality is desirable in the network, diversity is >> > >> desirable between different transport layer protocols - some, like >> > >> NewReno TCP [RFC5681], may not choose to make their rate response >> > >> proportionate to the size of each dropped packet, while others will >> > >> (e.g. TFRC-SP [RFC4828]). >> > >>>>> >> > >> >> > > >> > >>======================================================================= >> > >> =============== >> > >>>>> >> > >>>>> >> > >>>>> >> > >>>>> >> > >>>>> Bob >> > >>>>> >> > >>>>> At 13:13 04/10/2012, gorry@erg.abdn.ac.uk wrote: >> > >>>>>> That would be brilliant! >> > >>>>>> >> > >>>>>> Gorry >> > >>>>>> >> > >>>>>>> Will try to look at this later today. >> > >>>>>>> >> > >>>>>>> >> > >>>>>>> Bob >> > >>>>>>> >> > >>>>>>> At 10:42 02/10/2012, philip.eardley@bt.com wrote: >> > >>>>>>>> Think the token is with bob >> > >>>>>>>> >> > >>>>>>>> Re your comment - yes, I think you could move 3.4 to the end >> > > of >> > >> S3 - >> > >>>>>>>> or to the start - and do some re-phrasing, maybe something >> > > like >> > >> I tried. >> > >>>>>>>> >> > >>>>>>>> -----Original Message----- >> > >>>>>>>> From: Manner Jukka [mailto:jukka.manner@aalto.fi] >> > >>>>>>>> Sent: 01 October 2012 20:18 >> > >>>>>>>> To: Eardley,PL,Philip,DUB8 R; Briscoe,RJ,Bob,DUB8 R >> > >>>>>>>> Cc: tsvwg-chairs@ietf.org; Gorry Fairhurst >> > >>>>>>>> Subject: Re: [REVISED I-D]: Document writeup for >> > >>>>>>>> draft-ietf-tsvwg-byte-pkt-congest >> > >>>>>>>> >> > >>>>>>>> Phil, Bob, can we close this finally? It's three weeks now >> > >> since >> > >>>>>>>> this previous message. >> > >>>>>>>> >> > >>>>>>>> Jukka >> > >>>>>>>> >> > >>>>>>>> On Sep 10, 2012, at 4:42 PM, Jukka Manner wrote: >> > >>>>>>>> >> > >>>>>>>>> Ack, I'm getting there...;) >> > >>>>>>>>> >> > >>>>>>>>> So, we could move 3.4 at the end of Section 3 and rephrase >> > >> it to >> > >>>>>>>> not talk about scaling per se, but rather about how Req 2.2 >> > >>>>>>>> eventually leads to Req 2.3? >> > >>>>>>>>> >> > >>>>>>>>> Jukka >> > >>>>>>>>> >> > >>>>>>>>> On Sep 7, 2012, at 4:07 PM, <philip.eardley@bt.com> >> > >>>>>>>>> <philip.eardley@bt.com> wrote: >> > >>>>>>>>> >> > >>>>>>>>>> 3.4 is, in my mind, completely different from 3.1, 3.2, >> > > 3.3 >> > >> and 3.5. >> > >>>>>>>>>> >> > >>>>>>>>>> 3.1, 3.2, 3.3 and 3.5 present motivating arguments for >> > >> Recommendation >> > >>>>>>>>>> 2.2 & a lesser extent 2.3 >> > >>>>>>>>>> 3.4 makes doesn't make a motivating argument - it explains >> > >> that Rec >> > >>>>>>>>>> 2.2 & 2.3 come as a pair (if you recommend one, then >> > >> inevitably you >> > >>>>>>>>>> recommend the other) (delta some politeness about TCP) >> > >>>>>>>>>> >> > >>>>>>>>>> >> > >>>>>>>>>> >> > >>>>>>>>>> -----Original Message----- >> > >>>>>>>>>> From: Manner Jukka [mailto:jukka.manner@aalto.fi] >> > >>>>>>>>>> Sent: 07 September 2012 13:24 >> > >>>>>>>>>> To: Eardley,PL,Philip,DUB8 R >> > >>>>>>>>>> Cc: Briscoe,RJ,Bob,DUB8 R; <tsvwg-chairs@ietf.org>; >> > >>>>>>>>>> <gorry@erg.abdn.ac.uk> >> > >>>>>>>>>> Subject: Re: [REVISED I-D]: Document writeup for >> > >>>>>>>>>> draft-ietf-tsvwg-byte-pkt-congest >> > >>>>>>>>>> >> > >>>>>>>>>> Hi, I'm trying to follow your reasoning, with little luck. >> > >> ;) >> > >>>>>>>>>> >> > >>>>>>>>>> After having read Section 3 again, I could think of a few >> > >>>>>>>> updates, but I'm not fully following what you want to see: >> > >>>>>>>>>> >> > >>>>>>>>>> - S3.5 is kind of small and lonely by itself. It could be >> > >> either >> > >>>>>>>> longer or merged somewhere. Not a major issue though and >> > > could >> > >> remain as >> > >>>>>>>> is. >> > >>>>>>>>>> >> > >>>>>>>>>> - S3.3: Yes, probably could be turned upside down, good >> > >> first, >> > >>>>>>>> bad later. Don't have a strong opinion. >> > >>>>>>>>>> >> > >>>>>>>>>> - S3.4: Here the scaling is used many times and in the >> > >> title, >> > >>>>>>>> although we talk about the end host side. This could be >> > >> rephrased >> > >>>>>>>> and slightly reworded to talk about hosts and their operation >> > >>>>>>>> altogether. Also the word scaling can turn the readers >> > >> expectation >> > >>>>>>>> into the wrong direction. >> > >>>>>>>>>> >> > >>>>>>>>>> - S3 altogether: It would help to have introductory text >> > >> about >> > >>>>>>>> networks vs. end host side, but I fail to see why (& how) to >> > >>>>>>>> restructure everything. >> > >>>>>>>>>> >> > >>>>>>>>>> cheers, >> > >>>>>>>>>> Jukka >> > >>>>>>>>>> >> > >>>>>>>>>> On Sep 6, 2012, at 6:18 PM, >> > >>>>>>>> <philip.eardley@bt.com> <philip.eardley@bt.com> wrote: >> > >>>>>>>>>> >> > >>>>>>>>>>> Ps otherwise Bob's comments are fine, though I suggest >> > > for >> > >> S3.5 you >> > >>>>>>>>>>> give more context (ie are you knocking down what you >> > >> recommend >> > >>>>>>>>>>> against, or building up what you recommend?) and in S3.3 >> > >> think >> > >>>>>>>>>>> would be better if you re-structured slightly [first talk >> > >> about what >> > >>>>>>>>>>> you like - penult para + 1st sentence of last para - then >> > >> talk about >> > >>>>>>>>>>> what you don't like - rest of text] >> > >>>>>>>>>>> >> > >>>>>>>>>>> From: Eardley,PL,Philip,DUB8 R >> > >>>>>>>>>>> Sent: 06 September 2012 16:12 >> > >>>>>>>>>>> To: Briscoe,RJ,Bob,DUB8 R; Manner Jukka >> > >>>>>>>>>>> Cc: tsvwg-chairs@ietf.org; Gorry Fairhurst >> > >>>>>>>>>>> Subject: RE: [REVISED I-D]: Document writeup for >> > >>>>>>>>>>> draft-ietf-tsvwg-byte-pkt-congest >> > >>>>>>>>>>> >> > >>>>>>>>>>> Thanks bob. Have now re-booted some state. >> > >>>>>>>>>>> I don't like S3.4. My basic problem with it is that it >> > >> isn't a >> > >>>>>>>> "motivating argument" (S3 title). It's an explanation about >> > >> the >> > >>>>>>>> consequences of assuming either [option 1] do pkt-size >> > > biasing >> > >> in >> > >>>>>>>> the transport; [option 2]; do pkt-size biasing in the nw. >> > >>>>>>>>>>> Perhaps your effort to buy peace with those who won't >> > >> change >> > >>>>>>>> TCP has obfuscated the basic point? >> > >>>>>>>>>>> The title of S3.4 (Scaling...) and the early text >> > > "scaling >> > >>>>>>>> argument" suggests that you're going to argue there is some >> > >> reason >> > >>>>>>>> why X scales as order N rather than N^2 or something like >> > > that >> > >> - >> > >>>>>>>> which isn't what S3.4 is about. >> > >>>>>>>>>>> >> > >>>>>>>>>>> I actually think it would be better to put S3.4 at the >> > >> start of >> > >>>>>>>> S3 - it's background where you say there are 2 options. After >> > >> that, >> > >>>>>>>> you can explain the reasons why option 1 [pkt biasing in the >> > >>>>>>>> transport] is better - the reasons being:- S3.1 (security >> > > etc >> > >>>>>>>> attacks if small pkts get preferential treatment), S3.2 >> > >> (really a >> > >>>>>>>> side note to S3.1; ie doesn't really present a "motivating >> > >>>>>>>> argument"), S3.3 (nw guessing MTU vs transport knowing pkt >> > >> size), >> > >>>>>>>> S3.5 (easier implementation). >> > >>>>>>>>>>> >> > >>>>>>>>>>> Something like... >> > >>>>>>>>>>> The behaviour of the network and transport together needs >> > >> to >> > >>>>>>>>>>> alleviate congestion. We assume the network is bit- >> > >> congestible >> > >>>>>>>> (currently packet-congestible resources are rate). Therefore >> > >>>>>>>> overall the response of the {network + transport} should be >> > >> the >> > >>>>>>>> same for the same congestion - whether the congestion is >> > >> caused by >> > >>>>>>>> a lot of small packets or a smaller number of larger packets. >> > >> The >> > >>>>>>>> implication is that the transport's behaviour should depend >> > > on >> > >>>>>>>> whether the network takes account (or not) of packet size >> > > when >> > >> it >> > >>>>>>>> generates congestion indications:- Case 1: If the network >> > >>>>>>>> implements packet-mode drop, ie its algorithm treats all >> > >> packets >> > >>>>>>>> equally regardless of their size, then the transport should >> > >> take >> > >>>>>>>> account of the size of the packet - in other words, the >> > >> transport's >> > >>>>>>>> congestion response should depend on the number of dropped >> > >> bytes. >> > >>>>>>>>>>> Case 2: If the network implements byte-mode drop, ie its >> > >>>>>>>> algorithm treats packets differently depending on their size, >> > >> then >> > >>>>>>>> the transport should not take account of the size of the >> > >> packet - >> > >>>>>>>> in other words, the transport's congestion response should >> > >> depend >> > >>>>>>>> on the number of dropped packets. >> > >>>>>>>>>>> >> > >>>>>>>>>>> These cases are explained in more detail in sub-section >> > >> below. >> > >>>>>>>>>>> >> > >>>>>>>>>>> In both cases the same discussion applies if packets are >> > >>>>>>>> ECN-marked rather than dropped. >> > >>>>>>>>>>> In the following sections we explain why Case 1 is >> > >> preferable, >> > >>>>>>>> and hence why we make the Recommendations of S2.2 and S3.3 >> > >>>>>>>> (respectively, drop/mark pkts equally regardless of their >> > >> size, and >> > >>>>>>>> the strength of the transport's response should be >> > >> proportionate to >> > >>>>>>>> the size of the dropped/marked pkt). >> > >>>>>>>>>>> Note that TCP responds to dropped or marked packets, >> > > which >> > >> differs >> > >>>>>>>>>>> from our recommendation for transports. However, we are >> > >> not >> > >>>>>>>>>>> recommending that TCP should be changed; to date it >> > >> hasn't been a >> > >>>>>>>>>>> significant problem because most TCP implementations >> > > have >> > >> been >> > >>>>>>>> used with similar packet sizes. But we do recommend that >> > >> future >> > >>>>>>>> transport protocols should respond to dropped or marked >> > > bytes >> > >> (as >> > >>>>>>>> for example TFRC-SP [RFC4828]effectively does). >> > >>>>>>>>>>> >> > >>>>>>>>>>> >> > >>>>>>>>>>> Sub-section >> > >>>>>>>>>>> You could use much of S3.3, I suggest supplement with >> > >> numerical >> > >>>>>>>>>>> examples continuing S1.2 >> > >>>>>>>>>>> -- >> > >>>>>>>>>>> << This gives a get-out clause to any transport that >> > > isn't >> > >>>>>>>>>>> packet-size dependent (e.g. current TCP). Lacking packet- >> > >> size >> > >>>>>>>>>>> dependence just means they don't scale correctly with >> > >> packet size - >> > >>>>>>>>>>> but they are at liberty not to scale with packet size >> > >> (particularly >> > >>>>>>>>>>> given MTU growth isn't often being used to increase link >> > >> rates at >> > >>>>>>>>>>> this stage in history).>> >> > >>>>>>>>>>> >> > >>>>>>>>>>> This is introducing a different point. it's saying that >> > >> your >> > >>>>>>>> recommendation 2 [drop/mark pkts equally regardless of their >> > >> size] >> > >>>>>>>> is a really strong reco whilst recommendation 3 [strength of >> > >>>>>>>> transport's response proportionate to the size of the >> > >>>>>>>> dropped/marked pkt] is a much less important reco. I can see >> > >> this >> > >>>>>>>> might be true, but I don't think the i-d talks about this. >> > >>>>>>>>>>> >> > >>>>>>>>>>> -- >> > >>>>>>>>>>> >> > >>>>>>>>>>> From: Briscoe,RJ,Bob,DUB8 R >> > >>>>>>>>>>> Sent: 05 September 2012 16:41 >> > >>>>>>>>>>> To: Manner Jukka >> > >>>>>>>>>>> Cc: tsvwg-chairs@ietf.org; Gorry Fairhurst; >> > >> Eardley,PL,Philip,DUB8 R >> > >>>>>>>>>>> Subject: Re: [REVISED I-D]: Document writeup for >> > >>>>>>>>>>> draft-ietf-tsvwg-byte-pkt-congest >> > >>>>>>>>>>> >> > >>>>>>>>>>> Jukka, >> > >>>>>>>>>>> >> > >>>>>>>>>>> I've repeated Phil's suggestions as quoted email then >> > > made >> > >> some >> > >>>>>>>> alterations. >> > >>>>>>>>>>> >> > >>>>>>>>>>> Phil asked me to send this, then he will think about it. >> > >>>>>>>>>>> >> > >>>>>>>>>>> I'll respond about John Leslie's argument next... >> > >>>>>>>>>>> >> > >>>>>>>>>>> >> > >>>>>>>>>>> 2.2. Recommendation on Encoding Congestion Notification >> > >>>>>>>>>>> >> > >>>>>>>>>>> [...] >> > >>>>>>>>>>> >> > >>>>>>>>>>> 1. AQM algorithms such as RED SHOULD use packet-mode >> > >> drop, ie they >> > >>>>>>>>>>> SHOULD NOT use byte-mode drop. The latter is more >> > >> complex, >> > >>>>>>>>>>> it creates the perverse incentive to fragment >> > >> segments into >> > >>>>>>>> tiny >> > >>>>>>>>>>> pieces and it is vulnerable to floods of small >> > >> packets. >> > >>>>>>>>>>> >> > >>>>>>>>>>> 2. If a vendor has implemented byte-mode drop, and an >> > >> operator has >> > >>>>>>>>>>> turned it on, it is RECOMMENDED to turn it off, >> > > after >> > >>>>>>>> establishing >> > >>>>>>>>>>> if there are any implications on the relative >> > >> performance of >> > >>>>>>>>>>> applications using different packet sizes. >> > >>>>>>>>>>> >> > >>>>>>>>>>> RED as a whole SHOULD NOT be turned off. Without >> > > RED, >> > >> a drop >> > >>>>>>>> tail >> > >>>>>>>>>>> queue biases against large packets >> > >>>>>>>>>>> >> > >>>>>>>>>>> [BB adds]: and it is >> > >> vulnerable to >> > >>>>>>>> floods >> > >>>>>>>>>>> of small-packets. >> > >>>>>>>>>>> >> > >>>>>>>>>>> (I suggest the last para just qualifies item 2. It >> > > doesn't >> > >> warrant >> > >>>>>>>>>>> another item number.) >> > >>>>>>>>>>> >> > >>>>>>>>>>> <Agree with Phil; Don't indent "Note well..."> >> > >>>>>>>>>>> >> > >>>>>>>>>>> [PE:] S3.3 >> > >>>>>>>>>>> The section could be slightly clearer. Starting with the >> > >> second >> > >>>>>>>> para, where you start talking about something you don't >> > >> recommend >> > >>>>>>>> (ie you have to remember that you're talking about the >> > >> consequences >> > >>>>>>>> of not following your recommendation). >> > >>>>>>>>>>> >> > >>>>>>>>>>> 3.3 Transport-Independent Network >> > >>>>>>>>>>> >> > >>>>>>>>>>> TCP congestion control ensures that flows competing for >> > >> the same >> > >>>>>>>>>>> resource each maintain the same number of segments in >> > >> flight, >> > >>>>>>>>>>> irrespective of segment size. So under similar >> > >> conditions, flows >> > >>>>>>>>>>> with different segment sizes will get different >> > > bit-rates. >> > >>>>>>>>>>> >> > >>>>>>>>>>> [OLD:] >> > >>>>>>>>>>> One motivation for the network biasing congestion >> > >> notification by >> > >>>>>>>>>>> packet size is to counter this effect and try to equalise >> > >> the bit- >> > >>>>>>>>>>> rates of flows with different packet sizes. >> > >>>>>>>>>>> [SUGGESTED:] >> > >>>>>>>>>>> To counter this effect it seems tempting not to follow >> > >> our >> > >>>>>>>>>>> recommendation, and instead for the network to bias >> > >> congestion >> > >>>>>>>>>>> notification by packet size in order to equalise the >> > > bit- >> > >> rates of >> > >>>>>>>>>>> flows with different packet sizes. >> > >>>>>>>>>>> [CONTINUE AS BEFORE:] >> > >>>>>>>>>>> However, in order to do this, the queuing >> > >> algorithm >> > >>>>>>>>>>> has to make assumptions about the transport, which >> > > become >> > >> embedded >> > >>>>>>>>>>> in the network. Specifically: >> > >>>>>>>>>>> [...] >> > >>>>>>>>>>> >> > >>>>>>>>>>> >> > >>>>>>>>>>> >> > >>>>>>>>>>> S3.3 >> > >>>>>>>>>>> o The queuing algorithm has to assume how aggressively >> > >> the >> > >>>>>>>> transport >> > >>>>>>>>>>> will respond to congestion (see Section 4.2.4). If >> > >> the network >> > >>>>>>>>>>> assumes the transport responds as aggressively as TCP >> > >> NewReno, >> > >>>>>>>> it >> > >>>>>>>>>>> will be wrong for Compound TCP and differently wrong >> > >> for Cubic >> > >>>>>>>>>>> TCP, etc. To achieve equal bit-rates, each transport >> > >> then has >> > >>>>>>>> to >> > >>>>>>>>>>> guess what assumption the network made, and work out >> > >> how to >> > >>>>>>>>>>> replace this assumed aggressiveness with its own >> > >> aggressiveness. >> > >>>>>>>>>>> o Also, if the network biases congestion notification >> > > by >> > >> packet >> > >>>>>>>> size >> > >>>>>>>>>>> it has to assume a baseline packet size--all proposed >> > >> algorithms >> > >>>>>>>>>>> use the local MTU. Then transports have to guess >> > >> which link was >> > >>>>>>>>>>> congested and what its local MTU was, in order to >> > > know >> > >> how to >> > >>>>>>>>>>> tailor their congestion response to that link. >> > >>>>>>>>>>> >> > >>>>>>>>>>> So you're saying that for byte-mode drop, the network >> > >> element >> > >>>>>>>> has to compare the size of the current pkt with the MTU, and >> > >> if >> > >>>>>>>> it's 25* smaller then reduce the pkt loss probably by 25*? >> > > And >> > >> if >> > >>>>>>>> the MTU was bigger, then the current pkt would be dropped >> > > with >> > >>>>>>>> greater probability? And both of these things are hard to do. >> > >>>>>>>>>>> >> > >>>>>>>>>>> [BB]: SUGGESTED REPLACEMENT to 2nd bullet: >> > >>>>>>>>>>> >> > >>>>>>>>>>> o Also, if the network biases congestion notification >> > > by >> > >> packet >> > >>>>>>>> size >> > >>>>>>>>>>> it has to assume a baseline packet size--all proposed >> > >> algorithms >> > >>>>>>>>>>> use the local MTU (for example see the byte-mode loss >> > >>>>>>>> probability >> > >>>>>>>>>>> formula in Table 1). Then if the non-Reno transports >> > >> mentioned >> > >>>>>>>> above >> > >>>>>>>>>>> are trying to reverse engineer what the network >> > >> assumed, they >> > >>>>>>>> also >> > >>>>>>>>>>> have to guess the MTU of the congested link. >> > >>>>>>>>>>> >> > >>>>>>>>>>> [Aside for Phil: The byte-mode algo uses MTU as its >> > >> baseline because >> > >>>>>>>>>>> a link cannot send a packet bigger than its MTU so the >> > >> algo won't >> > >>>>>>>>>>> ever give a drop probability >1).] >> > >>>>>>>>>>> >> > >>>>>>>>>>> >> > >>>>>>>>>>> S3.4 (& knock-on effect on 3.3) >> > >>>>>>>>>>> I think this could be better. >> > >>>>>>>>>>> I agree with your arguments in S3.1 & 3.2 that dropping >> > > or >> > >>>>>>>> marking should be independent of pkt size. >> > >>>>>>>>>>> You now go into a couple of pages, which I think could be >> > >> boiled >> > >>>>>>>>>>> down to a paragraph, saying something like:- >> > >>>>>>>>>>> * the overall system (network + transport) has to be >> > >> packet-size >> > >>>>>>>>>>> dependent in order to drive out the right amount of >> > >> traffic so the >> > >>>>>>>>>>> resource becomes uncongested (since we assume all >> > >> resources are in >> > >>>>>>>>>>> practice byte rather than packet limited) >> > >>>>>>>>>>> * from S3.1 and S3.2 the IETF is convinced that the >> > >> network >> > >>>>>>>> should notify independent of packet-size. >> > >>>>>>>>>>> * therefore the transport must act dependent on packet- >> > >> size. >> > >>>>>>>>>>> >> > >>>>>>>>>>> It seems to me this is a necessary consequence of your >> > >> earlier >> > >>>>>>>> assumptions /arguments. >> > >>>>>>>>>>> >> > >>>>>>>>>>> [BB:] This isn't at all obvious. Until I worked it >> > >> through, it >> > >>>>>>>> wasn't obvious to me. I think it needs to be spelled out. >> > >>>>>>>>>>> >> > >>>>>>>>>>> >> > >>>>>>>>>>> >> > >>>>>>>>>>> [On the other hand, if someone disagrees with the >> > >> arguments and >> > >>>>>>>>>>> thinks that the network should notify dependent on >> > > packet- >> > >> size, then >> > >>>>>>>>>>> it follows that the transport can act independent of >> > >> packet-size.] >> > >>>>>>>>>>> >> > >>>>>>>>>>> I think in S3.3 you're trying to make another argument >> > >> (?):- >> > >>>>>>>>>>> * we can either do packet-size biasing in the network >> > >> element or on >> > >>>>>>>>>>> the end host transport (see argument above) >> > >>>>>>>>>>> * packet-size biasing in the network is quite tricky, >> > >> because >> > >>>>>>>> then both the network element & the transport have to make >> > >> some >> > >>>>>>>> guesses about MTU - whereas with packet-size biasing in the >> > >>>>>>>> transport, neither network element nor transport needs to >> > > know >> > >> what the >> > >>>>>>>> MTU is. >> > >>>>>>>>>>> * it is also easier to implement packet-size biasing in >> > >> the >> > >>>>>>>> transport than in the network. >> > >>>>>>>>>>> >> > >>>>>>>>>>> [BB] ...and all the other arguments about vulnerability >> > > to >> > >>>>>>>> flooding etc, transport-independence etc. >> > >>>>>>>>>>> >> > >>>>>>>>>>> Our argument is sort of like how Phil summarises it, but >> > >> with >> > >>>>>>>> an added nuance. It's also saying no-one has to do >> > > packet-size >> > >>>>>>>> biasing anyway. It's not saying "The transport should given >> > >> the >> > >>>>>>>> network shouldn't". It's saying "*If the transport wants to >> > >> claim >> > >>>>>>>> to be scalable wrt packet size* it should if the network >> > >> shouldn't". >> > >>>>>>>>>>> >> > >>>>>>>>>>> This gives a get-out clause to any transport that isn't >> > >>>>>>>> packet-size dependent (e.g. current TCP). Lacking packet-size >> > >>>>>>>> dependence just means they don't scale correctly with packet >> > >> size - >> > >>>>>>>> but they are at liberty not to scale with packet size >> > >> (particularly >> > >>>>>>>> given MTU growth isn't often being used to increase link >> > > rates >> > >> at >> > >>>>>>>> this stage in history). >> > >>>>>>>>>>> >> > >>>>>>>>>>> None of what Phil says convinces me we could say 3.4 (or >> > >> 3.3) >> > >>>>>>>> any more briefly or differently. It's easier to summarise >> > > what >> > >> you >> > >>>>>>>> understand having read something. But that's not the same as >> > >>>>>>>> writing the thing that helped you understand. There doesn't >> > >> seem to >> > >>>>>>>> be any argument for why anything needs to be changed, so I'm >> > >> going >> > >>>>>>>> to leave it as is, unless pushed. >> > >>>>>>>>>>> >> > >>>>>>>>>>> >> > >>>>>>>>>>> >> > >>>>>>>>>>> On the last bullet, I didn't really follow your argument >> > >> in >> > >>>>>>>> S3.5. Might be easier to follow if S3.5 also talked about how >> > >> a >> > >>>>>>>> network element would take account of pkt size? >> > >>>>>>>>>>> Also, does the following argument also apply?: >> > >>>>>>>>>>> * if pkt-size biasing is done in the transport, then the >> > >> work (to do >> > >>>>>>>>>>> the biasing) is spread over many end hosts, >> > >>>>>>>>>>> * whereas if pkt-size biasing is done in the network, >> > > then >> > >> the >> > >>>>>>>> work is concentrated in just the congested network element, >> > >> which >> > >>>>>>>> has to operate on all the traffic at line speed. >> > >>>>>>>>>>> >> > >>>>>>>>>>> [BB]: 3.5. Implementation Efficiency >> > >>>>>>>>>>> >> > >>>>>>>>>>> [ADD:] >> > >>>>>>>>>>> Biasing against large packets typically requires an >> > > extra >> > >> multiply >> > >>>>>>>>>>> and divide in the network (see the example byte-mode >> > > drop >> > >>>>>>>> formula in table 1). >> > >>>>>>>>>>> [CONTINUE AS BEFORE:] >> > >>>>>>>>>>> Allowing for packet size at the transport rather than in >> > >> the >> > >>>>>>>>>>> network ensures that neither the network nor the >> > >> transport needs to >> > >>>>>>>>>>> do a multiply operation--multiplication by packet size >> > > is >> > >>>>>>>>>>> effectively achieved as a repeated add when the >> > > transport >> > >> adds to >> > >>>>>>>>>>> its count of marked bytes as each congestion event is >> > > fed >> > >> to it. >> > >>>>>>>>>>> [ADD:] >> > >>>>>>>>>>> Also the work to do the biasing is spread over many >> > >> hosts, rather >> > >>>>>>>>>>> than concentrated in just the congested network element. >> > >>>>>>>>>>> [CONTINUE AS BEFORE:] >> > >>>>>>>>>>> >> > > These >> > >> aren't >> > >>>>>>>>>>> principled reasons in themselves, but they are a happy >> > >> consequence >> > >>>>>>>>>>> of the other principled reasons. >> > >>>>>>>>>>> >> > >>>>>>>>>>> >> > >>>>>>>>>>> Bob >> > >>>>>>>>>>> >> > >>>>>>>>>>> At 13:36 05/09/2012, Manner Jukka wrote: >> > >>>>>>>>>>> >> > >>>>>>>>>>> Hi Bob, >> > >>>>>>>>>>> >> > >>>>>>>>>>> Since Phil is your colleague at BT, even at Ipswich (?), >> > >> can >> > >>>>>>>> you check those and agree with Phil how to go forward? Some >> > > of >> > >>>>>>>> those are clear, others need discussion with Phil. >> > >>>>>>>>>>> >> > >>>>>>>>>>> Gorry, on point 2) below, I don't follow Leslie's >> > >> arguments. I >> > >>>>>>>> asked from him detailed comments and resolutions months ago, >> > >> and nothing >> > >>>>>>>> came. >> > >>>>>>>>>>> >> > >>>>>>>>>>> cheers, >> > >>>>>>>>>>> Jukka >> > >>>>>>>>>>> >> > >>>>>>>>>>> On Sep 5, 2012, at 3:26 PM, Gorry Fairhurst wrote: >> > >>>>>>>>>>> >> > >>>>>>>>>>>> Bob and Jukka, >> > >>>>>>>>>>>> >> > >>>>>>>>>>>> I think I need your help to resolve this... Some of the >> > >>>>>>>> comments raised by John Leslie appear to have substance, and >> > >> I'm >> > >>>>>>>> not sure what you plan to do to address these. >> > >>>>>>>>>>>> >> > >>>>>>>>>>>> It would be good to do something quick so that I can >> > >> complete >> > >>>>>>>> the submission to the IESG. >> > >>>>>>>>>>>> >> > >>>>>>>>>>>> The issues I particularly note are: >> > >>>>>>>>>>>> >> > >>>>>>>>>>>> 1) This contribution from Phil appears to be within >> > > scope >> > >> and >> > >>>>>>>> it seems may have been overlooked (sorry): >> > >>>>>>>>>>>> http://www.ietf.org/mail- >> > >> archive/web/tsvwg/current/msg11234.html >> > >>>>>>>>>>>> >> > >>>>>>>>>>>> 2) Can you be clear here about what is actually been >> > > said >> > >> - >> > >>>>>>>> because the point below seems to be a misunderstanding of >> > >> language >> > >>>>>>>> to me, and I hoped we did not claim this: >> > >>>>>>>>>>>> >> > >>>>>>>>>>>>>> ] Alas, when I read it carefully this week, I find >> > > that >> > >> Bob is >> > >>>>>>>>>>>>>> actually ] saying that transport-layer should _only_ >> > >> consider >> > >>>>>>>>>>>>>> "congested bytes", not ] "congested packets". >> > >>>>>>>>>>>>>> ] >> > >>>>>>>>>>>> >> > >>>>>>>>>>>> 3) You'll need to post the updated draft and send a note >> > >> to >> > >>>>>>>> the group to confirm that the new version indeed concludes >> > > the >> > >>>>>>>> WGLC. I'll add a note to the writeup to indicate the draft >> > >> content >> > >>>>>>>> has been stable since Oct 2011. >> > >>>>>>>>>>>> >> > >>>>>>>>>>>> Gorry (with my TSVWG Documnent Shepherd hat) >> > >>>>>>>>>>>> cc: Co-Chairs for info. >> > >>>>>>>>>>>> >> > >>>>>>>>>>>>> >> > >>>>>>>>>>>>>> -----Original Message----- >> > >>>>>>>>>>>>>> From: Gorry Fairhurst [ mailto:gorry@erg.abdn.ac.uk] >> > >>>>>>>>>>>>>> Sent: Mittwoch, 15. August 2012 21:41 >> > >>>>>>>>>>>>>> To: tsvwg chair >> > >>>>>>>>>>>>>> Subject: Fwd: Re: [tsvwg] FYI: Document writeup for >> > >>>>>>>>>>>>>> draft-ietf-tsvwg- byte-pkt-congest >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> Hi guys, >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> Apart form the usual rhetoric, there are some points >> > >> here that we >> > >>>>>>>>>>>>>> should note as chairs. What are you own thoughts? And >> > >> was >> > >>>>>>>>>>>>>> anything noted at the IETF in Vancouver concerning >> > > this >> > >> draft? >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> Gorry >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> -------- Original Message -------- >> > >>>>>>>>>>>>>> Subject: Re: [tsvwg] FYI: Document writeup for >> > >>>>>>>>>>>>>> draft-ietf-tsvwg-byte- pkt-congest >> > >>>>>>>>>>>>>> Date: Wed, 15 Aug 2012 15:07:28 -0400 >> > >>>>>>>>>>>>>> From: John Leslie <john@jlc.net> >> > >>>>>>>>>>>>>> To: Gorry Fairhurst <gorry@erg.abdn.ac.uk> >> > >>>>>>>>>>>>>> CC: tsvwg@ietf.org >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> Gorry Fairhurst <gorry@erg.abdn.ac.uk> wrote: >> > >>>>>>>>>>>>>>> >> > >>>>>>>>>>>>>>> The following document has been revised after >> > >> receiving >> > >>>>>>>> WGLC comments. >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> The changes are mostly refreshing a document >> > >> scheduled to >> > >>>>>>>>>>>>>> expire this month. >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>>> No issues were found: >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> Is that with WGC-hat-on? >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> Neither my issues nor Philip Eardley's comments were >> > >> addressed. >> > >>>>>>>>>>>>>> In fact, not even the changes Bob Briscoe mentioned >> > > on- >> > >> list have >> > >>>>>>>>>>>>>> been included. >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>>> As required by RFC 4858, this is the current template >> > >> for the >> > >>>>>>>>>>>>>> Document >> > >>>>>>>>>>>>>>> Shepherd Write-Up. >> > >>>>>>>>>>>>>>> ... >> > >>>>>>>>>>>>>>> (1) What type of RFC is being requested (BCP, >> > > Proposed >> > >> Standard, >> > >>>>>>>>>>>>>>> Internet Standard, Informational, Experimental, or >> > >> Historic)? >> > >>>>>>>>>>>>>>> Why is this the proper type of RFC? Is this type of >> > >> RFC >> > >>>>>>>>>>>>>>> indicated in the title page header? >> > >>>>>>>>>>>>>>> >> > >>>>>>>>>>>>>>> This document is intended as BCP. (This was discussed >> > >> at IETF-81 >> > >>>>>>>>>>>>>>> and that the status changed from Informational to BCP >> > >> since it >> > >>>>>>>>>>>>>>> provides guidance to implementors and people >> > >> configuring >> > >>>>>>>> routers and hosts). >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> Thank you for finally answering my question about >> > > BCP >> > >> status. >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>>> ... >> > >>>>>>>>>>>>>>> Working Group Summary >> > >>>>>>>>>>>>>>> >> > >>>>>>>>>>>>>>> There was consensus to publish this as a WG document >> > >> and >> > >>>>>>>>>>>>>>> agreement at >> > >>>>>>>>>>>>>>> IETF-82 that the document was now complete. >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> Was that reviewed on-list? >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>>> ... >> > >>>>>>>>>>>>>>> (3) Briefly describe the review of this document that >> > >> was >> > >>>>>>>>>>>>>>> performed >> > >>>>>>>>>>>>>> by >> > >>>>>>>>>>>>>>> the Document Shepherd. If this version of the >> > > document >> > >> is not >> > >>>>>>>>>>>>>>> ready for publication, please explain why the >> > > document >> > >> is being >> > >>>>>>>>>>>>>>> forwarded to the IESG. >> > >>>>>>>>>>>>>>> >> > >>>>>>>>>>>>>>> The document was presented at IEWTF-82 (Taipei), with >> > >> a request >> > >>>>>>>>>>>>>>> for >> > >>>>>>>>>>>>>> WGLC. >> > >>>>>>>>>>>>>>> WGLC concluded with some discussion but no >> > > substantive >> > >> changes >> > >>>>>>>>>>>>>>> on Friday 30th March 2012. >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> Yes, I guess that is true; but ignoring substantive >> > >> comments >> > >>>>>>>>>>>>>> isn't a good practice IMHO. >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>>> (4) Does the document Shepherd have any concerns >> > > about >> > >> the depth >> > >>>>>>>>>>>>>>> or breadth of the reviews that have been performed? >> > >>>>>>>>>>>>>>> >> > >>>>>>>>>>>>>>> No - the original draft had a lot of background >> > >> material, much >> > >>>>>>>>>>>>>>> of >> > >>>>>>>>>>>>>> this >> > >>>>>>>>>>>>>>> has been condensed or removed, resulting in a smaller >> > >> document. >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> That is indeed what I basically asked for, but I >> > >> don't see any >> > >>>>>>>>>>>>>> evidence of it: >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> - diff(07,08) 48 lines changed or deleted; 53 lines >> > >> changed or >> > >>>>>>>> added >> > >>>>>>>>>>>>>> - diff(06,07) 8 lines changed or deleted; 13 lines >> > >> changed or >> > >>>>>>>> added >> > >>>>>>>>>>>>>> - diff(05,06) 64 lines changed or deleted; 69 lines >> > >> changed or >> > >>>>>>>>>>>>>> added >> > >>>>>>>>>>>>>> - diff(04,05) 823 lines changed or deleted; 864 lines >> > >> changed or >> > >>>>>>>>>>>>>> added >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> (Need I go farther? That gets us back to March >> > > 2011...) >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>>> (9) How solid is the WG consensus behind this >> > >> document? Does it >> > >>>>>>>>>>>>>>> represent the strong concurrence of a few >> > > individuals, >> > >> with >> > >>>>>>>>>>>>>>> others being silent, or does the WG as a whole >> > >> understand >> > >>>>>>>> and agree with it? >> > >>>>>>>>>>>>>>> >> > >>>>>>>>>>>>>>> The document has WG support and there is consensus to >> > >> publish. >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> This, IMHO, doesn't answer the question. :^( >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> Does it represent the concurrence of a few >> > >> individuals, or does >> > >>>>>>>>>>>>>> the WG as a whole understand and agree with it? >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> For reference, my comments included: >> > >>>>>>>>>>>>>> ] >> > >>>>>>>>>>>>>> ] When I previously skimmed this document, I believed >> > >> it >> > >>>>>>>>>>>>>> contained only ] common ground: that AQM should >> > >> drop/mark packets >> > >>>>>>>>>>>>>> without favoring small ] packets, and that to whatever >> > >> extent >> > >>>>>>>>>>>>>> packet size _is_ considered, that ] should be a >> > >>>>>>>> transport-layer responsibility. >> > >>>>>>>>>>>>>> ] >> > >>>>>>>>>>>>>> ] Alas, when I read it carefully this week, I find >> > > that >> > >> Bob is >> > >>>>>>>>>>>>>> actually ] saying that transport-layer should _only_ >> > >> consider >> > >>>>>>>>>>>>>> "congested bytes", not ] "congested packets". >> > >>>>>>>>>>>>>> ] >> > >>>>>>>>>>>>>> ] In fact, Bob openly endorses replacing TCP >> > >> congestion-control >> > >>>>>>>>>>>>>> with an ] algorithm which calculates "fair-share >> > > bytes" >> > >> and >> > >>>>>>>>>>>>>> doesn't back off at all ] (even in the presence of 20% >> > >> packet >> > >>>>>>>>>>>>>> loss) unless you're already sending ] more than this >> > >>>>>>>> "fair-share". >> > >>>>>>>>>>>>>> ] >> > >>>>>>>>>>>>>> ] I do not believe that is the consensus of this >> > > WG; >> > >> and I >> > >>>>>>>> believe >> > >>>>>>>>>>>>>> if >> > >>>>>>>>>>>>>> ] that were the consensus it would exceed our charter. >> > >>>>>>>>>>>>>> ] >> > >>>>>>>>>>>>>> ] The draft contains a number of things I _do_ >> > > support; >> > >> and I'd >> > >>>>>>>>>>>>>> be happy ] to support a considerably shorter draft >> > >> which >> > >>>>>>>>>>>>>> concentrates on the AQM ] recommendation, omitting any >> > >>>>>>>>>>>>>> suggestions of modifying TCP congestion ] control. >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> and Philip's comments included: >> > >>>>>>>>>>>>>> ] >> > >>>>>>>>>>>>>> ] S2.2 I thought the numbered bullets could be better >> > >> phrased, >> > >>>>>>>>>>>>>> something like ] +this:- ] >> > >>>>>>>>>>>>>> ] 1. AQM algorithms such as RED SHOULD use packet- >> > >> mode >> > >>>>>>>> drop, ie they >> > >>>>>>>>>>>>>> SHOULD NOT >> > >>>>>>>>>>>>>> ] +use byte-mode drop. The latter ] is more complex, >> > >>>>>>>>>>>>>> ] it creates the perverse incentive to fragment >> > >> segments >> > >>>>>>>> into >> > >>>>>>>>>>>>>> tiny >> > >>>>>>>>>>>>>> ] pieces and it is vulnerable to floods of >> > >> small- >> > >>>>>>>>>>>>>> ] packets. >> > >>>>>>>>>>>>>> ] >> > >>>>>>>>>>>>>> ] 2. If a vendor has implemented byte-mode drop, >> > >> and an >> > >>>>>>>> operator >> > >>>>>>>>>>>>>> has >> > >>>>>>>>>>>>>> ] turned it on, it is RECOMMENDED to turn it >> > >> off, after >> > >>>>>>>>>>>>>> establishing if >> > >>>>>>>>>>>>>> ] +there are any implications on >> > >>>>>>>>>>>>>> ] the relative performance of >> > >>>>>>>>>>>>>> ] applications using different packet sizes. >> > >>>>>>>>>>>>>> ] >> > >>>>>>>>>>>>>> ] 3. RED SHOULD NOT be turned off. Without RED, a >> > >> drop tail >> > >>>>>>>>>>>>>> ] queue biases against large packets. >> > >>>>>>>>>>>>>> ] >> > >>>>>>>>>>>>>> ] Nit - the following paragraph shouldn't be indented >> > >> (Note well >> > >>>>>>>>>>>>>> that >> > >>>>>>>>>>>>>> RED's...) >> > >>>>>>>>>>>>>> ] >> > >>>>>>>>>>>>>> ] S3.3 >> > >>>>>>>>>>>>>> ] The section could be slightly clearer. Starting with >> > >> the second >> > >>>>>>>>>>>>>> para, where you ] +start talking about something you >> > >> don't >> > >>>>>>>>>>>>>> recommend (ie you have to remember that ] +you're >> > >> talking about >> > >>>>>>>>>>>>>> the consequences of not following your >> > > recommendation). >> > >>>>>>>>>>>>>> ] >> > >>>>>>>>>>>>>> ] S3.3 >> > >>>>>>>>>>>>>> ] o Also, if the network biases congestion >> > >> notification by >> > >>>>>>>> packet >> > >>>>>>>>>>>>>> size >> > >>>>>>>>>>>>>> ] it has to assume a baseline packet size--all >> > >> proposed >> > >>>>>>>>>>>>>> algorithms >> > >>>>>>>>>>>>>> ] use the local MTU. Then transports have to >> > >> guess which >> > >>>>>>>> link >> > >>>>>>>>>>>>>> was >> > >>>>>>>>>>>>>> ] congested and what its local MTU was, in order >> > >> to know >> > >>>>>>>> how to >> > >>>>>>>>>>>>>> ] tailor their congestion response to that link. >> > >>>>>>>>>>>>>> ] So you're saying that for byte-mode drop, the >> > > network >> > >> element >> > >>>>>>>>>>>>>> has to compare the ] +size of the current pkt with the >> > >> MTU, and >> > >>>>>>>>>>>>>> if it's 25* smaller then reduce the ] +pkt loss >> > >> probably by 25*? >> > >>>>>>>>>>>>>> And if the MTU was bigger, then the current pkt would >> > > ] >> > >> +be >> > >>>>>>>>>>>>>> dropped with greater probability? And both of these >> > >> things >> > >>>>>>>> are hard to do. >> > >>>>>>>>>>>>>> ] >> > >>>>>>>>>>>>>> ] S3.4 (& knock-on effect on 3.3) ] I think this could >> > >> be better. >> > >>>>>>>>>>>>>> ] I agree with your arguments in S3.1 & 3.2 that >> > >> dropping or >> > >>>>>>>>>>>>>> marking should be ] +independent of pkt size. >> > >>>>>>>>>>>>>> ] You now go into a couple of pages, which I think >> > >> could be >> > >>>>>>>>>>>>>> boiled down to a ] +paragraph, saying something like:- >> > >> ] * the >> > >>>>>>>>>>>>>> overall system (network + transport) has to be packet- >> > >> size >> > >>>>>>>>>>>>>> dependent in ] >> > >>>>>>>>>>>>>> +order to drive out the right amount of traffic so the >> > >> resource >> > >>>>>>>>>>>>>> becomes ] >> > >>>>>>>>>>>>>> +uncongested (since we assume all resources are in >> > >> practice byte >> > >>>>>>>>>>>>>> +rather >> > >>>>>>>>>>>>>> than ] +packet limited) ] * from S3.1 and S3.2 the >> > > IETF >> > >> is >> > >>>>>>>>>>>>>> convinced that the network should notify ] >> > > +independent >> > >> of >> > >>>>>>>> packet-size. >> > >>>>>>>>>>>>>> ] * therefore the transport must act dependent on >> > >> packet-size. >> > >>>>>>>>>>>>>> ] >> > >>>>>>>>>>>>>> ] It seems to me this is a necessary consequence of >> > >> your earlier >> > >>>>>>>>>>>>>> assumptions ] +/arguments. >> > >>>>>>>>>>>>>> ] >> > >>>>>>>>>>>>>> ] [On the other hand, if someone disagrees with the >> > >> arguments and >> > >>>>>>>>>>>>>> thinks that the ] +network should notify dependent on >> > >>>>>>>>>>>>>> packet-size, then it follows that the ] +transport can >> > >> act >> > >>>>>>>>>>>>>> independent of >> > >>>>>>>>>>>>>> packet- size.] ] ] I think in S3.3 you're trying to >> > >> make another >> > >>>>>>>>>>>>>> argument (?):- ] * we can either do packet-size >> > >> biasing in the >> > >>>>>>>>>>>>>> network element or on the end host ] +transport (see >> > >> argument >> > >>>>>>>>>>>>>> above) ] * packet-size biasing in the network is quite >> > >> tricky, >> > >>>>>>>>>>>>>> because then both the ] +network element & the >> > >> transport have to >> > >>>>>>>>>>>>>> make some guesses about MTU - whereas ] >> > >>>>>>>>>>>>>> +with packet-size biasing in the transport, neither >> > >> network >> > >>>>>>>>>>>>>> +element >> > >>>>>>>>>>>>>> nor ] +transport needs to know what the MTU is. >> > >>>>>>>>>>>>>> ] * it is also easier to implement packet-size biasing >> > >> in the >> > >>>>>>>>>>>>>> transport than in ] +the network. >> > >>>>>>>>>>>>>> ] >> > >>>>>>>>>>>>>> ] On the last bullet, I didn't really follow your >> > >> argument in >> > >>>>>>>> S3.5. >> > >>>>>>>>>>>>>> Might be ] +easier to follow if S3.5 also talked about >> > >> how a >> > >>>>>>>>>>>>>> network element would take ] +account of pkt size? >> > >> Also, does the >> > >>>>>>>>>>>>>> following argument also apply?: >> > >>>>>>>>>>>>>> ] * if pkt-size biasing is done in the transport, then >> > >> the work >> > >>>>>>>>>>>>>> (to do the ] +biasing) is spread over many end hosts, >> > > ] >> > >> * whereas >> > >>>>>>>>>>>>>> if pkt-size biasing is done in the network, then the >> > >> work is ] >> > >>>>>>>>>>>>>> +concentrated in just the congested network element, >> > >> which has to >> > >>>>>>>>>>>>>> operate on all ] +the traffic at line speed. >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> (Has there perhaps been some confusion about what >> > >> went into the >> > >>>>>>>>>>>>>> -08 >> > >>>>>>>>>>>>>> version?) >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> -- >> > >>>>>>>>>>>>>> John Leslie <john@jlc.net> >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>>> >> > >>>>>>>>>>>>> >> > >>>>>>>>>>>> >> > >>>>>>>>>>> >> > >> ________________________________________________________________ >> > >>>>>>>>>>> Bob Briscoe, BT Innovate & >> > >> Design >> > >>>>>>>>>>> >> > >>>>>>>>>> >> > >>>>>>>>> >> > >>>>>>> >> > >>>>>>> >> > >> ________________________________________________________________ >> > >>>>>>> Bob Briscoe, BT Innovate & >> > >> Design >> > >>>>>>> >> > >>>>> >> > >>>>> ________________________________________________________________ >> > >>>>> Bob Briscoe, BT Innovate & Design >> > >>> >> > >>> ________________________________________________________________ >> > >>> Bob Briscoe, BT Innovate & Design >> > > > >________________________________________________________________ >Bob Briscoe, BT Innovate & Design ________________________________________________________________ Bob Briscoe, BT Innovate & Design
- Re: [tsvwg] [REVISED I-D]: Document writeup for d… Hannes Tschofenig
- [tsvwg] Fwd: RE: [REVISED I-D]: Document writeup … Bob Briscoe
- Re: [tsvwg] Fwd: RE: [REVISED I-D]: Document writ… Andrew McGregor