Re: [aqm] floating a draft charter

Andrew Mcgregor <andrewmcgr@google.com> Wed, 05 June 2013 08:28 UTC

Return-Path: <andrewmcgr@google.com>
X-Original-To: aqm@ietfa.amsl.com
Delivered-To: aqm@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 9C6F621F99FF for <aqm@ietfa.amsl.com>; Wed, 5 Jun 2013 01:28:15 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.863
X-Spam-Level:
X-Spam-Status: No, score=-2.863 tagged_above=-999 required=5 tests=[AWL=-0.113, BAYES_00=-2.599, FM_FORGED_GMAIL=0.622, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_LOW=-1, SARE_SUB_OBFU_Q1=0.227]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id UmLUiVvCyAcs for <aqm@ietfa.amsl.com>; Wed, 5 Jun 2013 01:28:10 -0700 (PDT)
Received: from mail-vc0-f173.google.com (mail-vc0-f173.google.com [209.85.220.173]) by ietfa.amsl.com (Postfix) with ESMTP id 6D60921F997B for <aqm@ietf.org>; Wed, 5 Jun 2013 01:28:07 -0700 (PDT)
Received: by mail-vc0-f173.google.com with SMTP id ht11so904535vcb.4 for <aqm@ietf.org>; Wed, 05 Jun 2013 01:28:04 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=lCHb5u01UqkmLPG5frDi8hdnrhoiF63K4vv/teG1nOU=; b=EuTVVusDrRMolk4Jw0MqyYYiEyQT9VrGhJ4Qrq/4OJpHho2up1wrWRI3iQ0JhyuT8I 1vMnWPblSPAdxB4Gj/i1BqwVzY/ebDOTnXfWCsYdDFWxvKnv4amFjLcoaWCaQSaUQGEQ BVtW8oUVA1UTMSlbkY8bC2HZKQDX8DZbUDkPqVRfNL77oDY9QdVMKbYxZJK/VsL3DAQU TsdR/8RegdsGWj6N61Z3HBEEhT6EaC22lJuaUzdtoOvVPgbo2BTWkxq0PSyiUsNDGuPJ 9VkSuxUbtJ5k3YyfE1eysO1dhYQEDbID/sHcb+oluHZne9/KgZa649BLLz87pmTLg1WS SU2g==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:x-gm-message-state; bh=lCHb5u01UqkmLPG5frDi8hdnrhoiF63K4vv/teG1nOU=; b=YTznYxjIQ+8xDyZwIrXMl7hBoZpU0SVY+E1bPZfQuH1u7YHOrh8OSgM/97aoyjjpfn 8nhX4D8Pkl6VBzReV7eA0Uc97Y3yWzE0kbvSkailroJBpr0Dhsc7dCBwxl2lr+jAAt1G Cj74x8MWPaRYu9zCE1SnsWN47qvJPRIDIQ9L3CEr5ylDMCRXYmoMiRDDHV3HccZKVmKF rjfUPkoMr3t9GJiOFPwpNeB56Q0FfAU6NAtKD3MI77hwtXj4ny6/FHiXKpgMiHUglKAl O/7gFmo7XyOlsTyGnImF0SBYclMm2Zvl+7PvxFbvq40hNiw1tht8eh+Txa+UHKNd+6R9 sjfA==
MIME-Version: 1.0
X-Received: by 10.221.4.131 with SMTP id oc3mr19848713vcb.49.1370420884372; Wed, 05 Jun 2013 01:28:04 -0700 (PDT)
Received: by 10.220.209.20 with HTTP; Wed, 5 Jun 2013 01:28:04 -0700 (PDT)
In-Reply-To: <BF2CBE9A994B89438F7F3B9FD58C2B7F185CD31C@xmb-aln-x15.cisco.com>
References: <CAGhGL2DVU4xLXFMeHi75g6MTFehgjNBZkcpKVFb_K0804EKViw@mail.gmail.com> <BF2CBE9A994B89438F7F3B9FD58C2B7F185CD31C@xmb-aln-x15.cisco.com>
Date: Wed, 05 Jun 2013 18:28:04 +1000
Message-ID: <CAPRuP3mvT+FVyqqQZcG4U7YPAwOB1MABQ89RQQDu3zRpjpUvdg@mail.gmail.com>
From: Andrew Mcgregor <andrewmcgr@google.com>
To: "Rong Pan (ropan)" <ropan@cisco.com>
Content-Type: multipart/alternative; boundary="089e01293fe201a50804de63fad6"
X-Gm-Message-State: ALoCoQnVJ8q91+FR7IT+G+LTvg+RvxR6TRbNDBdmw30Q5pHu9HHeFIXePS+lSG9hY+x/YQ+tb+sp/2UZ08V+145lP2ae8sZie24BPMlzGxO0g8TXOjF1K8b2JjltJSeROoyVjMRlAB5U2CGAH79+UY4TDt0fm+EDx7sqbVhB7lCoFw6BFQSXFO0GaztQPEapuHC5uY86h9lO
Cc: Wesley Eddy <wes@mti-systems.com>, Mirja Kuehlewind <mirja.kuehlewind@ikr.uni-stuttgart.de>, Jim Gettys <jg@freedesktop.org>, "aqm@ietf.org" <aqm@ietf.org>
Subject: Re: [aqm] floating a draft charter
X-BeenThere: aqm@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "Discussion list for active queue management and flow isolation." <aqm.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/aqm>, <mailto:aqm-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/aqm>
List-Post: <mailto:aqm@ietf.org>
List-Help: <mailto:aqm-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/aqm>, <mailto:aqm-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 05 Jun 2013 08:28:15 -0000

On 5 June 2013 17:09, Rong Pan (ropan) <ropan@cisco.com> wrote:

>  I understand packet scheduling is important. However, most network
> devices today treat these two issues separately: classed-based
> scheduling/queueing and drop/mark algorithms. In class-based queueing,
> vendors differ in how many classes that they support: 2, 4 or 8. In
> drop/mark algorithm, vendors tend to implement both tail-drop and WRED.
>

I'm aware of chips, not including software forwarding, with 1, 2, 4, 5, 6,
8, 9, 16, 64 and arbitrary numbers of classes limited by available memory,
with strict- or soft-priority, DRR, WRR, and drop/mark algorithms including
tail-drop, head-drop, WRED, priority- or class- weighted buffer pressure
management with head or tail eviction, and that's without using rate ACL
mechanisms.  So even restricting the scope to ethernet switch chips there
is far more variety than you're suggesting.

Software can of course do nearly anything, and given that these days
software forwarding is slightly cheaper than silicon forwarding even up to
around 32x10G levels, it is entirely realistic to expect to run software
forwarding in many parts of the network.


> I think we can continue to treat these two issues separately as it has
> been done in practice.
>

Any form of buffer pressure management that takes classes into account
couples these, and two widely deployed switch silicon families that I know
of have that.  That amounts to weak AQM.


> We can prove the values of delay-based AQM and we can also prove the value
> of FQ. The goal of AQM is to give early congestion notification before the
> buffer becomes full or the delay gets big. Scheduling algorithms determine
> bandwidth allocation and weighted fairness among different classes/flows.
> We don't have to mandate a particular scheduling algorithm with a
> particular AQM scheme. Vendors shall be free to choose their best
> performance-cost tradeoff point. I think it would be challenging to force
> vendors to adopt a fixed set.
>
>  Best Regards,
>
>  Rong
>
>
More than challenging, that's impossible.  Not to mention completely
undesirable, because we're well aware that anything we have right now is
imperfect.

Andrew


>
>
>   From: Jim Gettys <jg@freedesktop.org>
> Date: Tuesday, June 4, 2013 8:47 AM
> To: Wesley Eddy <wes@mti-systems.com>
> Cc: Mirja Kuehlewind <mirja.kuehlewind@ikr.uni-stuttgart.de>, "
> aqm@ietf.org" <aqm@ietf.org>
>
> Subject: Re: [aqm] floating a draft charter
>
>   There are a number of things I want to see:
>
>  1) packet scheduling is as important as drop/mark algorithms: it is the
> two together that is such a win with fq_codel.  So I, for one, would be
> *very* unhappy to see a charter that meant it was out of scope to discuss
> and document scheduling and flow queuing strategies.  And packet scheduling
> can interact with drop mark algorithms.
>
>  Optimal scheduling depends on where you are in the network (e.g. a host,
> versus home router, vs. ISP head end, etc.); documenting what makes the
> most sense where would likely help both implementors of software and
> hardware and operators of those networks.  I've given some thought to the
> area, but until we have running code, it's hand waving, but probably won't
> be hand waving in a year or two.
>
>  2) many mark/drop algorithms can co-exist (and generating informational
> RFC's to help customers write RFP's is worth while): but so far, there is
> no universal "one size fits all" mark/drop algorithm that is optimally
> effective everywhere.  Kathy and Van say a primary goal of CoDel was
> "first, do no harm", rather than "optimal under all circumstances".  The
> goal was an algorithm that never hurt you and would always be safe to have
> enabled. CoDel's constants were chosen so that CoDel will work at the edge
> of the network with planetary length paths in the face of highly variable
> bandwidth.
>
>  But CoDel inside a datacenter (unless you mess with its constants) is
> not effective in a timely fashion.
>
>  People have feared tuneable algorithms such as (W)RED can hurt them, so
> what we have had for AQM is sometimes not present at all, or not enabled
> (even in nodes that it would help greatly).  Note that one of Dave Taht's
> discoveries was that RED had been broken in the Linux source tree for 5
> years and *no one had noticed*.
>
>  Maybe someday we'll reach nirvana with a drop/mark algorithm that "just
> works" everywhere well all the time and can never hurt and always works
> perfectly, but that's a lofty goal left for research; in the meanwhile,
> helping people understand what to use where in the network and the
> tradeoffs is important.
>
>  But if CoDel (as we believe) or some other algorithm really is "first,
> do no harm" without tuning,  it has great value to be on by default just to
> prevent the disaster an operator not enabling any AQM at all; the world
> will never be worse than that, and usually be radically better. Anything
> that *requires* tuning of knobs to work is a not a candidate....
>
>  3) Some algorithms may be hard to retrofit into existing platforms.  For
> example, if you can't timestamp a packet, it's really hard to use CoDel,
> which is an algorithm easy to implement from scratch, but could be really
> hard retrofit in an existing hardware design. I don't want to discourage
> people doing things a lot better than nothing as a stop-gap, but people
> need to understand that it may be a second best solution.
>
>  Again, advice as to what/where algorithms may be applicable will help
> designers and operators both.  We face a 5-10 year transition here, even if
> we succeed this time.
>
>  4) There are "feagtures" or lack of features that today's hardware does
> that can/does make life for software very hard. Changes could make life
> much easier.
>
>  For example, today's hardware makes it hard to do true head drop (which
> hardware assist could make trivial).  Some have noticed that at low
> bandwidth we may adjust CoDel parameters to work around the lack of head
> drop and unaccounted for time in the driver's ring buffers (which CoDel
> needs), since Linux's qdisc's can't see the time in the rings. So there are
> a set of recommendations to hardware and OS designers that would help the
> state of the network: this simple head drop feature, packet pacing, making
> GSO/TSO less evil, and so on, war stories on all the places we've been
> working on debloating Linux to guide other OS developers, etc.
>
>  This is really the other side of queue management: try to encourage
> traffic and systems that is less bursty and gentle to the network, so the
> AQM/scheduling algorithms don't have to work as hard.
>
>  Such things should go someplace in the IETF, and if not here, where?
>  Doing nothing anywhere is what I want to avoid.
>
>  Best regards,
>     Jim
>
>
>
> On Thu, May 30, 2013 at 12:13 PM, Wesley Eddy <wes@mti-systems.com> wrote:
>
>> On 5/30/2013 8:20 AM, Mirja Kuehlewind wrote:
>> >
>> > But my point is, we should not only to standardize more and more AQM
>> > mechanisms because that's a large research area which maybe even should
>> be
>> > homed in the IRTF, but we need to find actually deployment strategies.
>> Having
>> > a working group on AQM will already help to make people aware of the
>> topic
>> > and maybe think about using AQM. But only standardizing more and more
>> AQM
>> > queue, might end up in investing a lot of working that no-one will ever
>> use.
>>
>>
>>  I suggest appending something like this to the charter:
>>
>>   Many AQM algorithms have been proposed in academic literature, but
>>   very few are widely implemented and deployed.  A goal of the working
>>   group is to produce recommendations that will actually be used, and
>>   algorithms that will actually be implemented, deployed in equipment,
>>   and enabled.  Towards these ends, the group actively encourages
>>   participation from operators and implementers, and will coordinate
>>   with the IETF OPS area and other relevant parts of the IETF and
>>   Internet community.  Wider research and evaluation of AQM mechanisms
>>   shall be coordinated with the IRTF/ICCRG, and significant
>>   participation in this WG from the academic and research community is
>>   highly desirable, when it is directly relevant to implementation and
>>   deployment.
>>
>>
>> We will definitely need to get engagement from some of the operators
>> that participate in the IETF, and should solicit participation more
>> widely outside as well (e.g. advertise to NANOG list, bufferbloat
>> list, etc.) in order to attract operators and implementers that aren't
>> already aware of this activity.
>>
>> Would this help to address your concern?
>>
>>
>> --
>> Wes Eddy
>> MTI Systems
>>  _______________________________________________
>> aqm mailing list
>> aqm@ietf.org
>> https://www.ietf.org/mailman/listinfo/aqm
>>
>
>
> _______________________________________________
> aqm mailing list
> aqm@ietf.org
> https://www.ietf.org/mailman/listinfo/aqm
>
>


-- 
Andrew McGregor | SRE | andrewmcgr@google.com | +61 4 8143 7128