Re: [aqm] AQM schemes: Queue length vs. delay based

Naeem Khademi <naeem.khademi@gmail.com> Thu, 14 November 2013 20:25 UTC

Return-Path: <naeem.khademi@gmail.com>
X-Original-To: aqm@ietfa.amsl.com
Delivered-To: aqm@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id CADEF11E810D for <aqm@ietfa.amsl.com>; Thu, 14 Nov 2013 12:25:42 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.317
X-Spam-Level:
X-Spam-Status: No, score=-2.317 tagged_above=-999 required=5 tests=[AWL=0.055, BAYES_00=-2.599, HTML_MESSAGE=0.001, NO_RELAYS=-0.001, SARE_SUB_OBFU_Q1=0.227]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id netoCuAbxCfw for <aqm@ietfa.amsl.com>; Thu, 14 Nov 2013 12:25:42 -0800 (PST)
Received: from mail-vb0-x22f.google.com (mail-vb0-x22f.google.com [IPv6:2607:f8b0:400c:c02::22f]) by ietfa.amsl.com (Postfix) with ESMTP id A865511E80FA for <aqm@ietf.org>; Thu, 14 Nov 2013 12:25:35 -0800 (PST)
Received: by mail-vb0-f47.google.com with SMTP id g10so2115590vbg.34 for <aqm@ietf.org>; Thu, 14 Nov 2013 12:25:35 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=WHFH51DJdeWY+oNnrvp8jVfYoKI2RN129fqiHrLGVBw=; b=jGvog39jbVpfPUQ9gl9dbJsjglKl97P477lnebXIbypsuzVtkt6HL/RiSpKjwIYc6X +WDRasKvavC1RVPpzBFwP/VxXUZzm8aTxggjX1Eesm8HDWWyPFYsLEQKj2qG6XIN89zN 5HjnYk3K2woyviCiX9xD6lsU/1Wtav0yHPPHRIgi2eBiLI2f1M00QfsfabI08Sdf390i jM03rZ1Lwgm+EHwqTliwFk/Of1f+1SNffY5FONjP37abv3qqioeGtaIt4jnWt4cdz2r+ IcDgmEiAAGkRo7vojI1CGlPc5snuIJwuInYQ+rraiba35XEJvbOyQTBd78YVX1G4o/vr RdGQ==
MIME-Version: 1.0
X-Received: by 10.58.133.77 with SMTP id pa13mr1855105veb.21.1384460735124; Thu, 14 Nov 2013 12:25:35 -0800 (PST)
Received: by 10.220.109.5 with HTTP; Thu, 14 Nov 2013 12:25:35 -0800 (PST)
In-Reply-To: <CEAA66F5.4B52B%prenatar@cisco.com>
References: <CAEjQQ5VhMfgik_QS2GxWRbMDJQf7VXEC3-e_LP0tx5P16pC9PQ@mail.gmail.com> <CEAA66F5.4B52B%prenatar@cisco.com>
Date: Thu, 14 Nov 2013 21:25:35 +0100
Message-ID: <CAEjQQ5V9FrouYRWfjKD-w=ztdEf=zS6ML2VXvK_ucUwR_-wr0w@mail.gmail.com>
From: Naeem Khademi <naeem.khademi@gmail.com>
To: Preethi Natarajan <preethi.cis@gmail.com>
Content-Type: multipart/alternative; boundary="047d7b67339852ce4e04eb28e2f5"
Cc: "aqm@ietf.org" <aqm@ietf.org>, Michael Welzl <michawe@ifi.uio.no>
Subject: Re: [aqm] AQM schemes: Queue length vs. delay based
X-BeenThere: aqm@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "Discussion list for active queue management and flow isolation." <aqm.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/aqm>, <mailto:aqm-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/aqm>
List-Post: <mailto:aqm@ietf.org>
List-Help: <mailto:aqm-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/aqm>, <mailto:aqm-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 14 Nov 2013 20:25:43 -0000

Very sorry for double-posting as for some reasons "what I was answering to"
was missed from the rest of my previous email. So here again:


On Thu, Nov 14, 2013 at 8:46 PM, Preethi Natarajan <preethi.cis@gmail.com>wrote:

>
>
> From: Naeem Khademi <naeem.khademi@gmail.com>
> Date: Thursday, November 14, 2013 5:32 AM
>
> To: Preethi Natarajan <preethi.cis@gmail.com>
> Cc: Michael Welzl <michawe@ifi.uio.no>, "aqm@ietf.org" <aqm@ietf.org>
> Subject: Re: [aqm] AQM schemes: Queue length vs. delay based
>
>
> AFAIK all three studied AQMs (CoDel, PIE, ARED) have a parameter named
> "target delay" or "target queuing". Judging based on the (param) name, they
> supposedly try to maintain the queue delay/length around/at a certain value
> although "target delay/queuing" has different semantics in each AQM and the
> response to crossing that threshold/value is also different from one AQM to
> another. Moreover, It is favorable for an AQM that tries to fix bufferbloat
> problem to have a predictable delay distribution, most favorably around
> (e.g. mean, median, max) a certain value. PIE uses it, CoDel has it and
> ARED in fixed BW setups has it too. So, I'm not quite sure, why it
> shouldn't be possible to allow a certain size of burst in such
> threshold-based mechanisms (AFAIK (A)RED's Linux implementation already has
> a "burst" param that can be tuned, although its semantics may not be
> identical to the way CoDel or PIE handle bursts).
>
>
> Of course. The question is how effective is the mechanism. As you note,
> whatever "burst" mechanism exists in Linux's ARED wasn't effective as
> evident from the low throughput in the plots.
>
>
>
>
>> Clearly delay-based ARED needs significant and careful redesign. I
>> suppose one can argue this task is as laborious as designing a new AQM
>> scheme as opposed to previous notions that delay-based ARED would work
>> right out of the box.
>>
>
> it's probably not accurate to say "redesign" as delay-based ARED doesn't
> exist as of know. AFAIK given our latest results with ARED, we raised
> the possibility of modifying ARED to use delay-based thresholds instead of
> size-based thresholds during the ICCRG talk and in the TR, and declared it
> as future work. I'm not aware of any other implementation/proposal of
> delay-based ARED to exist as of now. We never claimed or speculated on how
> this future delay-based AQM might perform and left it to when the future
> work is done and the outcome is properly investigated and therefore we
> never said that "it would work right out of the box" :-).
>
>
>
> Great. Thanks for confirming this point that ARED as is does not work well
> for the scenarios considered. What is required is new AQM design, not just
> plugging delay into ARED.
> Preethi
>

Agreed with all your points though to complete it, developing new AQMs
without looking back at and learning from what we have had available leads
to inventing things that improve utilization instead of delay for instance
:-). Again to emphasis we never recommended that people should turn on RED
or ARED (or any future variant of ARED) on their edge devices, but rather
we experimentally demonstrated that it worked better than CoDel and PIE is
most of the scenarios we studied except in very few. We thought it was
important to have more basic understandable experimental results (with
long-lived TCP flows) available before we indulge in looking at more
complex traffic scenarios.

What I would probably choose to investigate e.g. why PIE performs with a
very long upper distribution tail of RTTs with common long-lived TCP
traffic more specifically for lower target delay values even though we know
TCP isn't a very bursty type of traffic? Could that be related to the way
it handles bursts?  or any other reason?  The technical report provides a
very first insight into such issues that need to be explored like this one
but then the story doesn't stop here and ARED was only a benchmark that was
close enough to IETF-documented RED and could work with a certain target
delay (in our scenarios).