Re: [aqm] Last Call: <draft-ietf-aqm-fq-codel-05.txt> (FlowQueue-Codel) to Experimental RFC

Dave Taht <dave.taht@gmail.com> Tue, 22 March 2016 04:41 UTC

Return-Path: <dave.taht@gmail.com>
X-Original-To: ietf@ietfa.amsl.com
Delivered-To: ietf@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id C2B0C12D67F; Mon, 21 Mar 2016 21:41:49 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.7
X-Spam-Level:
X-Spam-Status: No, score=-2.7 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 9FrCS1LG478O; Mon, 21 Mar 2016 21:41:47 -0700 (PDT)
Received: from mail-oi0-x22a.google.com (mail-oi0-x22a.google.com [IPv6:2607:f8b0:4003:c06::22a]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 846C612D518; Mon, 21 Mar 2016 21:41:47 -0700 (PDT)
Received: by mail-oi0-x22a.google.com with SMTP id w20so105708138oia.2; Mon, 21 Mar 2016 21:41:47 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc; bh=PLpdnBu/v30oPCKizLEe8xjGX9bSZc8i7KwofuY0ypk=; b=rOZ6gFN0t/xaSNhv+zp2jHW+D4fPMoOtiyAM0rB5Zuns4AaLhvlmpAsUz8jkkRvHzB EFivEmEdFzu4mJiz1/LSc0tI/NnXGQGBbLaRo4rFmkPwZikxtPqHit6ttgwgMy1SpJgp y55yhoPmo1BuEkYcaXqvlEdiG+lNxMLuN2OJRjg6jXke0/ZzWfrWIn6MdsnwuyeQqG7q yD/5r2Csl2wFtO3rGzkTDMWpV++nrvQvryEocki0niHTqGU1donlEjHTd/dHuZcKx3Ko 5fl1RCZn/OoYWB1v45Flt4iu/J4rXOl3KjXa6D6lcSuz8pwkzn/aUiBIEC5QnLgy3zfj HERw==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to:cc; bh=PLpdnBu/v30oPCKizLEe8xjGX9bSZc8i7KwofuY0ypk=; b=DGucxMsQTc9Ui0RLmTfgVk1aH+MCCXhgnycLmHoW25tk0GpDJjbUOZ9GWogjKf2zRx BN2QgBFqwB5hkzxmRwYhC1J8JXrLgsyXdWll7B18cm16BIPdwJjHB6mCC58gkgaPl8yQ DL1WlM9a2DO+MRIgSppN/BnHs0XxOfVULfzWGdAP9kyHMR2OHMm3KzZODRrIluJLe4Es uyWKHRsSZmIRo4vE3Vu0ddINV87rQjtF2HLLQSkpJ/Rr/1AQMbnSvyFy+jmNaqj/gkRP ftrdr16jm611d5pzjaUsbGIbiW068Uq5JnyW5mDCvJdFiO9DS1ugrzqoa4RLeKb4WsRa M+gw==
X-Gm-Message-State: AD7BkJIQz0CTbI9wPM2WPAmSevHaFMZf4U+jG85s248Fsc5KCzfHGienIozvBqH9U1pBWJCQ2ujdrM5Eq8NQ+A==
MIME-Version: 1.0
X-Received: by 10.202.178.135 with SMTP id b129mr18679094oif.139.1458621706352; Mon, 21 Mar 2016 21:41:46 -0700 (PDT)
Received: by 10.202.78.129 with HTTP; Mon, 21 Mar 2016 21:41:46 -0700 (PDT)
In-Reply-To: <56F037AE.90608@bobbriscoe.net>
References: <20160303172022.12971.79276.idtracker@ietfa.amsl.com> <56EBDA04.3020500@bobbriscoe.net> <8737rox89h.fsf@alrua-desktop.borgediget.toke.dk> <56F037AE.90608@bobbriscoe.net>
Date: Mon, 21 Mar 2016 21:41:46 -0700
Message-ID: <CAA93jw6KsAWY8cuz86JL1ZtXA-LYa2Hapw_o=eUFk3vCW7ZT0w@mail.gmail.com>
Subject: Re: [aqm] Last Call: <draft-ietf-aqm-fq-codel-05.txt> (FlowQueue-Codel) to Experimental RFC
From: Dave Taht <dave.taht@gmail.com>
To: Bob Briscoe <research@bobbriscoe.net>
Content-Type: text/plain; charset=UTF-8
Archived-At: <http://mailarchive.ietf.org/arch/msg/ietf/92Mg7Bqi3ylUgPqIXuEawIU7Am4>
X-Mailman-Approved-At: Tue, 22 Mar 2016 09:41:06 -0700
Cc: draft-ietf-aqm-fq-codel@ietf.org, =?UTF-8?B?VG9rZSBIw7hpbGFuZC1Kw7hyZ2Vuc2Vu?= <toke@toke.dk>, ietf@ietf.org, aqm-chairs@ietf.org, Martin Stiemerling <mls.ietf@gmail.com>, "aqm@ietf.org" <aqm@ietf.org>
X-BeenThere: ietf@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: IETF-Discussion <ietf.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf>, <mailto:ietf-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ietf/>
List-Post: <mailto:ietf@ietf.org>
List-Help: <mailto:ietf-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 22 Mar 2016 04:41:50 -0000

I don't even know where to start bob. This part of the language has
been in the draft for 2 years, and you are the only person to object
that I can recall.

It's an experimental RFC. By "safe" we mean that deploying it, within
the guidelines, won't break anything to any huge extent to enormous
benefits. "unsafe", for example, would be promoting use of dctcp while
it still responds incorrectly to packet loss.

Verses your decades long quest for better variable rate video, we've
had over a decade of the bufferbloat problem to deal with on all
traffic, particularly along the edge, and even after solutions started
to appear in mid 2012, we haven't made a real dent in what's deployed,
except for the small select group of devs, academics, ISPs, and
manufacturers willing to try something new.  I'd like to imagine
things are shifting to the left side of the green line here, but under
load, most users are still experiencing latency orders of magnitude in
excess of what can be achieved
http://www.dslreports.com/speedtest/results/bufferbloat?up=1

I've been testing the latest generation of wifi APs of late, and the
"best" of them, under load, in a single direction, has over 2 seconds
at the lower rates. Applying any of these algorithms to wifi is
proving hard, and it's where the bottlenecks are shifting to at least
in my world, where the default download speed is hovering at around
75mbit, and wifi starts breaking down long before that is hit.

...

I tore apart that HAS experiment you cited here:
https://lists.bufferbloat.net/pipermail/bloat/2016-February/007198.html
- where I was, at least, happy
to see fq_codel handle the onslought of dctcp traffic, gracefully. (It
makes me nervous to have such tcps loose on the internet where a
configuration mistake might send that at the wrong people. fq_codel,
"safe" - not, perhaps, optimal - in the face of dctcp.)

my key objections to nearly all the experiments on your side are
non-reproducability, no competing traffic (not even bothering to
measure web PLT in
that paper, for example), no competing upload traffic, and no
inclusion of the typical things that are latency sensitive at all
(voip, dns, tcp neg, ssl neg, etc).

with competing download and upload traffic, fq_codel *dramatically*
improves the responsiveness and utilization of the link, for all
traffic. Above 5mbits pretty much the only thing that matters for web
traffic is RTT, the google cite for this is around somewhere.

I tend to weigh low latency for every other form of traffic...
today... over marginal improvements in a contrived video download
scenario someday.

As for pie vs fq_codel, despite making pie widely available (for
example, it's shipped in the ubuntu 15.10 release) and just as easily
configurable in the sqm-scripts as fq_codel and cake are, it's been
very difficult to get users to stick with it for a decent a/b test.
This is perhaps the best comparison of the two I know of:

http://burntchrome.blogspot.com/2014/05/i-dont-like-this-pie-more-bufferbloat.html

The best hope I have for a set of subjective QoE experiences, is
perhaps this summer we'll have real pie and fq_codel capable modems
and be able to do a blind "taste test". The pie that we have to play
with is not enough like the draft to trust the results from. We're
still screwed in that the CMTSes are not fixed and in most of my test
sites, download bufferbloat is experienced more often than upload, in
one by a factor of 6000x1, so we need to compare three things -
default cmts behavior down + pie/fq_codel up. And with the cmts
throttled down (as we do it today) via htb + pie/fq_codel.

My core sadness in the flowqueue codel draft is that we spend way more
time talking about the caveats than the observed benefits.

In the case of torrent traffic, utp (especially) works well for me, on
my workloads. I run through a vpn all day without bothering to do
anything special about it. GF uploads big fat images all day without
ever bothering me. The gamers on campus are generally happy. (The only
prioritization I do is dns traffic).

The HUGE benefit of having the fq-aqm tech in place is how much better
it is than anything that exists in the field. I cannot remember the
last non-wifi buffering event I've had on any streaming media, nor the
last time netflix paused. I can't remember the last time I got annoyed
in a ssh or mosh shell. I hold videoconferences on a high rate
conference bridge (60mbit down/8 up) all the time on a 75mbit link
with "cake" while testing other stuff... but that is perhaps just me
and a rag-tag group of people that have bothered to actually install
and configure the software, and their workloads.

Without more implementations of all these bufferbloat fighting
technologies, made deployable, with more people trying them, we're not
going to get anywhere. There's still billions of boxes left to deploy
and a long tail of deployed gear that will probably last 10 years or
more.