Re: [aqm] aqm Digest, Vol 24, Issue 18

dpreed@reed.com Fri, 27 February 2015 20:06 UTC

Return-Path: <dpreed@reed.com>
X-Original-To: aqm@ietfa.amsl.com
Delivered-To: aqm@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 0F3CC1A0058 for <aqm@ietfa.amsl.com>; Fri, 27 Feb 2015 12:06:33 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: 0.799
X-Spam-Level:
X-Spam-Status: No, score=0.799 tagged_above=-999 required=5 tests=[BAYES_50=0.8, RCVD_IN_DNSWL_NONE=-0.0001, SPF_PASS=-0.001] autolearn=ham
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id G--aaFIFknr9 for <aqm@ietfa.amsl.com>; Fri, 27 Feb 2015 12:06:31 -0800 (PST)
Received: from smtp102.iad3a.emailsrvr.com (smtp102.iad3a.emailsrvr.com [173.203.187.102]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id C916A1A0053 for <aqm@ietf.org>; Fri, 27 Feb 2015 12:06:30 -0800 (PST)
Received: from smtp13.relay.iad3a.emailsrvr.com (localhost.localdomain [127.0.0.1]) by smtp13.relay.iad3a.emailsrvr.com (SMTP Server) with ESMTP id 153581001B6 for <aqm@ietf.org>; Fri, 27 Feb 2015 15:06:30 -0500 (EST)
Received: from app55.wa-webapps.iad3a (relay-webapps.rsapps.net [172.27.255.140]) by smtp13.relay.iad3a.emailsrvr.com (SMTP Server) with ESMTP id F2D0D1003AD for <aqm@ietf.org>; Fri, 27 Feb 2015 15:06:29 -0500 (EST)
X-Sender-Id: dpreed@reed.com
Received: from app55.wa-webapps.iad3a (relay-webapps.rsapps.net [172.27.255.140]) by 0.0.0.0:25 (trex/5.4.2); Fri, 27 Feb 2015 20:06:30 GMT
Received: from reed.com (localhost.localdomain [127.0.0.1]) by app55.wa-webapps.iad3a (Postfix) with ESMTP id DD19B80069; Fri, 27 Feb 2015 15:06:29 -0500 (EST)
Received: by apps.rackspace.com (Authenticated sender: dpreed@reed.com, from: dpreed@reed.com) with HTTP; Fri, 27 Feb 2015 15:06:29 -0500 (EST)
Date: Fri, 27 Feb 2015 15:06:29 -0500
From: dpreed@reed.com
To: aqm@ietf.org
Cc: aqm@ietf.org
MIME-Version: 1.0
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
Importance: Normal
X-Priority: 3 (Normal)
X-Type: plain
In-Reply-To: <mailman.1502.1425065417.2864.aqm@ietf.org>
References: <mailman.1502.1425065417.2864.aqm@ietf.org>
X-Auth-ID: dpreed@reed.com
Message-ID: <1425067589.903626768@apps.rackspace.com>
X-Mailer: webmail/11.3.13-RC
Archived-At: <http://mailarchive.ietf.org/arch/msg/aqm/fDliWzdGvMQG-91hsrNsalx2er4>
Subject: Re: [aqm] aqm Digest, Vol 24, Issue 18
X-BeenThere: aqm@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: "Discussion list for active queue management and flow isolation." <aqm.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/aqm>, <mailto:aqm-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/aqm/>
List-Post: <mailto:aqm@ietf.org>
List-Help: <mailto:aqm-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/aqm>, <mailto:aqm-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 27 Feb 2015 20:06:33 -0000

Having done many experiments in many locations, I can tell you it is a real problem and a severe problem, in the real world.

I can observe it every time I fly from Boston to San Jose and back (which I do once per month).  If you use Gogo (the provider that supports several airlines' network access), all you need to do is measure either the uplink or downlink queue, and you find the typical symptom: a buffer that has multiple seconds of backlog, and no packet loss whatsoever (using either ICMP or UDP).  That's pretty severe - what it means is that the probability that "ssh" times out is non zero, not because of lost packets or lost carrier, but because of the default timeout in ssh being exceeded.

Now why is this a serious problem?  Well, think about it.  If packets sit in queues for 4 seconds, what impact does that have on people using HTTP?  Well, they think a website is down, so within a couple of seconds, they switch to surfing some other web site.  Meanwhile, the downloaded packets (pictures or whatever, which are in transit because of a relatively long TCP window that has built up in HTTP 1.1/TCP/IP, which supports all consecutive HTTP requests to the same server, because the service was temporarily OK), are filling up the download queue - even though the user(s) have moved on and probably even closed their connections on their end).  Does this happen?  Of course it does.  Since I'm in the airplane with these guys, I've run wireshark on the WiFi network, recorded TCP packet streams (headers only), and observed exactly this behavior - very delayed HTTP 1.1 packets (many seconds of delay between request and response).

I don't intend to publish this data for one key reason: it does not meet the standards of academic research publication. Why? because I can't get the data that only Gogo has in its network administration.  Either they will measure and publish or they will not. I suspect they will do the latter. Only Comcast has been willing to publish at all - and that is in a CableLabs tech report by a Sr. VP I know at Comcast, who has the authority to make the decision to reveal Comcast "trade secrets" to some extent.  ATT Wireless had this problem on half of its HSPA+ access networks, but it will never publish - it's embarrassing (and would make people switch to an alternate provider, probably, too).  Instead they have blamed "wireless interference" and "badly designed iOS apps", anything other than their network operations.  (I told ATT about this, not under non-disclosure, when I first measured it.  Their network operations people actually measured the problem, I was told, but I don't know if they fixed it. All I know is that they blamed a subcontractor, without naming that subcontractor in my phone call).

So there's nothing for an academic to cite, unfortunately. Sorry.

It's actually quite hard to do research on whether a problem is real - if it is a problem the operator or vendor will not allow you to publish, or worse, will claim that you didn't measure the right thing or are lying or misleading people.  Thank goodness for Comcast's engineers, who were honest. (after their top management started an aggressive attack on those who observed that their initial solution, an attempt to disrupt "pirate traffic" by TCP RST injection, caused the US FCC to slap them on the wrist.

But the problem is real, and quite prevalent.  I carry around gear and software that I use whenever I am in a new location to detect both the "overbuffering" that may be there, and ALSO, by running tests for 24 hours every hour, how it affects usability over a 24 hour period.

I assure you it is a real problem in practice, and also that it affect others than "gamers".  HTTP based services suck badly when the problem arises. In the ATT case, when I was measuring the problem in a Chicago office building where I was doing other consulting work, every iPhone user was complaining that their iPhone data services were not working.






On Friday, February 27, 2015 2:30pm, aqm-request@ietf.org said:

> Send aqm mailing list submissions to
> 	aqm@ietf.org
> 
> To subscribe or unsubscribe via the World Wide Web, visit
> 	https://www.ietf.org/mailman/listinfo/aqm
> or, via email, send a message with subject or body 'help' to
> 	aqm-request@ietf.org
> 
> You can reach the person managing the list at
> 	aqm-owner@ietf.org
> 
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of aqm digest..."
> Today's Topics:
> 
>    1. Re: Is bufferbloat a real problem? (Curtis Villamizar)
>    2. Re: Is bufferbloat a real problem? (Daniel Havey)
>    3. Re: Is bufferbloat a real problem? (Daniel Havey)
> _______________________________________________
> aqm mailing list
> aqm@ietf.org
> https://www.ietf.org/mailman/listinfo/aqm
>