Re: [81attendees] sucky Delta hotel network (and bufferbloat)

Jim Gettys <jg@freedesktop.org> Mon, 01 August 2011 20:58 UTC

Return-Path: <gettysjim@gmail.com>
X-Original-To: 81attendees@ietfa.amsl.com
Delivered-To: 81attendees@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 2B1B01F0C47 for <81attendees@ietfa.amsl.com>; Mon, 1 Aug 2011 13:58:22 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -3.598
X-Spam-Level:
X-Spam-Status: No, score=-3.598 tagged_above=-999 required=5 tests=[BAYES_00=-2.599, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_LOW=-1]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 2uo7MIb3HwXU for <81attendees@ietfa.amsl.com>; Mon, 1 Aug 2011 13:58:20 -0700 (PDT)
Received: from mail-vw0-f44.google.com (mail-vw0-f44.google.com [209.85.212.44]) by ietfa.amsl.com (Postfix) with ESMTP id B35931F0C39 for <81attendees@ietf.org>; Mon, 1 Aug 2011 13:58:20 -0700 (PDT)
Received: by vws12 with SMTP id 12so5725239vws.31 for <81attendees@ietf.org>; Mon, 01 Aug 2011 13:58:27 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=sender:message-id:date:from:organization:user-agent:mime-version:to :cc:subject:references:in-reply-to:content-type; bh=duVAPInbO9wFyPr2U3qNOgl/u9EvKP0MDxa4n1NEIKI=; b=k46HX3sismRqBS5nWbFyD3EhFNlCfS1pcPwL3Sqe+4KZot3ajneBpWco5BRjnIEkjv xGIWXNRsBv6e/qF0IT1R3XTkb1gRD+11/ZvE5nR2ztysEkympSUYf7auG8KWEfXGpjwX vyAq24llwT8OmAOEMwOWu6vnIe9ZlNXWHkhoU=
Received: by 10.52.93.201 with SMTP id cw9mr5138871vdb.250.1312232307309; Mon, 01 Aug 2011 13:58:27 -0700 (PDT)
Received: from [10.0.0.3] (c-24-218-177-117.hsd1.ma.comcast.net [24.218.177.117]) by mx.google.com with ESMTPS id n21sm1993574vcn.31.2011.08.01.13.58.24 (version=SSLv3 cipher=OTHER); Mon, 01 Aug 2011 13:58:25 -0700 (PDT)
Sender: Jim Gettys <gettysjim@gmail.com>
Message-ID: <4E371370.40804@freedesktop.org>
Date: Mon, 01 Aug 2011 16:58:24 -0400
From: Jim Gettys <jg@freedesktop.org>
Organization: Bell Labs
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:5.0) Gecko/20110627 Thunderbird/5.0
MIME-Version: 1.0
To: Maciek Konstantynowicz <maciek@juniper.net>
References: <4E2CC532.3090209@sunet.se> <4E2CCE72.3010500@sunet.se> <4E2E2236.2060808@sunet.se> <87EC2139-C7C2-4113-97E2-9EB9DA2406EA@juniper.net> <alpine.BSF.2.00.1107262259400.80739@joyce.lan> <9D00CDB7-2C04-446E-8D53-0552BAEE22EE@juniper.net> <633EAF31-607E-4FEF-B502-5FFAD89BF01A@juniper.net>
In-Reply-To: <633EAF31-607E-4FEF-B502-5FFAD89BF01A@juniper.net>
Content-Type: multipart/alternative; boundary="------------090004030609040500070106"
Cc: 81attendees@ietf.org
Subject: Re: [81attendees] sucky Delta hotel network (and bufferbloat)
X-BeenThere: 81attendees@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: IETF 81 Attendee List <81attendees.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/81attendees>, <mailto:81attendees-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/81attendees>
List-Post: <mailto:81attendees@ietf.org>
List-Help: <mailto:81attendees-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/81attendees>, <mailto:81attendees-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 01 Aug 2011 20:58:22 -0000

Hotels are probably the best (worst) place to see bufferbloat (and many
other problems) in all their glory, particularly when a pile of IETF
folks arrive.  Fred Baker has been studying this slow motion disaster
for many years; he has some very entertaining plots and data
demonstrating various diseases he's found including bufferbloat.  Might
make a good transport plenary session sometime.

Wireless drivers in the operating systems have lots of issues, such as
trying to retransmit packets until the cows come home in the face of
errors, compounding problems; not all of what you see in sick wireless
networks is bufferbloat; lots of other diseases abound.  If you want to
see (and maybe help fix) such an example, look at:
https://www.bufferbloat.net/issues/216 where we are fighting such a
problem in the Linux stack, specifically the Ath9k driver.

In the hotel case, you have many places to suffer from bufferbloat,
depending on the bottleneck link:

    1) in your host operating systems you'll get bufferbloat, if the
transfer rate you are getting via wireless is below that of the
broadband link, common due to both running through multiple walls and
the sharing of the link with others.  A common ring buffer size in
drivers is 200-300 *packets* on modern hardware.

    2) In many hotels, these access points are home routers rather than
enterprise grade access points, so you get into all the trouble in those
I've noted in my talk and blog.

    3) then you get into the bloat in the back haul from the hotels. 
Many/most hotels are now (under) provisioned over broadband connections,
so that hop, anytime there are a lot of users, saturates and stays
saturated during busy hours, particularly if IETF is in town, so you get
latency and bad packet loss there.  Similar pheonomena occur in venues
like libraries.

    4) particular ISP's who specialised in connecting hotels to the
Internet appear to have been (to be) particularly clueless and do not
run any form of AQM on their networks, even though parts of those
networks are often saturated.  Those ISP's seem to have been merged into
bigger ISP's, but the same clueless operators seem still in charge of
the remains of those networks. I've seen bad problems 5 hops into such
networks.  This is classic http://www.faqs.org/rfcs/rfc2309.html mistake
of the 1990's. I won't name any names, but you can figure out who on
your own easily enough.

If you run Windows, pingplotter is probably the best tool to observe
problems; on Linux, mtr can be useful.  I've done my own work mostly
with vanilla ping and scp (to saturate the path).

And anywhere I go, I *always* run netalyzr first, as it warns me of all
sorts of illness (and tries to test for bufferbloat too).
I recommend everyone do so.  See: http://netalyzr.icsi.berkeley.edu/

So hotels and conferences are particularly fruitful places to see
Internet illnesses of all sorts.  The one exception to the conference
problems I've seen is NANOG/IETF, where cluefull people have done useful
classification and AQM setup and where there is lots of traffic via
enterprise access points that aren't over buffered, so that even RED93
algorithms can succeed at controlling the queues.  Would that someone
would write up that configuration and publish it as a BCP: it would at
least help other conferences and enterprises to do much better, while we
sort out the home router mess which needs other AQM algorithms, as yet
untested.
                        - Jim