Re: Naive question on multiple TCP/IP channels and please dont start a uS NN debate here unless you really want to.

Phillip Hallam-Baker <phill@hallambaker.com> Fri, 06 February 2015 15:31 UTC

Return-Path: <hallam@gmail.com>
X-Original-To: ietf@ietfa.amsl.com
Delivered-To: ietf@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 74DDE1A1B38; Fri, 6 Feb 2015 07:31:39 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: 1.423
X-Spam-Level: *
X-Spam-Status: No, score=1.423 tagged_above=-999 required=5 tests=[BAYES_50=0.8, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FM_FORGED_GMAIL=0.622, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, SPF_PASS=-0.001] autolearn=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id GAj09gvC7LZk; Fri, 6 Feb 2015 07:31:37 -0800 (PST)
Received: from mail-lb0-x22d.google.com (mail-lb0-x22d.google.com [IPv6:2a00:1450:4010:c04::22d]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 054141A1B80; Fri, 6 Feb 2015 07:31:06 -0800 (PST)
Received: by mail-lb0-f173.google.com with SMTP id p9so18630121lbv.4; Fri, 06 Feb 2015 07:31:04 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date:message-id:subject :from:to:cc:content-type; bh=Wn/cte9ZMu1u9rGU2RJDv50K5mSX3PcQA8liuXUV95k=; b=Wt3OUyFtKJFrSGE/7xuoPbogcD1p6IyZ7jMG3qcLe1Qgwxsx0+yI2dzTw+Uqeykm9Z OkapBxDD43fUl1++LV3WLVZetDWmHge4gLIctTtrDxLAHDWHTFNR+KsW/LJ2eJuwPfTD 5E5ZfwBoj5x5RtMTwmviIlZDVcdjiVAzVAk4N1IETrmwY8V5tzuzuZtcEe6mHB91dDeM 9THf27bpqYie4rZbvUtLpMkNzJsqW65Ca7GXQUQTWlfTUiaNl+kSgSPOllEIBVN12ANn KjGjxfvbYhGvUeQEI5emJbm1a4N/0YTg2Qmx1pTwHGzZCqxgGePXvbEP44k+DHiMyocS tXUw==
MIME-Version: 1.0
X-Received: by 10.152.242.132 with SMTP id wq4mr3350896lac.79.1423236664423; Fri, 06 Feb 2015 07:31:04 -0800 (PST)
Sender: hallam@gmail.com
Received: by 10.112.147.193 with HTTP; Fri, 6 Feb 2015 07:31:04 -0800 (PST)
In-Reply-To: <CAGhGL2AAda10+YY54GJRN4Af_pGC4ZaMv=97=6aNRzqKfJBqkg@mail.gmail.com>
References: <D0F962E2.1E9B2%richard@shockey.us> <CAGhGL2AAda10+YY54GJRN4Af_pGC4ZaMv=97=6aNRzqKfJBqkg@mail.gmail.com>
Date: Fri, 06 Feb 2015 10:31:04 -0500
X-Google-Sender-Auth: UkYqLOlAK1wp3-L3C-WxJ3vPcSM
Message-ID: <CAMm+Lwj4QVKQwwMvmypf6Gv8KEThbWzTFL+OR7ytjPT+RtGwRA@mail.gmail.com>
Subject: Re: Naive question on multiple TCP/IP channels and please dont start a uS NN debate here unless you really want to.
From: Phillip Hallam-Baker <phill@hallambaker.com>
To: Jim Gettys <jg@freedesktop.org>
Content-Type: multipart/alternative; boundary="001a113417f4d07da8050e6d1b2d"
Archived-At: <http://mailarchive.ietf.org/arch/msg/ietf/b_KDLBZ8DGYI2ES-XNPIvsYKsjs>
Cc: IETF Discussion Mailing List <ietf@ietf.org>, "tsvwg@ietf.org" <tsvwg@ietf.org>, Richard Shockey <richard@shockey.us>
X-BeenThere: ietf@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: IETF-Discussion <ietf.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf>, <mailto:ietf-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ietf/>
List-Post: <mailto:ietf@ietf.org>
List-Help: <mailto:ietf-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 06 Feb 2015 15:31:39 -0000

On Fri, Feb 6, 2015 at 8:47 AM, Jim Gettys <jg@freedesktop.org> wrote:

>
>
> On Thu, Feb 5, 2015 at 6:34 PM, Richard Shockey <richard@shockey.us>
> wrote:
>
>>
>>
>>>>
>> OK yes, My Netflix download is going to kill your VOIP call.
>>
>
> ​Yes, it may, though pacing traffic may help (see the sched_fq work in
> Linux).​
>
>
>>
>> RS > Well ..its always been more subtle than you think. You have to
>> distinguish between a Voice OVER IP call aka Skype etc vs a “managed
>> service” like Voice USING IP or VuIP which is an entirely different beast.
>> Modern VuIP which is all of Cable Voice in Europe and the US  VZ FIOS etc
>> and VoLTE is managed and  uses IP technologies such as SIP/IMS but the
>> routing may or may not have anything to  to do with BGP.  And BTW you have
>> to still do the first order number translation as well.  AKA RFC 6116 ENUM
>> or something new which we are actually debating in RAI.
>>
>> Its segmented managed traffic. Its not Netfilx killing Skype its
>> Microsoft Apple Android Updates as well.   We have no visibility on how the
>> OS actually queues application packets if at all.
>>
>
> ​Yup; this is a problem in the upstream direction (so is not the case you
> state above, but the inverse case, typified by events such as some one
> emailing a pile of images to your friends/family, backup and similar
> operations.
>
> One of the most common bottlenecks is the WiFi or cellular hop, and if the
> operating system does a single bloated FIFO drop tail queue discipline, you
> get into trouble.​
>
> ​ On Linux, this queue discipline has historically been one called
> PFIFO_FAST, and implemented​ a large (typically 1024 packet) FIFO drop tail
> queue (with a little bit of diffserv thrown in for good measure).
>
> ​Turning on a different queue discipline​ is a single configuration line,
> and it appears that people have been deciding to make fq_codel the default
> in various Linux distributions as of last fall (it has been the default in
> OpenWrt on routers for a while).
>
> At some point, it may make sense to use a different queue discipline
> (sched_fq) as the default, but I think more testing is needed.  That's a
> bit of a discussion better left to a different message.
>

OK we have a technical fix. But the problem is how to get it deployed by
the broadband providers.


There is a very good paper on computer security that might be relevant
here. 'Folk models of home computer security'

http://www.rickwash.com/papers/rwash-homesec-soups10-final.pdf

One of the thing we have found at Comodo is that virtually every home
computer issue is almost automatically attributed to being 'a virus'. What
the customer actually wants is for their damn machine to work. Or
alternatively, they want their relatives to stop calling them to ask them
to fix their computers.

The point is that people tend to leap to external attack as being the most
likely cause of any computer failure issue that isn't obviously dead
hardware. Which is really weird when I spent most of the 90s being told I
was an evil scaremonger for suggesting those cuddly-wuddly hackers might
actually be rather nasty thieving types.


So people see their Netflix or their Vonage suddenly sputter and the first
explanation that comes into their heads is 'my ISP is trying to kill them
to sell me their stuff instead'.


We have the potential here for a really bitter dispute. Instead of picking
up the phone to complain to their ISP, people have been picking up the
phone to complain to their Senator or the guy in the White House.

And it is going to be really difficult to explain to a lot of people who
have taken up arms on either side of this dispute that this might be the
cause of the slowdowns. One side is going to accuse us of being shills for
the corporate interests. And on the other side there are a lot of lobbyists
licking their chops at the thought of fat billable hours for as long as
they can make the fight worse.


So how do we de-escalate the situation?

One part is that we need more than a technical fix for this problem. We
need to be able to tell Joe or Jane Consumer how often these slowdowns
occur and what the cause is. The problem being that the cause of the
problem might be on the broadband provider side or the home user side of
the network.

So maybe have the residential gateway collect some data and expose it to
the consumer in some fashion. This could also help debug other connection
issues. I was having unexplained network slowdowns for a month that were
eventually found to have been caused by a falling tree snapping the fiber
but not the sheath round it. So the result was a flaky connection that had
the peculiar property of working for some frequencies but not others.

Another is to point out that just as the fact that you can't print from
your computer is because you have the wrong printer port assignment rather
than a virus does not mean viruses do not exist, the fact that buffer bloat
explains some network slowdowns does not mean it explains every slowdown.
Nor does it mean that malice is never a cause.