Re: [tsvwg] Naive question on multiple TCP/IP channels

Jim Gettys <jg@freedesktop.org> Thu, 05 February 2015 16:59 UTC

MIME-Version: 1.0
Sender: gettysjim@gmail.com
In-Reply-To: <53117D83-2F80-4CC7-97FC-8C6FAA49857C@gmail.com>
References: <CAMm+Lwgb9L9bUG6ommBDYJzQTCU1cC_zLSEf_5JPeJ+c=yrYmA@mail.gmail.com> <5DF6DC77-E476-408F-9FA5-F107DDC9F857@netapp.com> <54D2B150.3020502@gmail.com> <53117D83-2F80-4CC7-97FC-8C6FAA49857C@gmail.com>
Date: Thu, 05 Feb 2015 11:59:43 -0500
Message-ID: <CAGhGL2ByvEh2P+n6TjKnT248yw=84bM1hNcA3jU4GZYfkbg_-Q@mail.gmail.com>
Subject: Re: [tsvwg] Naive question on multiple TCP/IP channels
From: Jim Gettys <jg@freedesktop.org>
To: Piers O'Hanlon <p.ohanlon@gmail.com>
Content-Type: multipart/alternative; boundary="001a11c2e5580a12c6050e5a3b9f"
Archived-At: <http://mailarchive.ietf.org/arch/msg/ietf/9HhLPVwDHfVGoc63DweNwOAqOPw>
Cc: Phillip Hallam-Baker <phill@hallambaker.com>, IETF Discussion Mailing List <ietf@ietf.org>, tsvwg WG <tsvwg@ietf.org>
Precedence: list

On Thu, Feb 5, 2015 at 5:06 AM, Piers O'Hanlon <p.ohanlon@gmail.com> wrote:

> One of the snags with using multiple [slower] TCP streams is that although
> you might get a faster throughput you end up losing more packets as the
> equivalent TCP loss rate for multiple lower rate flows is higher than for
> one larger stream. This can be explained by looking at the TCP [simple]
> equation Xrate = 1.2*PacketLen/(RTT*sqrt(loss)) where it can be seen that
> the lower the send rate the higher the loss needs to be on a path with the
> same RTT.
>

There is sooo much buffering in most of the Internet edge devices, that
packet loss is mostly irrelevant for most web traffic.  There is typically
more than a page worth's of buffering everywhere, sometimes lots more. So
beware theory; it can be misleading...

The scale of buffering is sometimes truely incredible.  I measured
*interplanetary* scale buffering on GoGo on my way to Hawaii....

See Dave Reed's comments on where it has taken them:
http://www.reed.com/blog-dpr/?p=174



>
> I guess there were also other approaches like BEEP, and Google's QUIC
> which multiplexed multiple connections over UDP.
>
> Piers
>
> On 4 Feb 2015, at 23:54, Brian E Carpenter wrote:
>
> > On 05/02/2015 08:49, Eggert, Lars wrote:
> >> Hi,
> >>
> >> CC'ing tsvwg, which would be a better venue for this discussion.
> >>
> >> On 2015-2-4, at 20:22, Phillip Hallam-Baker <phill@hallambaker.com>
> wrote:
> >>>
> >>> Today most Web browsers attempt to optimize download of images etc. by
> opening multiple TCP/IP streams at the same time. This is actually done for
> two reasons, first to reduce load times and second to allow the browser to
> optimize page layout by getting image sizes etc up front.
> >>>
> >>> This approach first appeared round about 1994. I am not sure whether
> anyone actually did a study to see if multiple TCP/IP streams are faster
> than one but the approach has certainly stuck.
> >>
> >> There have been many studies; for example,
> http://www.aqualab.cs.northwestern.edu/publications/106-modeling-and-taming-parallel-tcp-on-the-wide-area-network
> >
> > GridFTP only exists because of lots of experience that several parallel
> FTP
> > streams achieve better throughput than a single stream, especially on
> paths
> > with a high bandwidth-delay product. I'm guessing that since buffer bloat
> > creates an artificially high BDP, that could apply pretty much anywhere.
> >
> > SCTP is not the only one-acronym answer: try MPTCP. The interesting
> thing there
> > is that because there is explicit coupling between the streams, the
> throughput
> > increases sub-linearly with the number of streams.
> >
> >    Brian
> >
> >>
> >>> But looking at the problem from the perspective of the network it is
> really hard to see why setting up five TCP/IP streams between the same
> endpoints should provide any more bandwidth than one. If the narrow waist
> is observed, then the only parts of the Internet that are taking note of
> the TCP part of the packet are the end points. So having five streams
> should not provide any more bandwidth than one unless the bandwidth
> bottleneck was at one or other endpoint.
> >>
> >> You don't get more bandwidth in stead state (well, with old Reno
> stacks, you got a little more, but not much). The real win is in getting
> more bandwidth during the first few RTTs of TCP slow-start, which is the
> crucial phase when transmitting short web objects.
> >>
> >>> Now there are some parts of the deployed Internet that do actually
> perform statefull inspection. But I would expect increasing the number of
> channels to degrade performance at a firewall or any other middle boxen.
> >>>
> >>> So we have a set of behavior that seems at odd with the theory. Has
> anyone done any experiments recently that would show which is right?
> >>
> >> I haven't seen any performance study, but another concern is that
> middleboxes obviously need to maintain state per connection, and multiple
> parallel connections eat that binding space up more quickly. (And for a
> NAT, reduce the number of clients it can serve.)
> >>
> >>> The reason it makes a difference is that it is becoming clear that
> modern applications are not best served by an application API that is
> limited to one bi-directional stream. There are two possible ways to fix
> this situation. The first is to build something on top of TCP/IP the second
> is to replace single stream TCP with multi-stream.
> >>
> >> SCTP has what you call multiple streams in your second option, and is
> designed the same way.
> >>
> >>> My preference and gut instinct is that the first is the proper
> architectural way to go regardless of the performance benefits. When
> Thompson and co were arguing that all files are flat sequences of bits,
> they were saying that was the right O/S abstraction because you could build
> anything you like on top.
> >>>
> >>> But then I started to ask what the performance benefits to a
> multi-stream TCP might be and I am pretty sure there should not be any. But
> the actual Internet does not always behave like it appears it should.
> >>
> >> See above.
> >>
> >> Also, one motivation for SPDY/HTTP2.0 is to reduce the number of
> parallel connections, since web people have noticed that more is not always
> better here.
> >>
> >>> I suspect that the preference for multiple streams probably comes from
> the threading strategies it permits. But that is an argument about where
> the boundary between the kernel and application is placed in the network
> stack rather than where multiplex should live in the stack. Microsoft
> already provides a network stack for .NET where the boundary is in the HTTP
> layer after all.
> >>>
> >>>
> >>> So anyone got hard data they could share?
> >>
> >> The TSVWG folks may have.
> >>
> >> Lars
> >>
> >
>
>

Naive question on multiple TCP/IP channels Phillip Hallam-Baker
Re: Naive question on multiple TCP/IP channels Roberto Peon
Re: Naive question on multiple TCP/IP channels Donald Eastlake
Re: Naive question on multiple TCP/IP channels David Morris
Re: Naive question on multiple TCP/IP channels Eggert, Lars
Re: Naive question on multiple TCP/IP channels Phillip Hallam-Baker
Re: Naive question on multiple TCP/IP channels Christopher Morrow
Re: Naive question on multiple TCP/IP channels Phillip Hallam-Baker
Re: Naive question on multiple TCP/IP channels Christopher Morrow
Re: Naive question on multiple TCP/IP channels Brian E Carpenter
Re: Naive question on multiple TCP/IP channels Phillip Hallam-Baker
Re: Naive question on multiple TCP/IP channels Martin Rex
Re: Naive question on multiple TCP/IP channels Jim Gettys
Re: Naive question on multiple TCP/IP channels Phillip Hallam-Baker
Re: Naive question on multiple TCP/IP channels Jim Gettys
Re: Naive question on multiple TCP/IP channels Phillip Hallam-Baker
Re: [tsvwg] Naive question on multiple TCP/IP cha… Piers O'Hanlon
Re: [tsvwg] Naive question on multiple TCP/IP cha… Jim Gettys
Re: Naive question on multiple TCP/IP channels Wesley Eddy