Re: [hybi] hum #3: Message

John Tamplin <jat@google.com> Thu, 05 August 2010 19:53 UTC

DomainKey-Signature: a=rsa-sha1; s=beta; d=google.com; c=nofws; q=dns; h=mime-version:in-reply-to:references:from:date:message-id: subject:to:cc:content-type:x-system-of-record; b=Q/PykK1/YO0cmlA6TwTWA+bsJoPcT8GPyvbMyapnOPbccymj8srFS4GOE9ynXRVQs jbp+JGrk9kfrLsZK/9xCg==
MIME-Version: 1.0
In-Reply-To: <Pine.LNX.4.64.1008051930160.5947@ps20323.dreamhostps.com>
References: <4C5AE93D.4040803@ericsson.com> <Pine.LNX.4.64.1008051758290.5947@ps20323.dreamhostps.com> <AANLkTik0kbh14s2JZARY2MFh0iNGV7H+B4Px4yG+wX44@mail.gmail.com> <71BCE4BF-D3F6-4F94-BE76-306BDF6A2E67@apple.com> <Pine.LNX.4.64.1008051930160.5947@ps20323.dreamhostps.com>
From: John Tamplin <jat@google.com>
Date: Thu, 05 Aug 2010 15:53:56 -0400
Message-ID: <AANLkTim_PzXf0r=nfhCgtxpt-=s8-51hdAe0z2bSd5B9@mail.gmail.com>
To: Ian Hickson <ian@hixie.ch>
Content-Type: multipart/alternative; boundary="0016361e82305a94eb048d18eabe"
Cc: "hybi@ietf.org" <hybi@ietf.org>
Subject: Re: [hybi] hum #3: Message
Precedence: list

On Thu, Aug 5, 2010 at 3:31 PM, Ian Hickson <ian@hixie.ch> wrote:

> I was trying to express both.
>
> Having to split a 2GB file into a bazilion pieces and write each one out
> individually with frame headers, compared to just writing the whole thing
> at once, seems inefficient far beyond simply the concern about the
> on-the-wire overhead.

Assuming you don't care about buffering requirements on the client and you
only care about the case where you already have the entire data to send in
memory and you don't care about compressing the output, then I agree just
doing write(socket, buf, len) is easier.  However, it isn't much easier
than:

while(len > MAX_FRAME_SIZE) {
  writeHeader(socket, true, MAX_FRAME_SIZE);
  write(socket, buf, MAX_FRAME_SIZE);
  buf+=MAX_FRAME_SIZE;
  len-= MAX_FRAME_SIZE;
}
writeHeader(socket, false, len);
write(socket, buf, len);

Also, you have previously stated that you expect compression to always be
used once we get around to supporting it.  I have given pseudocode
previously of what the compression loop looks like -- if I have to send the
length for everything up front, I have to complete compressing the entire
data before I can send one byte over the wire.  I have to have an
arbitrarily large output buffer (which means reallocating it with multiple
copies), which is inefficient.

Finally, there are cases where the WebSocket server isn't the ultimate
source of the data to send, so requiring a length up-front means buffering
every byte from the source before you can write a single byte on the
WebSocket connection.  If instead I have fragments, I can use a fixed-size
buffer that I never have to reallocate, read the data from the server, and
write individual WebSocket frames as that buffer fills.

Regarding the small-frame case -- a variable-length length field is only
more efficient than fixed length if the payload data is between 0 and 127
bytes.  In that case, the extra byte required seems insignificant to the
TCP/IP overhead for a small packet, where even mobile devices don't use IP
header compression.  Again, this doesn't seem to rise up to "quite
inefficient".

When considering inefficient beyond bytes on the wire, consider these two
receivers:

int len = 0;
int c;
do {
  c = readByte(socket);
  len = (len << 7) + (c & 127);
} while(c & 128);
(re)allocate buf[len]
read(socket, buf, len);

vs

struct WSHeader hdr;
read(socket, &hdr, sizeof(hdr));
fixup byteorder
// buf is preallocated to be the maximum size frame
read(socket, buf, hdr.len);

[ignoring error handling of course]

The former again requires either many allocations/frees of the buffer or
reallocating it as larger frames are seen, and more system calls.

-- 
John A. Tamplin
Software Engineer (GWT), Google

Re: [hybi] hum #3: Message Jamie Lokier
[hybi] hum #3: Message Salvatore Loreto
Re: [hybi] hum #3: Message Ian Hickson
Re: [hybi] hum #3: Message John Tamplin
Re: [hybi] hum #3: Message Douglas Otis
Re: [hybi] hum #3: Message Maciej Stachowiak
Re: [hybi] hum #3: Message Dave Cridland
Re: [hybi] hum #3: Message Ian Hickson
Re: [hybi] hum #3: Message Julian Reschke
Re: [hybi] hum #3: Message John Tamplin
Re: [hybi] hum #3: Message Jack Moffitt
Re: [hybi] hum #3: Message Bjoern Hoehrmann
Re: [hybi] hum #3: Message Willy Tarreau
Re: [hybi] hum #3: Message Bjoern Hoehrmann
Re: [hybi] hum #3: Message Maciej Stachowiak
Re: [hybi] hum #3: Message Julian Reschke
Re: [hybi] hum #3: Message Roberto Peon
Re: [hybi] hum #3: Message Maciej Stachowiak
Re: [hybi] hum #3: Message Jack Moffitt
Re: [hybi] hum #3: Message Willy Tarreau
Re: [hybi] hum #3: Message Maciej Stachowiak
Re: [hybi] hum #3: Message Maciej Stachowiak
Re: [hybi] hum #3: Message Dave Cridland
Re: [hybi] hum #3: Message Maciej Stachowiak
Re: [hybi] hum #3: Message Dave Cridland
Re: [hybi] hum #3: Message Greg Wilkins
Re: [hybi] hum #3: Message Dave Cridland
Re: [hybi] hum #3: Message Jack Moffitt
Re: [hybi] hum #3: Message Ian Fette (イアンフェッティ)
Re: [hybi] hum #3: Message Ian Fette (イアンフェッティ)
[hybi] Background info: Properties of sendfile() Jamie Lokier
Re: [hybi] hum #3: Message Jamie Lokier
Re: [hybi] Background info: Properties of sendfil… Roy T. Fielding
Re: [hybi] Background info: Properties of sendfil… Jamie Lokier
Re: [hybi] Background info: Properties of sendfil… Greg Wilkins
Re: [hybi] Background info: Properties of sendfil… Roberto Peon
Re: [hybi] hum #3: Message Scott Ferguson
Re: [hybi] Background info: Properties of sendfil… Jamie Lokier
Re: [hybi] Background info: Properties of sendfil… Greg Wilkins
Re: [hybi] Background info: Properties of sendfil… Willy Tarreau
Re: [hybi] Background info: Properties of sendfil… Maciej Stachowiak
Re: [hybi] Background info: Properties of sendfil… Willy Tarreau
Re: [hybi] hum #3: Message Pieter Hintjens
Re: [hybi] hum #3: Message Arman Djusupov
Re: [hybi] hum #3: Message Pieter Hintjens
Re: [hybi] hum #3: Message Julian Reschke
Re: [hybi] hum #3: Message Arman Djusupov
Re: [hybi] Background info: Properties of sendfil… Jack Moffitt
Re: [hybi] hum #3: Message John Tamplin
Re: [hybi] hum #3: Message Arman Djusupov
Re: [hybi] hum #3: Message Scott Ferguson
Re: [hybi] hum #3: Message Scott Ferguson
Re: [hybi] hum #3: Message Patrick McManus
Re: [hybi] hum #3: Message Scott Ferguson
Re: [hybi] hum #3: Message Patrick McManus
Re: [hybi] Background info: Properties of sendfil… Roberto Peon
Re: [hybi] Background info: Properties of sendfil… Jamie Lokier
Re: [hybi] hum #3: Message Dave Cridland
Re: [hybi] hum #3: Message Dave Cridland
Re: [hybi] hum #3: Message Douglas Otis
[hybi] Impact of mandatory chunking (was Re: Back… Maciej Stachowiak
Re: [hybi] Impact of mandatory chunking (was Re: … Jack Moffitt
Re: [hybi] hum #3: Message Pieter Hintjens
Re: [hybi] Impact of mandatory chunking (was Re: … Maciej Stachowiak
Re: [hybi] hum #3: Message Scott Ferguson
Re: [hybi] hum #3: Message Willy Tarreau
Re: [hybi] hum #3: Message Scott Ferguson
Re: [hybi] hum #3: Message Roberto Peon
Re: [hybi] hum #3: Message Willy Tarreau
Re: [hybi] hum #3: Message Scott Ferguson
Re: [hybi] hum #3: Message Pieter Hintjens
Re: [hybi] hum #3: Message Greg Wilkins
Re: [hybi] hum #3: Message Pieter Hintjens
Re: [hybi] hum #3: Message Martin Sustrik
Re: [hybi] hum(s) followup Maciej Stachowiak
[hybi] hum(s) followup Salvatore Loreto
Re: [hybi] hum(s) followup Pieter Hintjens
Re: [hybi] hum(s) followup Salvatore Loreto