Re: [hybi] Multiplexing in WebSocket

Greg Wilkins <gregw@webtide.com> Sun, 25 October 2009 08:02 UTC

Message-ID: <4AE405FA.7030002@webtide.com>
Date: Sun, 25 Oct 2009 01:02:02 -0700
From: Greg Wilkins <gregw@webtide.com>
User-Agent: Thunderbird 2.0.0.23 (X11/20090817)
MIME-Version: 1.0
To: Jamie Lokier <jamie@shareable.org>, hybi@ietf.org
References: <4AD53DCA.6050304@webtide.com> <Pine.LNX.4.62.0910170203460.9145@hixie.dreamhostps.com> <4ADA7FD4.9010406@webtide.com> <4ADB6F0B.4000004@gmail.com> <Pine.LNX.4.62.0910221120380.9145@hixie.dreamhostps.com> <4AE08907.7080402@webtide.com> <Pine.LNX.4.62.0910230348470.9145@hixie.dreamhostps.com> <4AE1E659.5050507@webtide.com> <Pine.LNX.4.62.0910232154470.13521@hixie.dreamhostps.com> <4AE23D7A.2060009@webtide.com> <20091024182133.GA30762@shareable.org>
In-Reply-To: <20091024182133.GA30762@shareable.org>
Content-Type: text/plain; charset="ISO-8859-1"
Content-Transfer-Encoding: 7bit
Subject: Re: [hybi] Multiplexing in WebSocket
Precedence: list

Jamie Lokier wrote:
> Greg Wilkins wrote:
>> * UTF-16 for those that can't deal with the uncertainty and/or
>>   unpredictability of the length of a UTF-8 string
> 
> Careful.  UTF-16 is a variable length encoding too.
> 
> It is possibly worse than UTF-8 for this, because everyone knows that
> UTF-8 is variable length, but many people seem to think UTF-16 is not.

OK that's worth a Wow!

I had assigned UTF-16 was fixed length because the java implementation
for it is.  But now I see in the java documentations:

 "The char data type (and therefore the value that a Character object
  encapsulates) are based on the original Unicode specification, which
  defined characters as fixed-width 16-bit entities. The Unicode standard
  has since been changed to allow for characters whose representation
  requires more than 16 bits. The range of legal code points is now
  U+0000 to U+10FFFF, known as Unicode scalar value. (Refer to the
  definition of the U+n notation in the Unicode standard.)

  The set of characters from U+0000 to U+FFFF is sometimes referred
  to as the Basic Multilingual Plane (BMP). Characters whose code
  points are greater than U+FFFF are called supplementary characters.
  The Java 2 platform uses the UTF-16 representation in char arrays
  and in the String and StringBuffer classes. In this representation,
  supplementary characters are represented as a pair of char
  values, the first from the high-surrogates range, (\uD800-\uDBFF),
  the second from the low-surrogates range (\uDC00-\uDFFF)."

So I'll modify my original point to

 * UTF-16 for java programmers and other BMP users that don't want to
   deal with the uncertainty and/or unpredictability of the length of
   a UTF-8 string

Thanks for the learning experience!

cheers

[hybi] HyBi Design Space Salvatore Loreto
Re: [hybi] HyBi Design Space Infinity Linden (Meadhbh Hamrick)
Re: [hybi] HyBi Design Space Thomson, Martin
Re: [hybi] HyBi Design Space Salvatore Loreto
Re: [hybi] HyBi Design Space Salvatore Loreto
[hybi] Multiplexing in WebSocket (Was: HyBi Desig… Ian Hickson
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Julian Reschke
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Infinity Linden (Meadhbh Hamrick)
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Greg Wilkins
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Graham Klyne
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Greg Wilkins
Re: [hybi] HyBi Design Space Stefano Salsano
Re: [hybi] HyBi Design Space Thomson, Martin
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Graham Klyne
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Ian Hickson
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Ian Hickson
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Greg Wilkins
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Ian Hickson
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Jamie Lokier
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Ian Hickson
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Greg Wilkins
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Greg Wilkins
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Julian Reschke
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Julian Reschke
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Michael(tm) Smith
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Julian Reschke
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Bjoern Hoehrmann
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Ian Hickson
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Julian Reschke
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Ian Hickson
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Julian Reschke
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… SM
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Ian Hickson
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Martin Tyler
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Ian Hickson
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Julian Reschke
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Greg Wilkins
Re: [hybi] Multiplexing in WebSocket (Was: HyBi D… Wellington Fernando de Macedo
[hybi] new drat design-space-bidirectional Salvatore Loreto
Re: [hybi] Multiplexing in WebSocket Ian Hickson
Re: [hybi] Multiplexing in WebSocket Greg Wilkins
Re: [hybi] Multiplexing in WebSocket Ian Hickson
Re: [hybi] Multiplexing in WebSocket Julian Reschke
Re: [hybi] Multiplexing in WebSocket Ian Hickson
Re: [hybi] Multiplexing in WebSocket Peter Saint-Andre
Re: [hybi] Multiplexing in WebSocket Greg Wilkins
Re: [hybi] Multiplexing in WebSocket Ian Hickson
Re: [hybi] Multiplexing in WebSocket Greg Wilkins
Re: [hybi] Multiplexing in WebSocket Jamie Lokier
Re: [hybi] Multiplexing in WebSocket Greg Wilkins
Re: [hybi] Multiplexing in WebSocket Greg Wilkins
Re: [hybi] Multiplexing in WebSocket Ian Hickson
Re: [hybi] Multiplexing in WebSocket Greg Wilkins
Re: [hybi] Multiplexing in WebSocket Ian Hickson