Re: [hybi] Review of draft-ietf-hybi-thewebsocketprotocol-13

Alexey Melnikov <alexey.melnikov@isode.com> Tue, 06 September 2011 22:45 UTC

Message-ID: <4E66A2EA.8080203@isode.com>
Date: Tue, 06 Sep 2011 23:47:06 +0100
From: Alexey Melnikov <alexey.melnikov@isode.com>
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.7.12) Gecko/20050915
To: "Richard L. Barnes" <rbarnes@bbn.com>
References: <942CCA6B-B784-441B-96CA-3506FFC439E1@bbn.com> <CALiegfmyQ5h4S2FgBnrh2VLr8+q-h0sLiGsww7T+1VwYNRo4wQ@mail.gmail.com> <72E40A0F-C923-472F-9534-538B89F7A444@bbn.com>
In-Reply-To: <72E40A0F-C923-472F-9534-538B89F7A444@bbn.com>
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"; format="flowed"
Content-Transfer-Encoding: 7bit
Cc: General Area Review Team <gen-art@ietf.org>, hybi@ietf.org
Subject: Re: [hybi] Review of draft-ietf-hybi-thewebsocketprotocol-13
Precedence: list

Richard L. Barnes wrote:

>>>Section 5.6, "Note that a particular text frame might include a partial UTF-8 sequence, however the whole message MUST contain valid UTF-8"
>>>This requirement is meaningless, since the concept of a "message" is not defined here.  Suggest going back to a requirement that a frame MUST contain valid UTF-8 (i.e., that it breaks at code-point boundaries).
>>>      
>>>
>>No please. This has been already discussed.
>>
>>Imagine I must send a very big WS UTF-8 message and due to max frame
>>size requeriments (still to know how such requiremente is
>>"negotiated") I need to split it in N frames. This feature would work
>>at the very transport core layer.
>>
>>Probably I have a function that splits the whole WS message into
>>chunks of N bytes (I mean "bytes" because I do know the max frame size
>>in *bytes*), so such function just counts N bytes from the WS message
>>and generates a frame. Please don't force such function to be
>>Unicode/UTF-8 aware, no please.    
>>
>
>Clearly it already has to be WebSocket aware, and it already has to read the opcode in order to distinguish data frames from control frames.  Adding on a requirement to break at code point boundaries does not seem hugely onerous.  It's three lines of C:
>
>/* 
>uint8_t *new_frame_start = *old_frame_start;
>new_frame_start += DESIRED_FRAME_LENGTH;
>*/
>if (opcode & 0x0f == 0x01) { /* If this is a text frame */
>    while (*new_frame_start & 0xc0 == 0x80) { /* While inside a code point */
>        new_frame_start--; /* Back up one octet */
>    }
>    /* new_frame_start is now at the beginning of a code point */
>}
>
>In contrast, *not* requiring breaking at UTF-8 code points means that clients can't do any meaningful validation on text frames.
>
That is not entirely true. One can check if a full Unicode character was 
received at the end of a frame and just buffer enough from the previous 
frame to continue validation.

There are also other types of possible UTF-8 brokeness, as overlong 
sequences, invalid bytes, etc. These can also be detected early.

>Which means you might as well get rid of text frames entirely.
>

[hybi] Review of draft-ietf-hybi-thewebsocketprot… Richard L. Barnes
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Iñaki Baz Castillo
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Alexey Melnikov
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Richard L. Barnes
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Willy Tarreau
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Philipp Serafin
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Richard L. Barnes
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Richard L. Barnes
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Tobias Oberstein
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Tobias Oberstein
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Richard L. Barnes
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Iñaki Baz Castillo
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Richard L. Barnes
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Richard L. Barnes
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Richard L. Barnes
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Iñaki Baz Castillo
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Iñaki Baz Castillo
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Tobias Oberstein
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Willy Tarreau
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Tobias Oberstein
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Bjoern Hoehrmann
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Tobias Oberstein
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Tobias Oberstein
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Richard L. Barnes
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Richard L. Barnes
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Richard L. Barnes
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Tobias Oberstein
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Tobias Oberstein
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Iñaki Baz Castillo
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Richard L. Barnes
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Iñaki Baz Castillo
Re: [hybi] [Gen-art] Review of draft-ietf-hybi-th… Alexey Melnikov
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Alexey Melnikov
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Richard L. Barnes
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Alexey Melnikov
Re: [hybi] Review of draft-ietf-hybi-thewebsocket… Greg Wilkins