Re: [hybi] ws: and wss: schemes

Ian Hickson <ian@hixie.ch> Fri, 04 September 2009 20:16 UTC

Return-Path: <ian@hixie.ch>
X-Original-To: hybi@core3.amsl.com
Delivered-To: hybi@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 270003A6765; Fri, 4 Sep 2009 13:16:04 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -3.55
X-Spam-Level:
X-Spam-Status: No, score=-3.55 tagged_above=-999 required=5 tests=[AWL=-0.951, BAYES_00=-2.599]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 0BRPSLgXNVXm; Fri, 4 Sep 2009 13:16:02 -0700 (PDT)
Received: from looneymail-a1.g.dreamhost.com (caibbdcaaaaf.dreamhost.com [208.113.200.5]) by core3.amsl.com (Postfix) with ESMTP id CE6AA3A6783; Fri, 4 Sep 2009 13:16:02 -0700 (PDT)
Received: from hixie.dreamhostps.com (hixie.dreamhost.com [208.113.210.27]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by looneymail-a1.g.dreamhost.com (Postfix) with ESMTP id 82CB715D56C; Fri, 4 Sep 2009 13:16:22 -0700 (PDT)
Date: Fri, 04 Sep 2009 20:19:47 +0000
From: Ian Hickson <ian@hixie.ch>
To: URI <uri@w3.org>, "hybi@ietf.org" <hybi@ietf.org>, "uri-review@ietf.org" <uri-review@ietf.org>, "public-i18n-core@w3.org" <public-i18n-core@w3.org>
In-Reply-To: <4AA17310.1090108@gmx.de>
Message-ID: <Pine.LNX.4.62.0909042013300.26930@hixie.dreamhostps.com>
References: <OF22CD1320.96C55266-ON85257610.004AB599-85257610.004BC9CA@lotus.com> <C9931C12-E123-437D-8E7D-F8C680C62397@mnot.net> <4A8CAA72.3000209@berkeley.edu> <Pine.LNX.4.62.0909040147300.6775@hixie.dreamhostps.com> <4AA14792.4020009@gmx.de> <4D25F22093241741BC1D0EEBC2DBB1DA01AD6282C2@EX-SEA5-D.ant.amazon.com> <Pine.LNX.4.62.0909041947250.26930@hixie.dreamhostps.com> <4AA17310.1090108@gmx.de>
Content-Language: en-GB-hixie
Content-Style-Type: text/css
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset="US-ASCII"
Subject: Re: [hybi] ws: and wss: schemes
X-BeenThere: hybi@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: Server-Initiated HTTP <hybi.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/hybi>, <mailto:hybi-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/hybi>
List-Post: <mailto:hybi@ietf.org>
List-Help: <mailto:hybi-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/hybi>, <mailto:hybi-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 04 Sep 2009 20:16:04 -0000

On Fri, 4 Sep 2009, Julian Reschke wrote:
> Ian Hickson wrote:
> > >
> > > Because that's how URI and thus URLs are defined.
> > 
> > The ws: and wss: URLs are IRIs; why would we limit them to URIs? I'm 
> > not especially interested in ASCII-only URIs at this point. These URLs 
> > are only ever going to be used in contexts that accept full IRIs.
> 
> But that's not who registering an URI scheme works. Check the relevant 
> RFCs. Essentially you register the *URI* scheme, and get IRIs based on 
> the mapping rules defined in RFC 3987.

That's what I thought, but then I got feedback saying I had to register an 
IRI scheme if I wanted to use IRIs.

I've no interest in making ws: and wss: URIs. Only IRIs.

If I define the syntax to be a subset of the full URI syntax, how does it 
ever get extended to be a subset of the full IRI syntax?

What should I put in the spec to make you happy and to make the use of ws: 
and wss: IRIs fully well-defined?


> > > > > Furthermore, it still doesn't answer what the semantics of these parts
> > > > > are. What do "ihier-part" and "iquery" represent in a ws URI?
> > > > This is defined by the RFC 3987, no? Surely we wouldn't want IRI
> > > > components to have different meanings in different schemes?
> > > If you can point to a section in RFC 3987 which defines more than the
> > > syntax, and can state that that also applies to "ws", then, great...
> > 
> > Isn't what the Web Socket protocol spec now says suitable?
> 
> I haven't checked yet.
> 
> It would be helpful if you didn't abuse the IETF ID submission system as local
> storage, and only submitted new drafts when you want community review.

Release early, release often. I always want community review.


> Is now a good moment to read it?

It's always a good moment to read it.


> > > Don't use the include feature then.
> > 
> > The reference feature allows me to automatically generate the 
> > references, which is of more benefit to me than referencing STD 
> > numbers instead of RFC numbers.
> 
> The RFC reference is immutable. Just paste the content in your source 
> file, and change the anchor attribute value.

My source file is an HTML document, so I don't think that would work well.


> > > > > I've deferred to RFC3987 to sidestep this issue.
> > > > A URI is not a IRI.
> > > > 
> > > > You can refer to the mapping, but that really needs a few more words
> > > > than "See RFC3987.".
> > > It may not need many more words, but certainly a few more words.
> > 
> > Could you elaborate? Which words should I add?
> 
> You need to state how you want to encode non-ASCII characters. "See RFC3987"
> goes into the right direction but really isn't sufficient. Please see RFC
> 4395, Section 2.6:
> 
> "2.6. Internationalization and Character Encoding
> 
>    When describing URI schemes in which (some of) the elements of the
>    URI are actually representations of human-readable text, care should
>    be taken not to introduce unnecessary variety in the ways in which
>    characters are encoded into octets and then into URI characters; see
>    RFC 3987 [6] and Section 2.5 of RFC 3986 [5] for guidelines.  If URIs
>    of a scheme contain any text fields, the scheme definition MUST
>    describe the ways in which characters are encoded, and any
>    compatibility issues with IRIs of the scheme."

I've read this, but as far as I can tell, "Always UTF-8" and "See IRI" are 
both complete and accurate ways of addressing this.

Since apparently neither of these options satisfies you, could you state 
exactly what literal text would satisfy you?

-- 
Ian Hickson               U+1047E                )\._.,--....,'``.    fL
http://ln.hixie.ch/       U+263A                /,   _.. \   _\  ;`._ ,.
Things that are impossible just take longer.   `._.-(,_..'--(,_..'`-.;.'