Re: [hybi] [Uri-review] ws: and wss: schemes

Joseph A Holsten <joseph@josephholsten.com> Fri, 04 September 2009 07:23 UTC

Return-Path: <josephholsten@gmail.com>
X-Original-To: hybi@core3.amsl.com
Delivered-To: hybi@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 51DB33A6866; Fri, 4 Sep 2009 00:23:26 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.599
X-Spam-Level:
X-Spam-Status: No, score=-2.599 tagged_above=-999 required=5 tests=[AWL=0.000, BAYES_00=-2.599]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id FZBNCXNZMr9A; Fri, 4 Sep 2009 00:23:25 -0700 (PDT)
Received: from mail-yw0-f204.google.com (mail-yw0-f204.google.com [209.85.211.204]) by core3.amsl.com (Postfix) with ESMTP id 04BBE3A676A; Fri, 4 Sep 2009 00:23:24 -0700 (PDT)
Received: by ywh42 with SMTP id 42so1147829ywh.30 for <multiple recipients>; Fri, 04 Sep 2009 00:20:14 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:sender:cc:message-id:from:to :in-reply-to:content-type:content-transfer-encoding:mime-version :subject:date:references:x-mailer; bh=dIkqL/XedW8/XNyiih3FJ0uEItg79mbiigcbY9+P6/Y=; b=XzToFPHC3WdKslW56sE1eS9PFjRAXohNM60rdNyd9KW8gcuu9yRovbx/MGZPYQPexy hL6eKsjnHsT/YfVaSCaJLZlP7Zj2eTv2xiX70zINeciRgm71MLlH7bkCgEtjxE+7V4nv WLV/18zQpL1pOlEC5QgRNCZSnKPpAPRScKHYs=
DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=sender:cc:message-id:from:to:in-reply-to:content-type :content-transfer-encoding:mime-version:subject:date:references :x-mailer; b=HJiVQ5XQf+u+e4sJpCL0tDxIxK38Pus4LZpIdG9YlJPme9LAsn95hWQeMvvmBafeR6 lJbY7QHLGUEOWQCrHrczUyhHBpgupdpP0fAgZoTfTD5nL3LYivj5uDkKgNwEn3RCaPNj XcGLGRZHbjx5zqeMLXG8c1im4Sv46xzTpFjxk=
Received: by 10.90.41.4 with SMTP id o4mr7935584ago.66.1252048490839; Fri, 04 Sep 2009 00:14:50 -0700 (PDT)
Received: from ?192.168.1.100? (ip70-189-108-199.ok.ok.cox.net [70.189.108.199]) by mx.google.com with ESMTPS id 2sm882499aga.18.2009.09.04.00.14.49 (version=TLSv1/SSLv3 cipher=RC4-MD5); Fri, 04 Sep 2009 00:14:50 -0700 (PDT)
Sender: Joseph Holsten <josephholsten@gmail.com>
Message-Id: <6CD25FB7-160A-4EC3-A3CB-0581D6ED3614@josephholsten.com>
From: Joseph A Holsten <joseph@josephholsten.com>
To: Ian Hickson <ian@hixie.ch>
In-Reply-To: <Pine.LNX.4.62.0909040147300.6775@hixie.dreamhostps.com>
Content-Type: text/plain; charset="US-ASCII"; format="flowed"; delsp="yes"
Content-Transfer-Encoding: 7bit
Mime-Version: 1.0 (Apple Message framework v936)
Date: Fri, 04 Sep 2009 02:14:48 -0500
References: <OF22CD1320.96C55266-ON85257610.004AB599-85257610.004BC9CA@lotus.com> <C9931C12-E123-437D-8E7D-F8C680C62397@mnot.net> <4A8CAA72.3000209@berkeley.edu> <Pine.LNX.4.62.0909040147300.6775@hixie.dreamhostps.com>
X-Mailer: Apple Mail (2.936)
X-Mailman-Approved-At: Thu, 10 Sep 2009 18:54:56 -0700
Cc: URI <uri@w3.org>, hybi@ietf.org, uri-review@ietf.org
Subject: Re: [hybi] [Uri-review] ws: and wss: schemes
X-BeenThere: hybi@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: Server-Initiated HTTP <hybi.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/hybi>, <mailto:hybi-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/hybi>
List-Post: <mailto:hybi@ietf.org>
List-Help: <mailto:hybi-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/hybi>, <mailto:hybi-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 04 Sep 2009 07:23:26 -0000

On Sep 4, 2009, at 12:33 AM, Ian Hickson wrote:

> On Fri, 14 Aug 2009, Julian Reschke wrote:
>>
>> [...] it now says:
>>
>>>   URI scheme syntax.
>>>      In ABNF terms using the terminals from the IRI specifications:
>>>      [RFC5238] [RFC3987]
>>>
>>>           "ws" ":" ihier-part [ "?" iquery ]
>>
>> That is even worse than before, because it now uses productions  
>> from the
>> IRI spec defining *URI* syntax.
>
> ws: and wss: URLs are i18n-aware; why would we want to limit them to
> ASCII?

URIs are not i18n-aware, you're thinking of IRIs. But since there is a  
standard mapping for IRIs, it's pretty clear what you actually want.  
The *URI* syntax should be:

   "ws" ":" heir-part [ "?" query ]


Then the encoding considerations should be something like:

   Because many characters are not permitted with this syntax, the
   "heir-part" and "query" elements may contain characters from the
   Unicode Character Set [UCS] as suggested by URI [RFC3986] using the
   reg-name and percent-encoding translations of IRI to URI
   mapping [RFC3937]. Translation is performed by first encoding those
   Unicode characters as octets to the UTF-8 character
   encoding [RFC3629]. Replace the reg-name part of the heir-part by
   the part converted using the ToASCII operation specified in section
   4.1 of [RFC3490] on each dot-separated label, and by using U+002E
   (FULL STOP) as a label separator, with the flag UseSTD3ASCIIRules
   set to TRUE, and with the flag AllowUnassigned set to TRUE. Then
   only those octets that do not correspond to characters in the
   unreserved set should be percent-encoded.

   By using UTF-8 encoding, there are no known compatibility issues
   with mapping Internationlized Resource Identifiers to websocket
   URIs according to [RFC3987].

>> Furthermore, it still doesn't answer what the semantics of these  
>> parts
>> are. What do "ihier-part" and "iquery" represent in a ws URI?
>
> This is defined by the RFC 3987, no? Surely we wouldn't want IRI
> components to have different meanings in different schemes?
>
>> What's the effect? How are they used?
>
> This is defined earlier in the Web Socket specification.

Section 3.1 Parsing Web Socket URLs seems to make the semantics pretty  
clear to me. How about adding "See Section 3.1" to URI scheme  
semantics portions of the IANA Considerations sections? Would that be  
sufficient?

Joseph Holsten