Re: [apps-discuss] Fwd: FW: New Version Notification for draft-kerwin-file-scheme-13.txt

Sam Ruby <rubys@intertwingly.net> Sat, 10 January 2015 23:32 UTC

Return-Path: <rubys@intertwingly.net>
X-Original-To: apps-discuss@ietfa.amsl.com
Delivered-To: apps-discuss@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 42B431A0231 for <apps-discuss@ietfa.amsl.com>; Sat, 10 Jan 2015 15:32:23 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.9
X-Spam-Level:
X-Spam-Status: No, score=-1.9 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_NONE=-0.0001] autolearn=ham
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id krVR9piagVw8 for <apps-discuss@ietfa.amsl.com>; Sat, 10 Jan 2015 15:32:21 -0800 (PST)
Received: from cdptpa-oedge-vip.email.rr.com (cdptpa-outbound-snat.email.rr.com [107.14.166.226]) by ietfa.amsl.com (Postfix) with ESMTP id 141181A0076 for <apps-discuss@ietf.org>; Sat, 10 Jan 2015 15:32:21 -0800 (PST)
Received: from [98.27.51.253] ([98.27.51.253:59312] helo=rubix) by cdptpa-oedge03 (envelope-from <rubys@intertwingly.net>) (ecelerity 3.5.0.35861 r(Momo-dev:tip)) with ESMTP id DD/A6-03035-486B1B45; Sat, 10 Jan 2015 23:32:20 +0000
Received: from [192.168.1.115] (unknown [192.168.1.115]) (using TLSv1 with cipher ECDHE-RSA-AES128-SHA (128/128 bits)) (No client certificate requested) (Authenticated sender: rubys) by rubix (Postfix) with ESMTPSA id 62943140A15; Sat, 10 Jan 2015 18:32:20 -0500 (EST)
Message-ID: <54B1B682.3070609@intertwingly.net>
Date: Sat, 10 Jan 2015 18:32:18 -0500
From: Sam Ruby <rubys@intertwingly.net>
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.3.0
MIME-Version: 1.0
To: Sean Leonard <dev+ietf@seantek.com>, apps-discuss@ietf.org
References: <20140926010029.26660.82167.idtracker@ietfa.amsl.com><EAACE200D9B0224D94BF52CF2DD166A425A68A90@ex10mb6.qut.edu.au><CACweHNBEYRFAuw9-vfeyd_wf703cvM3ykZoRMqAokRFYG_O7hQ@mail.gmail.com><DM2PR0201MB09602B351692D424A49C6B0DC3650@DM2PR0201MB0960.namprd02.prod.outlook.com><CACweHNBN_Bv=jeXQ_VwXi2HzHKNEwZJ1NiF-BJJo_9-mhO60gQ@mail.gmail.com><54A557E1.6050502@intertwingly.net> <CACweHNCQZg1U1u8U=-f6h0+BPnp6Wr_T=r_wGiPAbhTbuMCGWQ@mail.gmail.com> <54A94109.5010901@intertwingly.net> <00cf01d02cc7$d5dba4c0$4001a8c0@gateway.2wire.net> <54B16C2B.9050604@seantek.com> <54B17BBE.4000900@intertwingly.net> <54B18B61.8010308@seantek.com> <54B19435.8070401@intertwingly.net> <54B1B211.3050807@seantek.com>
In-Reply-To: <54B1B211.3050807@seantek.com>
Content-Type: text/plain; charset="utf-8"; format="flowed"
Content-Transfer-Encoding: 7bit
X-RR-Connecting-IP: 107.14.168.142:25
X-Cloudmark-Score: 0
Archived-At: <http://mailarchive.ietf.org/arch/msg/apps-discuss/ZtMDvyImIZ9xrCNFSw7UyBZGUCM>
Subject: Re: [apps-discuss] Fwd: FW: New Version Notification for draft-kerwin-file-scheme-13.txt
X-BeenThere: apps-discuss@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: General discussion of application-layer protocols <apps-discuss.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/apps-discuss>, <mailto:apps-discuss-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/apps-discuss/>
List-Post: <mailto:apps-discuss@ietf.org>
List-Help: <mailto:apps-discuss-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/apps-discuss>, <mailto:apps-discuss-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sat, 10 Jan 2015 23:32:23 -0000

On 01/10/2015 06:13 PM, Sean Leonard wrote:
> On 1/10/2015 1:05 PM, Sam Ruby wrote:
>>
>> Permit me to introduce myself.  I'm one of the co-chairs of the W3C
>> HTML Working group.  I am indeed aware of this work.
>
> Pleased to meet you!

Likewise!

> [various other text]
>
> I believe that the relevant prompting text is:
> ---
> I'm working on a specification how strings that can found in the href
> attribute of HTML elements like <link>, <base>, and <a>.  Some of
> those strings may start with "file:".
>
> I think it would be a good idea for us to explore whether there is a
> relationship between those strings and RFC 3986.  I'll go further and
> state that it is in nobody's best interest for there to be overlapping
> and conflicting standards in this area.
> ---
>
> Let us agree that RFC 3986 === "URI". A URI is supposed to be made up of
> characters *entirely within* the repertoire I previously posted (taken
> from RFC 3986).

Let's not.  I believe that there needs to be a RFC 3986bis created, and 
therefore a Working Group chartered.

As a concrete example, I believe that the repertoire of characters 
available for fragment identifiers should be expanded.

And related to the original discussion, domains containing Unicode 
characters do now exist, and IDNA specifies the mapping of those 
characters to the set of characters that RFC 3986 allows.  RFC 3986 
needs to be updated to cover this.

> The issue at hand is that a lot of software uses Unicode, and in
> particular, there are protocol slots and data entry fields that are
> labeled "URI" or "URL" in some fashion, but do not limit themselves to
> the RFC 3986 repertoire. I.e., in <a href=""> etc. you can have Unicode
> characters, so the question is, what should you do with them?
>
> That is a very interesting question and one that I do not particularly
> care to wade into. So, I guess I am sidestepping the prompt. :)
>
> But the subject of this thread is about registering the file: URI scheme
> under the current rules in force, which are RFC 3986 and RFC 4395. (And,
> I suppose, RFC 3987.)
>
> So it seems to me that the first and "most important" part of
> draft-kerwin-file-scheme should clearly state how file: URIs work,
> according to the character repertoire permitted by RFC 3986.

This very question changes when the possibility of a RFC 3986bis enters 
the discussion.

> Then if the author wants to get into the other issues, i.e., dealing
> with the presence of non-URI characters (including Unicode characters
> that keep the string following IRI), that can be done in a subsequent
> section. (Which can be a "normative" section of the main text, if we can
> get agreement on all of that.)
>
> Note that RFC 4395 says:
>
>       2.6 <http://tools.ietf.org/html/rfc4395#section-2.6>.
>       Internationalization and Character Encoding
>
>     When describing URI schemes in which (some of) the elements of the
>     URI are actually representations of human-readable text, care should
>     be taken not to introduce unnecessary variety in the ways in which
>     characters are encoded into octets and then into URI characters; see
>     RFC 3987  <http://tools.ietf.org/html/rfc3987>  [6
> <http://tools.ietf.org/html/rfc4395#ref-6>] andSection 2.5 of RFC 3986
> <http://tools.ietf.org/html/rfc3986#section-2.5>  [5
> <http://tools.ietf.org/html/rfc4395#ref-5>] for guidelines.  If URIs
>     of a scheme contain any text fields, the scheme definition MUST
>     describe the ways in which characters are encoded, and any
>     compatibility issues with IRIs of the scheme.
>
> ******
>
> The text says not to introduce "unnecessary variety", but in the file:
> case, maybe some variety is natural and therefore necessary.

I actually would argue that the advice RFC 4395 provides be followed. 
And that's precisely why IDNA is relevant here.

> Sean

- Sam Ruby