Re: [whatwg] New URL Standard from Anne van Kesteren on 2012-09-24 (public-whatwg-archive@w3.org from September 2012)

Ian Hickson <ian@hixie.ch> Mon, 22 October 2012 21:47 UTC

Return-Path: <ian@hixie.ch>
X-Original-To: ietf@ietfa.amsl.com
Delivered-To: ietf@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 8492D11E80E4 for <ietf@ietfa.amsl.com>; Mon, 22 Oct 2012 14:47:01 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.599
X-Spam-Level:
X-Spam-Status: No, score=-2.599 tagged_above=-999 required=5 tests=[AWL=0.000, BAYES_00=-2.599]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id EBVei4KTZCZa for <ietf@ietfa.amsl.com>; Mon, 22 Oct 2012 14:47:00 -0700 (PDT)
Received: from homiemail-a57.g.dreamhost.com (caibbdcaaaaf.dreamhost.com [208.113.200.5]) by ietfa.amsl.com (Postfix) with ESMTP id ABC141F042B for <ietf@ietf.org>; Mon, 22 Oct 2012 14:46:56 -0700 (PDT)
Received: from homiemail-a57.g.dreamhost.com (localhost [127.0.0.1]) by homiemail-a57.g.dreamhost.com (Postfix) with ESMTP id 0F793208073; Mon, 22 Oct 2012 14:46:55 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=hixie.ch; h=date:from:to :cc:subject:in-reply-to:message-id:references:mime-version: content-type; s=hixie.ch; bh=fSpWlUw2oz9502BQQIxlxr2Z34w=; b=acN yEZFH48HITssQv3fc/yUw2BZ4yv1D9cDz66ylwv9yD3livYeoT8rq9XyNe53Ac0A yt1JszMe4FPUaMKHdbHmsUtZrLIb66udBCqTWsYp9PcsCis8vxgegVa3sBGT7S96 j31s2Az17qsr0J/oOhJMF/cuEoOugTzivX1WllUk=
Received: from ps20323.dreamhostps.com (ps20323.dreamhost.com [69.163.222.251]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) (Authenticated sender: internal@index.hixie.ch) by homiemail-a57.g.dreamhost.com (Postfix) with ESMTPSA id 0281F208070; Mon, 22 Oct 2012 14:46:55 -0700 (PDT)
Date: Mon, 22 Oct 2012 21:46:56 +0000
From: Ian Hickson <ian@hixie.ch>
To: Julian Reschke <julian.reschke@gmx.de>, "Roy T. Fielding" <fielding@gbiv.com>
Subject: Re: [whatwg] New URL Standard from Anne van Kesteren on 2012-09-24 (public-whatwg-archive@w3.org from September 2012)
In-Reply-To: <0DBC8A11-319C-4120-975E-7E40FD5818BF@gbiv.com>
Message-ID: <Pine.LNX.4.64.1210222137530.2471@ps20323.dreamhostps.com>
References: <50604C1A.7090901@gmx.de> <5060A964.5060001@stpeter.im> <Pine.LNX.4.64.1210172354500.2478@ps20323.dreamhostps.com> <507F5A7E.6040206@arcanedomain.com> <50856E3C.103@gmail.com> <Pine.LNX.4.64.1210221753010.2471@ps20323.dreamhostps.com> <0DBC8A11-319C-4120-975E-7E40FD5818BF@gbiv.com>
Content-Language: en-GB-hixie
Content-Style-Type: text/css
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset="US-ASCII"
X-Mailman-Approved-At: Tue, 23 Oct 2012 09:13:03 -0700
Cc: Noah Mendelsohn <nrm@arcanedomain.com>, mnot@mnot.net, URI <uri@w3.org>, IETF Discussion <ietf@ietf.org>
X-BeenThere: ietf@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: IETF-Discussion <ietf.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf>, <mailto:ietf-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ietf>
List-Post: <mailto:ietf@ietf.org>
List-Help: <mailto:ietf-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 22 Oct 2012 21:47:01 -0000

On Mon, 22 Oct 2012, Julian Reschke wrote:
> > 
> > I couldn't agree more! We've been waiting for four years for the URI 
> > working group to get their act together and fix the URL mess. Nothing 
> > has happened. We lost patience and are now doing it ourselves. ...
> 
> Clarifying: there is no URI Working Group, and as far as I can tell, 

Whoever. The people complaining that it should be done at the IETF haven't 
done any work. That's the complaint. Until they do the work, complaining 
that we're doing it instead is going to fall on deaf ears and be met with 
the rolling of eyeballs.


> there is no consensus that there is a "mess" to fix related to URIs.

The specs don't define everything that implementations have to do to be 
interoperable. If the IETF doesn't think that's a problem, then that's 
fine, but then y'all shouldn't be surprised when people who _do_ think 
that's a problem try and fix it.


On Mon, 22 Oct 2012, Roy T. Fielding wrote:
> 
> What you are insisting on defining as a "URL" is the input to the 
> process of making a hypertext reference (the arbitrary string typed into 
> a dialog or placed inside an href/src attribute)

Or placed on a command line to wget(1), or put in an RDFa triple store, or 
in transmitted in an HTTP Location: header, or...


> whereas the IETF standards define the output of that process as a 
> uniform addressing syntax for use on the Internet by every application 
> that makes use of Web addresses.

That's what Anne is specifying, because, despite your claims, STD 66 
doesn't actually define that. It only defines processing for valid 
strings, not invalid ones.


> Browsers implement both the input processing and the output URI 
> standards.  HTML must define the input processing, either within the 
> spec or by reference to a new spec.

This has nothing particularly to do with HTML, HTML is just one of many 
many contexts in which URLs are found.


> "" is not a URL.

Whether it's a valid URL or not is besides the point (it's like people 
claiming that invalid XML files aren't XML, a pointless argument). We 
still have to define the processing for such a string when software is to 
process it as a URL.


> > I'm sure Anne would love nothing more than to be able to work on 
> > something more interesting that this. But at the end of the day, 
> > someone has to do it, and y'all aren't doing it.
> > 
> > This should not come as a surprise to anyone, the IETF and W3C have 
> > been discussing this matter at last as far back as 2008.
> 
> Yes, we have been discussing it since 1994.

19 years of not fixing the problem, then. Shouldn't come as a surprise 
that someone has finally gotten around to doing it.


> It would be nice if you would take the advice already received and 
> define references in HTML, including the algorithms for converting them 
> into URI references (for DOM and network usage) and IRI references (for 
> display).

As far as HTML goes, my plan is to remove all references to STD 66 and 
rely entirely on Anne's work, so that there doesn't have to be any 
preprocessing nonsense. Then the STD 66 RFCs are entirely irrelevant.

-- 
Ian Hickson               U+1047E                )\._.,--....,'``.    fL
http://ln.hixie.ch/       U+263A                /,   _.. \   _\  ;`._ ,.
Things that are impossible just take longer.   `._.-(,_..'--(,_..'`-.;.'