Re: [whatwg] New URL Standard from Anne van Kesteren on 2012-09-24 (public-whatwg-archive@w3.org from September 2012)

Ian Hickson <ian@hixie.ch> Tue, 23 October 2012 00:05 UTC

Return-Path: <ian@hixie.ch>
X-Original-To: ietf@ietfa.amsl.com
Delivered-To: ietf@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 0CDA11F0C5C for <ietf@ietfa.amsl.com>; Mon, 22 Oct 2012 17:05:11 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.484
X-Spam-Level:
X-Spam-Status: No, score=-2.484 tagged_above=-999 required=5 tests=[AWL=0.115, BAYES_00=-2.599]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 0QRYnvd2vKNi for <ietf@ietfa.amsl.com>; Mon, 22 Oct 2012 17:05:10 -0700 (PDT)
Received: from homiemail-a82.g.dreamhost.com (caibbdcaaaaf.dreamhost.com [208.113.200.5]) by ietfa.amsl.com (Postfix) with ESMTP id 460AC1F0429 for <ietf@ietf.org>; Mon, 22 Oct 2012 17:05:10 -0700 (PDT)
Received: from homiemail-a82.g.dreamhost.com (localhost [127.0.0.1]) by homiemail-a82.g.dreamhost.com (Postfix) with ESMTP id A97DD28207B; Mon, 22 Oct 2012 17:05:14 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=hixie.ch; h=date:from:to :cc:subject:in-reply-to:message-id:references:mime-version: content-type; s=hixie.ch; bh=xzlwGPdaQEvXLmecGBSmGeAYXdc=; b=nls ysUUXr1+zu+mNtkjmoaABlyU16RsdD76QKTvT/BSTnD2FH4M2P5ke8dfHDJWeDpb zNJhjemzUGTs4FhW43UgXYJA3MgJK5ftYhNErTYQUEjeLKJRq6O5ZDvihpBcIxF/ 14G73znIbHHgVjAjf/OJ3z5zJkj+Sju2xjxN9TiQ=
Received: from ps20323.dreamhostps.com (ps20323.dreamhost.com [69.163.222.251]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) (Authenticated sender: internal@index.hixie.ch) by homiemail-a82.g.dreamhost.com (Postfix) with ESMTPSA id 9A953282074; Mon, 22 Oct 2012 17:05:14 -0700 (PDT)
Date: Tue, 23 Oct 2012 00:05:09 +0000
From: Ian Hickson <ian@hixie.ch>
To: Mark Nottingham <mnot@mnot.net>
Subject: Re: [whatwg] New URL Standard from Anne van Kesteren on 2012-09-24 (public-whatwg-archive@w3.org from September 2012)
In-Reply-To: <D9485C0B-E3DA-4B51-9A25-9EAB018C1951@mnot.net>
Message-ID: <Pine.LNX.4.64.1210222359080.2471@ps20323.dreamhostps.com>
References: <50604C1A.7090901@gmx.de> <5060A964.5060001@stpeter.im> <Pine.LNX.4.64.1210172354500.2478@ps20323.dreamhostps.com> <507F5A7E.6040206@arcanedomain.com> <50856E3C.103@gmail.com> <Pine.LNX.4.64.1210221753010.2471@ps20323.dreamhostps.com> <0DBC8A11-319C-4120-975E-7E40FD5818BF@gbiv.com> <Pine.LNX.4.64.1210222137530.2471@ps20323.dreamhostps.com> <5085C4BA.2030505@gmx.de> <Pine.LNX.4.64.1210222220510.2471@ps20323.dreamhostps.com> <CAHBU6is8LNZ7Rq-vwLuOm+8ThKB9c=QPwbUfQwDQD5bDPjtf7w@mail.gmail.com> <Pine.LNX.4.64.1210222320070.2471@ps20323.dreamhostps.com> <09DC68AA-2DAD-4CB1-9CA9-799AF12B7BE2@mnot.net> <Pine.LNX.4.64.1210222337520.2471@ps20323.dreamhostps.com> <D9485C0B-E3DA-4B51-9A25-9EAB018C1951@mnot.net>
Content-Language: en-GB-hixie
Content-Style-Type: text/css
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset="US-ASCII"
X-Mailman-Approved-At: Tue, 23 Oct 2012 09:13:03 -0700
Cc: IETF Discussion <ietf@ietf.org>, Julian Reschke <julian.reschke@gmx.de>, "Roy T. Fielding" <fielding@gbiv.com>, Jan Algermissen <jan.algermissen@nordsc.com>, Noah Mendelsohn <nrm@arcanedomain.com>, URI <uri@w3.org>, Tim Bray <tbray@textuality.com>
X-BeenThere: ietf@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: IETF-Discussion <ietf.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf>, <mailto:ietf-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ietf>
List-Post: <mailto:ietf@ietf.org>
List-Help: <mailto:ietf-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 23 Oct 2012 00:05:11 -0000

On Tue, 23 Oct 2012, Mark Nottingham wrote:
> On 23/10/2012, at 10:40 AM, Ian Hickson <ian@hixie.ch> wrote:
> > On Tue, 23 Oct 2012, Mark Nottingham wrote:
> >> 
> >> Don't much care about the venue, as long as there's *some* 
> >> coordination / communication.
> > 
> > Everyone is welcome to participate in the WHATWG list.
> 
> As they are on the IETF list. The difference is that the WHATWG is run 
> by an unelected board of "members" - <http://www.whatwg.org/charter>.

"Run" is a bit of a strong word. There's basically no non-public activity 
from the charter members.


> > Anne's spec will define "valid URL", which addressed that need.
> 
> Why not define (or reuse) a separate term for the input stream, and 
> leave "URL" alone?

Because everyone calls these things URLs (except STD 66).


> >> Browser implementers may not care, but it's pretty obvious that lots 
> >> of other people do.
> > 
> > Browser implementors aren't particularly special here.
> 
> No, but your arguments are often coloured by your perspective -- just as 
> everyone else's are.

Which arguments in particular are we talking about here? I've mostly been 
talking about curl, wget, GoogleBot, Perl libraries, etc.


> If I believed that Anne was willing to and capable of re-specifying 
> RFC3986 in such a way that the definition, syntax and semantics of URLs 
> (or whatever they ends up being called) doesn't change at all, I'd be 
> less concerned.
> 
> However, that doesn't seem very likely, especially when he isn't 
> engaging with the folks that wrote that spec (especially, Roy).
> 
> RFC3986 is referenced by a LOT of technologies, not just Web browsers, 
> not just HTML. Replacing it unilaterally with input from the browser / 
> HTML community from an implementer perspective is very likely to break 
> most of them.

I suspect it will break nothing, but I guess we'll find out.

I don't really understand how it _could_ break anything, so long as the 
processing of IRI and URIs as defined by IETF is the same in the WHATWG 
spec, except where software already differs with the IETF specs.

Do you have a concrete example I could study?


> As such, they won't use your new spec, and we'll be living in a world 
> where there will be two definitions of "URL" -- the IETF one and the 
> WHATWG one [...].
> 
> That seems a pretty bad tradeoff for the benefits you're getting -- a 
> slightly easier-to-read spec for browser implementers (a relatively tiny 
> audience).

If you have any concrete concerns, please don't hesitate to e-mail the 
WHATWG list, showing the specific examples you're worried about. Browsers 
are but one of many implementation classes that are relevant.

-- 
Ian Hickson               U+1047E                )\._.,--....,'``.    fL
http://ln.hixie.ch/       U+263A                /,   _.. \   _\  ;`._ ,.
Things that are impossible just take longer.   `._.-(,_..'--(,_..'`-.;.'