Re: [whatwg] New URL Standard from Anne van Kesteren on 2012-09-24 (public-whatwg-archive@w3.org from September 2012)

mike amundsen <mamund@yahoo.com> Tue, 23 October 2012 00:33 UTC

Return-Path: <mca@amundsen.com>
X-Original-To: ietf@ietfa.amsl.com
Delivered-To: ietf@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 000471F0C49 for <ietf@ietfa.amsl.com>; Mon, 22 Oct 2012 17:33:13 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -0.379
X-Spam-Level:
X-Spam-Status: No, score=-0.379 tagged_above=-999 required=5 tests=[AWL=-0.300, BAYES_00=-2.599, FM_FORGED_GMAIL=0.622, FORGED_YAHOO_RCVD=2.297, HTML_MESSAGE=0.001, J_CHICKENPOX_38=0.6, RCVD_IN_DNSWL_LOW=-1]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id pJWFvtn2P6NZ for <ietf@ietfa.amsl.com>; Mon, 22 Oct 2012 17:33:12 -0700 (PDT)
Received: from mail-vb0-f44.google.com (mail-vb0-f44.google.com [209.85.212.44]) by ietfa.amsl.com (Postfix) with ESMTP id 99EF51F0429 for <ietf@ietf.org>; Mon, 22 Oct 2012 17:33:12 -0700 (PDT)
Received: by mail-vb0-f44.google.com with SMTP id fc26so4005408vbb.31 for <ietf@ietf.org>; Mon, 22 Oct 2012 17:33:11 -0700 (PDT)
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:sender:in-reply-to:references:from:date :x-google-sender-auth:message-id:subject:to:cc:content-type :x-gm-message-state; bh=Fmf+3zTslB/hiqB2nteJGIPSiOdO689Uc+GtKoTErlE=; b=Mgp6HbuA7/AOpepNFOzzULal2vAynoYyMU27sInfgRaxM+wrubQ78Wuzd+LcJfDDd5 VENToSrxMkX38odx9zuPSjY8pVn8CJw8ukiOUq+unuiPsVZ5B8tEf63yBEz8dNTaWvXt +TOHE5JSSqP+5F7Aisf0dkafh2lZm7VVfZHp7xQxo8b9NiO2qWTCZ0WJ1otnazwcuFvb IGkX6pkX29O6wHuylBYxma8h/fcMQhx0LWvuKzrLrBjJaY+mxweO1QpNOPrm72xCTqgx t2b1REY3fSDMxFXStsRlYR8FcVi4eF6Z94WaKqQ3schUoWY6h2d9ohciDONTC59VVOhB BICQ==
Received: by 10.52.93.238 with SMTP id cx14mr14301659vdb.42.1350952391441; Mon, 22 Oct 2012 17:33:11 -0700 (PDT)
MIME-Version: 1.0
Sender: mca@amundsen.com
Received: by 10.58.187.228 with HTTP; Mon, 22 Oct 2012 17:32:51 -0700 (PDT)
In-Reply-To: <CAHBU6iv29xAJvgVtcNVpYX=23tydt+mCOQegRiNLV=K8sCh-Cg@mail.gmail.com>
References: <50604C1A.7090901@gmx.de> <5060A964.5060001@stpeter.im> <Pine.LNX.4.64.1210172354500.2478@ps20323.dreamhostps.com> <507F5A7E.6040206@arcanedomain.com> <50856E3C.103@gmail.com> <Pine.LNX.4.64.1210221753010.2471@ps20323.dreamhostps.com> <0DBC8A11-319C-4120-975E-7E40FD5818BF@gbiv.com> <Pine.LNX.4.64.1210222137530.2471@ps20323.dreamhostps.com> <5085C4BA.2030505@gmx.de> <Pine.LNX.4.64.1210222220510.2471@ps20323.dreamhostps.com> <CAHBU6is8LNZ7Rq-vwLuOm+8ThKB9c=QPwbUfQwDQD5bDPjtf7w@mail.gmail.com> <Pine.LNX.4.64.1210222320070.2471@ps20323.dreamhostps.com> <09DC68AA-2DAD-4CB1-9CA9-799AF12B7BE2@mnot.net> <Pine.LNX.4.64.1210222337520.2471@ps20323.dreamhostps.com> <D9485C0B-E3DA-4B51-9A25-9EAB018C1951@mnot.net> <Pine.LNX.4.64.1210222359080.2471@ps20323.dreamhostps.com> <CAHBU6iv29xAJvgVtcNVpYX=23tydt+mCOQegRiNLV=K8sCh-Cg@mail.gmail.com>
From: mike amundsen <mamund@yahoo.com>
Date: Mon, 22 Oct 2012 20:32:51 -0400
X-Google-Sender-Auth: Ev1YBH3uKkdJFLFDOyVNLxCCQzA
Message-ID: <CAPW_8m6r38A+OB40FKqJ7LwEk3iUUZb=ZQsc2wvBR1h_KV7o0g@mail.gmail.com>
Subject: Re: [whatwg] New URL Standard from Anne van Kesteren on 2012-09-24 (public-whatwg-archive@w3.org from September 2012)
To: Tim Bray <tbray@textuality.com>
Content-Type: multipart/alternative; boundary="20cf3071d090667ed004ccaf1df1"
X-Gm-Message-State: ALoCoQmVyAfJavL//e8IsqNYrcW/3siU0IcCzLxcKbcgzXnrwUetLR/8NCtWgiR1tauRAKIzUkHH
X-Mailman-Approved-At: Tue, 23 Oct 2012 09:13:03 -0700
Cc: IETF Discussion <ietf@ietf.org>, Julian Reschke <julian.reschke@gmx.de>, "Roy T. Fielding" <fielding@gbiv.com>, Jan Algermissen <jan.algermissen@nordsc.com>, URI <uri@w3.org>, Noah Mendelsohn <nrm@arcanedomain.com>, Mark Nottingham <mnot@mnot.net>, Ian Hickson <ian@hixie.ch>
X-BeenThere: ietf@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: IETF-Discussion <ietf.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf>, <mailto:ietf-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ietf>
List-Post: <mailto:ietf@ietf.org>
List-Help: <mailto:ietf-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 23 Oct 2012 00:33:14 -0000

<snip>
I’d have to say that URI interoperability problems haven’t come near
getting into the list of top-20 pain points.
</snip>
I can't recall the last time i experienced "URI interoperability problems"
across various user agents/implementations on the public Internet. My
problems w/ browser implementations is another thing.

In this particular case (URIs), I applaud Anne's attempt to fix the broken
way browsers handle these strings within their own executable code (i.e
"browsers (apart from Chrome) do not unescape URL escapes."[1]).

However, I find the announcement that he plans to change the names of
things along the way (i.e. "And yes, the plan is to do away with IRI/URI
and just call them all URLs"[1]) a waste of time.

Fixing the code can happen regardless of naming. Do that and do it now.

Thanks.

[1] http://annevankesteren.nl/2012/09/url-equivalence


mca+1.859.757.1449
skype: mca.amundsen
http://amundsen.com/blog/
http://twitter.com/mamund
https://github.com/mamund
http://www.linkedin.com/in/mikeamundsen


On Mon, Oct 22, 2012 at 8:15 PM, Tim Bray <tbray@textuality.com> wrote:

> One more data point... I work on Web software all the time and have for
> many years; in recent years mostly at the REST (app-to-app HTTP
> conversations) rather than browser-wrangling level.  I’d have to say that
> URI interoperability problems haven’t come near getting into the list of
> top-20 pain points.   So either my experience is wildly untypical, or maybe
> it’s a combination of being a little bit lucky, and that the pain which
> exists is highly concentrated in the browser space.  -T
>
> On Mon, Oct 22, 2012 at 5:05 PM, Ian Hickson <ian@hixie.ch> wrote:
>
>> On Tue, 23 Oct 2012, Mark Nottingham wrote:
>> > On 23/10/2012, at 10:40 AM, Ian Hickson <ian@hixie.ch> wrote:
>> > > On Tue, 23 Oct 2012, Mark Nottingham wrote:
>> > >>
>> > >> Don't much care about the venue, as long as there's *some*
>> > >> coordination / communication.
>> > >
>> > > Everyone is welcome to participate in the WHATWG list.
>> >
>> > As they are on the IETF list. The difference is that the WHATWG is run
>> > by an unelected board of "members" - <http://www.whatwg.org/charter>.
>>
>> "Run" is a bit of a strong word. There's basically no non-public activity
>> from the charter members.
>>
>>
>> > > Anne's spec will define "valid URL", which addressed that need.
>> >
>> > Why not define (or reuse) a separate term for the input stream, and
>> > leave "URL" alone?
>>
>> Because everyone calls these things URLs (except STD 66).
>>
>>
>> > >> Browser implementers may not care, but it's pretty obvious that lots
>> > >> of other people do.
>> > >
>> > > Browser implementors aren't particularly special here.
>> >
>> > No, but your arguments are often coloured by your perspective -- just as
>> > everyone else's are.
>>
>> Which arguments in particular are we talking about here? I've mostly been
>> talking about curl, wget, GoogleBot, Perl libraries, etc.
>>
>>
>> > If I believed that Anne was willing to and capable of re-specifying
>> > RFC3986 in such a way that the definition, syntax and semantics of URLs
>> > (or whatever they ends up being called) doesn't change at all, I'd be
>> > less concerned.
>> >
>> > However, that doesn't seem very likely, especially when he isn't
>> > engaging with the folks that wrote that spec (especially, Roy).
>> >
>> > RFC3986 is referenced by a LOT of technologies, not just Web browsers,
>> > not just HTML. Replacing it unilaterally with input from the browser /
>> > HTML community from an implementer perspective is very likely to break
>> > most of them.
>>
>> I suspect it will break nothing, but I guess we'll find out.
>>
>> I don't really understand how it _could_ break anything, so long as the
>> processing of IRI and URIs as defined by IETF is the same in the WHATWG
>> spec, except where software already differs with the IETF specs.
>>
>> Do you have a concrete example I could study?
>>
>>
>> > As such, they won't use your new spec, and we'll be living in a world
>> > where there will be two definitions of "URL" -- the IETF one and the
>> > WHATWG one [...].
>> >
>> > That seems a pretty bad tradeoff for the benefits you're getting -- a
>> > slightly easier-to-read spec for browser implementers (a relatively tiny
>> > audience).
>>
>> If you have any concrete concerns, please don't hesitate to e-mail the
>> WHATWG list, showing the specific examples you're worried about. Browsers
>> are but one of many implementation classes that are relevant.
>>
>>
>> --
>> Ian Hickson               U+1047E                )\._.,--....,'``.    fL
>> http://ln.hixie.ch/       U+263A                /,   _.. \   _\  ;`._ ,.
>> Things that are impossible just take longer.   `._.-(,_..'--(,_..'`-.;.'
>>
>
>