Re: [whatwg] New URL Standard from Anne van Kesteren on 2012-09-24 (public-whatwg-archive@w3.org from September 2012)

David Sheets <kosmo.zb@gmail.com> Wed, 24 October 2012 03:49 UTC

MIME-Version: 1.0
In-Reply-To: <Pine.LNX.4.64.1210232348110.2471@ps20323.dreamhostps.com>
References: <50604C1A.7090901@gmx.de> <5060A964.5060001@stpeter.im> <Pine.LNX.4.64.1210172354500.2478@ps20323.dreamhostps.com> <507F5A7E.6040206@arcanedomain.com> <50856E3C.103@gmail.com> <Pine.LNX.4.64.1210221753010.2471@ps20323.dreamhostps.com> <0DBC8A11-319C-4120-975E-7E40FD5818BF@gbiv.com> <Pine.LNX.4.64.1210222137530.2471@ps20323.dreamhostps.com> <CA+9kkMDpEZCvcG1DJd=O1qPNV+=+GTBeN+CGndUe51Xym_A9sg@mail.gmail.com> <Pine.LNX.4.64.1210232115210.2471@ps20323.dreamhostps.com> <15E1D98B-8883-4936-81A9-174E1323683C@nordsc.com> <CAGKvQ5ZV6_GMVgjEezhR-oKqSikxR7GYgacMitbfczmNh725mw@mail.gmail.com> <Pine.LNX.4.64.1210232348110.2471@ps20323.dreamhostps.com>
Date: Tue, 23 Oct 2012 20:49:52 -0700
Message-ID: <CAAWM5Tz3NdprjqwgyoVoV9qUuiwXb2gTQ49u4a4ePGfjyusDkw@mail.gmail.com>
Subject: Re: [whatwg] New URL Standard from Anne van Kesteren on 2012-09-24 (public-whatwg-archive@w3.org from September 2012)
From: David Sheets <kosmo.zb@gmail.com>
To: Ian Hickson <ian@hixie.ch>
Content-Type: text/plain; charset="ISO-8859-1"
Cc: Christophe Lauret <clauret@weborganic.com>, Jan Algermissen <jan.algermissen@nordsc.com>, Ted Hardie <ted.ietf@gmail.com>, URI <uri@w3.org>, IETF Discussion <ietf@ietf.org>
Precedence: list

On Tue, Oct 23, 2012 at 4:51 PM, Ian Hickson <ian@hixie.ch> wrote:
> On Wed, 24 Oct 2012, Christophe Lauret wrote:
>>
>> As a Web developer who's had to write code multiple times to handle URIs
>> in very different contexts, I actually *like* the constraints in STD 66,
>> there are many instances where it is simpler to assume that the error
>> handling has been done prior and simply reject an invalid URI.
>
> I think we can agree that the error handling should be, at the option of
> the software developer, either to handle the input as defined by the
> spec's algorithms, or to abort and not handle the input at all.

Yes, input is handled according to the specs' algorithmS.

>> But why not do it as a separate spec?
>
> Having multiple specs means an implementor has to refer to multiple specs
> to implement one algorithm, which is not a way to get interoperability.
> Bugs creep in much faster when implementors have to switch between specs
> just in the implementation of one algorithm.

One algorithm? There seem to be several functions...

- URI reference parsing (parse : scheme -> string -> raw uri_ref)
- URI reference normalization (normalize : raw uri_ref -> normal uri_ref)
- absolute URI predicate (absp : normal uri_ref -> absolute uri_ref option)
- URI resolution (resolve : absolute uri_ref -> _ uri_ref -> absolute uri_ref)

Of course, some of these may be composed in any given implementation.
In the case of a/@href and img/@src, it appears to be something like
(one_algorithm = (resolve base_uri) . normalize . parse (scheme
base_uri)) is in use.

A good way to get interop is to thoroughly define each function and
supply implementors with test cases for each processing stage
(one_algorithm's test cases define some tests for parse, normalize,
and resolve as well).

Some systems use more than the simple function composition of web browsers...

>> Increasing the space of valid addresses, when the set of addressable
>> resources is not actually increasing only means more complex parsing rules.
>
> I'm not saying we should increase the space of valid addresses.

Anne's current draft increases the space of valid addresses. This
isn't obvious as Anne's draft lacks a grammar and URI component
alphabets. You support Anne's draft and its philosophy, therefore you
are saying the space of valid addresses should be expanded.

Here is an example of a grammar extension that STD 66 disallows but
WHATWGRL allows:
<http://www.rfc-editor.org/errata_search.php?rfc=3986&eid=3330>

> The de facto parsing rules are already complicated by de facto requirements for
> handling errors, so defining those doesn't increase complexity either
> (especially if such behaviour is left as optional, as discussed above.)

*parse* is separate from *normalize* is separate from checking if a
reference is absolute (*absp*) is separate from *resolve*.

Why don't we have a discussion about the functions and types involved
in URI processing?

Why don't we discuss expanding allowable alphabets and production rules?

David

Re: [whatwg] New URL Standard from Anne van Keste… Julian Reschke
Re: [whatwg] New URL Standard from Anne van Keste… Peter Saint-Andre
Re: [whatwg] New URL Standard from Anne van Keste… SM
Re: [whatwg] New URL Standard from Anne van Keste… Brian E Carpenter
Re: [whatwg] New URL Standard from Anne van Keste… John C Klensin
Re: [whatwg] New URL Standard from Anne van Keste… Ian Hickson
Re: [whatwg] New URL Standard from Anne van Keste… Noah Mendelsohn
Re: [whatwg] New URL Standard from Anne van Keste… Ian Hickson
Re: [whatwg] New URL Standard from Anne van Keste… Brian E Carpenter
Re: [whatwg] New URL Standard from Anne van Keste… Julian Reschke
Re: [whatwg] New URL Standard from Anne van Keste… Tim Bray
Re: [whatwg] New URL Standard from Anne van Keste… Julian Reschke
Re: [whatwg] New URL Standard from Anne van Keste… Mark Nottingham
Re: [whatwg] New URL Standard from Anne van Keste… Mark Nottingham
Re: [whatwg] New URL Standard from Anne van Keste… Tim Bray
Re: [whatwg] New URL Standard from Anne van Keste… Mark Nottingham
Re: [whatwg] New URL Standard from Anne van Keste… Mark Nottingham
Re: [whatwg] New URL Standard from Anne van Keste… Tim Bray
Re: [whatwg] New URL Standard from Anne van Keste… Julian Reschke
Re: [whatwg] New URL Standard from Anne van Keste… Ian Hickson
Re: [whatwg] New URL Standard from Anne van Keste… Noah Mendelsohn
Re: [whatwg] New URL Standard from Anne van Keste… Roy T. Fielding
Re: [whatwg] New URL Standard from Anne van Keste… Ian Hickson
Re: [whatwg] New URL Standard from Anne van Keste… Jan Algermissen
Re: [whatwg] New URL Standard from Anne van Keste… Ian Hickson
Re: [whatwg] New URL Standard from Anne van Keste… James M Snell
Re: [whatwg] New URL Standard from Anne van Keste… Ian Hickson
Re: [whatwg] New URL Standard from Anne van Keste… Mark Nottingham
Re: [whatwg] New URL Standard from Anne van Keste… Ian Hickson
Re: [whatwg] New URL Standard from Anne van Keste… Ian Hickson
Re: [whatwg] New URL Standard from Anne van Keste… Ian Hickson
Re: [whatwg] New URL Standard from Anne van Keste… Mark Nottingham
Re: [whatwg] New URL Standard from Anne van Keste… Ian Hickson
Re: [whatwg] New URL Standard from Anne van Keste… Ian Hickson
Re: [whatwg] New URL Standard from Anne van Keste… mike amundsen
Re: [whatwg] New URL Standard from Anne van Keste… Ian Hickson
Re: [whatwg] New URL Standard from Anne van Keste… Graham Klyne
Re: [whatwg] New URL Standard from Anne van Keste… John Cowan
websockets in the IETF, was: [whatwg] New URL Sta… Julian Reschke
Re: [whatwg] New URL Standard from Anne van Keste… Ted Hardie
Re: [whatwg] New URL Standard from Anne van Keste… Ted Hardie
Re: [whatwg] New URL Standard from Anne van Keste… Ted Hardie
Re: [whatwg] New URL Standard from Anne van Keste… Jari Arkko
Re: [whatwg] New URL Standard from Anne van Keste… Brian E Carpenter
Re: [whatwg] New URL Standard from Anne van Keste… Stephen Farrell
Re: [whatwg] New URL Standard from Anne van Keste… Ian Hickson
Re: websockets in the IETF, was: [whatwg] New URL… Ian Hickson
Re: websockets in the IETF, was: [whatwg] New URL… James M Snell
Re: [whatwg] New URL Standard from Anne van Keste… Ian Hickson
Re: websockets in the IETF, was: [whatwg] New URL… Peter Saint-Andre
Re: [whatwg] New URL Standard from Anne van Keste… Jan Algermissen
Re: [whatwg] New URL Standard from Anne van Keste… Ian Hickson
Re: [whatwg] New URL Standard from Anne van Keste… Jan Algermissen
Re: [whatwg] New URL Standard from Anne van Keste… Christophe Lauret
Re: [whatwg] New URL Standard from Anne van Keste… Ian Hickson
Re: [whatwg] New URL Standard from Anne van Keste… Ian Hickson
Re: [whatwg] New URL Standard from Anne van Keste… Ian Hickson
RE: [whatwg] New URL Standard from Anne van Keste… Manger, James H
Re: [whatwg] New URL Standard from Anne van Keste… David Sheets
Re: [whatwg] New URL Standard from Anne van Keste… John Cowan
Re: [whatwg] New URL Standard from Anne van Keste… David Sheets
RE: [whatwg] New URL Standard from Anne van Keste… Ian Hickson
RE: [whatwg] New URL Standard from Anne van Keste… Manger, James H
Re: [whatwg] New URL Standard from Anne van Keste… Anne van Kesteren
Re: [whatwg] New URL Standard from Anne van Keste… Jan Algermissen
Re: [whatwg] New URL Standard from Anne van Keste… John C Klensin
Re: [whatwg] New URL Standard from Anne van Keste… Carsten Bormann
Re: [whatwg] New URL Standard from Anne van Keste… Melinda Shore
Re: [whatwg] New URL Standard from Anne van Keste… Ian Hickson
Re: [whatwg] New URL Standard from Anne van Keste… Roy T. Fielding
Re: [whatwg] New URL Standard from Anne van Keste… David Morris
Re: [whatwg] New URL Standard from Anne van Keste… David Sheets
Re: [whatwg] New URL Standard from Anne van Keste… Anne van Kesteren
Re: [whatwg] New URL Standard from Anne van Keste… Roberto Peon