Re: I18N Concensus - Generic Syntax Document
"Roy T. Fielding" <fielding@kiwi.ics.uci.edu> Fri, 07 March 1997 16:30 UTC
Received: from cnri by ietf.org id aa07346; 7 Mar 97 11:30 EST
Received: from services.Bunyip.Com by CNRI.Reston.VA.US id aa10912; 7 Mar 97 11:31 EST
Received: (from daemon@localhost) by services.bunyip.com (8.8.5/8.8.5) id KAA11420 for uri-out; Fri, 7 Mar 1997 10:34:27 -0500 (EST)
Received: from mocha.bunyip.com (mocha.Bunyip.Com [192.197.208.1]) by services.bunyip.com (8.8.5/8.8.5) with SMTP id KAA11415 for <uri@services.bunyip.com>; Fri, 7 Mar 1997 10:34:25 -0500 (EST)
Received: from paris.ics.uci.edu by mocha.bunyip.com with SMTP (5.65a/IDA-1.4.2b/CC-Guru-2b) id AA21479 (mail destined for uri@services.bunyip.com); Fri, 7 Mar 97 10:34:23 -0500
Received: from kiwi.ics.uci.edu by paris.ics.uci.edu id aa02583; 7 Mar 97 7:29 PST
To: "Martin J. Duerst" <mduerst@ifi.unizh.ch>
Cc: URI List <uri@bunyip.com>
Subject: Re: I18N Concensus - Generic Syntax Document
In-Reply-To: Your message of "Fri, 07 Mar 1997 14:50:36 +0100." <Pine.SUN.3.95q.970307134328.245D-100000@enoshima>
Date: Fri, 07 Mar 1997 07:28:58 -0800
From: "Roy T. Fielding" <fielding@kiwi.ics.uci.edu>
Message-Id: <9703070729.aa02583@paris.ics.uci.edu>
Sender: owner-uri@bunyip.com
Precedence: bulk
>> >+ It is recommended that UTF-8 [RFC 2044] be used to represent characters >> >+ with octets in URLs, wherever possible. >> > >> >+ For schemes where no single character->octet encoding is specified, >> >+ a gradual transition to UTF-8 can be made by servers make resources >> >+ available with UTF-8 names on their own, on a per-server or a >> >+ per-resource basis. Schemes and mechanisms that use a well- >> >+ defined character->octet encoding which is however not UTF-8 should >> >+ define the mapping between this encoding and UTF-8, because generic >> >+ URL software is unlikely to be aware of and to be able to handle >> >+ such specific conventions. >> >> Here is where you lose me. > >Don't worry. I hope we will have you back soon again :-). > >> I have no desire to add a UTF-8 character >> mapping table to our server. > >There is no need to do so. The above is only a *recommendation*. Sorry, I misread the paragraph. It would be clearer to say URL creation mechanisms that generate the URL from a source which is not restricted to a single character->octet encoding are encouraged, but not required, to transition resource names toward using UTF-8 exclusively. URL creation mechanisms that generate the URL from a source which is restricted to a single character->octet encoding should use UTF-8 exclusively. If the source encoding is not UTF-8, then a mapping between the source encoding and UTF-8 should be used. And please cut the self-righteous crap in your replies. I am fully aware of why people want to localize their URLs, and I am in a better position to know what the implementation issues are when doing filename<->URL mapping. I have yet to see a memory+time efficient mapping from arbitrary charset to UTF-8, and I have a lot more faith in standards based on running code than on supposition. .....Roy
- I18N Concensus - Generic Syntax Document Rich Petke
- Re: I18N Concensus - Generic Syntax Document Martin J. Duerst
- Re: I18N Concensus - Generic Syntax Document Roy T. Fielding
- Re: I18N Concensus - Generic Syntax Document Rich Salz
- Re: I18N Concensus - Generic Syntax Document Martin J. Duerst
- Re: I18N Concensus - Generic Syntax Document Roy T. Fielding
- Re: I18N Concensus - Generic Syntax Document Dan Oscarsson
- Re: I18N Concensus - Generic Syntax Document Roy T. Fielding
- Re: I18N Concensus - Generic Syntax Document Martin J. Duerst
- Re: I18N Concensus - Generic Syntax Document Martin J. Duerst