Re: I18N Concensus - Generic Syntax Document

"Roy T. Fielding" <fielding@kiwi.ics.uci.edu> Fri, 07 March 1997 15:34 UTC

Received: from cnri by ietf.org id aa03327; 7 Mar 97 10:34 EST
Received: from services.Bunyip.Com by CNRI.Reston.VA.US id aa09529; 7 Mar 97 10:34 EST
Received: (from daemon@localhost) by services.bunyip.com (8.8.5/8.8.5) id JAA10163 for uri-out; Fri, 7 Mar 1997 09:59:38 -0500 (EST)
Received: from mocha.bunyip.com (mocha.Bunyip.Com [192.197.208.1]) by services.bunyip.com (8.8.5/8.8.5) with SMTP id JAA10156 for <uri@services.bunyip.com>; Fri, 7 Mar 1997 09:59:34 -0500 (EST)
Received: from paris.ics.uci.edu by mocha.bunyip.com with SMTP (5.65a/IDA-1.4.2b/CC-Guru-2b) id AA21076 (mail destined for uri@services.bunyip.com); Fri, 7 Mar 97 09:59:31 -0500
Received: from kiwi.ics.uci.edu by paris.ics.uci.edu id aa00482; 7 Mar 97 6:57 PST
To: Rich Salz <rsalz@osf.org>
Cc: uri@bunyip.com
Subject: Re: I18N Concensus - Generic Syntax Document
In-Reply-To: Your message of "Fri, 07 Mar 1997 08:14:42 EST." <199703071314.IAA26595@sulphur.osf.org>
Date: Fri, 07 Mar 1997 06:57:23 -0800
From: "Roy T. Fielding" <fielding@kiwi.ics.uci.edu>
Message-Id: <9703070657.aa00482@paris.ics.uci.edu>
Sender: owner-uri@bunyip.com
Precedence: bulk

>>  An HTTP server doesn't need one -- its URLs are
>>either composed by computation (in which case knowing the charset is not
>>possible) or by derivation from the filesystem (in which case it will use
>>whatever charset the filesystem uses, and in any case has no way of
>>determining whether or not that charset is UTF-8).
>
>I don't know if you can just rule out filesystems just like that.
>I can imagine networked filesystems that span hosts that would have,
>or need to have, the locale stored at the mountpoint.

I am sure it is possible on some file systems to determine the charset.
It just isn't possible on all of the file systems for which you can
use an Apache server, nor is it possible for us to distribute code
that maps from any possible filesystem charset into UTF-8 and back
again, nor is it desirable for us to build a server that does it in
the first place because, as I said in a message a while back, I don't
think it is a good idea for http URLs to contain (or be displayed)
as anything other than ASCII characters, regardless of the locale.

.....Roy