Re: http charset labelling
Gavin Nicol <gtn@ebt.com> Tue, 06 February 1996 16:41 UTC
Received: from ietf.nri.reston.va.us by IETF.CNRI.Reston.VA.US id aa15304; 6 Feb 96 11:41 EST
Received: from CNRI.Reston.VA.US by IETF.CNRI.Reston.VA.US id aa15299; 6 Feb 96 11:41 EST
Received: from services.Bunyip.COM by CNRI.Reston.VA.US id aa09302; 6 Feb 96 11:41 EST
Received: (from daemon@localhost) by services.bunyip.com (8.6.10/8.6.9) id KAA27470 for uri-out; Tue, 6 Feb 1996 10:26:23 -0500
Received: from mocha.bunyip.com (mocha.Bunyip.Com [192.197.208.1]) by services.bunyip.com (8.6.10/8.6.9) with SMTP id KAA27461 for <uri@services.bunyip.com>; Tue, 6 Feb 1996 10:26:19 -0500
Received: from ebt-inc.ebt.com by mocha.bunyip.com with SMTP (5.65a/IDA-1.4.2b/CC-Guru-2b) id AA00297 (mail destined for uri@services.bunyip.com); Tue, 6 Feb 96 10:26:16 -0500
Received: (from gtn@localhost) by ebt-inc.ebt.com (8.6.12/8.6.9) id KAA13675; Tue, 6 Feb 1996 10:04:55 -0500
Date: Tue, 06 Feb 1996 10:04:55 -0500
Sender: ietf-archive-request@IETF.CNRI.Reston.VA.US
From: Gavin Nicol <gtn@ebt.com>
Message-Id: <199602061504.KAA13675@ebt-inc.ebt.com>
To: mohta@necom830.cc.titech.ac.jp
Cc: masinter@parc.xerox.com, keld@dkuug.dk, uri@bunyip.com
In-Reply-To: <199602060213.LAA15162@necom830.cc.titech.ac.jp> (message from Masataka Ohta on Tue, 6 Feb 96 11:12:57 JST)
Subject: Re: http charset labelling
X-Orig-Sender: owner-uri@bunyip.com
Precedence: bulk
>> Or fix the problem by allowing specification of the encoding used for >> the URL's. > >That's no fix. > >If you allow specification of the encoding, what we can see on paper >is resulting lengthy specification of the encoding concatenated with >lengthy 7bit encoding of the URL body. Don't be silly. On paper, people will be looking at glyphs, and thereby associat them with characters (one way of decoding information). On the Internet, computers will be looking at a set of octets, and mapping them to characters by using some information about the encoding used for the characters. The end result is the same (a mapping to characters), but the process is entirely different, and rightly so. The point is simply this: if I give a business card to someone, and it has a URL pointing to something with kanji in it, then if that person goes to a SJIS systems and types in the URL, the server needs to know how to map that set of octets to a resource. The results might vary widely depending on whether the data was transmitted as SJIS, EUC or UTF-8, if there is no encoding information. I agree that such URL's are not very useful in an international setting, but that does not mean they should be dissallowed entirely. That is like saying that Japanese should only use romanji. >> Yes, there is an Internet directory put out by Gakken (I forget the >> name) that had such an article last month. > >Then, Gakken should be wrong. Or, you may be confusing URL and >text content. I most certainly did not confuse a URL with content (very difficult to do, especially as I can read Japanese). I guess you, I, and a lot of other people, think that if people really want to be global, they should avoid using kanji, or whatever, in URL's. However, as a persoan at Astec said, and I agree, people *will* put kanji into resource names, and they *will* expect it to work. As such, I think it better to design a system that can handle *all* cases, as users expect them to be handled.
- Re: URN to URC resolution scenario Mitra
- Re: Library Standards and URIs Terry Allen
- Re: Library Standards and URIs Terry Allen
- Re: URC formats vs interfaces Ronald E. Daniel
- URC formats vs interfaces Daniel LaLiberte
- no child-of-URI groups at Dallas IETF? Larry Masinter
- Re: no child-of-URI groups at Dallas IETF? Ronald E. Daniel
- Re: html, http, urls and internationalisation Keld J|rn Simonsen
- Re: http charset labelling Larry Masinter
- Re: http charset labelling Keld J|rn Simonsen
- Re: http charset labelling Keld J|rn Simonsen
- Re: http charset labelling Larry Masinter
- Re: http charset labelling Keld J|rn Simonsen
- Re: http charset labelling Masataka Ohta
- Re: http charset labelling Gavin Nicol
- Re: http charset labelling Masataka Ohta
- Re: http charset labelling Gavin Nicol
- Re: http charset labelling Masataka Ohta
- Re: http charset labelling Gavin Nicol
- Re: http charset labelling Masataka Ohta
- Re: http charset labelling Masataka Ohta
- Re: http charset labelling Gavin Nicol
- Re: http charset labelling Gavin Nicol
- Re: http charset labelling Masataka Ohta
- Re: http charset labelling Gavin Nicol
- Re: 8 bit characters in DNS names (and URNs?) Keld J|rn Simonsen
- Re: Typeable characters Keld J|rn Simonsen
- Re: UTF-8 and URLs Keld J|rn Simonsen