Re: Using UTF-8 for non-ASCII Characters in URLs
"Martin J. Duerst" <mduerst@ifi.unizh.ch> Mon, 05 May 1997 10:37 UTC
Received: from cnri by ietf.org id aa13644; 5 May 97 6:37 EDT
Received: from services.Bunyip.Com by CNRI.Reston.VA.US id aa06955; 5 May 97 6:37 EDT
Received: (from daemon@localhost) by services.bunyip.com (8.8.5/8.8.5) id GAA28059 for uri-out; Mon, 5 May 1997 06:20:17 -0400 (EDT)
Received: from mocha.bunyip.com (mocha.Bunyip.Com [192.197.208.1]) by services.bunyip.com (8.8.5/8.8.5) with ESMTP id GAA28044 for <uri@services.bunyip.com>; Mon, 5 May 1997 06:20:12 -0400 (EDT)
Received: from josef.ifi.unizh.ch (josef.ifi.unizh.ch [130.60.48.10]) by mocha.bunyip.com (8.8.5/8.8.5) with SMTP id GAA11995 for <uri@bunyip.com>; Mon, 5 May 1997 06:20:10 -0400 (EDT)
Received: from enoshima.ifi.unizh.ch by josef.ifi.unizh.ch with SMTP (PP) id <16127-0@josef.ifi.unizh.ch>; Mon, 5 May 1997 12:12:31 +0200
Date: Mon, 05 May 1997 12:12:27 +0200
From: "Martin J. Duerst" <mduerst@ifi.unizh.ch>
To: Edward Cherlin <cherlin@newbie.net>
cc: uri@bunyip.com
Subject: Re: Using UTF-8 for non-ASCII Characters in URLs
In-Reply-To: <v0300783faf8f314b10e6@[206.245.192.60]>
Message-ID: <Pine.SUN.3.96.970505115640.245B-100000@enoshima>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset="US-ASCII"
Sender: owner-uri@bunyip.com
Precedence: bulk
Hello Edward, Thanks for the corrections and additions to the draft. In some cases, I just had some "???" where I didn't yet have time to look things up or write them out. I didn't mean that other would have to complete it. On Thu, 1 May 1997, Edward Cherlin wrote: > That could be taken to apply to math and APL characters, which would be > unfortunate. There are strong reasons for allowing math and APL expressions > in identifiers for math and APL pages. I published a book, "The > Encyclopedia of APL" which was indexed in APL as well as in English names > of APL symbols, functions, and operators. It would have been a useful Web > site. I can see the point in the case of APL, which is a computer-based, alphabet-like collection with a well-established and accessible keyboard mapping very familliar to the respective community. I don't see that much of a point for math in general, because the codepoints available in Unicode often leave questions about what exactly it has to look and to mean, so that consistent transcription is not at all guaranteed. Also, in the above APL example, the APL characters would probably appear later down the path hierarchy, and would not be that important as entry points. > All codepoints can be entered from standard keyboards. There are keyboards > and other entry methods for almost all Unicode characters implemented in > some software, and all can be used in keyboard layouts of standard form. We don't want the average user of an URL to have to compose the URL with complicated operations. That's a fallback for those cases where nothing else is available, or may be okay for a single character, but shouldn't be encouraged too much. Regards, Martin.
- Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Dan Connolly
- Re: Using UTF-8 for non-ASCII Characters in URLs Michael Kung <MKUNG.US.ORACLE.COM>
- Re: Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Dan Oscarsson
- Re: Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Dan Oscarsson
- Re: Using UTF-8 for non-ASCII Characters in URLs Gary Adams - Sun Microsystems Labs BOS
- Re: Using UTF-8 for non-ASCII Characters in URLs Gary Adams - Sun Microsystems Labs BOS
- Re: Using UTF-8 for non-ASCII Characters in URLs Francois Yergeau
- Re: Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Michael Kung <MKUNG.US.ORACLE.COM>
- Re: Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Dan Oscarsson
- Re: "Difficult Characters" draft Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Edward Cherlin
- Re: Using UTF-8 for non-ASCII Characters in URLs Chris Newman
- Re: "Difficult Characters" draft Larry Masinter
- Re: "Difficult Characters" draft Alain LaBont/e'/
- Re: "Difficult Characters" draft Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Martin J. Duerst
- Re: "Difficult Characters" draft Leslie Daigle
- Re: "Difficult Characters" draft Alain LaBont/e'/
- Re: "Difficult Characters" draft Martin J. Duerst
- Re: "Difficult Characters" draft Patrik Faltstrom
- Re: Using UTF-8 for non-ASCII Characters in URLs Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Alain LaBont/e'/