Re: UTF-8 URL for testing

"Martin J. Duerst" <mduerst@ifi.unizh.ch> Mon, 14 April 1997 11:57 UTC

Received: from cnri by ietf.org id aa25429; 14 Apr 97 7:57 EDT
Received: from services.Bunyip.Com by CNRI.Reston.VA.US id aa08141; 14 Apr 97 7:57 EDT
Received: (from daemon@localhost) by services.bunyip.com (8.8.5/8.8.5) id HAA17843 for uri-out; Mon, 14 Apr 1997 07:12:42 -0400 (EDT)
Received: from mocha.bunyip.com (mocha.Bunyip.Com [192.197.208.1]) by services.bunyip.com (8.8.5/8.8.5) with SMTP id HAA17837 for <uri@services.bunyip.com>; Mon, 14 Apr 1997 07:12:40 -0400 (EDT)
Received: from josef.ifi.unizh.ch by mocha.bunyip.com with SMTP (5.65a/IDA-1.4.2b/CC-Guru-2b) id AA02536 (mail destined for uri@services.bunyip.com); Mon, 14 Apr 97 07:09:34 -0400
Received: from enoshima.ifi.unizh.ch by josef.ifi.unizh.ch with SMTP (PP) id <26522-0@josef.ifi.unizh.ch>; Mon, 14 Apr 1997 12:52:26 +0200
Date: Mon, 14 Apr 1997 12:52:25 +0200
From: "Martin J. Duerst" <mduerst@ifi.unizh.ch>
To: Harald.T.Alvestrand@uninett.no
Cc: Francois Yergeau <yergeau@alis.com>, uri@bunyip.com
Subject: Re: UTF-8 URL for testing
In-Reply-To: <12614.861001021@munken.uninett.no>
Message-Id: <Pine.SUN.3.96.970414122919.245B-100000@enoshima>
Mime-Version: 1.0
Content-Type: TEXT/PLAIN; charset="US-ASCII"
Sender: owner-uri@bunyip.com
Precedence: bulk

On Mon, 14 Apr 1997 Harald.T.Alvestrand@uninett.no wrote:

> Result of testing of Francois' page with Netscape 3.0:

Result of testing Francois' page with Netscape 4.0 (Preview Release 2,
on Solaris (with DISPLAY on SunOS):

- The character encoding of the document, as announced by the server,
	was recoginzed by the browser, and the characters appeared
	correctly (co^te' with the ^ and ' over their preceeding character).
- The same is true if you have a look at the source (congratulations to
	Netscape! I wouldn't have expected that to work), but not in the
	location field or at the bottom of the page. In those places,
	the data is interpreted as Latin-1. One can type in accented
	characters into the Location field using the "compose" key on the
	Sun keyboard, but these again are encoded as Latin-1.
- The %-encoded versions are displayed with percent signs.


My conclusion: Raw UTF-8 URLs are partially supported, and for the rest
are "mostly harmless" (that is, they just display bizarrely); %HH-encoded
UTF-8 URLs are supported and displayed with encoding.
 
> Pair of fact beats house of theory - thanks, Francois!

Thanks from my side, too.	Martin.