Re: revised "generic syntax" internet draft

"Martin J. Duerst" <mduerst@ifi.unizh.ch> Mon, 21 April 1997 13:18 UTC

Received: from cnri by ietf.org id aa03523; 21 Apr 97 9:18 EDT
Received: from services.Bunyip.Com by CNRI.Reston.VA.US id aa03608; 21 Apr 97 9:18 EDT
Received: (from daemon@localhost) by services.bunyip.com (8.8.5/8.8.5) id IAA24358 for uri-out; Mon, 21 Apr 1997 08:45:03 -0400 (EDT)
Received: from mocha.bunyip.com (mocha.Bunyip.Com [192.197.208.1]) by services.bunyip.com (8.8.5/8.8.5) with SMTP id IAA24349 for <uri@services.bunyip.com>; Mon, 21 Apr 1997 08:44:56 -0400 (EDT)
Received: from josef.ifi.unizh.ch by mocha.bunyip.com with SMTP (5.65a/IDA-1.4.2b/CC-Guru-2b) id AA20785 (mail destined for uri@services.bunyip.com); Mon, 21 Apr 97 08:44:45 -0400
Received: from enoshima.ifi.unizh.ch by josef.ifi.unizh.ch with SMTP (PP) id <24456-0@josef.ifi.unizh.ch>; Mon, 21 Apr 1997 14:43:08 +0200
Date: Mon, 21 Apr 1997 14:43:07 +0200
From: "Martin J. Duerst" <mduerst@ifi.unizh.ch>
To: Keld J|rn Simonsen <keld@dkuug.dk>
Cc: John C Klensin <klensin@mci.net>, Dan Oscarsson <Dan.Oscarsson@trab.se>, Harald.T.Alvestrand@uninett.no, uri@bunyip.com, fielding@kiwi.ics.uci.edu
Subject: Re: revised "generic syntax" internet draft
In-Reply-To: <199704152232.AAA29896@dkuug.dk>
Message-Id: <Pine.SUN.3.96.970421142501.245H-100000@enoshima>
Mime-Version: 1.0
Content-Type: TEXT/PLAIN; charset="US-ASCII"
Sender: owner-uri@bunyip.com
Precedence: bulk

On Wed, 16 Apr 1997, Keld J|rn Simonsen wrote:

> John Klensin writes about use of UTF-8 and penalties in size 
> and readability for various user communities. Some remarks:

> Maybe John wants to be able to use other charsets for encoding
> an URL. I actually proposed some time ago a solution labelling
> the encoding of the URL in a "URL-charset:" header and a
> having UTF-8 as default, and I remember somebody else also proposing
> charset labelling - on the URL line. I have not at this time evaluated 
> such proposals compared to Martin and Frangois's proposals, but it
> is clear that the intended functionality is the same - and my old
> proposal could be seen as an extension to Martin/Frangois - but I
> am not sure it is necessary.

In particular, the "FORM-UTF8: Yes" I proposed is very similar
to your proposal. To be able to label arbitrary "charset"s is
an extension, but I don't think it is needed at this stage of
ISO 10646 and Internet development. The way I put it usually
is that currently, we have "chaos". There is no need to proceed
to "labeled chaos" when we can proceed to "order" directly.
The Universal Character Set really shows off its strength most
directly for short and widely used strings such as URLs.

Regards,	Martin.