Re: http charset labelling

dupuy@cs.columbia.edu Fri, 09 February 1996 23:13 UTC

Received: from ietf.nri.reston.va.us by IETF.CNRI.Reston.VA.US id aa29804; 9 Feb 96 18:13 EST
Received: from CNRI.Reston.VA.US by IETF.CNRI.Reston.VA.US id aa29799; 9 Feb 96 18:13 EST
Received: from services.Bunyip.COM by CNRI.Reston.VA.US id aa16352; 9 Feb 96 18:13 EST
Received: (from daemon@localhost) by services.bunyip.com (8.6.10/8.6.9) id RAA12376 for uri-out; Fri, 9 Feb 1996 17:35:24 -0500
Received: from mocha.bunyip.com (mocha.Bunyip.Com [192.197.208.1]) by services.bunyip.com (8.6.10/8.6.9) with SMTP id RAA12371 for <uri@services.bunyip.com>; Fri, 9 Feb 1996 17:35:22 -0500
Received: from versed.smarts.com by mocha.bunyip.com with SMTP (5.65a/IDA-1.4.2b/CC-Guru-2b) id AA29330 (mail destined for uri@services.bunyip.com); Fri, 9 Feb 96 17:35:17 -0500
Received: from just.smarts.com (dupuy@just.smarts.com [198.49.114.138]) by versed.smarts.com (8.6.12/8.6.12) with SMTP id RAA05955; Fri, 9 Feb 1996 17:34:01 -0500
Received: by just.smarts.com (5.x/SMI-SVR4) id AA18323; Fri, 9 Feb 1996 17:35:08 -0500
Date: Fri, 9 Feb 1996 17:35:08 -0500
Message-Id: <9602092235.AA18323@just.smarts.com>
To: mohta@necom830.cc.titech.ac.jp, gtn@ebt.com
Subject: Re: http charset labelling
Cc: uri@bunyip.com
Sender: ietf-archive-request@IETF.CNRI.Reston.VA.US
From: dupuy@cs.columbia.edu
X-Sun-Charset: US-ASCII
X-Orig-Sender: owner-uri@bunyip.com
Precedence: bulk

Gavin wrote:
> >> Nor should there be. People map from a glyph to a *character*, which
> >> is quite different to a character code. I have enough trouble mapping
> >> some glyphs to characters.

Masataka replied:
> >Then, why do you think you can force people to perform the
> >troublesome mapping?

Gavin replied:
> I'm not proposing to force anyone to do anything, but rather, simply
> give them a choice. You *are* trying to force people to use a
> non-natural representation.

You're giving them a choice between using a "non-natural" representation which
will always work, and using a "natural" representation which may not work in
some cases, e.g. in Greece where a user might enter a Greek A rather than a
Latin A when typing in a URL from a business card.

The convenience of extended character set for URL may outweigh the
inconvenience of less reliable ability to type them in from printed
representation.  But, we should be aware of this tradeoff if we decide to make
it.  This is Masataka's point, which you seem to be missing.

Note that this issue of duplicated code points is independent of the code
mapping issue; unless code mapping is dealt with, extended character set URLs
are unlikely to work in many cases when typing them in from printed
representation.  There have been a few proposals for dealing with code
mapping; I'm not totally convinced that they all of them will work well in
practice with existing browsers that are unaware of character codes.

@alex
--
inet: dupuy@smarts.com
Member of the League for Programming Freedom -- write to lpf@uunet.uu.net
GCS d?@ H s++: !g p? !au a w v US+++$ C++$ P+ 3 L E++ N+(!N) K- W M V- po- Y+
     t+ !5 j R G? tv-- b++ !D B- e>* u+(**)@ h--- f+ r++ n+ y+*