Re: http charset labelling

Masataka Ohta <mohta@necom830.cc.titech.ac.jp> Wed, 07 February 1996 03:23 UTC

Received: from ietf.nri.reston.va.us by IETF.CNRI.Reston.VA.US id aa28462; 6 Feb 96 22:23 EST
Received: from CNRI.Reston.VA.US by IETF.CNRI.Reston.VA.US id aa28458; 6 Feb 96 22:23 EST
Received: from services.Bunyip.COM by CNRI.Reston.VA.US id aa18706; 6 Feb 96 22:23 EST
Received: (from daemon@localhost) by services.bunyip.com (8.6.10/8.6.9) id WAA16811 for uri-out; Tue, 6 Feb 1996 22:00:22 -0500
Received: from mocha.bunyip.com (mocha.Bunyip.Com [192.197.208.1]) by services.bunyip.com (8.6.10/8.6.9) with SMTP id WAA16806 for <uri@services.bunyip.com>; Tue, 6 Feb 1996 22:00:19 -0500
Received: from necom830.cc.titech.ac.jp by mocha.bunyip.com with SMTP (5.65a/IDA-1.4.2b/CC-Guru-2b) id AA06515 (mail destined for uri@services.bunyip.com); Tue, 6 Feb 96 22:00:13 -0500
Received: by necom830.cc.titech.ac.jp (8.6.11/necom-mx-rg); Wed, 7 Feb 1996 11:46:54 +0859
Sender: ietf-archive-request@IETF.CNRI.Reston.VA.US
From: Masataka Ohta <mohta@necom830.cc.titech.ac.jp>
Message-Id: <199602070247.LAA18548@necom830.cc.titech.ac.jp>
Subject: Re: http charset labelling
To: Gavin Nicol <gtn@ebt.com>
Date: Wed, 07 Feb 1996 11:46:52 -0000
Cc: masinter@parc.xerox.com, keld@dkuug.dk, uri@bunyip.com
In-Reply-To: <199602061504.KAA13675@ebt-inc.ebt.com>; from "Gavin Nicol" at Feb 6, 96 10:04 am
X-Mailer: ELM [version 2.3 PL11]
X-Orig-Sender: owner-uri@bunyip.com
Precedence: bulk

> >> Or fix the problem by allowing specification of the encoding used for
> >> the URL's.
> > 
> >That's no fix.
> > 
> >If you allow specification of the encoding, what we can see on paper
> >is resulting lengthy specification of the encoding concatenated with
> >lengthy 7bit encoding of the URL body.
> 
> Don't be silly.

You don't be silly.

> The results might
> vary widely depending on whether the data was transmitted as SJIS,
> EUC or UTF-8, if there is no encoding information.

Because of duplicated shape of 'A' for Latin and Greek capital
letter 'A' and alpha, and because of duplicated encoding of Big5,
encoding information, in general, is no fix for unique conversion
from shape on a paper to internal code.

Don't try to do something proven to be impossible.

PERIOD.

							Masataka Ohta