RE: Unicode progress

Masataka Ohta <mohta@necom830.cc.titech.ac.jp> Wed, 27 October 1993 05:36 UTC

Received: from ietf.nri.reston.va.us by IETF.CNRI.Reston.VA.US id aa27596; 27 Oct 93 1:36 EDT
Received: from CNRI.RESTON.VA.US by IETF.CNRI.Reston.VA.US id aa27588; 27 Oct 93 1:36 EDT
Received: from ucdavis.ucdavis.edu by CNRI.Reston.VA.US id aa09573; 27 Oct 93 1:37 EDT
Received: by ucdavis.ucdavis.edu (4.1/UCD2.05) id AA09578; Tue, 26 Oct 93 21:44:35 PDT
X-Orig-Sender: ietf-wnils-request@ucdavis.edu
Received: from necom830.cc.titech.ac.jp by ucdavis.ucdavis.edu (4.1/UCD2.05) id AA09386; Tue, 26 Oct 93 21:39:15 PDT
Received: by necom830.cc.titech.ac.jp (5.65+/necom-mx-rg); Wed, 27 Oct 93 13:35:46 +0859
Sender: ietf-archive-request@IETF.CNRI.Reston.VA.US
From: Masataka Ohta <mohta@necom830.cc.titech.ac.jp>
Return-Path: <mohta@necom830.cc.titech.ac.jp>
Message-Id: <9310270436.AA01885@necom830.cc.titech.ac.jp>
Subject: RE: Unicode progress
To: Borka Jerman-Blazic <jerman-blazic@ijs.si>
Date: Wed, 27 Oct 93 13:35:44 JST
Cc: ietf-charsets@innosoft.com, ietf-wnils@ucdavis.edu
In-Reply-To: <211*/S=jerman-blazic/O=ijs/PRMD=ac/ADMD=mail/C=si/@MHS>; from "Borka Jerman-Blazic" at Oct 25, 93 12:09 pm
X-Mailer: ELM [version 2.3 PL11]

> I would say that they have different code points because they belong to
> different scripts!.

Good. So, CJK Hans should have different code points.

Hiragana-Katakana-Han script for Japanese and Hangul-Han script for Korean
and pur Han script for Chinese are, of course, completely different.

> You can order all latin characters from ISO 10 646  
> in one collation string for many different languages (it is 
> difficult because the ordering rules
> differ from language to language, some already done work is
> around)

It is well known that such collation (so called phone book order) is
impossible, as the ordering rules are often contradictory diffferent
language by language even within those with Latin characters ('v' and
'w', and 'a' with ring above are famous examples).

> I agree completly.

Agree with what? I'm afraid you misunderstand the issue.

> Glyphs and typography is not related issue here.

But, as for "whois", if the data received could not be displayed because
"Glyphs and typography is not related issue here", it is just an abstract
nonsense.

> Characters in one coded character set are supposed to be unique i.e
> one character is coded only once in one character set table.

I don't mind what your difinition of "character" is, but you confuse the
uniqueness of a character code and the uniqueness of a character.

> p.s but we agreed what are the problems  to be solved over Internet,
> did we??

Yes, we have already agreed to be able to handle mixed multilingual text,
which means we need a code which separate CJK Han.

I don't mind if you do not call it "a character code".

							Masataka Ohta