Re: 8 bit characters in DNS names (and URNs?)

Masataka Ohta <mohta@necom830.hpcl.titech.ac.jp> Mon, 11 March 1996 03:17 UTC

Received: from ietf.cnri.reston.va.us by IETF.CNRI.Reston.VA.US id aa14211; 10 Mar 96 22:17 EST
Received: from CNRI.Reston.VA.US by IETF.CNRI.Reston.VA.US id aa14207; 10 Mar 96 22:17 EST
Received: from services.Bunyip.COM by CNRI.Reston.VA.US id aa12314; 10 Mar 96 22:17 EST
Received: (from daemon@localhost) by services.bunyip.com (8.6.10/8.6.9) id VAA23223 for uri-out; Sun, 10 Mar 1996 21:53:27 -0500
Received: from mocha.bunyip.com (mocha.Bunyip.Com [192.197.208.1]) by services.bunyip.com (8.6.10/8.6.9) with SMTP id VAA23206 for <uri@services.bunyip.com>; Sun, 10 Mar 1996 21:52:48 -0500
Received: from [131.112.32.132] by mocha.bunyip.com with SMTP (5.65a/IDA-1.4.2b/CC-Guru-2b) id AA15414 (mail destined for uri@services.bunyip.com); Sun, 10 Mar 96 21:52:33 -0500
Sender: ietf-archive-request@IETF.CNRI.Reston.VA.US
From: Masataka Ohta <mohta@necom830.hpcl.titech.ac.jp>
Message-Id: <199603110239.LAA05204@necom830.hpcl.titech.ac.jp>
Received: by necom830.hpcl.titech.ac.jp (8.6.11/TM2.1) id LAA05204; Mon, 11 Mar 1996 11:39:40 +0900
Subject: Re: 8 bit characters in DNS names (and URNs?)
To: Peter Paul Sint <sint@oeaw.ac.at>
Date: Mon, 11 Mar 96 11:39:39 JST
Cc: masinter@parc.xerox.com, keld@dkuug.dk, martin@terena.nl, wg-i18n@terena.nl, uri@bunyip.com
In-Reply-To: <v02130505ad668ac95a8d@[193.170.88.66]>; from "Peter Paul Sint" at Mar 9, 96 2:34 am
X-Mailer: ELM [version 2.3 PL11]
X-Orig-Sender: owner-uri@bunyip.com
Precedence: bulk

> >> While uppercase mapping is culturally sensitive, can we not make a
> >> culturally independent 'character matching' algorithm that is good
> >> enough for directory services.

> >Theoretically, it is a union of all the matching rules of all
> >the culture. But, in practice, it is hard especially because
> >the expected degree of matching differs service by service.

It's a union. OK?

> German has a lower case letter
> (looks like a beta

Of course.

> You would never write umlaut A as an A. (only aliens do so - and software).

Of course.

> The back transformation is not unique!

Of course.

But, it is not a problem if, with some internationalized non-strict
directory service, a pattern of umlaut 'A' matches both 'a', 'A',
umlaut 'a', umlaut 'A', 'ae' and 'AE'.

							Masataka Ohta