Re: 8 bit characters in DNS names (and URNs?)

Keld J|rn Simonsen <keld@dkuug.dk> Thu, 07 March 1996 07:16 UTC

Message-Id: <199603070643.HAA02413@dkuug.dk>
Sender: ietf-archive-request@IETF.CNRI.Reston.VA.US
From: Keld J|rn Simonsen <keld@dkuug.dk>
Date: Thu, 07 Mar 1996 07:43:45 +0100
In-Reply-To: Larry Masinter <masinter@parc.xerox.com> "Re: 8 bit characters in DNS names (and URNs?)" (Mar 7, 6:13)
Mime-Version: 1.0
Content-Type: Text/Plain; Charset="ISO-8859-1"
Content-Transfer-Encoding: 8bit
Mnemonic-Intro: 29
To: Larry Masinter <masinter@parc.xerox.com>
Subject: Re: 8 bit characters in DNS names (and URNs?)
Cc: martin@terena.nl, wg-i18n@terena.nl, uri@bunyip.com
Precedence: bulk

Larry Masinter writes:

> While in ASCII you can define 'case independent match' by
> performing 'translate to upper case and then use string equality',
> this does not work for other character repertoires, e.g., JIS might
> have separate codes for single and double-wide codes yet want to treat
> them equivalent for matching.
> 
> While uppercase mapping is culturally sensitive, can we not make a
> culturally independent 'character matching' algorithm that is good
> enough for directory services. Perhaps it means treating accented and
> unaccented versions of French initial capitals equivalent, even though
> this equivalence is not determined by 'canonicalization'?
> 
ISO/IEC JTC1/SC22/WG20 is producing a sorting/comparison standard
that may be used for this purpose. It has a number of levels that
the comparison may be done at, for exmaple level 1 would equivalence
all "A"s and the level could also equivalence single and double-
width encodings (of the latin letters, mostly). The standard is
ISO 14651 now appearing as WD3 and going to CD stage in May 1996.

Keld

Re: URN to URC resolution scenario Mitra
Re: Library Standards and URIs Terry Allen
Re: Library Standards and URIs Terry Allen
Re: URC formats vs interfaces Ronald E. Daniel
URC formats vs interfaces Daniel LaLiberte
no child-of-URI groups at Dallas IETF? Larry Masinter
Re: no child-of-URI groups at Dallas IETF? Ronald E. Daniel
Re: html, http, urls and internationalisation Keld J|rn Simonsen
Re: http charset labelling Larry Masinter
Re: http charset labelling Keld J|rn Simonsen
Re: http charset labelling Keld J|rn Simonsen
Re: http charset labelling Larry Masinter
Re: http charset labelling Keld J|rn Simonsen
Re: http charset labelling Masataka Ohta
Re: http charset labelling Gavin Nicol
Re: http charset labelling Masataka Ohta
Re: http charset labelling Gavin Nicol
Re: http charset labelling Masataka Ohta
Re: http charset labelling Gavin Nicol
Re: http charset labelling Masataka Ohta
Re: http charset labelling Masataka Ohta
Re: http charset labelling Gavin Nicol
Re: http charset labelling Gavin Nicol
Re: http charset labelling Masataka Ohta
Re: http charset labelling Gavin Nicol
Re: 8 bit characters in DNS names (and URNs?) Keld J|rn Simonsen
Re: Typeable characters Keld J|rn Simonsen
Re: UTF-8 and URLs Keld J|rn Simonsen