Re: 8 bit characters in DNS names (and URNs?)
Keld J|rn Simonsen <keld@dkuug.dk> Tue, 05 March 1996 17:36 UTC
Received: from ietf.cnri.reston.va.us by IETF.CNRI.Reston.VA.US id aa12447;
5 Mar 96 12:36 EST
Received: from CNRI.Reston.VA.US by IETF.CNRI.Reston.VA.US id aa12443;
5 Mar 96 12:36 EST
Received: from services.Bunyip.COM by CNRI.Reston.VA.US id aa09200;
5 Mar 96 12:36 EST
Received: (from daemon@localhost) by services.bunyip.com (8.6.10/8.6.9) id
LAA04201 for uri-out; Tue, 5 Mar 1996 11:33:54 -0500
Received: from mocha.bunyip.com (mocha.Bunyip.Com [192.197.208.1]) by
services.bunyip.com (8.6.10/8.6.9) with SMTP id LAA04193 for
<uri@services.bunyip.com>; Tue, 5 Mar 1996 11:33:51 -0500
Received: from dkuug.dk by mocha.bunyip.com with SMTP
(5.65a/IDA-1.4.2b/CC-Guru-2b)
id AA04598 (mail destined for uri@services.bunyip.com);
Tue, 5 Mar 96 11:33:47 -0500
Received: (from keld@localhost) by dkuug.dk (8.6.12/8.6.12) id RAA27148;
Tue, 5 Mar 1996 17:32:48 +0100
Message-Id: <199603051632.RAA27148@dkuug.dk>
Sender: ietf-archive-request@IETF.CNRI.Reston.VA.US
From: Keld J|rn Simonsen <keld@dkuug.dk>
Date: Tue, 5 Mar 1996 17:32:40 +0100
In-Reply-To: martin@terena.nl (John Martin)
"8 bit characters in DNS names (and URNs?)" (Mar 5, 10:44)
X-Charset: ISO-8859-1
X-Char-Esc: 29
Mime-Version: 1.0
Content-Type: Text/Plain; Charset=ISO-8859-1
Content-Transfer-Encoding: 8bit
Mnemonic-Intro: 29
X-Mailer: Mail User's Shell (7.2.2 4/12/91)
To: John Martin <martin@terena.nl>, wg-i18n@terena.nl
Subject: Re: 8 bit characters in DNS names (and URNs?)
Cc: uri@bunyip.com
X-Orig-Sender: owner-uri@bunyip.com
Precedence: bulk
Alexander Dupuy writes a note on 8-bit DNS entries. He states that the biggest problem is that DNS entries are case insensitive, and that this is not well defined beyond ASCII. I am the editor of an ISO standard where we are defining a format for cultural conventions building on the POSIX locales and charmaps. Included will be a standard locale with mapping tables between lower and upper case for the whole of 10646. This locale will be freely available on the net together with charmaps more than 100 coded character sets. Data is already available that is similar to this, but not complete yet over full 10646. Alexander also writes that the upercase mapping is culturally sensitive. This is correct, but there is a great majority of cultures that have the same toupper() specifications. In most cultures a latin small e with acute is capitalized into a capital e with acute. Likewise with a small greek omega - it is capitalized into a capital greek omega. The only exception I can think of is in Turkish <i without dot> with uppercase <I>, and <i> capitalized into <I with dot>. Then some say that in french they never use capitalized accented letters, but that seems not to be the rule, according to official French sources. I am confident that the uppercase mapping should not be a problem. But I am not sure that we should do this just as an enhancement in DNS. Anyway one way to do it would be to say that the entry should be in UTF-8, and we could define a new RR type to do this. URLs could then first look there and if not found look in the normal RRs. I am not sure it is the right time to make such specifications, though. Keld
- 8 bit characters in DNS names (and URNs?) Alexander Dupuy
- Re: 8 bit characters in DNS names (and URNs?) Keld J|rn Simonsen
- Re: 8 bit characters in DNS names (and URNs?) Masataka Ohta
- Re: 8 bit characters in DNS names (and URNs?) Alexander Dupuy
- Re: 8 bit characters in DNS names (and URNs?) Larry Masinter
- Re: 8 bit characters in DNS names (and URNs?) Larry Masinter
- Re: 8 bit characters in DNS names (and URNs?) Masataka Ohta
- Re: 8 bit characters in DNS names (and URNs?) Patrik Faltstrom
- Re: 8 bit characters in DNS names (and URNs?) Peter Paul Sint
- Re: 8 bit characters in DNS names (and URNs?) Masataka Ohta