Re: [idn] Re: character tables

"Adam M. Costello" <idn.amc+0@nicemice.net.RemoveThisWord.cnri.reston.va.us> Thu, 03 March 2005 06:41 UTC

Received: from psg.com (mailnull@psg.com [147.28.0.62]) by ietf.org (8.9.1a/8.9.1a) with ESMTP id BAA18719 for <idn-archive@lists.ietf.org>; Thu, 3 Mar 2005 01:41:48 -0500 (EST)
Received: from majordom by psg.com with local (Exim 4.44 (FreeBSD)) id 1D6jtR-000GSX-So for idn-data@psg.com; Thu, 03 Mar 2005 06:33:05 +0000
Received: from [128.32.132.165] (helo=nicemice.net) by psg.com with esmtps (TLSv1:DES-CBC3-SHA:168) (Exim 4.44 (FreeBSD)) id 1D6jtO-000GS1-M1 for idn@ops.ietf.org; Thu, 03 Mar 2005 06:33:02 +0000
Received: from amc by nicemice.net with local (Exim 3.35 #1 (Debian)) id 1D6jtL-0000lt-00 for <idn@ops.ietf.org>; Wed, 02 Mar 2005 22:32:59 -0800
Date: Thu, 03 Mar 2005 06:32:58 +0000
From: "Adam M. Costello" <idn.amc+0@nicemice.net.RemoveThisWord.cnri.reston.va.us>
To: idn@ops.ietf.org
Subject: Re: [idn] Re: character tables
Message-ID: <20050303063258.GA2271~@nicemice.net>
Reply-To: IETF idn working group <idn@ops.ietf.org>
References: <421FCBD7.8000805@vanderpoel.org> <42227EBF.9040703@vanderpoel.org> <45781B7428C6AA07C3B283BD@scan.jck.com> <42229BBC.8020608@vanderpoel.org> <p0621021ebe484f52c0c5@[10.20.30.249]> <4225ABAB.60002@mozilla.org> <p0621022dbe4ab4b8a3fa@[10.20.30.249]> <42251B80.5050503@vanderpoel.org> <Pine.LNX.4.61.0503020759240.17184@nic.museum> <42261AC2.3020004@vanderpoel.org>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Disposition: inline
In-Reply-To: <42261AC2.3020004@vanderpoel.org>
User-Agent: Mutt/1.5.6+20040722i
X-Spam-Checker-Version: SpamAssassin 3.0.1 (2004-10-22) on psg.com
X-Spam-Status: No, score=-2.6 required=5.0 tests=BAYES_00 autolearn=ham version=3.0.1
Sender: owner-idn@ops.ietf.org
Precedence: bulk

Erik van der Poel <erik@vanderpoel.org> wrote:

> Other communities have other needs. I've been told that some
> communities use a set of letters that are currently encoded in two
> different ranges of the Unicode space (e.g. Latin and Cyrillic).
> Today, my idea is that these communities can "occupy" their "own" part
> of the DNS space, for example a .tld or a .2ld.tld.  They can publish
> the rules that they enforce in their registries, and then the browsers
> can either allow any character sequence in those labels or check them
> to see if the rules were indeed followed.

I've also thought along these lines, but I rejected this approach.  The
domain hierarchy is ultimately based on delegation of naming authority,
and trying to use it for any other purpose will run into conflicting
constraints.  Suppose country X wants to support language Y, which is
used in many countries around the world.  Who would be the registry for
the Y domain, and how would you get worldwide agreement on that?  Would
country X be delegated a subdomain of Y?  Would registrants accept X.Y
as a legitimate country X domain, or would they demand to be in an X
top-level domain?  Would users of language Y not get annoyed at seeing Y
at the end of almost every domain name they use?  It's bad enough that
so many domains end in .com, imagine if they all ended in .com.lat (for
"Latin").

I still like the idea of allowing every TLD to have one synonym-TLD per
script, although we might need to recognize some scripts in addition to
the Unicode scripts, for example, the subset-of-(Latin-plus-Cyrillic)
script that you allude to.

AMC