[idn] Re: character tables

Cary Karp <ck@nic.museum> Wed, 02 March 2005 16:11 UTC

Received: from psg.com (mailnull@psg.com [147.28.0.62]) by ietf.org (8.9.1a/8.9.1a) with ESMTP id LAA27264 for <idn-archive@lists.ietf.org>; Wed, 2 Mar 2005 11:11:20 -0500 (EST)
Received: from majordom by psg.com with local (Exim 4.44 (FreeBSD)) id 1D6WMe-000CMX-8K for idn-data@psg.com; Wed, 02 Mar 2005 16:06:20 +0000
Received: from [130.242.24.5] (helo=nic.museum) by psg.com with esmtp (Exim 4.44 (FreeBSD)) id 1D6WMa-000CLq-HS for idn@ops.ietf.org; Wed, 02 Mar 2005 16:06:16 +0000
Received: from nic.museum (nic.museum [127.0.0.1]) by nic.museum (8.13.1/8.13.1) with ESMTP id j22G6jhT026147; Wed, 2 Mar 2005 17:06:45 +0100
Received: from localhost (ck@localhost) by nic.museum (8.13.1/8.13.1/Submit) with ESMTP id j22G6ion026144; Wed, 2 Mar 2005 17:06:44 +0100
Date: Wed, 02 Mar 2005 17:06:44 +0100
From: Cary Karp <ck@nic.museum>
To: idn@ops.ietf.org
Subject: [idn] Re: character tables
In-Reply-To: <42251B80.5050503@vanderpoel.org>
Message-ID: <Pine.LNX.4.61.0503020759240.17184@nic.museum>
References: <421B8484.3070802@vanderpoel.org> <20050223072837.GA21463~@nicemice.net> <D872CCF059514053ECF8A198@scan.jck.com> <421D8411.9030006@vanderpoel.org> <p06210208be4390618c81@[192.168.0.101]> <421E0D0C.2000309@vanderpoel.org> <p06210202be43c3888991@[192.168.0.101]> <E07CE813AD23B2D95DA0C740@scan.jck.com> <421E30F2.1040408@vanderpoel.org> <0E7F74C71945B923C52211F3@scan.jck.com> <421EA0C9.1010500@vanderpoel.org> <00a401c51af3$7863aae0$030aa8c0@DEWELL> <A574CA1BE87BFDA3C2A1AC0E@scan.jck.com> <421FA55B.9000308@vanderpoel.org> <421FCBD7.8000805@vanderpoel.org> <42227EBF.9040703@vanderpoel.org> <45781B7428C6AA07C3B283BD@scan.jck.com> <42229BBC.8020608@vanderpoel.org> <p0621021ebe484f52c0c5@[10.20.30.249]> <4225ABAB.60002@mozilla.org> <p0621022dbe4ab4b8a3fa@[10.20.30.249]> <42251B80.5050503@vanderpoel.org>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset="US-ASCII"; format="flowed"
X-Spam-Checker-Version: SpamAssassin 3.0.1 (2004-10-22) on psg.com
X-Spam-Status: No, score=-2.6 required=5.0 tests=BAYES_00 autolearn=ham version=3.0.1
Sender: owner-idn@ops.ietf.org
Precedence: bulk

Quoting Erik van der Poel:

> I note that this particular conversation is between a browser 
> developer (Gervase) and one of the IDNA authors (Paul), neither of 
> which is a registry representative, so why exactly are you 2 
> having this conversation? :-)
>
> Sorry, I'm half joking. Half, because you two have every right to 
> discuss whatever you wish. The other half because I believe 
> browser developers can afford to focus more on their end of 
> things.

Under the assumption that the discussion might be furthered by a 
registry rep describing that end of things --

I'm responsible for the development and maintenance of the policies 
for .museum. This is a sponsored gTLD (sTLD), which means that there 
are eligibility requirements for name holders, and restrictions on 
the way names may be structured and used. The policies specific to 
IDN are stated in detail at http://about.museum/idn/idnpolicy.html, 
which includes links to both the general policy statement and the 
listing of permitted characters and code points. Although 
prospective name holders are unlikely to read the fine print, the 
detailed statement provides an unequivocal reference in the 
case-to-case discussions about policy instantiation that we 
frequently conduct with individual museums. That dialogue provides 
an effective barrier to deliberate abuse of the .museum namespace 
(which we are contractually obligated to prevent) and is also a 
safeguard against inadvertent homographic confusion.

There are significant differences in the operation of an 
unrestricted gTLD (uTLD), where there may be a far greater volume of 
registration traffic, and direct contact between the registry and 
registrant is not a part of the registration process. In this 
situation, such things as restriction on IDN registration need to be 
fully automated, with sentient review being a remedial device when 
it is noted that something the algorithmic filter is intended to 
stop has nonetheless passed through.

The gTLD registries are as concerned by and with the present 
homograph alert as is any other group attempting to curb the risk 
for such damage. In addition to the range of operating conditions 
implied above, there are differences in the way the registries have 
interpreted and implemented ICANN's IDN Guidelines. That document 
states, "As the deployment of IDNs proceeds, ICANN and the IDN 
registries will review these Guidelines at regular intervals, and 
revise them as necessary based on experience." We are now clearly at 
such a juncture, and the revision process is already being 
initiated. Reducing the latitude for interpretation of the 
Guidelines will bring the registries one important step closer to 
being able to establish a best practice that can bring an end to the 
concern about point-of-registration control that is currently being 
expressed.

The development of these best practices would be further abetted by 
the candidate IDN repertoire being purged of the graphic symbols and 
other signs that are not needed in a naming system (using that term 
in a sense that a linguist might regard as reasonable) but clearly 
exacerbate the risk for malicious exploitation. There is similar 
utility in freeing IDNA from its current lock to Unicode 3.2. Since 
both of these concerns can only be addressed through revision of 
nameprep and stringprep, that action is of priority concern to the 
registries.

There is more to be said about gTLD perspectives on the role IDN 
plays in the communities that we serve, but I'll save that for 
separate communication.

/Cary