Re: [DNSOP] draft-liman-tld-names-04

Eric Brunner-Williams <ebw@abenaki.wabanaki.net> Wed, 24 November 2010 12:18 UTC

Return-Path: <ebw@abenaki.wabanaki.net>
X-Original-To: dnsop@core3.amsl.com
Delivered-To: dnsop@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 188BD3A6A30 for <dnsop@core3.amsl.com>; Wed, 24 Nov 2010 04:18:39 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.682
X-Spam-Level:
X-Spam-Status: No, score=-2.682 tagged_above=-999 required=5 tests=[AWL=-0.083, BAYES_00=-2.599]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Ezgv8Yo+wNzt for <dnsop@core3.amsl.com>; Wed, 24 Nov 2010 04:18:38 -0800 (PST)
Received: from abenaki.wabanaki.net (abenaki.wabanaki.net [65.99.1.133]) by core3.amsl.com (Postfix) with ESMTP id 11F2B3A6A38 for <dnsop@ietf.org>; Wed, 24 Nov 2010 04:18:37 -0800 (PST)
Received: from limpet.local (cpe-67-255-5-237.twcny.res.rr.com [67.255.5.237]) by abenaki.wabanaki.net (8.14.4/8.14.4) with ESMTP id oAOAsSjA012757 for <dnsop@ietf.org>; Wed, 24 Nov 2010 05:54:28 -0500 (EST) (envelope-from ebw@abenaki.wabanaki.net)
Message-ID: <4CED02CD.4090408@abenaki.wabanaki.net>
Date: Wed, 24 Nov 2010 07:19:25 -0500
From: Eric Brunner-Williams <ebw@abenaki.wabanaki.net>
Organization: wampumpeag
User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.5; en-US; rv:1.9.2.12) Gecko/20101027 Thunderbird/3.1.6
MIME-Version: 1.0
To: dnsop@ietf.org
References: <B35360B6-0DB9-49CB-B68E-09DFFFB1ACA0@icann.org> <31FCAB67-9E3E-4E2B-957F-1A1F628AA8FB@hopcount.ca> <20101117091928.GA30093@nic.fr> <4CE9E942.20906@dougbarton.us> <0E561274-43FE-4657-951E-74C8FF0FD307@hopcount.ca> <4CEC43DC.1060709@dougbarton.us> <E7796748-6880-4928-B96D-0024E27E98D5@hopcount.ca> <4CEC69C5.3040209@dougbarton.us> <A109DC85-2F26-4F52-A11D-EA9C26DF3597@frobbit.se>
In-Reply-To: <A109DC85-2F26-4F52-A11D-EA9C26DF3597@frobbit.se>
Content-Type: text/plain; charset="ISO-8859-1"; format="flowed"
Content-Transfer-Encoding: quoted-printable
Subject: Re: [DNSOP] draft-liman-tld-names-04
X-BeenThere: dnsop@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: IETF DNSOP WG mailing list <dnsop.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/dnsop>, <mailto:dnsop-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/dnsop>
List-Post: <mailto:dnsop@ietf.org>
List-Help: <mailto:dnsop-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/dnsop>, <mailto:dnsop-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 24 Nov 2010 12:18:39 -0000

On 11/24/10 5:50 AM, Patrik Fältström wrote:
> On 24 nov 2010, at 02.26, Doug Barton wrote:
>
>> no _technical_ reason that TLD labels should be all-alphabetic
>
>
> FWIW, when you display internationalized domain names, and mixed RTL and LTR contexts (overall, in a label etc), you can get "interesting" results when characters that have not directionality (like numbers) are displayed adjacent to punctuation.

digits in the U0030..U0039 (latin script digits), U0660..U669 (arabic 
script arabic-indic digits) and U06F0..U06F9 (arabic script eastern 
arabic-indic digits) have directionality properties. i know this was a 
casual error and paf is caffine-free slovenia at the moment.

> See http://stupid.domain.name/node/681 for an example.

the example shows that the bidi algorithm used, correctly, if the 
sequence of encoded values is "text", incorrectly, if the sequence of 
encoded values is a dns label, render a directionality property of "."

restated, "." has a property which is dependent upon the 
directionality of the proximal character(s), according to the authors 
of the idn bidi algorithm. this may come as a surprise to applications 
that assume that "." is a separator, semantically unaffected by the 
properties of adjacent character(s).

the net of the unicadette usage advice to the dns weenies is don't put 
digits in identifiers, um, don't put digits in identifiers composed 
from characters with a rtl directionality property, um, don't mix 
scripts with differing directionality properties, at least, not in 
sight of dots or children, not because these are not "proper words" in 
(some language), the point of view advanced by an important, but 
mistaken on this point, group of arabic script and arabic language 
(but not arabic script and farsi/urdu/... language) users, but because 
Ox2E in US-ASCII, a standard published in 1963, has a property unknown 
to the the authors of that sequence of standards.

one could summarize the unicadette's point of view that the choice of 
"." as a label separator was an error in 952 and that ken, mary and 
jake blew it, but the real blame should go to jon who blew it in 921 
(required reading for the no-digits advacates). we're all lucky that 
820's use of 0x2E has not yet been discovered by the unicadettes.

-e