Re: [DNSOP] draft-liman-tld-names-04

Masataka Ohta <mohta@necom830.hpcl.titech.ac.jp> Thu, 25 November 2010 01:22 UTC

Return-Path: <mohta@necom830.hpcl.titech.ac.jp>
X-Original-To: dnsop@core3.amsl.com
Delivered-To: dnsop@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 059E73A6AA2 for <dnsop@core3.amsl.com>; Wed, 24 Nov 2010 17:22:54 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: 1.87
X-Spam-Level: *
X-Spam-Status: No, score=1.87 tagged_above=-999 required=5 tests=[BAYES_00=-2.599, HELO_EQ_JP=1.244, HOST_EQ_JP=1.265, RCVD_IN_BL_SPAMCOP_NET=1.96]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id aWoQ+zEOFfYq for <dnsop@core3.amsl.com>; Wed, 24 Nov 2010 17:22:52 -0800 (PST)
Received: from necom830.hpcl.titech.ac.jp (necom830.hpcl.titech.ac.jp [131.112.32.132]) by core3.amsl.com (Postfix) with SMTP id 5F6773A6A9C for <dnsop@ietf.org>; Wed, 24 Nov 2010 17:22:52 -0800 (PST)
Received: (qmail 85314 invoked from network); 25 Nov 2010 01:59:47 -0000
Received: from ccnetptb005.nns.ne.jp (HELO ?172.16.0.18?) (61.193.140.5) by necom830.hpcl.titech.ac.jp with SMTP; 25 Nov 2010 01:59:47 -0000
Message-ID: <4CEDBA88.6060306@necom830.hpcl.titech.ac.jp>
Date: Thu, 25 Nov 2010 10:23:20 +0900
From: Masataka Ohta <mohta@necom830.hpcl.titech.ac.jp>
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; ja; rv:1.9.2.12) Gecko/20101027 Thunderbird/3.1.6
MIME-Version: 1.0
To: dnsop@ietf.org
References: <B35360B6-0DB9-49CB-B68E-09DFFFB1ACA0@icann.org> <31FCAB67-9E3E-4E2B-957F-1A1F628AA8FB@hopcount.ca> <20101117091928.GA30093@nic.fr> <4CE9E942.20906@dougbarton.us> <0E561274-43FE-4657-951E-74C8FF0FD307@hopcount.ca> <4CEC43DC.1060709@dougbarton.us> <E7796748-6880-4928-B96D-0024E27E98D5@hopcount.ca> <4CEC69C5.3040209@dougbarton.us> <A109DC85-2F26-4F52-A11D-EA9C26DF3597@frobbit.se> <4CED02CD.4090408@abenaki.wabanaki.net>
In-Reply-To: <4CED02CD.4090408@abenaki.wabanaki.net>
Content-Type: text/plain; charset="ISO-2022-JP"
Content-Transfer-Encoding: 7bit
Subject: Re: [DNSOP] draft-liman-tld-names-04
X-BeenThere: dnsop@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: IETF DNSOP WG mailing list <dnsop.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/dnsop>, <mailto:dnsop-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/dnsop>
List-Post: <mailto:dnsop@ietf.org>
List-Help: <mailto:dnsop-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/dnsop>, <mailto:dnsop-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 25 Nov 2010 01:22:54 -0000

Eric Brunner-Williams wrote:

>>> no _technical_ reason that TLD labels should be all-alphabetic

>> FWIW, when you display internationalized domain names, and mixed RTL 
>> and LTR contexts (overall, in a label etc), you can get "interesting" 
>> results when characters that have not directionality (like numbers) 
>> are displayed adjacent to punctuation.

>> See http://stupid.domain.name/node/681 for an example.

> the example shows that the bidi algorithm used, correctly, if the 
> sequence of encoded values is "text", incorrectly, if the sequence of 
> encoded values is a dns label, render a directionality property of "."

The only way to handle bidi with *PLAIN* text, which is what
we use in UNIX command lines and DNS names, is to do it with
finite state.

Otherwise, you can't search, because search with push down
automaton is prohibitively inefficient, which means nested
directionality can not be supported.

I warned so about 20 years ago.

Then, bidi support with plain text is just simple. Characters
do not have any bidi property and are displayed right to left
in RTL lines and left to right in LRT lines (words are often
spelled backward).

However, as Unicode plain text includes control characters for
nested  bidi, and is not plain, its bidi support specification
is totally confused and is unusable to handle real plain text.

We can't move forward until we have character encoding
with rational support for bidi.

						Masataka Ohta