RE: Single-letter names (was: Re: Update of RFC 2606 based on therecent ICANN changes?)

<michael.dillon@bt.com> Mon, 07 July 2008 16:25 UTC

Return-Path: <ietf-bounces@ietf.org>
X-Original-To: ietf-archive@megatron.ietf.org
Delivered-To: ietfarch-ietf-archive@core3.amsl.com
Received: from [127.0.0.1] (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 4C2C93A6AA8; Mon, 7 Jul 2008 09:25:26 -0700 (PDT)
X-Original-To: ietf@core3.amsl.com
Delivered-To: ietf@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 428743A6A95 for <ietf@core3.amsl.com>; Mon, 7 Jul 2008 09:25:24 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -5.599
X-Spam-Level:
X-Spam-Status: No, score=-5.599 tagged_above=-999 required=5 tests=[BAYES_00=-2.599, GB_I_LETTER=-2, RCVD_IN_DNSWL_LOW=-1]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id RWQmsfg7edeM for <ietf@core3.amsl.com>; Mon, 7 Jul 2008 09:25:23 -0700 (PDT)
Received: from smtp4.smtp.bt.com (smtp4.smtp.bt.com [217.32.164.151]) by core3.amsl.com (Postfix) with ESMTP id 30DB03A69F9 for <ietf@ietf.org>; Mon, 7 Jul 2008 09:25:23 -0700 (PDT)
Received: from E03MVZ2-UKDY.domain1.systemhost.net ([193.113.30.62]) by smtp4.smtp.bt.com with Microsoft SMTPSVC(6.0.3790.1830); Mon, 7 Jul 2008 17:25:27 +0100
X-MimeOLE: Produced By Microsoft Exchange V6.5
Content-class: urn:content-classes:message
MIME-Version: 1.0
Subject: RE: Single-letter names (was: Re: Update of RFC 2606 based on therecent ICANN changes?)
Date: Mon, 07 Jul 2008 17:25:28 +0100
Message-ID: <C0F2465B4F386241A58321C884AC7ECC0706DD8A@E03MVZ2-UKDY.domain1.systemhost.net>
In-Reply-To: <b789c2f00807041201k689e6f1dn4fc6647776d94fbc@mail.gmail.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: Single-letter names (was: Re: Update of RFC 2606 based on therecent ICANN changes?)
Thread-Index: AcjgSly7LHzJ+ZYuTEO0FbERSPDmDQAAUpgA
From: michael.dillon@bt.com
To: ietf@ietf.org
X-OriginalArrivalTime: 07 Jul 2008 16:25:27.0253 (UTC) FILETIME=[10B69050:01C8E04E]
Cc: idna-update@alvestrand.no
X-BeenThere: ietf@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: IETF-Discussion <ietf.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=unsubscribe>
List-Post: <mailto:ietf@ietf.org>
List-Help: <mailto:ietf-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=subscribe>
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Sender: ietf-bounces@ietf.org
Errors-To: ietf-bounces@ietf.org

> Alphabetic scripts such as Latin mostly represent sounds used 
> to make up words. While one can certainly find some 
> legitimate single-character words (such as the article "a" or 
> the personal pronoun "i") 

And lest someone might think that this curiosity of single
character words only applies to vowel sounds, in Russian,
the Cyrillic letter equivalents of v, k and s, are also
single letter words. 

> On the other hand, characters in ideographic scripts such as 
> Han are not mere sounds or glyphs; they represent one or more 
> concepts.

Some people might dispute that and say that they represent
syllables. Since the various Chinese dialects tend to have
monosyllabic words, almost all possible syllables also represent
a word or concept. However, many concepts in modern Chinese
dialects require multiple syllables to express them and
therefore multiple characters to write them. So there isn't
really a one to one mapping of word, syllable, concept as
many people suppose.

It would be more defensible to disallow single codepoint labels
where the code point represents a single consonant sound or a single
vowel sound. That still leaves a grey area of syllabic symbol systems
such as Hiragana, Inuit syllabics, etc. However, the number of people
affected by a rule on syllabics is small enough that one could
reasonably
poll representatives of these language communities to see if a rule
prohibiting single-syllable TLDs would cause hardship.

Note that the current system allows both single syllable TLDs such
as .to and single ideograph TLDs such as .sing when ASCII characters
are used. Or if you want to include tones, then .sing4 would be a single
ideographic codepoint. I think that it would be a good thing to update 
RFC 2606 to collect the various arguments and reasoning so that the
ICANN 
experts have some guidance to work from. If we can't deal with all the 
corner cases in an updated RFC, then at least ICANN experts have a point

of reference from which to depart, or not.

--Michael Dillon
_______________________________________________
Ietf mailing list
Ietf@ietf.org
https://www.ietf.org/mailman/listinfo/ietf