RE: Pinyin

Lang Gérard <gerard.lang@insee.fr> Thu, 25 September 2008 06:31 UTC

Return-Path: <gerard.lang@insee.fr>
X-Original-To: ietf-languages@alvestrand.no
Delivered-To: ietf-languages@alvestrand.no
Received: from localhost (localhost [127.0.0.1]) by eikenes.alvestrand.no (Postfix) with ESMTP id CA30F39E476 for <ietf-languages@alvestrand.no>; Thu, 25 Sep 2008 08:31:47 +0200 (CEST)
X-Virus-Scanned: Debian amavisd-new at eikenes.alvestrand.no
Received: from eikenes.alvestrand.no ([127.0.0.1]) by localhost (eikenes.alvestrand.no [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id IAd4MJJFZmox for <ietf-languages@alvestrand.no>; Thu, 25 Sep 2008 08:31:46 +0200 (CEST)
X-Greylist: from auto-whitelisted by SQLgrey-1.6.8
Received: from pechora2.lax.icann.org (pechora2.icann.org [208.77.188.37]) by eikenes.alvestrand.no (Postfix) with ESMTPS id 6AB1439E40A for <ietf-languages@alvestrand.no>; Thu, 25 Sep 2008 08:31:46 +0200 (CEST)
Received: from hermes.insee.fr (hermes.insee.fr [194.254.38.66]) by pechora2.lax.icann.org (8.13.8/8.13.8) with ESMTP id m8P6Vs4e021247 for <ietf-languages@iana.org>; Wed, 24 Sep 2008 23:32:15 -0700
Received: from evariste.insee.fr (unknown [194.254.38.143]) by hermes.insee.fr (Insee Mail server) with ESMTP id CB4325DC027; Thu, 25 Sep 2008 08:31:53 +0200 (CEST)
Received: from localhost (unknown [127.0.0.1]) by evariste.insee.fr (Postfix) with ESMTP id AFA3579401F; Thu, 25 Sep 2008 08:31:53 +0200 (CEST)
X-Virus-Scanned: ClamAV version 0.93.3, clamav-milter version 0.93.3 on pechora2.lax.icann.org
X-Virus-Scanned: amavisd-new at insee.fr
Received: from evariste.insee.fr ([127.0.0.1]) by localhost (evariste.insee.fr [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id WZw2ZuJjL0cw; Thu, 25 Sep 2008 08:31:53 +0200 (CEST)
Received: from s90x2smtp.ad.insee.intra (unknown [194.254.38.144]) by evariste.insee.fr (Postfix) with ESMTP id 8061E79401E; Thu, 25 Sep 2008 08:31:53 +0200 (CEST)
Received: from S90X2HUB1.ad.insee.intra ([10.90.200.51]) by s90x2smtp.ad.insee.intra with Microsoft SMTPSVC(6.0.3790.3959); Thu, 25 Sep 2008 08:31:53 +0200
X-MimeOLE: Produced By Microsoft Exchange V6.5
Content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
Subject: RE: Pinyin
Date: Thu, 25 Sep 2008 08:31:53 +0200
Message-ID: <68723E6B2E0EDC4999504D17DDE8F94904AD07AA@S90X2HUB1.ad.insee.intra>
In-Reply-To: <DDB6DE6E9D27DD478AE6D1BBBB835795633BC6BE71@NA-EXMSG-C117.redmond.corp.microsoft.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: Pinyin
thread-index: Ackehhh+0WThYeiwSmOJ+m4ss2ZElwAHgCZQAAwwlRA=
References: <83C5E5CB-FE27-47BA-A98F-F5003F586A64@evertype.com><006e01c91e68$9e4abce0$6801a8c0@oemcomputer><20080924172101.GU19886@mercury.ccil.org><6d99d1fd0809241042g44eba0e8q613989437a958ee@mail.gmail.com><20080924190502.GD11053@mercury.ccil.org><6d99d1fd0809241226k384e9a11h41ebae090bb1b8d6@mail.gmail.com><4D25F22093241741BC1D0EEBC2DBB1DA014C26B041@EX-SEA5-D.ant.amazon.com><6d99d1fd0809241250k18a51b12p7b13d313d3eb41a3@mail.gmail.com><4D25F22093241741BC1D0EEBC2DBB1DA014C26B0EF@EX-SEA5-D.ant.amazon.com><001201c91e86$12691fa0$6801a8c0@oemcomputer> <DDB6DE6E9D27DD478AE6D1BBBB835795633BC6BE71@NA-EXMSG-C117.redmond.corp.microsoft.com>
From: Lang Gérard <gerard.lang@insee.fr>
To: Peter Constable <petercon@microsoft.com>, ietf-languages@iana.org, Lang Gérard <gerard.lang@insee.fr>
X-OriginalArrivalTime: 25 Sep 2008 06:31:53.0665 (UTC) FILETIME=[66684B10:01C91ED8]
X-Virus-Status: Clean
X-Greylist: IP, sender and recipient auto-whitelisted, not delayed by milter-greylist-4.0 (pechora2.lax.icann.org [208.77.188.37]); Wed, 24 Sep 2008 23:32:15 -0700 (PDT)
X-BeenThere: ietf-languages@alvestrand.no
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: IETF Language tag discussions <ietf-languages.alvestrand.no>
List-Unsubscribe: <http://www.alvestrand.no/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@alvestrand.no?subject=unsubscribe>
List-Archive: <http://www.alvestrand.no/pipermail/ietf-languages>
List-Post: <mailto:ietf-languages@alvestrand.no>
List-Help: <mailto:ietf-languages-request@alvestrand.no?subject=help>
List-Subscribe: <http://www.alvestrand.no/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@alvestrand.no?subject=subscribe>
X-List-Received-Date: Thu, 25 Sep 2008 06:31:47 -0000

Because we are tagging (almost) only with the letters of the Latin script, it is a complete illusion to think to encourage only "non-mnemonic" registration:
1-It would be a complete reversion of the past politic (even if some will try to deny this, it is sufficient to inspect the stock to be sure of this; maybe it is not written in the rules, but it was done so); and why would the future tagged languages not have the benefice of a "mnemonic" registration ?
2-Almost every string of Latin letters has or will have connotations with cultural, geographical, historical or political facts or ideas. So, the better is to choose a string whose most evident connotations are really intended, like a mnemonic link with some version of the name of the tagged language.
3-If you want a non-mnemonic, non-significant tagging, then use numeric-tagging. And the other advantage is that, much more that Latin script, numeric arab digital script is almost universal !
 4- In fact, even digital script can also have undesired connotations if you let them have partial significance..
For example, the social security identifier in France is a numeric-13 number, the fisrt one being 1 for men and 2 for wemen. Andd wemen organizations have protested because 1 is before 2. So, I proposed to choose a digitalization only using bits (0/1), so that the fist digit could be 0 for wemen and 1 for men, and so we have 0 before 1 in the natural order, but this was not accepted.
 But this choice that can give non-mnemonic, non-significant strings has a price: the strings are far longer with an "alphabet" having only 2 characters that with one having 10 characters or one having (more that) 26 characters.
Gérard LANG
 

-----Message d'origine-----
De : ietf-languages-bounces@alvestrand.no [mailto:ietf-languages-bounces@alvestrand.no] De la part de Peter Constable
Envoyé : jeudi 25 septembre 2008 02:20
À : ietf-languages@iana.org
Objet : RE: Pinyin

From: ietf-languages-bounces@alvestrand.no [mailto:ietf-languages-bounces@alvestrand.no] On Behalf Of Randy Presuhn

> A large percentage of the traffic on this list seems to be a direct 
> result of folks reading too much into the mnemonic value of subtags.

Indeed.


> Perhaps 4646-bis should encourage only non-mnemonic registrations, to 
> avoid repeated waste of time.  But that would be a discussion for 
> ltru@ietf.org, not here, and it's really too late to be opening up 
> that kind of discussion anyway.

Even if not spec'd in BCP 47, this kind of thing is also something that this list could adopt as informal policy -- assuming a consensus can be formed and maintained.



Peter
_______________________________________________
Ietf-languages mailing list
Ietf-languages@alvestrand.no
http://www.alvestrand.no/mailman/listinfo/ietf-languages