RE: Pinyin

"Tracey, Niall" <niall.tracey@logica.com> Fri, 26 September 2008 09:50 UTC

Return-Path: <niall.tracey@logica.com>
X-Original-To: ietf-languages@alvestrand.no
Delivered-To: ietf-languages@alvestrand.no
Received: from localhost (localhost [127.0.0.1]) by eikenes.alvestrand.no (Postfix) with ESMTP id BA43839E47A for <ietf-languages@alvestrand.no>; Fri, 26 Sep 2008 11:50:16 +0200 (CEST)
X-Virus-Scanned: Debian amavisd-new at eikenes.alvestrand.no
Received: from eikenes.alvestrand.no ([127.0.0.1]) by localhost (eikenes.alvestrand.no [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id oRyBG9kcUF8H for <ietf-languages@alvestrand.no>; Fri, 26 Sep 2008 11:50:16 +0200 (CEST)
X-Greylist: from auto-whitelisted by SQLgrey-1.6.8
Received: from pechora4.lax.icann.org (pechora4.icann.org [208.77.188.39]) by eikenes.alvestrand.no (Postfix) with ESMTPS id 788DE39E46F for <ietf-languages@alvestrand.no>; Fri, 26 Sep 2008 11:50:15 +0200 (CEST)
Received: from WA4EHSOBE006.bigfish.com (outbound-wa4.frontbridge.com [216.32.181.16]) by pechora4.lax.icann.org (8.13.8/8.13.8) with ESMTP id m8Q9oPWm014863 for <ietf-languages@iana.org>; Fri, 26 Sep 2008 02:50:45 -0700
Received: from mail17-wa4-R.bigfish.com (10.8.14.244) by WA4EHSOBE006.bigfish.com (10.8.40.26) with Microsoft SMTP Server id 8.1.240.5; Fri, 26 Sep 2008 09:50:24 +0000
Received: from mail17-wa4 (localhost.localdomain [127.0.0.1]) by mail17-wa4-R.bigfish.com (Postfix) with ESMTP id 8D4DE7701EA; Fri, 26 Sep 2008 09:50:24 +0000 (UTC)
X-BigFish: VPS2(zzzzzzz2fh6bh43j61h)
X-Spam-TCS-SCL: 0:0
X-FB-SS: 5,
Received: by mail17-wa4 (MessageSwitch) id 1222422622892097_13096; Fri, 26 Sep 2008 09:50:22 +0000 (UCT)
Received: from c1-ex002.groupinfra.com (unknown [213.86.115.109]) by mail17-wa4.bigfish.com (Postfix) with ESMTP id 944E21948058; Fri, 26 Sep 2008 09:50:22 +0000 (UTC)
Received: from uk-ex003.groupinfra.com ([158.234.38.247]) by c1-ex002.groupinfra.com with Microsoft SMTPSVC(6.0.3790.1830); Fri, 26 Sep 2008 10:50:19 +0100
Received: from uk-ex007.groupinfra.com ([158.234.24.97]) by uk-ex003.groupinfra.com with Microsoft SMTPSVC(6.0.3790.3959); Fri, 26 Sep 2008 10:50:19 +0100
X-MimeOLE: Produced By Microsoft Exchange V6.5
Content-Class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
Subject: RE: Pinyin
Date: Fri, 26 Sep 2008 10:51:40 +0100
Message-ID: <A07ECA086F63AC488A70CCC5DB1CEF780B951CBB@uk-ex007.groupinfra.com>
In-Reply-To: <DDB6DE6E9D27DD478AE6D1BBBB835795633BC6C2EC@NA-EXMSG-C117.redmond.corp.microsoft.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: Pinyin
Thread-Index: AckfM3bOKVcM/WVbTw+QF5QVc5/K4QAGTkYgABu58iA=
References: <mailman.5976.1222283002.6324.ietf-languages@alvestrand.no><7B1C8ACAE1994C49B8A417F457B32083@DGBP7M81><000601c91eb6$274cde40$6801a8c0@oemcomputer><20080925030358.GD30848@mercury.ccil.org><001c01c91ec8$3e3dcda0$6801a8c0@oemcomputer><DDB6DE6E9D27DD478AE6D1BBBB835795633BC6C00F@NA-EXMSG-C117.redmond.corp.microsoft.com><003a01c91f33$722b4000$6801a8c0@oemcomputer> <DDB6DE6E9D27DD478AE6D1BBBB835795633BC6C2EC@NA-EXMSG-C117.redmond.corp.microsoft.com>
From: "Tracey, Niall" <niall.tracey@logica.com>
To: Peter Constable <petercon@microsoft.com>, Randy Presuhn <randy_presuhn@mindspring.com>, ietf-languages@iana.org
X-OriginalArrivalTime: 26 Sep 2008 09:50:19.0324 (UTC) FILETIME=[49256FC0:01C91FBD]
X-Virus-Scanned: ClamAV 0.93.3/8342/Fri Sep 26 00:48:35 2008 on pechora4.lax.icann.org
X-Virus-Status: Clean
X-Greylist: IP, sender and recipient auto-whitelisted, not delayed by milter-greylist-4.0 (pechora4.lax.icann.org [208.77.188.39]); Fri, 26 Sep 2008 02:50:45 -0700 (PDT)
X-BeenThere: ietf-languages@alvestrand.no
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: IETF Language tag discussions <ietf-languages.alvestrand.no>
List-Unsubscribe: <http://www.alvestrand.no/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@alvestrand.no?subject=unsubscribe>
List-Archive: <http://www.alvestrand.no/pipermail/ietf-languages>
List-Post: <mailto:ietf-languages@alvestrand.no>
List-Help: <mailto:ietf-languages-request@alvestrand.no?subject=help>
List-Subscribe: <http://www.alvestrand.no/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@alvestrand.no?subject=subscribe>
X-List-Received-Date: Fri, 26 Sep 2008 09:50:16 -0000

After my last post, I've realised there's another technical/design concern to be addressed:

Most Chinese computers use pinyin as their main textual input method, correct? If computers are capable of converting pinyin to Chinese script, then we can assume that a lot of systems designers will chose to display content tagged as Hanyu pinyin in the Chinese script by default, as it is generally easier for a native to read that way.

If the pinyin is going to be converted to Chinese script, then the uniformity of the phonetic conventions across the various pinyins should not be our main concern -- it should be the consistency of the mapping of pinyin<->script. Here we suddenly have to concern ourselves not only with minor variations in conventions, but also with quite significant variations in the form and pronunciation of words.

The original request was for a tag for Hanyu Pinyin, and anything marked with a tag unambiguously representing Hanyu Pinyin should be able to be mapped to Madarin in the Chinese script, with the exception of typos, misspellings and the odd non-Mandarin name. As I understand it, at present we are not able to mandate a prefix that would give us that unambiguous definition, so introducing a generic "-pinyin" would introduce ambiguity.

So as far as I can see, the only computer-friendly option is to have a tag specifically for Hanyu Pinyin, whatever name you choose to give it.

Níall.

This e-mail and any attachment is for authorised use by the intended recipient(s) only. It may contain proprietary material, confidential information and/or be subject to legal privilege. It should not be copied, disclosed to, retained or used by, any other party. If you are not an intended recipient then please promptly delete this e-mail and any attachment and all copies and inform the sender. Thank you.