Re: Pinyin

Michael Everson <everson@evertype.com> Fri, 26 September 2008 11:12 UTC

Return-Path: <everson@evertype.com>
X-Original-To: ietf-languages@alvestrand.no
Delivered-To: ietf-languages@alvestrand.no
Received: from localhost (localhost [127.0.0.1]) by eikenes.alvestrand.no (Postfix) with ESMTP id 0711139E47A for <ietf-languages@alvestrand.no>; Fri, 26 Sep 2008 13:12:58 +0200 (CEST)
X-Virus-Scanned: Debian amavisd-new at eikenes.alvestrand.no
Received: from eikenes.alvestrand.no ([127.0.0.1]) by localhost (eikenes.alvestrand.no [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id LQ6OBwv5PEwI for <ietf-languages@alvestrand.no>; Fri, 26 Sep 2008 13:12:51 +0200 (CEST)
X-Greylist: from auto-whitelisted by SQLgrey-1.6.8
Received: from pechora1.lax.icann.org (pechora1.icann.org [208.77.188.36]) by eikenes.alvestrand.no (Postfix) with ESMTPS id 50FF539E46F for <ietf-languages@alvestrand.no>; Fri, 26 Sep 2008 13:12:51 +0200 (CEST)
Received: from lh22.dnsireland.com (lh22.dnsireland.com [78.137.164.62]) by pechora1.lax.icann.org (8.13.8/8.13.8) with ESMTP id m8QBD0DK008098 for <ietf-languages@iana.org>; Fri, 26 Sep 2008 04:13:21 -0700
Received: from murrisk2.westnet.ie ([88.81.100.235]:55320 helo=[192.168.1.112]) by lh22.dnsireland.com with esmtpa (Exim 4.69) (envelope-from <everson@evertype.com>) id 1KjBFj-0006k3-3J for ietf-languages@iana.org; Fri, 26 Sep 2008 12:12:52 +0100
Message-Id: <899D1E82-2633-4FC2-AE5B-788259AC013D@evertype.com>
From: Michael Everson <everson@evertype.com>
To: ietflang IETF Languages Discussion <ietf-languages@iana.org>
In-Reply-To: <A07ECA086F63AC488A70CCC5DB1CEF780B951CBB@uk-ex007.groupinfra.com>
Content-Type: text/plain; charset="ISO-8859-1"; format="flowed"; delsp="yes"
Content-Transfer-Encoding: quoted-printable
Mime-Version: 1.0 (Apple Message framework v929.2)
Subject: Re: Pinyin
Date: Fri, 26 Sep 2008 12:12:53 +0100
References: <mailman.5976.1222283002.6324.ietf-languages@alvestrand.no><7B1C8ACAE1994C49B8A417F457B32083@DGBP7M81><000601c91eb6$274cde40$6801a8c0@oemcomputer><20080925030358.GD30848@mercury.ccil.org><001c01c91ec8$3e3dcda0$6801a8c0@oemcomputer><DDB6DE6E9D27DD478AE6D1BBBB835795633BC6C00F@NA-EXMSG-C117.redmond.corp.microsoft.com><003a01c91f33$722b4000$6801a8c0@oemcomputer> <DDB6DE6E9D27DD478AE6D1BBBB835795633BC6C2EC@NA-EXMSG-C117.redmond.corp.microsoft.com> <A07ECA086F63AC488A70CCC5DB1CEF780B951CBB@uk-ex007.groupinfra.com>
X-Mailer: Apple Mail (2.929.2)
X-AntiAbuse: This header was added to track abuse, please include it with any abuse report
X-AntiAbuse: Primary Hostname - lh22.dnsireland.com
X-AntiAbuse: Original Domain - iana.org
X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12]
X-AntiAbuse: Sender Address Domain - evertype.com
X-Virus-Scanned: ClamAV 0.93.3/8343/Fri Sep 26 02:43:08 2008 on pechora1.lax.icann.org
X-Virus-Status: Clean
X-Greylist: IP, sender and recipient auto-whitelisted, not delayed by milter-greylist-4.0 (pechora1.lax.icann.org [208.77.188.36]); Fri, 26 Sep 2008 04:13:21 -0700 (PDT)
X-BeenThere: ietf-languages@alvestrand.no
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: IETF Language tag discussions <ietf-languages.alvestrand.no>
List-Unsubscribe: <http://www.alvestrand.no/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@alvestrand.no?subject=unsubscribe>
List-Archive: <http://www.alvestrand.no/pipermail/ietf-languages>
List-Post: <mailto:ietf-languages@alvestrand.no>
List-Help: <mailto:ietf-languages-request@alvestrand.no?subject=help>
List-Subscribe: <http://www.alvestrand.no/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@alvestrand.no?subject=subscribe>
X-List-Received-Date: Fri, 26 Sep 2008 11:12:58 -0000

On 26 Sep 2008, at 10:51, Tracey, Niall wrote:

> After my last post, I've realised there's another technical/design  
> concern to be addressed:
>
> Most Chinese computers use pinyin as their main textual input  
> method, correct?

No; there are a variety of input methods in current use. One is stroke  
based.

> If the pinyin is going to be converted to Chinese script.

It is impossible to do this. Even if you write the tones (and there  
are options as how to do that) a given pinyin syllable (say, yín or  
its equivalent yin2) can refer to one of five different characters  
(taken from a small 600 page dictionary). And if the tone isn't  
marked, ther are 18 different possible characters for "yin".

Michael Everson * http://www.evertype.com