RE: Pinyin

Mark Crispin <markrcrispin@live.com> Fri, 26 September 2008 15:24 UTC

Return-Path: <markrcrispin@live.com>
X-Original-To: ietf-languages@alvestrand.no
Delivered-To: ietf-languages@alvestrand.no
Received: from localhost (localhost [127.0.0.1]) by eikenes.alvestrand.no (Postfix) with ESMTP id BC93139E47C for <ietf-languages@alvestrand.no>; Fri, 26 Sep 2008 17:24:28 +0200 (CEST)
X-Virus-Scanned: Debian amavisd-new at eikenes.alvestrand.no
Received: from eikenes.alvestrand.no ([127.0.0.1]) by localhost (eikenes.alvestrand.no [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id BngnjG1GbI7d for <ietf-languages@alvestrand.no>; Fri, 26 Sep 2008 17:24:28 +0200 (CEST)
X-Greylist: domain auto-whitelisted by SQLgrey-1.6.8
Received: from pechora2.lax.icann.org (pechora2.icann.org [208.77.188.37]) by eikenes.alvestrand.no (Postfix) with ESMTPS id 9849939E46F for <ietf-languages@alvestrand.no>; Fri, 26 Sep 2008 17:24:27 +0200 (CEST)
Received: from blu0-omc2-s1.blu0.hotmail.com (blu0-omc2-s1.blu0.hotmail.com [65.55.111.76]) by pechora2.lax.icann.org (8.13.8/8.13.8) with ESMTP id m8QFOb0G008296 for <ietf-languages@iana.org>; Fri, 26 Sep 2008 08:24:57 -0700
Received: from BLU126-W2 ([65.55.111.71]) by blu0-omc2-s1.blu0.hotmail.com with Microsoft SMTPSVC(6.0.3790.3959); Fri, 26 Sep 2008 08:24:35 -0700
Message-ID: <BLU126-W248B4263EC518F14E0845B8470@phx.gbl>
X-Originating-IP: [206.124.149.114]
From: Mark Crispin <markrcrispin@live.com>
To: "Tracey, Niall" <niall.tracey@logica.com>, Peter Constable <petercon@microsoft.com>, Randy Presuhn <randy_presuhn@mindspring.com>, ietf-languages@iana.org
Subject: RE: Pinyin
Date: Fri, 26 Sep 2008 08:24:35 -0700
Importance: Normal
In-Reply-To: <A07ECA086F63AC488A70CCC5DB1CEF780B951CBB@uk-ex007.groupinfra.com>
References: <mailman.5976.1222283002.6324.ietf-languages@alvestrand.no><7B1C8ACAE1994C49B8A417F457B32083@DGBP7M81><000601c91eb6$274cde40$6801a8c0@oemcomputer><20080925030358.GD30848@mercury.ccil.org><001c01c91ec8$3e3dcda0$6801a8c0@oemcomputer><DDB6DE6E9D27DD478AE6D1BBBB835795633BC6C00F@NA-EXMSG-C117.redmond.corp.microsoft.com><003a01c91f33$722b4000$6801a8c0@oemcomputer> <DDB6DE6E9D27DD478AE6D1BBBB835795633BC6C2EC@NA-EXMSG-C117.redmond.corp.microsoft.com> <A07ECA086F63AC488A70CCC5DB1CEF780B951CBB@uk-ex007.groupinfra.com>
Content-Type: text/plain; charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
X-OriginalArrivalTime: 26 Sep 2008 15:24:35.0403 (UTC) FILETIME=[FB8039B0:01C91FEB]
X-Virus-Scanned: ClamAV version 0.93.3, clamav-milter version 0.93.3 on pechora2.lax.icann.org
X-Virus-Status: Clean
X-Greylist: IP, sender and recipient auto-whitelisted, not delayed by milter-greylist-4.0 (pechora2.lax.icann.org [208.77.188.37]); Fri, 26 Sep 2008 08:24:57 -0700 (PDT)
X-BeenThere: ietf-languages@alvestrand.no
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: IETF Language tag discussions <ietf-languages.alvestrand.no>
List-Unsubscribe: <http://www.alvestrand.no/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@alvestrand.no?subject=unsubscribe>
List-Archive: <http://www.alvestrand.no/pipermail/ietf-languages>
List-Post: <mailto:ietf-languages@alvestrand.no>
List-Help: <mailto:ietf-languages-request@alvestrand.no?subject=help>
List-Subscribe: <http://www.alvestrand.no/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@alvestrand.no?subject=subscribe>
X-List-Received-Date: Fri, 26 Sep 2008 15:24:28 -0000


> Most Chinese computers use pinyin as their main textual input
> method, correct? If computers are capable of converting pinyin
> to Chinese script, then we can assume that a lot of systems
> designers will chose to display content tagged as Hanyu pinyin
> in the Chinese script by default, as it is generally easier for a
> native to read that way.

That idea sounds attractive, but in real life it does not work well.

The conversion of Latin script to Han characters involves an
intelligent human interacting with the conversion software to
choose the correct Han character from multiple choices.

Some software attempts to make intelligent decisions based upon
context.  There is considerable controversy as to whether that
intelligence is helpful or harmful.

The bottom line is that it is not safe to assume that any automated
process can convert Hanyu pinyin into Han without human
intervention.
_________________________________________________________________
See how Windows connects the people, information, and fun that are part of your life.
http://clk.atdmt.com/MRT/go/msnnkwxp1020093175mrt/direct/01/