Re: [Ltru] Consensus call: extlang

"Phillips, Addison" <addison@amazon.com> Wed, 28 May 2008 16:41 UTC

Return-Path: <ltru-bounces@ietf.org>
X-Original-To: ltru-archive@megatron.ietf.org
Delivered-To: ietfarch-ltru-archive@core3.amsl.com
Received: from [127.0.0.1] (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id F1F373A6C8E; Wed, 28 May 2008 09:41:44 -0700 (PDT)
X-Original-To: ltru@core3.amsl.com
Delivered-To: ltru@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 293B03A6AE4 for <ltru@core3.amsl.com>; Wed, 28 May 2008 09:41:44 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -106.599
X-Spam-Level:
X-Spam-Status: No, score=-106.599 tagged_above=-999 required=5 tests=[BAYES_00=-2.599, RCVD_IN_DNSWL_MED=-4, USER_IN_WHITELIST=-100]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id GK0X3hAQN90g for <ltru@core3.amsl.com>; Wed, 28 May 2008 09:41:43 -0700 (PDT)
Received: from smtp-fw-6101.amazon.com (smtp-fw-6101.amazon.com [72.21.208.25]) by core3.amsl.com (Postfix) with ESMTP id E567E28C0CF for <ltru@ietf.org>; Wed, 28 May 2008 09:41:42 -0700 (PDT)
X-IronPort-AV: E=Sophos;i="4.27,556,1204502400"; d="scan'208";a="316091712"
Received: from smtp-in-1105.vdc.amazon.com ([10.140.9.24]) by smtp-border-fw-out-6101.iad6.amazon.com with ESMTP/TLS/DHE-RSA-AES256-SHA; 28 May 2008 16:41:48 +0000
Received: from ex-hub-4103.ant.amazon.com (ex-hub-4103.sea5.amazon.com [10.248.163.24]) by smtp-in-1105.vdc.amazon.com (8.12.11/8.12.11) with ESMTP id m4SGflWl011859 (version=TLSv1/SSLv3 cipher=RC4-MD5 bits=128 verify=FAIL); Wed, 28 May 2008 16:41:48 GMT
Received: from EX-SEA5-D.ant.amazon.com ([10.248.163.28]) by ex-hub-4103.ant.amazon.com ([10.248.163.24]) with mapi; Wed, 28 May 2008 09:41:48 -0700
From: "Phillips, Addison" <addison@amazon.com>
To: Peter Constable <petercon@microsoft.com>, LTRU Working Group <ltru@ietf.org>
Date: Wed, 28 May 2008 09:41:46 -0700
Thread-Topic: [Ltru] Consensus call: extlang
Thread-Index: Aci/pjqrriDDSRZfTWWe42oHcWg9DQAnJslAACVkLKAAAh658A==
Message-ID: <4D25F22093241741BC1D0EEBC2DBB1DA013A766B5C@EX-SEA5-D.ant.amazon.com>
References: <01c301c8bbe5$8c2810c0$6801a8c0@oemcomputer> <008a01c8bedc$72b97b20$6801a8c0@oemcomputer> <30b660a20805252132g28ff50b0kd5b04d6f47ca35d2@mail.gmail.com> <002001c8bef3$e0497520$6801a8c0@oemcomputer> <30b660a20805262003j21fff6c4tf20d59be11f28633@mail.gmail.com> <E19FDBD7A3A7F04788F00E90915BD36C13C251B2E2@USSDIXMSG20.spe.sony.com> <DDB6DE6E9D27DD478AE6D1BBBB835795633304E1F5@NA-EXMSG-C117.redmond.corp.microsoft.com>
In-Reply-To: <DDB6DE6E9D27DD478AE6D1BBBB835795633304E1F5@NA-EXMSG-C117.redmond.corp.microsoft.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
acceptlanguage: en-US
MIME-Version: 1.0
Subject: Re: [Ltru] Consensus call: extlang
X-BeenThere: ltru@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: Language Tag Registry Update working group discussion list <ltru.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/pipermail/ltru>
List-Post: <mailto:ltru@ietf.org>
List-Help: <mailto:ltru-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=subscribe>
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Sender: ltru-bounces@ietf.org
Errors-To: ltru-bounces@ietf.org

Peter wrote:

> Rather than filtering, it's *lookup* for which a request for zh-cmn
> would match existing zh content. But I think it's fair to say that
> lookup is most typically used for UI resource matching or similar
> scenarios in which the options for available content are likely to be
> fairly limited and controlled, with users being presented with the set
> of specific options available. In that kind of controlled environment,
> fallback from (zh-)cmn to zh is less likely to be needed.
>

In fact, in my experiments at least, lookup requires some modification in order to produce the best results when working with extlang. For my applications, at least, it seems that treating the primary-extlang pair as "atomic" for at least the first pass makes the most sense. That is, the fallback pattern works better when it goes like this:

zh-cmn-Hans-HK
zh-cmn-Hans
zh-cmn
zh-Hans-HK
zh-Hans
zh
(default)

... then when it goes according to the current rules:

zh-cmn-Hans-HK
zh-cmn-Hans
zh-cmn
zh
(default)

This is especially true when one has mixed content regimes (mixed scripts, regions, dialects, languages, etc.) in which 'zh' can represent only one variety (and frequently the wrong value).

Extlangistas will note that you have to make the same modification if you omit extlang to obtain the *exact* same results:

cmn-Hans-HK
cmn-Hans
cmn
zh-Hans-HK (presumably from Macrolanguage field??)
zh-Hans
zh
(default)

Addison

Addison Phillips
Globalization Architect -- Lab126

Internationalization is not a feature.
It is an architecture.


_______________________________________________
Ltru mailing list
Ltru@ietf.org
https://www.ietf.org/mailman/listinfo/ltru