RE: [Ltru] Extended language tags (long reply)

Shawn Steele <Shawn.Steele@microsoft.com> Wed, 10 October 2007 18:02 UTC

Return-path: <ltru-bounces@ietf.org>
Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1Iffsl-0002E6-EB; Wed, 10 Oct 2007 14:02:07 -0400
Received: from ltru by megatron.ietf.org with local (Exim 4.43) id 1Iffsk-000242-2A for ltru-confirm+ok@megatron.ietf.org; Wed, 10 Oct 2007 14:02:06 -0400
Received: from [10.90.34.44] (helo=chiedprmail1.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1Iffsj-00021w-LU for ltru@ietf.org; Wed, 10 Oct 2007 14:02:05 -0400
Received: from mailc.microsoft.com ([131.107.115.214] helo=smtp.microsoft.com) by chiedprmail1.ietf.org with esmtp (Exim 4.43) id 1Iffsi-00057Z-Uh for ltru@ietf.org; Wed, 10 Oct 2007 14:02:05 -0400
Received: from tk5-exhub-c103.redmond.corp.microsoft.com (157.54.70.186) by TK5-EXGWY-E803.partners.extranet.microsoft.com (10.251.56.169) with Microsoft SMTP Server (TLS) id 8.1.177.2; Wed, 10 Oct 2007 11:02:03 -0700
Received: from NA-EXMSG-C116.redmond.corp.microsoft.com ([157.54.62.39]) by tk5-exhub-c103.redmond.corp.microsoft.com ([157.54.70.186]) with mapi; Wed, 10 Oct 2007 11:02:03 -0700
From: Shawn Steele <Shawn.Steele@microsoft.com>
To: Mark Davis <mark.davis@icu-project.org>, Addison Phillips <addison@yahoo-inc.com>
Date: Wed, 10 Oct 2007 11:02:01 -0700
Subject: RE: [Ltru] Extended language tags (long reply)
Thread-Topic: [Ltru] Extended language tags (long reply)
Thread-Index: AcgLV9bSZPjEpSv6Q221PL+RneS7+AADmQkQ
Message-ID: <C9BF0238EED3634BA1866AEF14C7A9E55A598803B7@NA-EXMSG-C116.redmond.corp.microsoft.com>
References: <E1IdT7z-0001vv-Ly@megatron.ietf.org> <C9BF0238EED3634BA1866AEF14C7A9E55A597AC370@NA-EXMSG-C116.redmond.corp.microsoft.com> <4709146F.6020504@yahoo-inc.com> <9d70cb000710071715p398a669fhd06326843d9d9390@mail.gmail.com> <30b660a20710071740ma6d39a3u61c8543c70125847@mail.gmail.com> <4709A420.80508@yahoo-inc.com> <30b660a20710100855g5130486awf10f33d3d31fb891@mail.gmail.com>
In-Reply-To: <30b660a20710100855g5130486awf10f33d3d31fb891@mail.gmail.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
acceptlanguage: en-US
MIME-Version: 1.0
X-Spam-Score: 0.0 (/)
X-Scan-Signature: 03169bfe4792634a390035a01a6c6d2f
Cc: "ltru@ietf.org" <ltru@ietf.org>
X-BeenThere: ltru@ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: Language Tag Registry Update working group discussion list <ltru.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www1.ietf.org/pipermail/ltru>
List-Post: <mailto:ltru@ietf.org>
List-Help: <mailto:ltru-request@ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=subscribe>
Content-Type: multipart/mixed; boundary="===============1795232004=="
Errors-To: ltru-bounces@ietf.org

================

What we don't want to do is make recommendations that if implemented, are harder for people to control and get the right answer. And baking extlang into the tags is even worse -- since it introduces backwards incompatibilities that require old code to be modified to work around.

================

cmn is completely incompatible with existing practice anyway, so you can’t claim that it solves the problem.  Existing clients ask for zh-HK (or whatever) and code is tagged as zh-HK (or whatever).  Those won’t match cmn using RFC 4647.

So for backwards compatibility zh-cmn is no worse than cmn.  And if you don’t like the inference of the zh, then you can ignore that part, but at least the data’s there if people do want it.

For some macro languages the strict fallback is probably inappropriate, however I don’t expect to find matches in that case (because I don’t expect correctly tagged data to be “zh”).  If this is a concern, those are easily filtered out.

From the discussion I don’t think the bigger problem is whether or not we go with zh-cmn or just cmn.  The bigger issue seems to be how to modify RFC 4647 to provide meaningful fallback with whichever mode is used.  Since some applications may (or may not) want to consider zh-HK or other legacy behaviors, such recommendations aren’t trivial.  Given the differing requirements amongst us, I suspect a certain flexibility of the applications will be necessary, perhaps several suggestions rather than the strict behavior of RFC 4647 (which everyone seems to modify for their purposes anyway)

- Shawn
_______________________________________________
Ltru mailing list
Ltru@ietf.org
https://www1.ietf.org/mailman/listinfo/ltru