Re: [Ltru] Macrolanguage usage
"Mark Davis" <mark.davis@icu-project.org> Fri, 16 May 2008 21:15 UTC
Return-Path: <ltru-bounces@ietf.org>
X-Original-To: ltru-archive@megatron.ietf.org
Delivered-To: ietfarch-ltru-archive@core3.amsl.com
Received: from [127.0.0.1] (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 1C01328C29B; Fri, 16 May 2008 14:15:39 -0700 (PDT)
X-Original-To: ltru@core3.amsl.com
Delivered-To: ltru@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 5261D28C29B for <ltru@core3.amsl.com>; Fri, 16 May 2008 14:15:37 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.981
X-Spam-Level:
X-Spam-Status: No, score=-1.981 tagged_above=-999 required=5 tests=[AWL=-0.005, BAYES_00=-2.599, FM_FORGED_GMAIL=0.622, HTML_MESSAGE=0.001]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 3OiqIw3KVRDc for <ltru@core3.amsl.com>; Fri, 16 May 2008 14:15:36 -0700 (PDT)
Received: from yw-out-2324.google.com (yw-out-2324.google.com [74.125.46.28]) by core3.amsl.com (Postfix) with ESMTP id B9C3F28C1FC for <ltru@ietf.org>; Fri, 16 May 2008 14:15:35 -0700 (PDT)
Received: by yw-out-2324.google.com with SMTP id 3so625100ywj.49 for <ltru@ietf.org>; Fri, 16 May 2008 14:15:26 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:sender:to:subject:cc:in-reply-to:mime-version:content-type:references:x-google-sender-auth; bh=McKf+H6h1J48M0pQrFETkf72b8ZzTFvUemeyi4Q8zQo=; b=BtGBAuFzPqm2vYnozYJuuBpFEyDsfaPK6WE/ePKFaZs8obP5Pdf97WMgQhB7BWJaMh/VJ5eOt4B7C1Q+JjCbACWdRZOnvp3IqAhkLxqSyRBxOK7K1bbURBcsa6LgMmT3nJ3uwJ71eqlHfgH9RFP3PwbB09MlrcPQSZD/gihDcV8=
DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:sender:to:subject:cc:in-reply-to:mime-version:content-type:references:x-google-sender-auth; b=L6B9cwENj3RQEkOF8MpFKk64Xhe5r0XqvF7/GjlUi7BTT/0ZvH8+LRlmfF1PJg4/OyjozgEuhLbhFNHyPbDeh/doxaXFkXblI1/c89Lu2G0jCULbiLhWcU8SoDI23L4taqtFVW6A6Bf93eARivTQChcPWPeb3sG2SPq79ulgsXI=
Received: by 10.150.83.41 with SMTP id g41mr4035143ybb.193.1210972526918; Fri, 16 May 2008 14:15:26 -0700 (PDT)
Received: by 10.150.206.3 with HTTP; Fri, 16 May 2008 14:15:26 -0700 (PDT)
Message-ID: <30b660a20805161415kb1172f0xa6c4dea251344bb6@mail.gmail.com>
Date: Fri, 16 May 2008 14:15:26 -0700
From: Mark Davis <mark.davis@icu-project.org>
To: Shawn Steele <Shawn.Steele@microsoft.com>
In-Reply-To: <C9BF0238EED3634BA1866AEF14C7A9E561554BEB09@NA-EXMSG-C116.redmond.corp.microsoft.com>
MIME-Version: 1.0
References: <mailman.494.1210865385.5128.ltru@ietf.org> <00a901c8b6f5$c04529a0$e6f5e547@DGBP7M81> <30b660a20805161108w578b6cf9g11933ca34996a596@mail.gmail.com> <005901c8b787$930f98c0$6801a8c0@oemcomputer> <30b660a20805161309u67158b6arcb3b2df1c46db6a7@mail.gmail.com> <C9BF0238EED3634BA1866AEF14C7A9E561554BEB09@NA-EXMSG-C116.redmond.corp.microsoft.com>
X-Google-Sender-Auth: 3582270d863e0213
Cc: LTRU Working Group <ltru@ietf.org>
Subject: Re: [Ltru] Macrolanguage usage
X-BeenThere: ltru@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: Language Tag Registry Update working group discussion list <ltru.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/pipermail/ltru>
List-Post: <mailto:ltru@ietf.org>
List-Help: <mailto:ltru-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=subscribe>
Content-Type: multipart/mixed; boundary="===============1438753212=="
Sender: ltru-bounces@ietf.org
Errors-To: ltru-bounces@ietf.org
On Fri, May 16, 2008 at 1:35 PM, Shawn Steele <Shawn.Steele@microsoft.com> wrote: > > That lets people like Google, which has a very serious and important > issue > > with backwards compatibility, support the vast amount of > Mandarin/Arabic/... > > that is tagged with "zh", and continue to tag Mandarin content with "zh". > It > > also lets people like Shawn's group at Microsoft, having no backwards > > compatibility issues with "cmn" and "zh", to shift over immediately. > This seems to be oversimplifying the issue. I presume that Google would > also be interested in recognizing cmn tagged data and cmn requests when they > see them? > Yes, clearly. Although even that takes time and effort to do; there is no magic wand. (As I've said, the addition of non-predominant encompassed language subtags like "yue" is important and useful for us. The addition of the predominant encompassed language subtags just means a lot of work for no additional benefit to us or our users. Following the de/gsw paradigm would have been *much* simpler. But that is water under the bridge; we just have to deal with the situation as it is, and try to give people guidance as to the best way to handle them.) > If its "just" some internal database of languages and you have to map > between the actual request/content names anyway, then I don't know why this > working group would care if Google called their internal data "zh" or "abc" > or whatever. > The working group might not, but we have a goal of using BCP 47 not only externally, but in communication *within* Google among the many different products and programs. I assume that's not a bad thing ;-) It the intent is interchange, then continuing to use "zh" when what is > really meant is a large subset of zh (Mandarin) seems to perpetuate the > existing ambiguity. > The issue is on output. If a program switches to "cmn" from "zh" on output, then an external party who doesn't recognize "cmn" breaks. So that program needs to output "zh" until it is certain that the recipients would all recognize "cmn". We could remain silent on this issue in the spec, but that would just be withholding useful advice for people in terms of "tagging wisely", advice that would allow people to interoperate more effectively. > Microsoft is also concerned with the backwards compatibility issues, > however we recognize that existing "zh" tagged data is not necessarily > Mandarin (even though its likely). We don't have language detection tools > to try to guess what the application's resources or a web page actually has > for a language. For back-compat, you'll recall that I thought zh-cmn > helped, which would seem to solve both problems. (Google'd have the zh it > wants and I'd have the specificity that I'm looking for.) > No, it doesn't solve the problem at all. There are many, many circumstances where you want precise communication, not lookup&fallback. If recipient expecting "zh" (and who doesn't know about the new codes) will break when they get "zh-cmn" just like they will break when they get "cmn": extlang or not makes no difference. (Extlang only really has an effect with lookup, you think positive, I think negative, but let's not discuss that here -- we've all agreed to look at that issue *after* this round.) > > Presumably (unless its only for internal use), Microsoft and Google can't > have different definitions of "zh" for interchange. If the definitions > differ, then indexing of IIS served pages or requests from IE browsers would > not necessarily provide the expected results. > Once both parties can handle both, it is not a problem. And where there is a handshaking protocol it is not a problem. But there will be a *long* transition period where only "zh" can be depended on to work. > What happens for other language tags that change? Like serbian (serbian > remains the same, but data tagged at the region level will change.) Needing > to support zh + cmn isn't that different than other common scenarios. > Right, I've been saying all along that this is the same issue with any predominant encompassed language. > > - Shawn > > -- Mark
_______________________________________________ Ltru mailing list Ltru@ietf.org https://www.ietf.org/mailman/listinfo/ltru
- [Ltru] Macrolanguage usage Mark Davis
- Re: [Ltru] Macrolanguage usage Shawn Steele
- Re: [Ltru] Macrolanguage usage Mark Davis
- Re: [Ltru] Macrolanguage usage Shawn Steele
- Re: [Ltru] Macrolanguage usage Mark Davis
- Re: [Ltru] Macrolanguage usage Shawn Steele
- Re: [Ltru] Macrolanguage usage Randy Presuhn
- Re: [Ltru] Macrolanguage usage [OT?] Phillips, Addison
- Re: [Ltru] Macrolanguage usage Doug Ewell
- Re: [Ltru] Macrolanguage usage Debbie Garside
- Re: [Ltru] Macrolanguage usage Peter Constable
- Re: [Ltru] Macrolanguage usage Phillips, Addison
- Re: [Ltru] Macrolanguage usage Shawn Steele
- Re: [Ltru] Macrolanguage usage Phillips, Addison
- Re: [Ltru] Macrolanguage usage Peter Constable
- Re: [Ltru] Macrolanguage usage Phillips, Addison
- Re: [Ltru] Macrolanguage usage Mark Davis
- Re: [Ltru] Macrolanguage usage Randy Presuhn
- Re: [Ltru] Macrolanguage usage Phillips, Addison
- Re: [Ltru] Macrolanguage usage Shawn Steele
- Re: [Ltru] Macrolanguage usage Phillips, Addison
- Re: [Ltru] Macrolanguage usage Phillips, Addison
- Re: [Ltru] Macrolanguage usage Mark Davis
- Re: [Ltru] Macrolanguage usage Shawn Steele
- Re: [Ltru] Macrolanguage usage Mark Davis
- Re: [Ltru] Macrolanguage usage Leif Halvard Silli
- Re: [Ltru] Macrolanguage usage Leif Halvard Silli
- Re: [Ltru] Macrolanguage usage Don Osborn
- Re: [Ltru] Macrolanguage usage Randy Presuhn
- Re: [Ltru] Macrolanguage usage Mark Davis
- Re: [Ltru] Macrolanguage usage Stephane Bortzmeyer
- Re: [Ltru] Macrolanguage usage John Cowan
- Re: [Ltru] Macrolanguage usage Leif Halvard Silli
- Re: [Ltru] Macrolanguage usage Leif Halvard Silli
- Re: [Ltru] Macrolanguage usage Mark Davis
- Re: [Ltru] Macrolanguage usage John Cowan
- Re: [Ltru] Macrolanguage usage Leif Halvard Silli
- Re: [Ltru] Macrolanguage usage Doug Ewell
- Re: [Ltru] my technical position on extlang Doug Ewell
- Re: [Ltru] Macrolanguage usage Doug Ewell
- Re: [Ltru] Macrolanguage usage Randy Presuhn
- Re: [Ltru] Macrolanguage usage John Cowan
- Re: [Ltru] Macrolanguage usage Peter Constable
- Re: [Ltru] Macrolanguage usage Peter Constable
- Re: [Ltru] Macrolanguage usage Leif Halvard Silli
- Re: [Ltru] Macrolanguage usage Leif Halvard Silli
- Re: [Ltru] Macrolanguage usage Leif Halvard Silli
- Re: [Ltru] Macrolanguage usage Leif Halvard Silli
- Re: [Ltru] Macrolanguage usage Kent Karlsson
- Re: [Ltru] Macrolanguage usage Peter Constable
- Re: [Ltru] Macrolanguage usage Doug Ewell
- Re: [Ltru] Macrolanguage usage Peter Constable
- Re: [Ltru] Macrolanguage usage Nicolas Krebs
- Re: [Ltru] Macrolanguage usage Kent Karlsson
- Re: [Ltru] Macrolanguage usage Leif Halvard Silli
- Re: [Ltru] Macrolanguage usage Leif Halvard Silli
- Re: [Ltru] Macrolanguage usage Leif Halvard Silli
- Re: [Ltru] Macrolanguage usage Peter Constable
- Re: [Ltru] Macrolanguage usage Leif Halvard Silli
- Re: [Ltru] Macrolanguage usage Leif Halvard Silli
- Re: [Ltru] Macrolanguage usage Doug Ewell
- Re: [Ltru] Macrolanguage usage Leif Halvard Silli
- Re: [Ltru] Macrolanguage usage Kent Karlsson
- Re: [Ltru] Macrolanguage usage Stephane Bortzmeyer
- Re: [Ltru] Macrolanguage usage Leif Halvard Silli
- Re: [Ltru] Macrolanguage usage Peter Constable
- Re: [Ltru] Macrolanguage usage Peter Constable
- Re: [Ltru] Macrolanguage usage John Cowan
- Re: [Ltru] Macrolanguage usage Leif Halvard Silli
- Re: [Ltru] Macrolanguage usage Leif Halvard Silli