Re: [Ltru] my technical position on extlang

John Cowan <cowan@ccil.org> Sun, 25 May 2008 02:16 UTC

Return-Path: <ltru-bounces@ietf.org>
X-Original-To: ltru-archive@megatron.ietf.org
Delivered-To: ietfarch-ltru-archive@core3.amsl.com
Received: from [127.0.0.1] (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 3E2CA3A69F5; Sat, 24 May 2008 19:16:53 -0700 (PDT)
X-Original-To: ltru@core3.amsl.com
Delivered-To: ltru@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id AE87E3A69F3 for <ltru@core3.amsl.com>; Sat, 24 May 2008 19:16:51 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.426
X-Spam-Level:
X-Spam-Status: No, score=-2.426 tagged_above=-999 required=5 tests=[AWL=0.173, BAYES_00=-2.599]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id LuBFd-zj2nTF for <ltru@core3.amsl.com>; Sat, 24 May 2008 19:16:50 -0700 (PDT)
Received: from earth.ccil.org (earth.ccil.org [192.190.237.11]) by core3.amsl.com (Postfix) with ESMTP id 8320F3A695E for <ltru@ietf.org>; Sat, 24 May 2008 19:16:50 -0700 (PDT)
Received: from cowan by earth.ccil.org with local (Exim 4.63) (envelope-from <cowan@ccil.org>) id 1K05mq-0007rn-Ix; Sat, 24 May 2008 22:16:40 -0400
Date: Sat, 24 May 2008 22:16:40 -0400
To: Mark Davis <mark.davis@icu-project.org>
Message-ID: <20080525021640.GE13854@mercury.ccil.org>
References: <20080523044305.GB7960@mercury.ccil.org> <30b660a20805230851r519f5d14wd93a92494d1db1c9@mail.gmail.com> <20080523160905.GD21554@mercury.ccil.org> <30b660a20805231405q56b156c4vbb3b6abda4af3893@mail.gmail.com> <20080523225400.GB13152@mercury.ccil.org> <30b660a20805231639w1de0fda8w116662738f8c5d6a@mail.gmail.com> <20080523234427.GC13152@mercury.ccil.org> <30b660a20805231655r34486205m9362e8fe65193ae6@mail.gmail.com> <20080524001151.GD13152@mercury.ccil.org> <30b660a20805240943o44a5719r50eb8f0eaf721dca@mail.gmail.com>
MIME-Version: 1.0
Content-Disposition: inline
In-Reply-To: <30b660a20805240943o44a5719r50eb8f0eaf721dca@mail.gmail.com>
User-Agent: Mutt/1.5.13 (2006-08-11)
From: John Cowan <cowan@ccil.org>
Cc: LTRU Working Group <ltru@ietf.org>
Subject: Re: [Ltru] my technical position on extlang
X-BeenThere: ltru@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: Language Tag Registry Update working group discussion list <ltru.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/pipermail/ltru>
List-Post: <mailto:ltru@ietf.org>
List-Help: <mailto:ltru-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=subscribe>
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Sender: ltru-bounces@ietf.org
Errors-To: ltru-bounces@ietf.org

Mark Davis scripsit:

> Just to make sure that we're talking about content negotiation, eg using
> Accept-Language to get the language for a website, where "zh" means "zh" or
> anything starting with it. 

Check.

> Here's an example. Let's suppose that content tagged with "fr", "gsw", and
> "yue" (or "zh-yue") are available, and nothing else. Let's suppose also
> that I understand German, French, and Mandarin, but not [anything else].

> Both models
> 
>    - If I specify "de fr", I'll get "fr" under either model. Fine,
>    I can understand "fr".
>    - If I wanted to also request Swiss German as well as German,
>    I would use "gsw de fr", and get "gsw".
>    - I only get "gsw" if I request it. That's good, because I don't
>    understand "gsw".

Check.

> No Extlang Model
> 
>    - If I specify "zh cmn fr", I'll get "fr" under either model. Fine,
>    I can understand "fr".
>    - If I wanted to also request Cantonese as well as Mardarin,
>    I would use "yue cmn zh fr" and get "yue"

That would only work if the Cantonese content was labeled "yue", which is
not possible until 4646bis goes into effect.  So for practical purposes
you'd need to say "yue zh-yue cmn zh fr", since "zh-yue" is currently
the *only* way to tag Cantonese.

>    - I only get "yue" if I request it. That's good, because I don't
>    understand "yue".

That's literally true, but you *will* get Cantonese content tagged "zh-yue".

>    - I need to include "zh", and will for the indefinite future because
>    the vast majority of Mandarin content is tagged that way.

Check.

>    - I might get content tagged "zh" but which is actually "yue" but the
>    chances of that are extremely remote.

Not so.  If the Cantonese content is tagged "zh-yue" (and again, that is the
*only* way to tag Cantonese today), you will get it.

> Extlang Model
> 
>    - If I specify "zh zh-cmn fr", I would get "zh-yue". Broken, I don't
>    understand Cantonese!

Check.

>    - In order to specify that I want "zh" or "cmn" but no other languages, I
>    have to use "zh-cjy;q=0, zh-cpx;q=0, zh-czh;q=0, zh-czo;q=0, zh-gan;q=0,
>    zh-hak;q=0, zh-hsn;q=0, zh-mnp;q=0, zh-nan;q=0, zh-wuu;q=0, zh-yue;q=0 zh
>    zh-cmn fr".

Check.  And yet:

If most users can handle most of the encompassed languages, then the
no-extlang model is painful (you have to specify too many positive-q
tags).  If most users can only handle a few encompassed languages, then
the extlang model is painful (you have to specify too many zero-q tags).
The situation is symmetrical.

>    - Moreover, as you said, if another encompassed language shows up
>    for zh, I could get that inadvertently unless I add this to my list.

Again, it's symmetrical: if another encompassed language shows up, I
could miss that inadvertently (in the no-extlang model) unless I add
it to my list.

>    - Again, I need to include "zh", and will for the indefinite future
>    because the vast majority of Mandarin content is tagged that way.

Check.

-- 
You know, you haven't stopped talking           John Cowan
since I came here. You must have been           http://www.ccil.org/~cowan
vaccinated with a phonograph needle.            cowan@ccil.org
        --Rufus T. Firefly
_______________________________________________
Ltru mailing list
Ltru@ietf.org
https://www.ietf.org/mailman/listinfo/ltru