Re: extlang (was Re: Suggested language for "mis" (Re: [Ltru] RE: ISO 639-2 decision: "mis"))

"Doug Ewell" <dewell@roadrunner.com> Tue, 19 June 2007 06:07 UTC

Return-path: <ltru-bounces@ietf.org>
Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1I0Ws4-0005KO-7Y; Tue, 19 Jun 2007 02:07:20 -0400
Received: from ltru by megatron.ietf.org with local (Exim 4.43) id 1I0Ws2-0005KG-Ko for ltru-confirm+ok@megatron.ietf.org; Tue, 19 Jun 2007 02:07:18 -0400
Received: from [10.91.34.44] (helo=ietf-mx.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1I0Ws1-0005K5-W7 for ltru@ietf.org; Tue, 19 Jun 2007 02:07:17 -0400
Received: from mta11.adelphia.net ([68.168.78.205]) by ietf-mx.ietf.org with esmtp (Exim 4.43) id 1I0Ws0-0001Sd-Lx for ltru@ietf.org; Tue, 19 Jun 2007 02:07:17 -0400
Received: from DGBP7M81 ([76.167.184.182]) by mta11.adelphia.net (InterMail vM.6.01.05.02 201-2131-123-102-20050715) with SMTP id <20070619060716.FTZD3934.mta11.adelphia.net@DGBP7M81>; Tue, 19 Jun 2007 02:07:16 -0400
Message-ID: <003701c7b238$16124fc0$6401a8c0@DGBP7M81>
From: Doug Ewell <dewell@roadrunner.com>
To: LTRU Working Group <ltru@ietf.org>
References: <30b660a20706171252l3c61d451p464b96e864d1a515@mail.gmail.com> <007f01c7b166$8ef7bf10$6401a8c0@DGBP7M81> <30b660a20706181006x3efbf772t9a0751feb070a6cb@mail.gmail.com> <20070619013433.GA15048@mercury.ccil.org>
Subject: Re: extlang (was Re: Suggested language for "mis" (Re: [Ltru] RE: ISO 639-2 decision: "mis"))
Date: Mon, 18 Jun 2007 23:07:15 -0700
MIME-Version: 1.0
Content-Type: text/plain; format="flowed"; charset="iso-8859-1"; reply-type="original"
Content-Transfer-Encoding: 7bit
X-Priority: 3
X-MSMail-Priority: Normal
X-Mailer: Microsoft Outlook Express 6.00.2900.3138
X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2900.3138
X-Spam-Score: 0.0 (/)
X-Scan-Signature: f60d0f7806b0c40781eee6b9cd0b2135
Cc:
X-BeenThere: ltru@ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: Language Tag Registry Update working group discussion list <ltru.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www1.ietf.org/pipermail/ltru>
List-Post: <mailto:ltru@ietf.org>
List-Help: <mailto:ltru-request@ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=subscribe>
Errors-To: ltru-bounces@ietf.org

John did a fine job articulating most of my position.  Anything not 
explicitly responded to here is +1.

John Cowan <cowan at ccil dot org> wrote:

> Mark Davis scripsit:
>
>> We added extlang to allow ourselves the freedom to make choices when 
>> 639-3 came along. We *very clearly did not define its meaning*, because 
>> we didn't know what 639-3 was finally going to look like,
>
> Not so much.  We had an excellent idea of what 639-3 would both look and 
> actually be like when 4646 was finalized.  We couldn't include 639-3 or 
> extlangs because 639-3 itself was not yet final.

In particular, we knew there would be macrolanguages and encompassed 
languages.  This was spelled out by Peter Constable on 2004-01-30 in a post 
which laid out the extlang concept for the first time:

http://www.alvestrand.no/pipermail/ietf-languages/2004-January/001658.html

>> nor did we have agreement on what we should actually do.
>
> We had at least the consensus of silence; at least, I don't remember any 
> complaints at the time.  Remember that the development of 4646 started at 
> least a year before LTRU was formally created.

I thought it was stronger than that, that just about everyone's expectation 
and understanding was that extlangs were intended for encompassed languages, 
and that encompassed languages would be represented by extlangs.  Tags like 
"zh-cmn" were registered at ietf-languages specifically so they would 
conform to this well-understood model.

>> Matching "zh" and "yue" is not something you want to do automatically.

That's only half true.  All Cantonese is Chinese, and since it's only been 
possible up to this point to tag Cantonese as "zh-something", it's 
reasonable to assume that much of it will continue to be tagged as 
"zh-something" in the future.

>> B. (optional) Add a field Macrolanguage: to the language subtag registry.
>
> I am not opposed to this, precisely because encompassed languages and the 
> corresponding macrolanguage cannot be identified syntactically.

So, for example:

Type: language
Subtag: qvc
Description: Cajamarca Quechua
Description: Quechua, Cajamarca
Added: 20xx-xx-xx
Macrolanguage: que

Would content in Cajamarca Quechua be required to be tagged as "qvc", or 
would "qu" also be acceptable?  (We all know "qu" would be widely used.) 
Would matching engines be permitted to perform one- or two-way matching 
between "qvc" and "qu"?  What about users who expect we will go through with 
the "qu-qvc" model, and the Web pages that have already started listing such 
tags?  (See www.udhrinunicode.org, for instance.)  Would all this require an 
update to 4647?

--
Doug Ewell  *  Fullerton, California, USA  *  RFC 4645  *  UTN #14
http://users.adelphia.net/~dewell/
http://www1.ietf.org/html.charters/ltru-charter.html
http://www.alvestrand.no/mailman/listinfo/ietf-languages



_______________________________________________
Ltru mailing list
Ltru@ietf.org
https://www1.ietf.org/mailman/listinfo/ltru