Re: Flagging macrolanguages in the LSR (Was: extlang (was Re: Suggested language for "mis" (Re: [Ltru] RE: ISO 639-2 decision: "mis"))

Addison Phillips <addison@yahoo-inc.com> Thu, 21 June 2007 15:59 UTC

Return-path: <ltru-bounces@ietf.org>
Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1I1P3m-0003Go-Hl; Thu, 21 Jun 2007 11:59:02 -0400
Received: from ltru by megatron.ietf.org with local (Exim 4.43) id 1I1P3l-0003Gj-Fk for ltru-confirm+ok@megatron.ietf.org; Thu, 21 Jun 2007 11:59:01 -0400
Received: from [10.91.34.44] (helo=ietf-mx.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1I1P3l-0003Gb-6B for ltru@ietf.org; Thu, 21 Jun 2007 11:59:01 -0400
Received: from rsmtp2.corp.yahoo.com ([207.126.228.150]) by ietf-mx.ietf.org with esmtp (Exim 4.43) id 1I1P3j-0003Or-Qy for ltru@ietf.org; Thu, 21 Jun 2007 11:59:01 -0400
Received: from [10.72.72.36] (snvvpn1-10-72-72-c36.corp.yahoo.com [10.72.72.36]) (authenticated bits=0) by rsmtp2.corp.yahoo.com (8.13.8/8.13.6/y.rout) with ESMTP id l5LFwtTq082084 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Thu, 21 Jun 2007 08:58:56 -0700 (PDT)
DomainKey-Signature: a=rsa-sha1; s=serpent; d=yahoo-inc.com; c=nofws; q=dns; h=message-id:date:from:user-agent:mime-version:to:cc:subject: references:in-reply-to:content-type:content-transfer-encoding; b=tecT9Z/W7y8m/7PGoab3qpiFuHDAdZCfRSuaO+mUui1xFrxpdzVHxh+amhVpbdZ5
Message-ID: <467AA03E.5070506@yahoo-inc.com>
Date: Thu, 21 Jun 2007 08:58:54 -0700
From: Addison Phillips <addison@yahoo-inc.com>
User-Agent: Thunderbird 2.0.0.4 (Windows/20070604)
MIME-Version: 1.0
To: Stephane Bortzmeyer <bortzmeyer@nic.fr>
Subject: Re: Flagging macrolanguages in the LSR (Was: extlang (was Re: Suggested language for "mis" (Re: [Ltru] RE: ISO 639-2 decision: "mis"))
References: <30b660a20706171252l3c61d451p464b96e864d1a515@mail.gmail.com> <007f01c7b166$8ef7bf10$6401a8c0@DGBP7M81> <30b660a20706181006x3efbf772t9a0751feb070a6cb@mail.gmail.com> <20070619013433.GA15048@mercury.ccil.org> <003701c7b238$16124fc0$6401a8c0@DGBP7M81> <20070619140547.GB30227@mercury.ccil.org> <4677F589.3090003@yahoo-inc.com> <20070619205855.GA19853@sources.org> <46784742.9080009@yahoo-inc.com> <20070621154104.GA25638@sources.org>
In-Reply-To: <20070621154104.GA25638@sources.org>
Content-Type: text/plain; charset="UTF-8"; format="flowed"
Content-Transfer-Encoding: 7bit
X-Spam-Score: -14.4 (--------------)
X-Scan-Signature: fb6060cb60c0cea16e3f7219e40a0a81
Cc: LTRU Working Group <ltru@ietf.org>
X-BeenThere: ltru@ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: Language Tag Registry Update working group discussion list <ltru.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www1.ietf.org/pipermail/ltru>
List-Post: <mailto:ltru@ietf.org>
List-Help: <mailto:ltru-request@ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=subscribe>
Errors-To: ltru-bounces@ietf.org

Stephane Bortzmeyer wrote:
> 
> It seems reasonable to enumerate a macrolanguage "members "in the
> macrolanguage itself. But I see some issues with this proposal:
> 
> 1) What will be the stability rules for Encloses? 

It's informative and thus could be changed. However, extlangs are 
permanent and stable (including their Prefix relationship), so they 
cannot be removed. Note that Encloses describes the function of the 
subtag, not necessarily the linguistic relationship (although in 
practice we hope that both are true).

> Can an extlang be
> dropped from a Encloses field? (if advances in linguistics show that
> it was wrongly added here, for instance.)

I would assume that an extlang can't be dropped from Encloses, since 
stability rules prevent the Prefix or status of an extlang from changing.


  And can an extlang be added
> to an Encloses? If yes (which seems reasonable), it means that the
> macrolanguage entry can change quite often (consider Zapotec with its
> dozens of enclosed extlangs).

You mean a new extlang? Of course new extlangs could be added as ISO 
639-3 assigns new codes. But there is no such thing as an existing 
extlang acquiring a new Prefix (and thus a new Encloses relationship). 
It is banned by stability rules.

> 
> 2) It introduces a redundancy in the registry, an information which
> was already there and could be computed from other records. Randy
> Presuhn noticed it can be seen as a good thing ("it should be flagged
> in its record, rather than requiring it to be computed by examining
> all other records"). But more redundancy means more possibilities to
> introduce contradictions.

The extlang records have a computable relationship. The main problem 
will be grandfathered primary language subtags that are enclosed by 
other subtags linguistically but not in terms of tagging practice. That 
is, not all possible macrolanguage relationships are described by or can 
be computed from entries in the registry.

> 
> 3) 4646bis does not indicate an upper bound for the length of a field

There isn't one.

> ("The content of this field is not restricted, except by [...]
> reasonable practical size limitations."). Zapotec's "Encloses:" will
> be at least 232 bytes long and sign langauges will be worse. While
> there is a long discussion of length issues for tags, it seems there
> is none for registry-parsing applications. I suspect that some of them
> may have hard-wired limits for field lengths.

Note that some comments approach or even exceed the sizes you note above.

I would tend to propose that we make Encloses a single field with 
structure simply because I find the multiple repetition of the same 
field unaesthetic. Of course, we went the other direction with Prefix, 
so there is some precedent for repeating the field.

Addison

-- 
Addison Phillips
Globalization Architect -- Yahoo! Inc.
Chair -- W3C Internationalization Core WG

Internationalization is an architecture.
It is not a feature.


_______________________________________________
Ltru mailing list
Ltru@ietf.org
https://www1.ietf.org/mailman/listinfo/ltru