RE: [Ltru] Extended language tags

Peter Constable <petercon@microsoft.com> Sun, 07 October 2007 06:50 UTC

Return-path: <ltru-bounces@ietf.org>
Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1IePxp-0006OU-Uf; Sun, 07 Oct 2007 02:50:09 -0400
Received: from ltru by megatron.ietf.org with local (Exim 4.43) id 1IePxo-0006OP-T1 for ltru-confirm+ok@megatron.ietf.org; Sun, 07 Oct 2007 02:50:08 -0400
Received: from [10.91.34.44] (helo=ietf-mx.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1IePxk-0006JK-GG for ltru@ietf.org; Sun, 07 Oct 2007 02:50:04 -0400
Received: from mail2.microsoft.com ([131.107.115.215] helo=smtp.microsoft.com) by ietf-mx.ietf.org with esmtp (Exim 4.43) id 1IePxe-0001zb-AD for ltru@ietf.org; Sun, 07 Oct 2007 02:50:04 -0400
Received: from tk1-exhub-c101.redmond.corp.microsoft.com (157.56.116.111) by TK5-EXGWY-E802.partners.extranet.microsoft.com (10.251.56.168) with Microsoft SMTP Server (TLS) id 8.1.177.2; Sat, 6 Oct 2007 23:49:27 -0700
Received: from NA-EXMSG-C117.redmond.corp.microsoft.com ([157.54.62.44]) by tk1-exhub-c101.redmond.corp.microsoft.com ([157.56.116.111]) with mapi; Sat, 6 Oct 2007 23:49:27 -0700
From: Peter Constable <petercon@microsoft.com>
To: "ltru@ietf.org" <ltru@ietf.org>
Date: Sat, 06 Oct 2007 23:49:27 -0700
Subject: RE: [Ltru] Extended language tags
Thread-Topic: [Ltru] Extended language tags
Thread-Index: AcgG650WUXKq4oH3SwSFINwonCmBrABwa93w
Message-ID: <DDB6DE6E9D27DD478AE6D1BBBB83579561AC50D5BE@NA-EXMSG-C117.redmond.corp.microsoft.com>
References: <E1IdT7z-0001vv-Ly@megatron.ietf.org> <C9BF0238EED3634BA1866AEF14C7A9E55A597AC370@NA-EXMSG-C116.redmond.corp.microsoft.com> <9d70cb000710041801k44ee5016u7374af1e1a17f6bc@mail.gmail.com>
In-Reply-To: <9d70cb000710041801k44ee5016u7374af1e1a17f6bc@mail.gmail.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
acceptlanguage: en-US
MIME-Version: 1.0
X-Spam-Score: 0.0 (/)
X-Scan-Signature: f4c2cf0bccc868e4cc88dace71fb3f44
X-BeenThere: ltru@ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: Language Tag Registry Update working group discussion list <ltru.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www1.ietf.org/pipermail/ltru>
List-Post: <mailto:ltru@ietf.org>
List-Help: <mailto:ltru-request@ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=subscribe>
Content-Type: multipart/mixed; boundary="===============1136085055=="
Errors-To: ltru-bounces@ietf.org

The Dinka language development activities sound like there are similarities to what has been going on in the Philippines, where it was decided to engineer a pan-Philippine language "Filipino" as an amalgam with greatest dependence initially on Tagalog.

In that case, a new language name was created -- and a new ISO 639 ID. One could apply a similar approach in the Dinka case, except that the sociolinguistics appear to be different: from your description, it sounds like the speaker community has a conception of an encompassing linguistic unity, and 639-2 has had an ID that is comparable in scope.


Peter

-----Original Message-----
From: Andrew Cunningham [mailto:lang.support@gmail.com]
Sent: Thursday, October 04, 2007 6:02 PM
To: Shawn Steele
Cc: ltru@ietf.org
Subject: Re: [Ltru] Extended language tags

Throwing in my two cents worth:

One aspect of language tagging is the preference for the tags to be as
specific as possible. In some cases it is desirable to be less
specific.

An example i mentioned on the teleconference was the case of Dinka. In
ISO-639-2 this is represented by the language code "din".

While ISO-639-3 have the following language codes:

dib     South Central Dinka
dik     Southwestern Dinka
dip     Northeastern Dinka
diw     Northwestern Dinka
dks     Southeastern Dinka

If I was applying a langauge tag to a Rek grammar, then i'd use "dik",
for a collection of Ciec folktales I'd use "dib". For a collection of
Bor proverbs I'd use "dks".

To describe the literacy materials and class room materials being
developed in Australia by the Dinka community, I'd use "din". Within
the diaspora and in Australia specifically the literacy and language
teachers, translators and interpreters are discussing a standardized
approach to written Dinka.

The original SPLA/M education policies highlighted Rek as a standard
for written Dinka. What seems to be occurring is an amalgam based on
Rek, but including aspects and vocab from other Dinka dialects. There
will be a locally hosted conference next year to thrash out some of
the issues.

In this context "din": would be the most appropriate way of tagging
the new educational material, while keeping the existing five
iso-639-3 language tags to more accurately describe information and
data written in one of the 20 odd specific dialects.

from the perspective of the user community, an extlang approach would
make more sense, i.e. Rek labeled as "din-dik" makes more sense than
"dik". To the community Rek is a Dinka langauge, Dinak Rek, not a
separate language called Rek.

In this sense extlang reflects the communities understanding of their language.

This is just an observation, i'm neither for or against extlang.

Although from the perspective of web development and how CSS and web
browsers handle psuedo langauge selectors and attribute selectors, I'd
suggest that extlang approach simplifies things for those rare
individuals amongst us that use these selectors.

Andrew
--
Andrew Cunningham
State Library of Victoria, Australia

andrewc@vicnet.net.au
lang.support@gmail.com


_______________________________________________
Ltru mailing list
Ltru@ietf.org
https://www1.ietf.org/mailman/listinfo/ltru
_______________________________________________
Ltru mailing list
Ltru@ietf.org
https://www1.ietf.org/mailman/listinfo/ltru