Re: [Ltru] Proposal: include new "Language-Type:" field

Addison Phillips <addison@yahoo-inc.com> Wed, 04 October 2006 15:33 UTC

Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1GV8kV-0004Wz-C7; Wed, 04 Oct 2006 11:33:31 -0400
Received: from [10.91.34.44] (helo=ietf-mx.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1GV8kU-0004Wo-6b for ltru@ietf.org; Wed, 04 Oct 2006 11:33:30 -0400
Received: from rsmtp2.corp.yahoo.com ([207.126.228.150]) by ietf-mx.ietf.org with esmtp (Exim 4.43) id 1GV8kQ-0006Ou-OR for ltru@ietf.org; Wed, 04 Oct 2006 11:33:30 -0400
Received: from [10.72.77.34] (snvvpn2-10-72-77-c34.corp.yahoo.com [10.72.77.34]) (authenticated bits=0) by rsmtp2.corp.yahoo.com (8.13.6/8.13.6/y.rout) with ESMTP id k94FXFqD012300 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Wed, 4 Oct 2006 08:33:15 -0700 (PDT)
DomainKey-Signature: a=rsa-sha1; s=serpent; d=yahoo-inc.com; c=nofws; q=dns; h=message-id:date:from:user-agent:mime-version:to:cc:subject: references:in-reply-to:content-type:content-transfer-encoding; b=Q7UJ4tMELt1g9moTPcYAcOXwllqObimpToTbnnGp1SL8XjvZLExN7TTgn9JdR6KW
Message-ID: <4523D43B.8050608@yahoo-inc.com>
Date: Wed, 04 Oct 2006 08:33:15 -0700
From: Addison Phillips <addison@yahoo-inc.com>
User-Agent: Thunderbird 1.5.0.7 (Windows/20060909)
MIME-Version: 1.0
To: John Cowan <cowan@ccil.org>
Subject: Re: [Ltru] Proposal: include new "Language-Type:" field
References: <20061004134200.GC15633@ccil.org>
In-Reply-To: <20061004134200.GC15633@ccil.org>
Content-Type: text/plain; charset="UTF-8"; format="flowed"
Content-Transfer-Encoding: 7bit
X-Spam-Score: -15.0 (---------------)
X-Scan-Signature: 0fa76816851382eb71b0a882ccdc29ac
Cc: ltru@ietf.org
X-BeenThere: ltru@ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: Language Tag Registry Update working group discussion list <ltru.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www1.ietf.org/pipermail/ltru>
List-Post: <mailto:ltru@ietf.org>
List-Help: <mailto:ltru-request@ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=subscribe>
Errors-To: ltru-bounces@ietf.org

This sounds like a reasonable proposal and a good way to deal with 
providing information related to subtag choice (especially for the 
vexing and vague collections).

Addison

John Cowan wrote:
> A proposal:
> 
> The 4646bis registry should capture the language type information from
> ISO 639-3.  Each language, including macrolanguages, is labeled in -3
> as either living, extinct, ancient, historic, or constructed.  These
> terms are defined precisely at http://www.sil.org/iso639-3/types.asp .
> Though informative rather than normative, this is very useful to help
> shrink the large number of languages to a more manageable size.
> 
> The current draft of 639-3 contains 6989 living languages, 417
> extinct languages, 114 ancient languages, 53 historic languages,
> and 24 constructed languages.  The codes 'mul', 'und', and 'zxx' are
> special cases.
> 
> The registry should also capture the individual language vs.  language
> collection information from ISO 639-2.  If a code element appears in
> -2 but not in -3, it is a language collection; there are 68 such code
> elements.  Now that we have 639-3 code elements for essentially every
> language on the planet, language-collection subtags are extremely vague
> and provide little guidance to the recipient.
> 
> (I'm not really happy with the vagueness of "Language-Type", and would
> prefer "Language-Status", but it's the term used in the FDIS.)
> 
> 
> I propose the following language for 4646bis section 3.1.2:
> 
> o Language-Type
>         o Language-Type's field-body contains one of the values
>           'collection', 'extinct', 'ancient', 'historic', 'constructed',
>           or 'special'.  This field MUST NOT appear except in records
>           of type 'language'.
> 
> 
> And here's a draft of the new section 3.1.3.8:
> 
> 3.1.3.8.  Language-Type field
> 
>         The field 'Language-Type' MUST only appear in records whose
>         'Type' field-body is 'language'. This field MUST NOT appear
>         more than once in a record.  Most of the language records in
>         the registry represent individual living languages.  This field
>         indicates those which are not.
> 
>         The value 'collection' indicates a language collection appearing
>         in ISO 639-2 but not ISO 639-3.  The values 'extinct', 'ancient',
>         'historic', and 'constructed' indicate languages which are so
>         designated in ISO 639-3; precise definitions of these terms can
>         be found in that standard.  The value 'special' is used for the
>         three subtags 'mul', 'und', and 'zxx', which do not actually
>         designate languages at all.
> 
> 
> Finally, here's a rule for section 4.1:
> 
>         8.  Language subtags with a 'Language-Type' field of 'collection'
>         do not represent specific languages, and SHOULD NOT be used
>         unless more specific information is unavailable.
> 
> Appropriate adjustments would be needed to 3.3, 3.4, and 3.5 as well.
> We should be able to set this field if and when we ever register
> a language subtag directly, and change it when 639-3 changes.
> 

-- 
Addison Phillips
Globalization Architect -- Yahoo! Inc.

Internationalization is an architecture.
It is not a feature.

_______________________________________________
Ltru mailing list
Ltru@ietf.org
https://www1.ietf.org/mailman/listinfo/ltru