Re: Anomaly in upcoming registry

"Doug Ewell" <doug@ewellic.org> Mon, 06 July 2009 03:57 UTC

Return-Path: <doug@ewellic.org>
X-Original-To: ietf-languages@alvestrand.no
Delivered-To: ietf-languages@alvestrand.no
Received: from localhost (localhost [127.0.0.1]) by eikenes.alvestrand.no (Postfix) with ESMTP id 322CC39E676 for <ietf-languages@alvestrand.no>; Mon, 6 Jul 2009 05:57:14 +0200 (CEST)
X-Virus-Scanned: Debian amavisd-new at eikenes.alvestrand.no
Received: from eikenes.alvestrand.no ([127.0.0.1]) by localhost (eikenes.alvestrand.no [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 7lpy4+An+LBv for <ietf-languages@alvestrand.no>; Mon, 6 Jul 2009 05:57:10 +0200 (CEST)
X-Greylist: from auto-whitelisted by SQLgrey-1.6.8
Received: from pechora2.lax.icann.org (pechora2.icann.org [208.77.188.37]) by eikenes.alvestrand.no (Postfix) with ESMTPS id 1BE6139E1DD for <ietf-languages@alvestrand.no>; Mon, 6 Jul 2009 05:57:09 +0200 (CEST)
Received: from smtpauth19.prod.mesa1.secureserver.net (smtpauth19.prod.mesa1.secureserver.net [64.202.165.30]) by pechora2.lax.icann.org (8.13.8/8.13.8) with SMTP id n663ulWC004069 for <ietf-languages@iana.org>; Sun, 5 Jul 2009 20:57:08 -0700
Received: (qmail 8508 invoked from network); 6 Jul 2009 03:56:47 -0000
Received: from unknown (67.166.27.148) by smtpauth19.prod.mesa1.secureserver.net (64.202.165.30) with ESMTP; 06 Jul 2009 03:56:46 -0000
Message-ID: <73EDDA54297144D596F9A9D4B8128F5C@DGBP7M81>
From: Doug Ewell <doug@ewellic.org>
To: ietf-languages@iana.org
Subject: Re: Anomaly in upcoming registry
Date: Sun, 05 Jul 2009 21:56:39 -0600
MIME-Version: 1.0
Content-Type: text/plain; format="flowed"; charset="utf-8"; reply-type="original"
Content-Transfer-Encoding: 8bit
X-Priority: 3
X-MSMail-Priority: Normal
X-Mailer: Microsoft Outlook Express 6.00.2900.5512
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.5579
X-Greylist: Sender DNS name whitelisted, not delayed by milter-greylist-4.0 (pechora2.lax.icann.org [208.77.188.37]); Sun, 05 Jul 2009 20:57:08 -0700 (PDT)
X-BeenThere: ietf-languages@alvestrand.no
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: IETF Language tag discussions <ietf-languages.alvestrand.no>
List-Unsubscribe: <http://www.alvestrand.no/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@alvestrand.no?subject=unsubscribe>
List-Archive: <http://www.alvestrand.no/pipermail/ietf-languages>
List-Post: <mailto:ietf-languages@alvestrand.no>
List-Help: <mailto:ietf-languages-request@alvestrand.no?subject=help>
List-Subscribe: <http://www.alvestrand.no/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@alvestrand.no?subject=subscribe>
X-List-Received-Date: Mon, 06 Jul 2009 03:57:14 -0000

CE Whitehead <cewcathar at hotmail dot com> wrote:

> Hi, I am not sure the code should have been 'retired'; that was done I 
> assume since it was a macrolanguage and a three-letter code was 
> preferred??, but I really do not know why it was 'retired'.  However, 
> as far as I can tell, a mixture of three-letter and two-letter codes 
> ([ar], [zh]) are used for macrolanguages too!

This is total speculation, and totally wrong.

As has been stated already on this list, "sh" was deprecated/withdrawn 
from ISO 639-1 back in 2000 because the individual code elements "sr" 
and "hr" overlapped it, and were considered preferable by the RA.  ISO 
639-3 and its concept of macrolanguages had not been created yet.

See http://www.loc.gov/standards/iso639-2/php/code_changes.php for the 
real explanation.  Search about 3/4 of the way down the page for the 
term "Serbo-Croatian."

Two-letter code elements are assigned in ISO 639-1; three-letter code 
elements are assigned in ISO 639-2.  All two-letter code elements 
assigned in 639-1 have a corresponding three-letter code element in 
639-2, except for "sh".  There would be no intrinsic reason to "prefer" 
a three-letter code element over a two-letter one.

Macrolanguages in draft-4646bis and draft-4645bis are taken directly 
from ISO 639-3, which assigns three-letter code elements compatible with 
those in 639-2, but also cross-references the corresponding two-letter 
code elements.  Extended language subtags (extlangs) are taken from a 
small subset of these, plus "sgn".  Each of the languages involved was 
chosen irrespective of whether a two-letter code element exists for that 
language.

Sorry to say it, but with all of these errors in interpretation and 
understanding, it might have been better if your post had stayed 
undeliverable.

--
Doug Ewell  *  Thornton, Colorado, USA  *  RFC 4645  *  UTN #14
http://www.ewellic.org
http://www1.ietf.org/html.charters/ltru-charter.html
http://www.alvestrand.no/mailman/listinfo/ietf-languages  ˆ