Re: [Ltru] Re: Remove extlang from ABNF?

Addison Phillips <addison@yahoo-inc.com> Wed, 12 December 2007 16:23 UTC

Return-path: <ltru-bounces@ietf.org>
Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1J2UMR-0008Ht-Jy; Wed, 12 Dec 2007 11:23:03 -0500
Received: from ltru by megatron.ietf.org with local (Exim 4.43) id 1J2UMQ-0008FO-CX for ltru-confirm+ok@megatron.ietf.org; Wed, 12 Dec 2007 11:23:02 -0500
Received: from [10.91.34.44] (helo=ietf-mx.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1J2UMQ-0008EP-24 for ltru@ietf.org; Wed, 12 Dec 2007 11:23:02 -0500
Received: from rsmtp1.corp.yahoo.com ([207.126.228.149]) by ietf-mx.ietf.org with esmtp (Exim 4.43) id 1J2UMP-0001HF-IM for ltru@ietf.org; Wed, 12 Dec 2007 11:23:02 -0500
Received: from [10.72.72.117] (snvvpn1-10-72-72-c117.corp.yahoo.com [10.72.72.117]) by rsmtp1.corp.yahoo.com (8.13.8/8.13.8/y.rout) with ESMTP id lBCGMicN045139; Wed, 12 Dec 2007 08:22:44 -0800 (PST)
DomainKey-Signature: a=rsa-sha1; s=serpent; d=yahoo-inc.com; c=nofws; q=dns; h=message-id:date:from:user-agent:mime-version:to:cc:subject: references:in-reply-to:content-type:content-transfer-encoding; b=OzEYmgeb+IYeDgcfRqSgwpZmweJe6KaOxJyz0Ohc+Y0fI2VX86EQAnNl21S/wy2s
Message-ID: <47600AD4.6000500@yahoo-inc.com>
Date: Wed, 12 Dec 2007 08:22:44 -0800
From: Addison Phillips <addison@yahoo-inc.com>
User-Agent: Thunderbird 2.0.0.9 (Windows/20071031)
MIME-Version: 1.0
To: Felix Sasaki <fsasaki@w3.org>
Subject: Re: [Ltru] Re: Remove extlang from ABNF?
References: <E1J01vI-0003cW-Rd@megatron.ietf.org> <019601c83818$b06c3070$6601a8c0@DGBP7M81> <DDB6DE6E9D27DD478AE6D1BBBB83579561E51429AA@NA-EXMSG-C117.redmond.corp.microsoft.com> <6.0.0.20.2.20071211163740.0a090850@localhost> <475E8342.1080206@w3.org> <DDB6DE6E9D27DD478AE6D1BBBB83579561E52A6F79@NA-EXMSG-C117.redmond.corp.microsoft.com> <475F2439.6020007@w3.org> <6.0.0.20.2.20071212153653.0ae521d0@localhost> <475FE141.8010601@w3.org>
In-Reply-To: <475FE141.8010601@w3.org>
Content-Type: text/plain; charset="UTF-8"; format="flowed"
Content-Transfer-Encoding: 7bit
X-Spam-Score: -15.0 (---------------)
X-Scan-Signature: 082a9cbf4d599f360ac7f815372a6a15
Cc: LTRU Working Group <ltru@ietf.org>
X-BeenThere: ltru@ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: Language Tag Registry Update working group discussion list <ltru.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www1.ietf.org/pipermail/ltru>
List-Post: <mailto:ltru@ietf.org>
List-Help: <mailto:ltru-request@ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=subscribe>
Errors-To: ltru-bounces@ietf.org

Felix Sasaki wrote:
>>
>> In RFC 4646, we defined some tags as well-formed. In RFC 4646bis, we 
>> suddenly
>> say that some of these tags are not well-formed. We never would do 
>> this for
>> valid tags, so I'm really not sure it is a good thing to do for 
>> well-formed tags.
>>   
> +1
> 

I agree.... except....

This is exactly what we did in RFC 4646. We made a vast array of 
"well-formed" but invalid tags illegal (by narrowing the ABNF). In 
4646bis, one could say that we were doing the same thing--making a (much 
smaller) array of tags (which were never valid) illegal. And we *have* 
changed the ABNF in a manner that narrows it in 4646bis already. We have 
narrowed the grandfathered production substantially.

I have no problem with XML Schema or others referencing the 4646 ANBF 
instead of the 4646bis ABNF for well-formness checking. It won't 
introduce anything particularly bad. And I am somewhat allergic to 
changing the ABNF because I have personally felt like we should resist 
tampering.

As an implementer, though, I really hate supporting extlang, now that it 
does nothing. So I'd propose:

1. Remove extlang from the ABNF in 4646bis. That would make the language 
production:

language      = (2*3ALPHA)             ; shortest ISO 639 code
               / 4ALPHA                 ; reserved for future use
               / 5*8ALPHA               ; registered language subtag


2. In the section on conformance, permit 4646bis well-formedness to 
reference either the current ABNF or an "obs-language" production that 
looks like:

obs-language  = (2*3ALPHA [ extlang ]) ; shortest ISO 639 code
               / 4ALPHA                 ; reserved for future use
               / 5*8ALPHA               ; registered language subtag

extlang       = *3("-" 3ALPHA)         ; removed in this version

3. Add a note saying that no tags were ever valid under obs-language, 
but that some processors may permit them. Also note that 3066 
well-formedness differed substantially from 4646/4646bis well-formedness 
and provide the 3066 production in that section for completeness.

Addison

-- 
Addison Phillips
Globalization Architect -- Yahoo! Inc.
Chair -- W3C Internationalization Core WG

Internationalization is an architecture.
It is not a feature.


_______________________________________________
Ltru mailing list
Ltru@ietf.org
https://www1.ietf.org/mailman/listinfo/ltru