Re: Language Subtag Registration

Felix Sasaki <fsasaki@w3.org> Thu, 29 October 2015 20:27 UTC

Return-Path: <fsasaki@w3.org>
X-Original-To: ietf-languages@alvestrand.no
Delivered-To: ietf-languages@alvestrand.no
Received: from localhost (localhost [127.0.0.1]) by mork.alvestrand.no (Postfix) with ESMTP id 873097C325A for <ietf-languages@alvestrand.no>; Thu, 29 Oct 2015 21:27:52 +0100 (CET)
X-Virus-Scanned: Debian amavisd-new at alvestrand.no
Received: from mork.alvestrand.no ([127.0.0.1]) by localhost (mork.alvestrand.no [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 2aLeYHUOOmWI for <ietf-languages@alvestrand.no>; Thu, 29 Oct 2015 21:27:50 +0100 (CET)
X-Greylist: from auto-whitelisted by SQLgrey-1.8.0
X-Greylist: from auto-whitelisted by SQLgrey-1.8.0
Received: from pechora3.lax.icann.org (pechora3.icann.org [IPv6:2620:0:2d0:201::1:73]) by mork.alvestrand.no (Postfix) with ESMTPS id B27257C0375 for <ietf-languages@alvestrand.no>; Thu, 29 Oct 2015 21:27:49 +0100 (CET)
Received: from lewis.sophia.w3.org (lewis.sophia.w3.org [193.51.208.79]) by pechora3.lax.icann.org (8.13.8/8.13.8) with ESMTP id t9TKROb4023718 (version=TLSv1/SSLv3 cipher=AES256-SHA bits=256 verify=NO) for <ietf-languages@iana.org>; Thu, 29 Oct 2015 20:27:46 GMT
Received: from pc6.renaissance-unet.ocn.ne.jp ([222.151.229.94] helo=[172.30.1.118]) by lewis.sophia.w3.org with esmtpsa (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:256) (Exim 4.80) (envelope-from <fsasaki@w3.org>) id 1Zrtn7-0005Bb-BN; Thu, 29 Oct 2015 20:27:21 +0000
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (Mac OS X Mail 8.2 \(2098\))
Subject: Re: Language Subtag Registration
From: Felix Sasaki <fsasaki@w3.org>
In-Reply-To: <B2CA9A39-FEE0-492D-A846-91CA6364CC4B@evertype.com>
Date: Fri, 30 Oct 2015 05:27:23 +0900
Content-Transfer-Encoding: quoted-printable
Message-Id: <B83C5195-DD81-4376-B157-16569F8D6293@w3.org>
References: <55F61E82.8030106@moisan.ca> <1BF02550-02CA-463D-B011-445966506C49@evertype.com> <FD0AA4FB-FB59-4AB8-8BD7-A5C6776CF750@w3.org> <D6BFD1B7-4D93-4458-AEC2-C29153D030FA@evertype.com> <3B678307-6F04-4FA0-B80C-E7FA4E86550A@w3.org> <FE62AC5A-E6DE-4802-A8FC-F759207813B5@evertype.com> <19366746-228E-4039-BF48-6EE87B8FE890@w3.org> <B2CA9A39-FEE0-492D-A846-91CA6364CC4B@evertype.com>
To: Michael Everson <everson@evertype.com>
X-Mailer: Apple Mail (2.2098)
X-Greylist: IP, sender and recipient auto-whitelisted, not delayed by milter-greylist-4.0 (pechora3.lax.icann.org [192.0.33.73]); Thu, 29 Oct 2015 20:27:46 +0000 (UTC)
Cc: ietflang IETF Languages Discussion <ietf-languages@iana.org>, amir.aharoni@mail.huji.ac.il
X-BeenThere: ietf-languages@alvestrand.no
X-Mailman-Version: 2.1.16
Precedence: list
List-Id: IETF Language tag discussions <ietf-languages.alvestrand.no>
List-Unsubscribe: <http://www.alvestrand.no/mailman/options/ietf-languages>, <mailto:ietf-languages-request@alvestrand.no?subject=unsubscribe>
List-Archive: <http://www.alvestrand.no/pipermail/ietf-languages/>
List-Post: <mailto:ietf-languages@alvestrand.no>
List-Help: <mailto:ietf-languages-request@alvestrand.no?subject=help>
List-Subscribe: <http://www.alvestrand.no/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@alvestrand.no?subject=subscribe>
X-List-Received-Date: Thu, 29 Oct 2015 20:27:52 -0000

> Am 29.10.2015 um 14:24 schrieb Michael Everson <everson@evertype.com>:
> 
> On 28 Oct 2015, at 22:45, Felix Sasaki <fsasaki@w3.org> wrote:
>> 
>>> Perhaps this can be finessed. Either we say “Wikipedia Simple Language Version” with en as the prefix adding fr or de or ru later, or to keep “Wikipedia Simple English” and add “Wikipedia Simple French” etc later at need. 
>> 
>> The issue is that the notion of simple language may differ severely among different wikipedia language version.
> 
> What, in morphology, vocabulary, and syntax? Sure: they’d be forms of distinct languages. Hence the prefix. 
> 
>> So if the purpose of the extension is to cover wikipedia simple english this should be made explicit in the subtag itself and not in the prefix. And there may then be a need later to create other subtags for other wikipedia language versions.
> 
> The prefix proposed is en, since as yet there are no fr, de, or ru Simple Wikipedias, though these have been discussed. The subtag proposed is wpsimple because for any Simple Wikipedia there will be house-style guidelines which define the content. 
> 
> Less precise than Basic English, for example. But nevertheless, defined and implemented. 
> 
>> My point is that each community behind a selected wikipedia language version will likely say: we want our own language (subtag) identifier. 
> 
> I don’t see why you make this assumption.

This comes from experience with the accessibility community who are keen on not trying to provide machine readable identifiers for simple languages. See the recent related subtag discussion on this list. The accessibility community has good reasons for avoiding such identifiers due to the variety of simple languages; with this background, I think defining a general wpsimple sub tag is a bad idea or at least should make sure that the accessibility community has been heard. In my impression after all they provide a lot of content creators in the simple language realm, may they edit in wikipedia or in other contexts.

Best,

Felix

> If an eventual Simple French Wikipedia were implemented, the Language Committee would simply tell them “Your prefix will be fr-wpsimple.” There would be no need for such a community to apply for a subtag. 
> 
>> The generalization of simple language to cover several language versions is problematic, like the generalization of sign languages was problematic (and is now an approach of the past).
> 
> Scouse differs from standard English by having a set of lexical and phonological differences. Simple English differs from standard English by being defined and implemented according to certain defined strictures.  Both should have en- and both a subtag. 
> 
> The sign language generalization is handy for librarians attempting to catalogue a class of items. That’s a different thing from what we’re doing with data, true.
> 
> Michael