Re: [Ietf-languages] BCP47 violation in the recent extlang ajp change

Doug Ewell <doug@ewellic.org> Tue, 28 March 2023 21:47 UTC

Return-Path: <doug@ewellic.org>
X-Original-To: ietf-languages@ietfa.amsl.com
Delivered-To: ietf-languages@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 1430FC157B45 for <ietf-languages@ietfa.amsl.com>; Tue, 28 Mar 2023 14:47:00 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.898
X-Spam-Level:
X-Spam-Status: No, score=-1.898 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_PASS=-0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Eo1m0EnSTzD6 for <ietf-languages@ietfa.amsl.com>; Tue, 28 Mar 2023 14:46:55 -0700 (PDT)
Received: from NAM10-BN7-obe.outbound.protection.outlook.com (mail-bn7nam10on20615.outbound.protection.outlook.com [IPv6:2a01:111:f400:7e8a::615]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id C6CFAC157902 for <ietf-languages@ietf.org>; Tue, 28 Mar 2023 14:46:55 -0700 (PDT)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=irg7a0CWP5xkoAYvGyflp7OlhIm2Wn9vLu7+X/wIF1tf+sP5B314CJRFdbIFkwKqo/i0UtAzF4wxMJWkDJLGqo7gclQXtgvewq9P2z9UDfZcIhW9FAEC5gjRh0dYVFAcIshE1RYlpPDiAPWChx/gPOXkli8HI9FQ4ZHVUr2jwsG/jxgfFjWrIQ+oLz3o0bIdEnWArAUMqjngTgF62Hs9vJf3c+ARrQAaq43400fH/7E1jYydeu5+HJs07Cd/5bcjcU8Eb2lJ0TXyFKfi7cZNcqCVDCk4jaKwp9j6NCX3Zm0cxQAnMRlTswwmS4MKhirnup6acBQgAKDecX1tnM5aXA==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=Gkxo/uGuWzxMb3HU8do7d6TFx1FpuHEB4Cgdu37/QOA=; b=nEf02SQjqVcpjIQtrSZQr6exjnNbddF55dLM7UE8UWWSvjWAuAVbHGjtTmDkLyADc8Rn7XA7gciIRkTRCU6snIlzKuHvMaPLC18SDJ9BrtG9+isDVetQnhC3Diba/DlGsd2js0x0hbmWi249JeOmPuEijTKFa7iL7+FY2fNo3CJnVTSy39ogQr0jyzmhKZ0CqOJMxwQUgVNhWqzd/Sq0LOfidl7K+u6YbjVMYiTMye6C5xiPvrgOiYdKD0HrYG4QTzDAD2Qx41MwumOCsVaf/BS7jBNfkXcmihF/0JIeX/j2LoiymRxZS/nEolk6DghqUxhL0oVFw4xJgE0PIUSxDQ==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=ewellic.org; dmarc=pass action=none header.from=ewellic.org; dkim=pass header.d=ewellic.org; arc=none
Received: from SJ0PR03MB6598.namprd03.prod.outlook.com (2603:10b6:a03:38a::21) by SN7PR03MB7295.namprd03.prod.outlook.com (2603:10b6:806:2e2::7) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6222.30; Tue, 28 Mar 2023 21:46:51 +0000
Received: from SJ0PR03MB6598.namprd03.prod.outlook.com ([fe80::838b:fcda:9dba:1768]) by SJ0PR03MB6598.namprd03.prod.outlook.com ([fe80::838b:fcda:9dba:1768%5]) with mapi id 15.20.6222.030; Tue, 28 Mar 2023 21:46:50 +0000
From: Doug Ewell <doug@ewellic.org>
To: Hugh Paterson III <sil.linguist@gmail.com>
CC: "Martin J. Dürst" <duerst@it.aoyama.ac.jp>, Christian Despres <christian.j.j.despres@gmail.com>, "ietf-languages@ietf.org" <ietf-languages@ietf.org>
Thread-Topic: [Ietf-languages] BCP47 violation in the recent extlang ajp change
Thread-Index: AQHZX/oCI288uzwZC0it7pmfXCzLQq8NYxrAgADca4CAAmNGgIAABWEwgAAPmICAAADkMA==
Date: Tue, 28 Mar 2023 21:46:50 +0000
Message-ID: <SJ0PR03MB6598669274C23AA002C9E324CA889@SJ0PR03MB6598.namprd03.prod.outlook.com>
References: <871qlbvbgo.fsf@gmail.com> <PH0PR03MB6606F4AFF9773C419EF09401CA8A9@PH0PR03MB6606.namprd03.prod.outlook.com> <7f1e4b72-fc25-bec8-9e36-cbffbdd6eeda@it.aoyama.ac.jp> <CAE=3Ky_fi5ixQp+gYJeV4kYAtr0BmTbRgiaLDUJLDAk8AE-xjA@mail.gmail.com> <SJ0PR03MB65981CE49906B746C69BCD03CA889@SJ0PR03MB6598.namprd03.prod.outlook.com> <CAE=3Ky8R3mKWn+JRnPo-V5m+g+W=jsQ+_T2g3xBS6Ct-p26MNg@mail.gmail.com>
In-Reply-To: <CAE=3Ky8R3mKWn+JRnPo-V5m+g+W=jsQ+_T2g3xBS6Ct-p26MNg@mail.gmail.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=ewellic.org;
x-ms-publictraffictype: Email
x-ms-traffictypediagnostic: SJ0PR03MB6598:EE_|SN7PR03MB7295:EE_
x-ms-office365-filtering-correlation-id: 947daf80-5ae5-40f6-f4fd-08db2fd5f03e
x-ms-exchange-senderadcheck: 1
x-ms-exchange-antispam-relay: 0
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: d+ByQ9y2NoiCh+v4YoKeHNgEPRcXfTmKOCtPY/wM0s+wfAXoXGbV2Gka0kE8DoIUPoAClVEEvp90NG/Ocbd0TwigCzyZMTfRUjJMnSXLIq6pG54KN+SoxOoPoVCju1iMUQ3LmCvgPURGC9ql/c1gzSgOeiIN72O1cjECcAbrnEOLO44ePe+0gD+0m7SupvmV/MCOrG9kf2+f8slHzz78w22XJTTxcyoz2KqWYhCCk6rmfcLH/Ayl2MkKiZlVRpuyOPS1bKDAVEk7ZjXrOdmj5d3d0IaWlAxl1bU4bPShQx7AUux9+GEFPrEnmLXMcX9iQCPujiRGpwa4A4UbyNUGiXC0YOzMg+aoPhv3VB4nBCNX/gGDehzacEMRak6jT0aLQN0uZuJwxmMU58mIjz6/4O1C0IGBkplhIAwakv6uiVBNa9bfj6ZOsR0nfuwqzzqlSdEBjcZQrcsRLQxbqu8hGFjsSHm95pyQrtiQVdXXWi7rnXe9aeDUArVU8BJF/Esxt5KIMBWMCD/ivpggcv6t2Yn8Y9NW6h8ZZfAwA9jpwEUoGiYX5oUei4MypJVwk1lfQzzSEFzgw5K/W0OrttiTZBzoyNzR9IYr3j70KN2nSeo=
x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:SJ0PR03MB6598.namprd03.prod.outlook.com; PTR:; CAT:NONE; SFS:(13230028)(366004)(396003)(39830400003)(346002)(136003)(376002)(451199021)(6506007)(9686003)(71200400001)(966005)(316002)(4326008)(54906003)(186003)(53546011)(66446008)(64756008)(6916009)(66556008)(76116006)(66476007)(66946007)(8676002)(478600001)(7696005)(66574015)(41300700001)(5660300002)(52536014)(38100700002)(2906002)(83380400001)(38070700005)(33656002)(86362001)(55016003)(8936002)(122000001); DIR:OUT; SFP:1101;
x-ms-exchange-antispam-messagedata-chunkcount: 1
x-ms-exchange-antispam-messagedata-0: wcKDjEnOWognwc5uDn8eBD77LRRh6NvrhxpevG1zFfDAtXCkYzH84hwj+7rETY8/Rb809xAOTeRT/QODesdbjEuzc/4bBhjTZzJCNiqQmcyVZ84rd3pyom6/erg2gaL5JuCdLAfw0St9ir32hXPKwqibd3eaIezIFB9M+u81D5kgKncHBr6hZcDEw83acpnzGukn6zAN3akOQTnLbRmxn5IAhxYaGn8q+uA3BdsRGKWABLWfyfQ9luOHY4LvlmPawL6XENJG03dRrmlH8N8tryngedrWDDVvzHpyq/aVNrVSQ99pJEB9Un+es5AUGwKEW/FqzkgQW58bzZHMKLeqvAnpTupoeZbZ4EFu3TPjt9F1JWrSnH3bxRUt4+mMNZYhdfDlDEx9XVztqF1Lj4dolX936DXItEV1FLgdietjjfqxzTacvrBgQ56KMUySrZTup3uNM5rmvH0CQPJqtqH9uczgr38v2qCD9m9jLwysnXxS1dGh2nRBKhteiljrvmLsrXUfoemrYA/McvTO81CHbMxEnidraXMDQyN4K/xS+CTJF/7e3voHY9YxWiMylizOwmtZNc8NpznpnbbWIONv8R7oMmFUf8WYmfyLy0o+OOlpTDee/Ycxc1M+0I6eaieQJC8r8u2EX6oa/r7txevAARYhi/sjfwNY6FZ3wXi06XGNudSaZ+G3JGbCHIgvo/bMOzCnsBqAWjCH6Sg4DjOeyiYwtDdEqjvRXxVjzt54xZXqzMTUDlmm3J4WrsK5BPyvb2fzJQf+B103Et93MZa0eRnfz3tuCNpCbQTsexvWrklTWkD3/akZuj53Hq3oYi0VDpTHZLWEsI5k1lovDpMQGRq8yRTmQE/Ho+5wNHRDmONL9qKx+tymoNV7CbDudVtDFSuzm7pLmnwXR0Qm31X+mgzRDqH3ES1+W5ibK3xXR/gLx/Pc90LweLInO6i7anzhGc0kDbUG1HkqGoYwqYGo3KYZ0zAEQHdcn87FEmoEvVZEqF50hNZ9LJGirHcCDSVqZc5U9Q3qpvtsFTeCxLZ7tzodsNylqT1uyDeNAsU941svXgTp99dV43/ocCKOHaRvfDpv9Vr9mnkDw5gFxXeTrw1x3BI3tjUPU+y+6CI/JwyqUUUWwEpW4LVM7FDYFYtIPiRgn8t7SX3Gb0IBzJKtu3oM46fjVsKC6PgrGgBNUNinDXOvskPnwjnfqTj3rmzKQDlTAVph39d5REdTBRiO2QIdc+fpVu55dCRsffSemp76oVirtFrr2amdSAq5YRWh4xXteUEB8bG21YJtOuAVwRwJeW7xqAC9zXEh1sRlc4Ncyev87b+ayLDg/LfzrwPeHzvecgosnCPA815XTCBIYU8ka88f9Wql7yQXovxcYPZLbajAjcOt5vG32Iqh4UpNiYa4WzorMx//gXu4HCNQbvRBBZjmcSWihwtH2rnZtVsYyhQxu2MfiixLExaut0o9K8SNz0Dv/Uw3gnjcR9Hb8WRKEPDUegARwzaoFOUoXUfs8Kt73KwQJ+SHwQn42gnzjqrh0813gQue+LDi3dN+4gGz4NN077dWSSA9NOakcOVkibyEJ1gtTSUHcHeR8b0sF1sIaDde43/55sXp3hTSDDu6lOeXK5v25pBqS0MLaE0=
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
MIME-Version: 1.0
X-OriginatorOrg: ewellic.org
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-AuthSource: SJ0PR03MB6598.namprd03.prod.outlook.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 947daf80-5ae5-40f6-f4fd-08db2fd5f03e
X-MS-Exchange-CrossTenant-originalarrivaltime: 28 Mar 2023 21:46:50.7838 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: af914547-9fbe-40e1-a852-1a58e1f247dc
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: NAZa+27Dag1L3yQJqSLXX3ULhflR29OrWDmX0JiExA+55Nr+D2ljk1KaribLx9kbI9r+KknnKyFbKCsNbP9wyg==
X-MS-Exchange-Transport-CrossTenantHeadersStamped: SN7PR03MB7295
Archived-At: <https://mailarchive.ietf.org/arch/msg/ietf-languages/ekAVrDOkLkQZ6TNdYfyFna-5vaM>
Subject: Re: [Ietf-languages] BCP47 violation in the recent extlang ajp change
X-BeenThere: ietf-languages@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: "Review of requests for language tag registration according to BCP 47 \(RFC 4646\)" <ietf-languages.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ietf-languages/>
List-Post: <mailto:ietf-languages@ietf.org>
List-Help: <mailto:ietf-languages-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 28 Mar 2023 21:47:00 -0000

There is a basic difference. -x- is a “reserved entity” defined by BCP 47. It is part of the definition of BCP 47 language tags; it is accounted for in the ABNF.

-t- and -u- are not reserved in BCP 47, but rather are registered in the Language Tag Extensions Registry:

https://www.iana.org/assignments/language-tag-extensions-registry

which is much less well-known than the Language Subtag Registry, but is equally a part of BCP 47. This would be like saying that the language subtag ‘en’ is a reserved entity in BCP 47.

A parser does not need to link to any RFC documents, but a fully compliant parser that supports extensions should link to both registries — the well-known LSR and the lesser-known LTER — as well as the data files used by the respective extensions, which in the case of -t- and -u- happen to be CLDR data files. Of course, someone building such a parser would need to refer to the separate RFCs in order to know this.

The existence of the LTER is what prevents the Open Language Advocacy Community from registering a singleton that collides with -t- and -u-. The RFC process for such an extension would hopefully attract review from someone who understands how BCP 47 works, and who would object to the collision. Perhaps this is the “database of singletons” you are referring to. It has existed since 2005, although for the first five years it was a stub, with no registered extensions until -u- came along.

--
Doug Ewell, CC, ALB | Lakewood, CO, US | ewellic.org


From: Hugh Paterson III <sil.linguist@gmail.com> 
Sent: Tuesday, March 28, 2023 15:26
To: Doug Ewell <doug@ewellic.org>
Cc: Martin J. Dürst <duerst@it.aoyama.ac.jp>; Christian Despres <christian.j.j.despres@gmail.com>; ietf-languages@ietf.org
Subject: Re: [Ietf-languages] BCP47 violation in the recent extlang ajp change

If -t and -u are not to be included in BCP-47 (as reserved entities), because they ought to be independently manageable (which I assume is the purpose for a separate RFC), then should the -x extension be kicked out to its own RFC? Why should it be included in RFC5646? Wouldn't the same reasons for inclusion or exclusion exist for all singletons? (§2.2/3.7).

Without linking and registration of the various RFC documents defining singletons, doesn't it mean that the only singleton which a parser can depend on by implementing BCP-47 is -x-, as that is the only one acknowledged? Potentially then different communities could define their own singletons and create clashing singletons. For example, I could create an organization called the Open Language Advocacy Community, and define a set of singletons, some of which may clash with RFC6067 and RFC6497. Presumably nothing is stopping some of these other singletons from becoming registered as RFCs. This situation may create confusion for parsers who expect -t and -u to be related to RFC6067 and RFC6497. 

Maybe the more pertinent architectural approach is to have a database of singletons where registration is required like the IANA database. In my first read of §2.2 and §3.7, I read it as there was only a single universe of singletons possible. Am I more appropriately to understand the current architecture to be that there is infact a multiverse of infinite options with regards to the semantics of any singleton other than -x-? With the caveat that in each world of the multiverse that -x- will in fact have the same semantics.  

Kind Regards,
Hugh