Re: [Ietf-languages] BCP47 violation in the recent extlang ajp change

Doug Ewell <doug@ewellic.org> Sun, 26 March 2023 19:16 UTC

Return-Path: <doug@ewellic.org>
X-Original-To: ietf-languages@ietfa.amsl.com
Delivered-To: ietf-languages@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id A5EF6C14F74A for <ietf-languages@ietfa.amsl.com>; Sun, 26 Mar 2023 12:16:52 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.897
X-Spam-Level:
X-Spam-Status: No, score=-1.897 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id xy91tGl6UQMx for <ietf-languages@ietfa.amsl.com>; Sun, 26 Mar 2023 12:16:48 -0700 (PDT)
Received: from NAM04-BN8-obe.outbound.protection.outlook.com (mail-bn8nam04on2062e.outbound.protection.outlook.com [IPv6:2a01:111:f400:7e8d::62e]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 8B2BEC14F75F for <ietf-languages@ietf.org>; Sun, 26 Mar 2023 12:16:48 -0700 (PDT)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=noBHY7BerLTCJSYaxWptLw8GflGExuIXNH+ISJc85XCWYpI8mSJzvn80Fov9bDiqDK9RwbGoCoOpo91oya1BS8mEnrn24gontckhP+amgrg5yVws9kduLEIvutMI2l4RPdU4tYoFeSWhosaZWSd5rUdT67Fo8bXjrEqRltgXGkOllTFzr3VOc54OrvVsnK7ovZsLtIauHEQT/jmG/vfEhhiWZlNQSfb8c7Y7F2cFSNPL9UT3aUTk80MMYms8BcQusfdBIoGqLs4/D84apH7kcrV0gphT+M+WZYpdmUVm/SY1+J8n/f3Zbpkd2gmBMG0Kd57ZRQrSxqwzuao9k1GsRQ==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=9WlPc7wDOzhGLtyM7ZvjvINKIcK5xYNoAXEzRaXwb0M=; b=NSmxkLqpEyhLw7PcZf5k770yOVh+eHyMLbl/eQ+1cGEKEaiBKri7hsb/IXLXEVKcBBZ3QnM5qwINVwFmRNAx32/XDqr5fCW8B2CQYAhw8e6wYl2ApFlyDo2qkToqVj9yOJW5BJTsHI/nxFgL//2hE3d4gjhDkx8OFxDyRA0JlL47h623JAVFoLGEM8HZqs8CT8LJuAN1aROAV3K6pOJeRhHper6H5gntKtuYIxC7rtbhb+ui0lAe/XHOnoULPd0nCwLsh0udzRTbSpsl1keQhPoTVz1FOehWWIbcdzbs99jctx6nCu9H20lDPYc1hdH9NsVmqJAUWIJi2YfI24MWWQ==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=ewellic.org; dmarc=pass action=none header.from=ewellic.org; dkim=pass header.d=ewellic.org; arc=none
Received: from PH0PR03MB6606.namprd03.prod.outlook.com (2603:10b6:510:b4::22) by MW4PR03MB6555.namprd03.prod.outlook.com (2603:10b6:303:126::12) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6178.37; Sun, 26 Mar 2023 19:16:42 +0000
Received: from PH0PR03MB6606.namprd03.prod.outlook.com ([fe80::df5a:f0cd:8dfa:d1c0]) by PH0PR03MB6606.namprd03.prod.outlook.com ([fe80::df5a:f0cd:8dfa:d1c0%5]) with mapi id 15.20.6178.041; Sun, 26 Mar 2023 19:16:42 +0000
From: Doug Ewell <doug@ewellic.org>
To: Christian Despres <christian.j.j.despres@gmail.com>, "ietf-languages@ietf.org" <ietf-languages@ietf.org>
Thread-Topic: [Ietf-languages] BCP47 violation in the recent extlang ajp change
Thread-Index: AQHZX/oCI288uzwZC0it7pmfXCzLQq8NYxrA
Date: Sun, 26 Mar 2023 19:16:42 +0000
Message-ID: <PH0PR03MB6606F4AFF9773C419EF09401CA8A9@PH0PR03MB6606.namprd03.prod.outlook.com>
References: <871qlbvbgo.fsf@gmail.com>
In-Reply-To: <871qlbvbgo.fsf@gmail.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=ewellic.org;
x-ms-publictraffictype: Email
x-ms-traffictypediagnostic: PH0PR03MB6606:EE_|MW4PR03MB6555:EE_
x-ms-office365-filtering-correlation-id: 6aabbf52-e9b7-487c-9720-08db2e2ea22b
x-ms-exchange-senderadcheck: 1
x-ms-exchange-antispam-relay: 0
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: rBc1zBrljH9nnK9l7kNiSlgkICCYoJ0bvbq3EYEgnhikniWOq2OY+3Xxz/xs/aCl29+5plfFfPNb62fH4cLRCl5JvsR2hYBPn9lRfVE931g9gZjdjNsjBBymJirLWF2kY/5N8rFFx1+pELE76kCx6mP/jd06gtUkNBABI+1XledXbV5NkOZZ2WbVSks4vmZ8+cguSnwH08jR2IDGFWbFSceiPRWLIhkiOJX/GaiHqb65G6pCyBpEJyKa13YQUZKy+MN06oFa1MgYOafwX9wI+RAG24cFjp4Rc9G7lX+2vDQUGmcG2Kf8ZKbnbg4ecp1wub0AyG6zvURKDmUqBAup06Q0VmorC/hpPNzmCOAUrE+K79Qgzw2/z3lEg+bkVyCiBDfGeZ9NnxTITEKOU6Wjrz0LYbwgaW3HPx3ULf3wW8xHKLvlhgVA4hcgbMtxmStbEFI99rbNbIYnWUuJ4+Y/8NgGUiIX2DoB0dUkZaxxnRYkq8KPb42JPLC4t0hDtx7l2O661yEn9YZvFWYz98gkOyJNexZOgtyeNIfLEhkpDkBK7MAtz9o9PFsyPqrUSi3Mxzdmu7hZH/0Bnua/7jusrTQQc80/2Gt1flLMvs2cI1c=
x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:PH0PR03MB6606.namprd03.prod.outlook.com; PTR:; CAT:NONE; SFS:(13230028)(346002)(376002)(366004)(136003)(39830400003)(396003)(451199021)(9686003)(6506007)(7696005)(41300700001)(8676002)(86362001)(8936002)(76116006)(110136005)(71200400001)(66946007)(66446008)(66476007)(52536014)(66556008)(64756008)(316002)(2906002)(5660300002)(478600001)(966005)(33656002)(122000001)(83380400001)(55016003)(38070700005)(186003)(38100700002); DIR:OUT; SFP:1101;
x-ms-exchange-antispam-messagedata-chunkcount: 1
x-ms-exchange-antispam-messagedata-0: eLBXEP4H+05XgMrmCMd/vdClu/USKwRHI7oH2NtDxXLLrXFWgVqqPGussYuWS+FsrXkRAcZJ2T0f3UC1TGzYl8NaWoQKvY8ajQMmpOS9lTHHu0EXmRH1iYgsFDtjCcfGD27J0L+ljTeYiOnreQO+xvRQntHZClTakL9UVAG1tQYZzAlrrp7ScyH0C+lrm2W7U3I1S6rhabNU1Y4IVblfKCj6K1AGiOdbk+Ac+7mH3N6EzB6v7lG9/BFL98X3vVXZgvjo8MHqMCzMbaBcgmmFKK9khJB/s4Obj9CC9nTYOSHDOMyREFRePqWI9WTbj2EWoxio32UMb5ci1Ynkm9qsvYqnvi3SE0TvfZOB2JvSJS3oemMmWrRQzVCzXdIakyEKzXTlAkFz5eMPviQgjeLu1HlIiG5F0aD1tGBlI9tnrG/Bz5hQxE6d7/DzJQyY8BICycsvg6wbY7aJPpD52uLns+41q8VYR3LNJoIWSgUAdYv0TDv/DfoqxWgrH3sHVHnrIjVrAwAhxcZDvBJAl09+wAAUAzls+y3QOjmFx7wDNDkYQL7liXCbR/I7qDcKAw0TZQ78ROi3eaFKcZywF9WVsrl2p2YpMH4N5BlzCRvuTET4in8sAWXYmyr2k1cpzXoTMpr9og7HX54GVg6+0hKc52UYjv2YX1X7g9pyR+bz8D8PbkgTSlVsabHJd5DAFIJv5rnyd+h2x4Udm5mEAabjmRYFOH1ZKogVOBuxeg7zvjI64YO5dpNlRlqkWZ4sPsgD6+A+jpkdlVHyvVUpIZzlbSXDf1UvBl7Eo9hJzt9j/m7mxkuICqrOq5G26+Lv9q6EQTkU+6OvG5qt2tJ4BRxsjtOWn9xDACtyALlDtIKuMtj5Qe78ju1wILfDD6rCDlRlUh34WSRhc9IuCQtzekFtd4oefPhKkTO1/CpeBqnxr/ANu3EJE1qmFbnjkLdP2PEWA7R6a8Q+ZGto/iSse7Mh69AJwp90MLc/Q7GKXNWFJBJMac9zJZgLdrlbyQfEi6IAW13bximdsxA5nrbfBQAEtnpigEKFLapX8K1NZdYv94e1d3WD9eDrn/11POnx70d2ZwF538i7o4YHzJ8/wEN65cxxa3FWLyMVLycYqKlr5ARPpY+OP5ju6PNye6nkHnLMW1f32LYsnrTaYj6YX+gWEMlfA2WFJe0+XZqeWpjc0GPvKVfC569GC7cTi9fdi7bz1WlgdaOw4OB/tIsvETLzBotLBetRcFY3R5SlfecHr9L0v6lBcAhwnxJUunnAonGLOOfo8IGw1XSuaWzXcqyXfqYkdpocoF7E1xr6gSQ5Assh1383b7keeZGKd8kcgFqPv6gBQSqEATJEDHVqWV6PA+tJ+J/HhncBoyrIiCYeVvcP1EreoxtCdMjNieVjqBMqeuKRD5Vk8m28czb39wDUK89oDAptInTEKOic3fp6rcoUC0+sq+DrR7yVN7DNY5QeE9632ELA9wUiJAkhpaX4u4s2v4mFFUTSVWL5IrlSPHI1wsNY6vX2GtP4+COdeXRGDyUM4ZBbgy21eBx5avPpLZi1HAtcJwwOYhKZU8h3eDF2ZCqX2mmYZfoawJs8Lg1GNx5zHlIprpT2Pzgpq7/lKw6t0WQbCNxoDSnVUirPrWU=
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
MIME-Version: 1.0
X-OriginatorOrg: ewellic.org
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-AuthSource: PH0PR03MB6606.namprd03.prod.outlook.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 6aabbf52-e9b7-487c-9720-08db2e2ea22b
X-MS-Exchange-CrossTenant-originalarrivaltime: 26 Mar 2023 19:16:42.7039 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: af914547-9fbe-40e1-a852-1a58e1f247dc
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: qARIqEBikwrI7vMQhhPmtQxTdPjKQtXCPYOFweV8K5duyFeI0aBU0dMvFUhX16XE/od4UJiUeRW2LMIKwUCXoQ==
X-MS-Exchange-Transport-CrossTenantHeadersStamped: MW4PR03MB6555
Archived-At: <https://mailarchive.ietf.org/arch/msg/ietf-languages/5Z-LSHYhuwd0zov4Oj9Hbeisp3I>
Subject: Re: [Ietf-languages] BCP47 violation in the recent extlang ajp change
X-BeenThere: ietf-languages@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: "Review of requests for language tag registration according to BCP 47 \(RFC 4646\)" <ietf-languages.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ietf-languages/>
List-Post: <mailto:ietf-languages@ietf.org>
List-Help: <mailto:ietf-languages-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 26 Mar 2023 19:16:52 -0000

Christian Despres wrote:

> In the recent IANA subtag registry update, the extlang "ajp" was
> deprecated, and its preferred value was changed from "ajp" to "apc".
> I believe this is in violation of the RFC5646 component of BCP47,
> specifically item (3) of
> <https://www.rfc-editor.org/rfc/rfc5646.html#section-2.2.2>, which
> states that the values of the subtag and preferred value fields of an
> extlang registry entry MUST be the same. This should be reverted.

Christian is right, and I will post forms shortly to correct this situation.

The same scenario occurred four years ago, when I noticed the violation of Section 2.2.2 in the deprecation of extlang subtags for ‘lsg’ (Lyons Sign Language), ‘rsi’ (Rennellese Sign Language), and ‘yds’ (Yiddish Sign Language). These subtags were deprecated in separate ISO 639-3 change cycles between 2015 and 2018. The story can be read at https://mailarchive.ietf.org/arch/msg/ietf-languages/_uxUBEppvHL1Xkq1JKWb2lpk_Fo/ .

The conflict is between Section 2.2.2, which requires extlangs to have a P-V that matches the subtag value (as Christian noted, and as I noted in 2019), and Section 3.1.7, which says that the P-V of a deprecated subtag should be updated as necessary to avoid “chaining” of subtags (‘xxx’ → ‘yyy’ → ‘zzz’). At that time, it was agreed (at least by silent assent) that Section 2.2.2 took precedence, and we need to do the same here, at the very least for consistency.

As far as I can tell, this situation has never occurred before with a spoken language (where the Preferred-Value and Prefix match), only with a sign language (where the Prefix is always “sgn”), nor has it ever occurred when the deprecated subtag represented a language that was merged into another, instead of declared non-existent (so that the primary language subtag has its P-V changed, instead of removed). The same rules for retaining the extlang’s P-V apply, though.

Note that this correction affects only the extlang, not the primary language subtag for ‘ajp’, which correctly had its P-V changed to “apc”.

> The way I believe this sort of situation is supposed to be handled in
> tag normalization is that a compliant processor will first convert the
> tag "ar-ajp" to simply "ajp", and then notice that the "ajp" is itself
> deprecated, at which point it will replace "ajp" with "apc". (The spec
> itself could probably be modified to permit an extlang's preferred
> value to differ from the value of the subtag itself without any issue,
> but that's beside the point).

In a perfect world, it would be best to be able to keep RFC 5646 and publish a short addendum clarifying how to resolve the conflict between sections 2.2.2 and 3.1.7.

In the IETF, however, I believe it would be necessary to publish a new RFC that replaces RFC 5646 in its entirety, all 84 pages. That would not only restart the whole long, painful RFC process that many of us hope to avoid, but would also open up all of RFC 5646 to being reevaluated and criticized. For example, there would likely be calls to replace or supplement ISO 639-3–based subtags with Glottolog, which would be a severe setback for stability and usability of BCP 47.

> Sorry if this is not the appropriate place to raise this concern.

It is exactly the right place. Thank you for bringing it here.

Issues like this are one of the main reasons we have a two-week review period for all change proposals, during which all list members are encouraged to check the proposals for all types of problems, including but not limited to BCP 47 violations.

--
Doug Ewell, CC, ALB | Lakewood, CO, US | ewellic.org