[Ietf-languages] ISO 639 changes (was: RE: Latin Sub tags)

Doug Ewell <doug@ewellic.org> Fri, 01 December 2023 23:06 UTC

Return-Path: <doug@ewellic.org>
X-Original-To: ietf-languages@ietfa.amsl.com
Delivered-To: ietf-languages@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id E6DFAC14F60D for <ietf-languages@ietfa.amsl.com>; Fri, 1 Dec 2023 15:06:52 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -6.905
X-Spam-Level:
X-Spam-Status: No, score=-6.905 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_HI=-5, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_FAIL=0.001, SPF_HELO_NONE=0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 7OPs4_DjlB0O for <ietf-languages@ietfa.amsl.com>; Fri, 1 Dec 2023 15:06:48 -0800 (PST)
Received: from out.mail.icann.org (out.mail.icann.org [64.78.33.6]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 6C0EBC14F5F1 for <ietf-languages@ietf.org>; Fri, 1 Dec 2023 15:06:48 -0800 (PST)
Received: from MBX112-W2-CO-1.pexch112.icann.org (10.226.41.128) by MBX112-W2-CO-2.pexch112.icann.org (10.226.41.130) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.1258.27; Fri, 1 Dec 2023 15:06:47 -0800
Received: from aesmt112-co-1-1.serverpod.net (10.224.74.75) by MBX112-W2-CO-1.pexch112.icann.org (10.226.41.129) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.1258.27 via Frontend Transport; Fri, 1 Dec 2023 15:06:47 -0800
Received: from aesc112-co-1-2.serverpod.net (aesc112-co-1-2.serverpod.net [10.224.76.91]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by aesmt112-co-1.serverpod.net (Postfix) with ESMTPS id 1B5F140002 for <ietf-languages@ex.icann.org>; Fri, 1 Dec 2023 15:06:47 -0800 (PST)
Received: from exmx112-co-1-2.serverpod.net (exmx112-co-1-2.serverpod.net [10.224.72.74]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by aesmt112-co-1.serverpod.net (Postfix) with ESMTPS id E81F7120002 for <ietf-languages@ex.icann.org>; Fri, 1 Dec 2023 15:06:46 -0800 (PST)
Received: from pechora5.dc.icann.org (pechora5.icann.org [192.0.46.71]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by west.smtp.mx.icann.org (Postfix) with ESMTPS id 86F19140002 for <ietf-languages@ex.icann.org>; Fri, 1 Dec 2023 15:06:46 -0800 (PST)
Received: from NAM02-BN1-obe.outbound.protection.outlook.com (mail-bn1nam02on20612.outbound.protection.outlook.com [IPv6:2a01:111:f400:7eb2::612]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by pechora5.dc.icann.org (Postfix) with ESMTPS id 7157D7000343 for <ietf-languages@iana.org>; Fri, 1 Dec 2023 23:06:45 +0000 (UTC)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=fSZGjSuJ/Peb4YsNulKZ/dyDpxU5Ehp2ysq0HWo3/bTjEukfWqH8ZE4NnsKMCAXaT5ZTedqEljzuGERjEgRpEBAa9kV47APgYyyxgrsIw5+eC/XHebILr+Zi+YzkgOScvTpYarxA5MxJzV9I25Pc2FGlry54+mFQQL/y+R7oybcAPEjQN35LbzcbPk5mdkc+Bv/wpMzMjS86hJdrDM4B4K1RTgP+/bPV3Z2itOGZnP0h3na/Jpv5eBZadw299vlyfTaTrWTxGvZVGdTLSr9ZQlFM0TVbJXOoweleGxRGozgfGUPRAbrlpOQWU6mYCb1lY8VEwmj3+NMqdGpaCKlwHQ==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=vvzRNf9a9BAkO0XYkXbNQzFl4U+NugKGjBfS09c1wOM=; b=N6aCbKDWfe7XhK/S4X5fDjtrqSm+cQpYdsNOSGhmA2AQoj2hwTu4VoTd937Yxw0dc6apcfw1xHRQHF+uwLPnzDmE0smNhgeYHQx8+MInyLeQCMqRbRqd4zFjSZPUbfDLX+29OW0s42OZ5wVF9hQyafqGHoY5bVxnRprkH3xRR+jtjIt2UvNMnbjijpX7Cc1WL9I783qlC6SMbv1at3JgBOL1yoeHTmwLqoe9EVGccwnUG/3tGsg0hn2O+MkXUjMsQXHW/+7EDwXHoMPqaffYw7cPvmpHd8ouHuRYGQ8g8I58u3cyvwg0q6E81kxJeUgPJzf5KB971GBWoJpL6Q65ng==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=ewellic.org; dmarc=pass action=none header.from=ewellic.org; dkim=pass header.d=ewellic.org; arc=none
Received: from SJ0PR03MB6598.namprd03.prod.outlook.com (2603:10b6:a03:38a::21) by SA1PR03MB7053.namprd03.prod.outlook.com (2603:10b6:806:326::20) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7068.15; Fri, 1 Dec 2023 23:06:18 +0000
Received: from SJ0PR03MB6598.namprd03.prod.outlook.com ([fe80::747c:c94a:1920:ba25]) by SJ0PR03MB6598.namprd03.prod.outlook.com ([fe80::747c:c94a:1920:ba25%4]) with mapi id 15.20.7068.012; Fri, 1 Dec 2023 23:06:17 +0000
From: Doug Ewell <doug@ewellic.org>
To: Hugh Paterson III <sil.linguist@gmail.com>
CC: "drude@xs4all.nl" <drude@xs4all.nl>, IETF Languages Discussion <ietf-languages@iana.org>
Thread-Topic: ISO 639 changes (was: RE: [Ietf-languages] Latin Sub tags)
Thread-Index: AQHaJKr9YYOc5GSMSkikqw/eukwKeQ==
Date: Fri, 01 Dec 2023 23:06:17 +0000
Message-ID: <SJ0PR03MB659804784A2F3D418ABDB216CA81A@SJ0PR03MB6598.namprd03.prod.outlook.com>
References: <CAE=3Ky-swzNn1hXba=muJF_radLugdKxhJ=u-_DcLiysDbothw@mail.gmail.com> <SJ0PR03MB6598925FCBA24238F417BE25CA82A@SJ0PR03MB6598.namprd03.prod.outlook.com> <CAE=3Ky_g3Sd7eBNq7H7_5BvkP-2qCbyYzQdO_eCSDzc70kQbJQ@mail.gmail.com> <012401da23b4$a9a291a0$fce7b4e0$@xs4all.nl> <SJ0PR03MB6598D48CD059645543661CA4CA81A@SJ0PR03MB6598.namprd03.prod.outlook.com> <018b01da2497$5094ed90$f1bec8b0$@xs4all.nl> <SJ0PR03MB65988A4F7614E585DF2BA6CBCA81A@SJ0PR03MB6598.namprd03.prod.outlook.com> <CAE=3Ky-jSJPg4pdKFbeabzHjOHzppasFV88J_v22RxKhATzxHA@mail.gmail.com>
In-Reply-To: <CAE=3Ky-jSJPg4pdKFbeabzHjOHzppasFV88J_v22RxKhATzxHA@mail.gmail.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=ewellic.org;
x-ms-publictraffictype: Email
x-ms-traffictypediagnostic: SJ0PR03MB6598:EE_|SA1PR03MB7053:EE_
x-ms-office365-filtering-correlation-id: 9c9a0623-de38-436e-474c-08dbf2c21fb3
x-ms-exchange-senderadcheck: 1
x-ms-exchange-antispam-relay: 0
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: ysEnpKxBJo0gHX7d7B/sMV/GdhHsqaJFVSO6FrwXoV7h+Eb9SwPeY3P7YbvL2GZpDRIyjlhwOXiGHHt/IU7GmjKv0Vqh4m0C3SsaumfOvQSzA7oXsjE9dwOeqhLhxiH14YDSY5ZTeiNpy04c8eOfrlqErxzUMo4HL5rmophVb41hCoZXzNR7MfLW/L0EquFv6ihJia2ehIEwMtlSEq72Keddsgur0p2UQPdrkEUJUwb9RO8Gp84GWoLnZOSCTm9cLmrdivUMtg1F/MBXwF9JPvY83FwNAaCbCntOszyU2CBzt5I/tKJMUuikROkOiHuwBD0xGmf8WNo1bdzSNSazQX7PljHsEBbx7B7VeomuP2LINtaa73kW0AwbS/FpIasL1Y33NUAWGL2d5ZdSrYPN209O+LgAqPeEABxirg+f63okxRI/bkhkV0/MsUBX78I3v0PchxsZvlmxXiEYUtZxatjBLd4vCR8AhlMSrZVEo4waoRSQrfPQkwbGZSfc6Zp3zu9qRRma9/WUqcTOazlx9xxXLEjUMIKAxJGcqHngy5zJYutkFzpS9apuNVJnhsWBr0jAbTZ/fiRtVkq5Cw9dmEXaGgoVMNjwtKYRlubwStckjRWA9Yi3WEuBN0eCqcrl
x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:SJ0PR03MB6598.namprd03.prod.outlook.com; PTR:; CAT:NONE; SFS:(13230031)(396003)(136003)(366004)(39830400003)(346002)(376002)(230922051799003)(451199024)(64100799003)(186009)(1800799012)(55016003)(478600001)(6506007)(71200400001)(7696005)(9686003)(122000001)(38070700009)(38100700002)(86362001)(33656002)(66946007)(5660300002)(66446008)(64756008)(54906003)(76116006)(41300700001)(66476007)(2906002)(66556008)(83380400001)(52536014)(8676002)(8936002)(6916009)(316002)(4326008); DIR:OUT; SFP:1101;
x-ms-exchange-antispam-messagedata-chunkcount: 1
x-ms-exchange-antispam-messagedata-0: fKETU2LMzj/kPfXz7OXNfcqscwuuMa01z6ueKvvEuYDjNbuRLqj1nBhFx1ofFCBsyXaii4AByCrmLvy4iQMBQYmsGXxZO0hyUgB0zbwEoVcFRny55LIx4vyd55GcBTw1Y9FIfBy7inJcccnMGyQYJUxKUV3R6T9oF1dCnngy5XOzuhjwXpaY1iOfZuzauK0wsD3fGKZHDY9xHNRyMjE+VA1BqwYB+8coQPhqtSA7lH4HOqAXgSEpyjFmpPy1ICQX9RNWqjg6TlbB2tcTbhIJEwHlkqLHvi+AVY3Wl508ptbwkb4LQmdFlBQAuT704TP9tPAkM6s329tOURRs+1dN7CFzcrHBh/JKmqHNdpINEYnVcd9+CrhVCRz2z5uRZhWH6w/uVgtGKO7RmJlvuRnjwolB5OPErfS8pQ1WTw1/VBff1yC/+kEiOcH1X1AObCm0ahpCy5g2+fk1v7gJju5P/ktioqMM+Mp8lQSHd8Olfk5JmEUvN2bZ5fMnwFSi2007EOCoPe2YfiVM+QZV4wN2zphFzcdQU5IOvw52byTTaLypCsuAIkY7W6HjPQXOXQoEAqHJaNnNbatzO8E2u5eY6m92ooBbmVgKpPA7XJwd8BjRyHq7q6Xvmi+bYmnHTo5m7r1WHoQE+5fAbhSLIDkPfeKx0haWkpCJjIfRh1fsMm2ky3EReXZ8vHvs1o05ha3Uar9Fvj/4swvDAOrAuQTznrXGbzSnKhtzjZxgQfVVdiV7Tf+bqOj9kU1bbEJZaJHBgLOQ0Tv5Kr1PVF4OoMcWNmhOpMrqw6V3cIC/vTb5s8O3LW5xsk9H4s5E5ReK84GV61PTPlECTUGQWTufjNA1vLBKjz6vEKMCg7bcZ0seU7j8HCAX+lUeWpLe5Rh6fJpPrJqddstdLQ0heYYKMMwdD7kVukmI251FKHJ6HtoCosisZqduU2CC1sUniepVH7ReBbq82rF7RZ1G1iIjCid9y10hx9e7Mn73YSThbQ3rRVm69qPPf/FjWu5ZOFlOfue9bx7TGu5NHeAGjHCtCkXi7Ll/0uw3R/3T39te3BA+CluE0Fg5vJ/bvB1x+NEBMDA6/tyRftwIsjLYEE8hLsAj4/Z6jtO90uFEEwBd2VY3IC3W7ib0QG0h7T9Z9P1/uUdP6b2R3yLUZyDPYvKG43U7Z+EA9xgSFjhSoY7b3Jgu5whEdMCUVGQGfjPHRHoZ+5zw7LL2zI0DLCgkWsFye+oCFKFHn8Cyv3mr5G6u98tQ7urcwbZyBM3eUMZ+P//JTun8cQFtU6F2/z+0D383i/tHgl0xFImyyg6OqygVwpeE3tZkXwtOonuNTezK30awmKvVqjXeF9ihPMVtoezE7D2+QWdj1ZZrTdFWKj/I+9QMn5POAbMhsCxKRJWbnTGJ7Il5TidMQRYpze+yESszdPO01CC9VBcy0MwMSCcXtm6hn+cQnTqbrjLSN5HHR1TPWB6eHRS4D7Ohv683I7S3IFKUzH9df9Ti1Tct+DqXpW5EL730HL/k86lR+8B97xDxTmGc1jUWACBmfO7ephxfHT8roZ99iaAhG+gOkn1+A6luqvdSfB89H0L4BBxXRzfo2JVWzVPIsXhZz4f7AGRz40RJzezWHxUWdEbvvllsmS7+Iiw=
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
MIME-Version: 1.0
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-AuthSource: SJ0PR03MB6598.namprd03.prod.outlook.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 9c9a0623-de38-436e-474c-08dbf2c21fb3
X-MS-Exchange-CrossTenant-originalarrivaltime: 01 Dec 2023 23:06:17.2058 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: af914547-9fbe-40e1-a852-1a58e1f247dc
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: +G1FaBBmJl5ndxgg4qlD+ZwdR0JruXh6L7OleRvfAjdb3Q23tg8vakWs/vRFLNUemruO3UxaBvQkHmFNVDIwYA==
X-MS-Exchange-Transport-CrossTenantHeadersStamped: SA1PR03MB7053
X-CMAE-Score: 0
X-CMAE-Analysis: v=2.4 cv=a9IjSGeF c=1 sm=1 tr=0 ts=656a6706 a=SRSNG2tq5TuPtBd2c0+Vew==:117 a=SRSNG2tq5TuPtBd2c0+Vew==:17 a=xqWC_Br6kY4A:10 a=IkcTkHD0fZMA:10 a=e2cXIFwxEfEA:10 a=NdtbqBCgZM0A:10 a=nORFd0-XAAAA:8 a=4xV5D755uwok-75AUCIA:9 a=QEXdDO2ut3YA:10 a=AYkXoqVYie-NGRFAsbO8:22
X-SOURCE-IP: 192.0.46.71
X-SPF-STATUS: hard_fail
X-SPF-FROM-STATUS: not_checked
X-RDNS-STATUS: pass
X-HELO-STRING: pechora5.dc.icann.org
Spam-Stopper-Id: 82dd1c31-3ed1-427a-964d-0a399eb1e490
Spam-Stopper-v2: Yes
X-Envelope-Mail-From: doug@ewellic.org
X-AES-Category: LEGIT
X-AES-Analytics-Data: eyJ0aW1lc3RhbXAiOiAiMjAyMy0xMi0wMVQyMzowNjo0Ny4wMTVaIiwgIm1lc3NhZ2VUcmFja2luZyI6IHsiaGFuZGxpbmciOiBbIlRISVJEIFBBUlRZIEJZUEFTUyJdLCAidW5pZmllZENhdGVnb3J5IjogIlVOQ0FURUdPUklTRUQifSwgImVuZ2luZXMiOiB7fX0=
X-Spam-Category: None
X-Spam-Reasons: None
X-Auto-Response-Suppress: DR, OOF, AutoReply
Archived-At: <https://mailarchive.ietf.org/arch/msg/ietf-languages/o7DT4RqMY3y2ALtulWQFv9HtEcw>
Subject: [Ietf-languages] ISO 639 changes (was: RE: Latin Sub tags)
X-BeenThere: ietf-languages@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: "Review of requests for language tag registration according to BCP 47 \(RFC 4646\)" <ietf-languages.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ietf-languages/>
List-Post: <mailto:ietf-languages@ietf.org>
List-Help: <mailto:ietf-languages-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 01 Dec 2023 23:06:53 -0000

Hugh Paterson III wrote:

> As a matter of practicality, it would be helpful for my processes if
> the format of the data files would remain tab delimited or CSV.

In my perfect Barbie world, the data files would look identical to the current 639-3 files, as I have built many tools that assume that format. The main one takes the 639-3 files as input and builds a mock, partial LSR, from which I extract proposed records that are used in the manual process of creating registration forms. Much of my overall process is manual, which is why it takes me days or weeks to get the BCP 47 changes onto this list, but the tools help a lot.

Of course, we are not in a perfect world, but software can always be updated or rewritten to accommodate new input formats. The important thing is to know in detail what changes will be made, if any, and when, so that we can continue to propagate data from the core standards into the LSR accurately and efficiently.

> Sebastian mentions a "database format". I can imagine that as an SQL
> dump or JSON file, which are valid and in some cases useful formats.

I didn’t see anything about a “database format” — perhaps I missed that — but he did mention databases several times. That doesn’t necessarily mean the publicly available format has to reflect the database architecture. I don’t know what database 639-3/RA is currently using, and thankfully I don’t have to.

> I am curious how the MA will approach the matter of data versioning.
> This is something that some of my colleagues and I have often desired
> from downloaded data (from various sources) in the BCP-47 space.
> Versions allow us to quickly look at the state of our systems and
> determine if we need to update data.

The first line of the LSR is the “File-Date” record, which is intended as a version number of sorts, and should make it clear whether you are up to date. This value has been duplicated eight times since 2006, when there have been errors in the LSR that required an immediate fix. Possibly a “Version” record could be added to further disambiguate, but with only eight such short-lived anomalies out of the 159 versions of the LSR generated since October 2005, this has not been reported to cause any problems.

I do have a local copy of all 159 versions, and all four versions of the LTER, but I almost never refer to them except to look up “fun facts” as in this post.

--
Doug Ewell, CC, ALB | Lakewood, CO, US | ewellic.org