Re: [Ietf-languages] Suggestion to update Urdu Script Designation in the subtag registry

Doug Ewell <doug@ewellic.org> Thu, 13 August 2020 16:20 UTC

Return-Path: <doug@ewellic.org>
X-Original-To: ietf-languages@ietfa.amsl.com
Delivered-To: ietf-languages@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 8CACB3A0E54 for <ietf-languages@ietfa.amsl.com>; Thu, 13 Aug 2020 09:20:19 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.895
X-Spam-Level:
X-Spam-Status: No, score=-1.895 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_NONE=0.001, SPF_NONE=0.001, URIBL_BLOCKED=0.001] autolearn=unavailable autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id i0SDZ299b3H9 for <ietf-languages@ietfa.amsl.com>; Thu, 13 Aug 2020 09:20:17 -0700 (PDT)
Received: from p3plsmtpa09-05.prod.phx3.secureserver.net (p3plsmtpa09-05.prod.phx3.secureserver.net [173.201.193.234]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id BFD153A0E4D for <ietf-languages@ietf.org>; Thu, 13 Aug 2020 09:20:17 -0700 (PDT)
Received: from DESKTOPLPOB1E4 ([73.229.14.229]) by :SMTPAUTH: with ESMTPSA id 6FxgkDenioFZZ6FxgkSoWU; Thu, 13 Aug 2020 09:20:17 -0700
X-CMAE-Analysis: v=2.3 cv=cdOsUULM c=1 sm=1 tr=0 a=9XGd8Ajh92evfb2NHZFWmw==:117 a=9XGd8Ajh92evfb2NHZFWmw==:17 a=IkcTkHD0fZMA:10 a=5k6HccHfAAAA:8 a=hUPTCDgYAAAA:8 a=AxSlH8AYAAAA:8 a=eBQKQTjpAAAA:8 a=8pif782wAAAA:8 a=nORFd0-XAAAA:8 a=iASvZ0A9QRK1LnYrT38A:9 a=HBx0lUk_7Qn3RWfV:21 a=64dkG_e3MAS8GlV-:21 a=QEXdDO2ut3YA:10 a=BelNp2b45h8A:10 a=BukB40I6FuKppiBORtFA:22 a=V2tJNjH2qGujC-K859gb:22 a=T4IiLjzlruFRxnq-Xc3m:22 a=SSRJb0yCq7WcsCTkdkvu:22 a=AYkXoqVYie-NGRFAsbO8:22
X-SECURESERVER-ACCT: doug@ewellic.org
From: Doug Ewell <doug@ewellic.org>
To: 'Hugh Paterson III' <sil.linguist@gmail.com>, 'r12a' <ishida@w3.org>, richard.wordingham@ntlworld.com
Cc: 'Daniel LaVon Billings' <daniel=40ChurchofJesusChrist.org@dmarc.ietf.org>, ietf-languages@ietf.org
References: <CY4PR0401MB36203305BEFEBF938B654E8FC6420@CY4PR0401MB3620.namprd04.prod.outlook.com> <000201d670e8$d25e7e60$771b7b20$@ewellic.org> <CY4PR0401MB362045E1E4D11D92E1F89443C6420@CY4PR0401MB3620.namprd04.prod.outlook.com> <001a01d670ed$9c868530$d5938f90$@ewellic.org> <f4fa9f5c-3bb6-6b27-f294-7df9e0afa3d4@w3.org> <CAE=3Ky-ZR1py3+Ok1i+YjDR-WUH1Q=0bahZhcAC_Y+i+xc80Cw@mail.gmail.com>
In-Reply-To: <CAE=3Ky-ZR1py3+Ok1i+YjDR-WUH1Q=0bahZhcAC_Y+i+xc80Cw@mail.gmail.com>
Date: Thu, 13 Aug 2020 10:20:17 -0600
Message-ID: <000001d6718d$a25400f0$e6fc02d0$@ewellic.org>
MIME-Version: 1.0
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: quoted-printable
X-Mailer: Microsoft Outlook 16.0
Thread-Index: AQH5rA+CwpVeTdY2lU0+kS8AwUmZoQHtL+XRAeUUEP4B+wQcGwHAtBlyAlBsNemooMTMwA==
Content-Language: en-us
X-CMAE-Envelope: MS4wfFgozVYfCe7Bzm0ACcCUIbtWQNIa2ZmyXxn5d7cXb56Tne3mXzSDaho+3MQOybYdLvYP6dF3I3/iBo8+VmX+L0AWistmoiozpdePU6VPDI6qGRY2Mv/r iO/ETVRMGDqv+zf1EZoLWvANPVnfqLxYJF9ZQHxH2pjCan9Bo1kYWFgOjXY/L3Hy7WAMLPt9sIVF3921CvzsIFs9ScNwj6I2dnq2J7LuaiVbBZto6RonFunA A5v3+9EjPBvUH/Ra+k9Gu2QkggtVlCocd8/fIBg+/oO22zj24SSAJSBd1ni5SVxZPwcjjzXYjVkdYQfOX62qqss+2d+gHhdYVXqwmKbltMg=
Archived-At: <https://mailarchive.ietf.org/arch/msg/ietf-languages/NcwIfqSpEqL_LG3RPCRiDAfvEZM>
Subject: Re: [Ietf-languages] Suggestion to update Urdu Script Designation in the subtag registry
X-BeenThere: ietf-languages@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <ietf-languages.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ietf-languages/>
List-Post: <mailto:ietf-languages@ietf.org>
List-Help: <mailto:ietf-languages-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 13 Aug 2020 16:20:20 -0000

Hugh Paterson III wrote:

> In Nigeria Hausa can also be written with the Latin script. Where can
> I go to find what the basic default assumptions are for a language
> tag? Is the default always Latin?

I realize you know most of what follows, because SIL. Much of this explanation is for others.

When I want to find out details like this about a language, I first go to the ISO 639-3/RA site:
https://iso639-3.sil.org/code/hau

That points me to four sites for more information:
Ethnologue:	https://www.ethnologue.com/language/hau
Glottolog:	https://glottolog.org/glottolog?iso=hau#12/11.1513/8.7804
Multitree:	http://www.multitree.org/codes/hau.html
Wikipedia:	https://en.wikipedia.org/wiki/Hausa_language

Unfortunately I can no longer recommend Ethnologue to anyone except professionals in the field, because it now requires a minimal subscription at USD 480 per year (more capable subscriptions cost much, much more) to see basic information like speaker population, location, and writing system.

Glottolog is focused on language family assignments, on which experts often disagree, but it does include location and alternate names, as well as a ton of references.

Multitree appears to be down at present, so I can't say anything about it except that. My understanding is that it is also focused on relationships between languages.

Wikipedia is what it always is: There's a ton of information; the question is whether you feel you can trust the expertise and accuracy of the contributors.

The IANA Language Subtag Registry was never intended to be an encyclopedic reference to languages, their names, or their usage. It lists the subtags corresponding to languages, scripts, regions, and language varieties that are necessary to construct good and meaningful tags for identifying and locating content.

There is no "default" writing system expressed or implied in either the Registry or BCP 47. Suppress-Script, as stated earlier, is a mechanism to help people avoid using script subtags that are unlikely to add useful information. But to find out what script is most common for any given language, or other important information about the language, encyclopedic references are often needed. And the absence of a Suppress-Script field for a language subtag is no indication of anything.

Richard Wordingham replied:

> Unless there is a suppress-script tag for the language, the default is
> not normatively defined.

There is nothing normative about Suppress-Script.

--
Doug Ewell | Thornton, CO, US | ewellic.org