Re: [Ietf-languages] Suggestion to update Urdu Script Designation in the subtag registry

Doug Ewell <doug@ewellic.org> Wed, 12 August 2020 20:40 UTC

Return-Path: <doug@ewellic.org>
X-Original-To: ietf-languages@ietfa.amsl.com
Delivered-To: ietf-languages@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id A47C43A0B63 for <ietf-languages@ietfa.amsl.com>; Wed, 12 Aug 2020 13:40:33 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.797
X-Spam-Level:
X-Spam-Status: No, score=-1.797 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HTML_MESSAGE=0.001, HTTPS_HTTP_MISMATCH=0.1, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_NONE=0.001, URIBL_BLOCKED=0.001] autolearn=no autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id wVpNnB55SPGB for <ietf-languages@ietfa.amsl.com>; Wed, 12 Aug 2020 13:40:32 -0700 (PDT)
Received: from p3plsmtpa11-09.prod.phx3.secureserver.net (p3plsmtpa11-09.prod.phx3.secureserver.net [68.178.252.110]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id ECB043A0B5F for <ietf-languages@ietf.org>; Wed, 12 Aug 2020 13:40:31 -0700 (PDT)
Received: from DESKTOPLPOB1E4 ([73.229.14.229]) by :SMTPAUTH: with ESMTPSA id 5xXykbH1UC7Kr5xXzkaBwA; Wed, 12 Aug 2020 13:40:31 -0700
X-CMAE-Analysis: v=2.3 cv=Sb+JicZu c=1 sm=1 tr=0 a=9XGd8Ajh92evfb2NHZFWmw==:117 a=9XGd8Ajh92evfb2NHZFWmw==:17 a=DAwyPP_o2Byb1YXLmDAA:9 a=nORFd0-XAAAA:8 a=48vgC7mUAAAA:8 a=I0CVDw5ZAAAA:8 a=UqCG9HQmAAAA:8 a=2j2T3sEYAAAA:8 a=wW1_WCu_HD3j8vjvnDgA:9 a=QEXdDO2ut3YA:10 a=yMhMjlubAAAA:8 a=SSmOFEACAAAA:8 a=y-K1f1D7qHam0E4o84UA:9 a=dj-M7QBaLV6cO1ir:21 a=1T8W08O0ELdP9tgF:21 a=YIeUfpDp3pVkOUSD:21 a=gKO2Hq4RSVkA:10 a=UiCQ7L4-1S4A:10 a=hTZeC7Yk6K0A:10 a=frz4AuCg-hUA:10 a=AYkXoqVYie-NGRFAsbO8:22 a=w1C3t2QeGrPiZgrLijVG:22 a=YdXdGVBxRxTCRzIkH2Jn:22 a=ZTv5dIgTOczADi9TCs5s:22
X-SECURESERVER-ACCT: doug@ewellic.org
From: Doug Ewell <doug@ewellic.org>
To: 'Daniel LaVon Billings' <daniel=40ChurchofJesusChrist.org@dmarc.ietf.org>, ietf-languages@ietf.org
References: <CY4PR0401MB36203305BEFEBF938B654E8FC6420@CY4PR0401MB3620.namprd04.prod.outlook.com>
In-Reply-To: <CY4PR0401MB36203305BEFEBF938B654E8FC6420@CY4PR0401MB3620.namprd04.prod.outlook.com>
Date: Wed, 12 Aug 2020 14:40:31 -0600
Message-ID: <000201d670e8$d25e7e60$771b7b20$@ewellic.org>
MIME-Version: 1.0
Content-Type: multipart/alternative; boundary="----=_NextPart_000_0003_01D670B6.87C8F060"
X-Mailer: Microsoft Outlook 16.0
Thread-Index: AQH5rA+CwpVeTdY2lU0+kS8AwUmZoajub9BQ
Content-Language: en-us
X-CMAE-Envelope: MS4wfC+aHCw4GeTpDt+XpSDyMqaA2Fr1w8G7U4wDaZp7CmQLSRVGc4WThLYpJOfV+W2/wWlwVreJdKTyWvey4DzkzTmaudwHKF+7Vumehuf3J563Ml+ruF1F TQ50xznBjfm9dvKvzQAQ0C88u248wuVw9VTNAO1EmUBjnveFWkl3XmqJG5I1c3HtMbcq6EdWP+HbAer2x8R+2B9wEaZnmybvUNIVftPftmagwQPKAH1jJtNw bYSMeucVqAOrw0wN+e15BA==
Archived-At: <https://mailarchive.ietf.org/arch/msg/ietf-languages/-oCR8s8klsau5T0gbnPT852bqaA>
Subject: Re: [Ietf-languages] Suggestion to update Urdu Script Designation in the subtag registry
X-BeenThere: ietf-languages@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <ietf-languages.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ietf-languages/>
List-Post: <mailto:ietf-languages@ietf.org>
List-Help: <mailto:ietf-languages-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 12 Aug 2020 20:40:34 -0000

Hi Daniel,

 �

The purpose of Suppress-Script in BCP 47 is to improve compatibility between BCP 47 applications and those written to older specifications, which did not support script subtags.

 �

There was a great deal of concern, in the mid-’00s when RFC 4646 was being developed, that the new script subtags would be overused, so that, for example, users who previously tagged English content as “en” or “en-US” would start tagging it as “en-Latn” or “en-Latn-US” instead. This would add virtually no information to the tag, because English is normally written in the Latin script; but it could cause compatibility problems with processes that did not understand the script subtag. Suppress-Script was devised as a way to discourage users from adding unnecessary script subtags like this. It is optional, pragmatic, and suggestive in nature; it does not attempt to provide a scholarly reference about the language.

 �

By changing the Suppress-Script for Urdu from ‘Arab’ to ‘Aran’, we would be essentially saying that the tag “ur-Arab” does add significant information beyond the tag “ur”, which is not true (most Urdu content is indeed written in the Arabic script) and in my opinion would be a step backward. Note that there is no corresponding script subtag for “Arabic script (Naskh variant).”

 �

I suspect, somewhat echoing Peter, that most users do not even know ‘Aran’ exists or why it is separately encoded. While I understand some of the thought process behind this proposal, I agree that the change should not be made.

 �

--

Doug Ewell | Thornton, CO, US | ewellic.org

 �

 �

From: Ietf-languages <ietf-languages-bounces@ietf.org> On Behalf Of Daniel LaVon Billings
Sent: Wednesday, August 12, 2020 11:56
To: ietf-languages@ietf.org
Subject: [Ietf-languages] Suggestion to update Urdu Script Designation in the subtag registry

 �

Urdu is listed as Arab script reference in the subtag registry when it should have the newer approved Aran designation:

 �

https://www.iana.org/assignments/language-subtag-registry/language-subtag-registry <https://nam10.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.iana.org%2Fassignments%2Flanguage-subtag-registry%2Flanguage-subtag-registry&data=02%7C01%7Cdaniel%40churchofjesuschrist.org%7C37060a5c0a8a4fb04c8c08d83ee334fe%7C61e6eeb35fd74aaaae3c61e8deb09b79%7C0%7C1%7C637328492850938946&sdata=6uXT5tBTUTGojI%2BkP2HtSYHJ%2BRzqKYTdnvvCZG79jhk%3D&reserved=0> 

 �

Urdu should be using the Aran script, not the Arab script:

 �

%%

Type: language

Subtag: ur

Description: Urdu

Added: 2005-10-16

Suppress-Script: Arab

%%

 �