Re: [Ietf-languages] Suggestion to update Urdu Script Designation in the subtag registry

Doug Ewell <doug@ewellic.org> Thu, 13 August 2020 19:57 UTC

Return-Path: <doug@ewellic.org>
X-Original-To: ietf-languages@ietfa.amsl.com
Delivered-To: ietf-languages@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id E593D3A10D9 for <ietf-languages@ietfa.amsl.com>; Thu, 13 Aug 2020 12:57:46 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.898
X-Spam-Level:
X-Spam-Status: No, score=-1.898 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_NONE=0.001, URIBL_BLOCKED=0.001] autolearn=unavailable autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id MoG-NvL-xuRH for <ietf-languages@ietfa.amsl.com>; Thu, 13 Aug 2020 12:57:45 -0700 (PDT)
Received: from p3plsmtpa11-05.prod.phx3.secureserver.net (p3plsmtpa11-05.prod.phx3.secureserver.net [68.178.252.106]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id E5A0C3A10E4 for <ietf-languages@ietf.org>; Thu, 13 Aug 2020 12:57:45 -0700 (PDT)
Received: from DESKTOPLPOB1E4 ([73.229.14.229]) by :SMTPAUTH: with ESMTPSA id 6JM8kplKHe84G6JM8kQJwJ; Thu, 13 Aug 2020 12:57:45 -0700
X-CMAE-Analysis: v=2.3 cv=BJrNU2YG c=1 sm=1 tr=0 a=9XGd8Ajh92evfb2NHZFWmw==:117 a=9XGd8Ajh92evfb2NHZFWmw==:17 a=IkcTkHD0fZMA:10 a=nORFd0-XAAAA:8 a=zLzpCSqz5uLsK2FuTRcA:9 a=QEXdDO2ut3YA:10 a=AYkXoqVYie-NGRFAsbO8:22
X-SECURESERVER-ACCT: doug@ewellic.org
From: Doug Ewell <doug@ewellic.org>
To: 'Richard Wordingham' <richard.wordingham=40ntlworld.com@dmarc.ietf.org>, ietf-languages@ietf.org
References: <CY4PR0401MB36203305BEFEBF938B654E8FC6420@CY4PR0401MB3620.namprd04.prod.outlook.com> <000201d670e8$d25e7e60$771b7b20$@ewellic.org> <CY4PR0401MB362045E1E4D11D92E1F89443C6420@CY4PR0401MB3620.namprd04.prod.outlook.com> <001a01d670ed$9c868530$d5938f90$@ewellic.org> <f4fa9f5c-3bb6-6b27-f294-7df9e0afa3d4@w3.org> <MWHPR1301MB21120388068B8E68EB6C8DE586430@MWHPR1301MB2112.namprd13.prod.outlook.com> <20200813202922.7afc52bc@JRWUBU2>
In-Reply-To: <20200813202922.7afc52bc@JRWUBU2>
Date: Thu, 13 Aug 2020 13:57:45 -0600
Message-ID: <001901d671ac$036bd340$0a4379c0$@ewellic.org>
MIME-Version: 1.0
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: quoted-printable
X-Mailer: Microsoft Outlook 16.0
Thread-Index: AQH5rA+CwpVeTdY2lU0+kS8AwUmZoQHtL+XRAeUUEP4B+wQcGwHAtBlyATjuoXIBu73LNKib5S2w
Content-Language: en-us
X-CMAE-Envelope: MS4wfIqmREKBKcuTCrMohHl09pB/9RzaNKLiqIT9zUjmEegFGd2A5IZNpFSPkC/H/aCJNDGVvVW4TtZb9okUsdJRJR3gt3BIuTp+qUrhW/M6puSzWof7yzC3 CPb4xrxYWbXj4BQwgxu4QxvzQv9G3aVcFulKsH325iPC9Jffox3f8nrFBPKk3sRsH1z/PwuSo3JN1xBSPe0205wgiGdaXbrkmboVbk2ABOy3LUywsWWmoXtI Ft+N3COGpPCDnihMPXIIwQ==
Archived-At: <https://mailarchive.ietf.org/arch/msg/ietf-languages/W9054QSbOSltSK-yq-Joa_gM--c>
Subject: Re: [Ietf-languages] Suggestion to update Urdu Script Designation in the subtag registry
X-BeenThere: ietf-languages@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <ietf-languages.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ietf-languages/>
List-Post: <mailto:ietf-languages@ietf.org>
List-Help: <mailto:ietf-languages-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 13 Aug 2020 19:57:54 -0000

Richard Wordingham wrote:

>> But aside from
>> characterizing font resources, it’s not clear to me where is it would
>> be beneficial to use ‘Aran’, either in a language tag or some other
>> context.
>
> That rather raises the question of why Aran is registered in BCP 47!
>
> However, I think Aran makes sense in catalogues, though my reasoning
> is based on the apparently analogous situation of Latn, Latf and Latg,
> but where Latn is not used if Latf or Latg were appropriate. [...]

The subtag 'Aran' is included in the Registry because the code element [Aran] is in ISO 15924, and we have to be extremely careful with the notion of cherry-picking code elements out of a core standard and leaving others behind. We don't know all the reasons why someone would want to use BCP 47, except that we all know (I hope) that web pages aren't the only use case. A tag or subtag that might not make any sense for one context might be very appropriate for another.

Initially, Daniel Billings simply asked for the Suppress-Script for 'ur' to be changed from 'Arab' to 'Aran', without stating the presumed benefit. Subsequent posts showed that he has document-processing applications which select a font based on the script subtag of the content, or based on the Suppress-Script of the language if no explicit script subtag is provided (I hope I have that right).

Well, specifying 'Aran' instead of assuming 'Arab' might make a lot of sense in that case, for printing documents in Urdu where the app doesn't already know what font to pick for Urdu. It might also be appropriate for cataloguing library holdings, as Richard said. It would make no sense at all for tagging plain text, because in plain text there is no difference between calligraphic styles. All of these are potential applications for BCP 47.

--
Doug Ewell | Thornton, CO, US | ewellic.org