Re: [Ietf-languages] Suggestion to update Urdu Script Designation in the subtag registry

Doug Ewell <doug@ewellic.org> Thu, 13 August 2020 17:53 UTC

Return-Path: <doug@ewellic.org>
X-Original-To: ietf-languages@ietfa.amsl.com
Delivered-To: ietf-languages@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id F137B3A1062 for <ietf-languages@ietfa.amsl.com>; Thu, 13 Aug 2020 10:53:22 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.895
X-Spam-Level:
X-Spam-Status: No, score=-1.895 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_NONE=0.001, SPF_NONE=0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id QX5Nnziv-M-0 for <ietf-languages@ietfa.amsl.com>; Thu, 13 Aug 2020 10:53:21 -0700 (PDT)
Received: from p3plsmtpa07-05.prod.phx3.secureserver.net (p3plsmtpa07-05.prod.phx3.secureserver.net [173.201.192.234]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id E99423A0FEE for <ietf-languages@ietf.org>; Thu, 13 Aug 2020 10:53:10 -0700 (PDT)
Received: from DESKTOPLPOB1E4 ([73.229.14.229]) by :SMTPAUTH: with ESMTPSA id 6HPZkSPWxkxIh6HPakaFzJ; Thu, 13 Aug 2020 10:53:10 -0700
X-CMAE-Analysis: v=2.3 cv=SPU8q9nH c=1 sm=1 tr=0 a=9XGd8Ajh92evfb2NHZFWmw==:117 a=9XGd8Ajh92evfb2NHZFWmw==:17 a=IkcTkHD0fZMA:10 a=nORFd0-XAAAA:8 a=48vgC7mUAAAA:8 a=SSmOFEACAAAA:8 a=xkP2vSgTXxwPQptJYjsA:9 a=rN-QoP3FR2cQ4e1s:21 a=rEH1WP8pzlkh4BkM:21 a=QEXdDO2ut3YA:10 a=AYkXoqVYie-NGRFAsbO8:22 a=w1C3t2QeGrPiZgrLijVG:22
X-SECURESERVER-ACCT: doug@ewellic.org
From: Doug Ewell <doug@ewellic.org>
To: 'Peter Constable' <pgcon6@msn.com>, 'r12a' <ishida@w3.org>, ietf-languages@ietf.org
References: <CY4PR0401MB36203305BEFEBF938B654E8FC6420@CY4PR0401MB3620.namprd04.prod.outlook.com> <000201d670e8$d25e7e60$771b7b20$@ewellic.org> <CY4PR0401MB362045E1E4D11D92E1F89443C6420@CY4PR0401MB3620.namprd04.prod.outlook.com> <001a01d670ed$9c868530$d5938f90$@ewellic.org> <f4fa9f5c-3bb6-6b27-f294-7df9e0afa3d4@w3.org> <MWHPR1301MB21120388068B8E68EB6C8DE586430@MWHPR1301MB2112.namprd13.prod.outlook.com>
In-Reply-To: <MWHPR1301MB21120388068B8E68EB6C8DE586430@MWHPR1301MB2112.namprd13.prod.outlook.com>
Date: Thu, 13 Aug 2020 11:53:10 -0600
Message-ID: <000001d6719a$9c3c7b40$d4b571c0$@ewellic.org>
MIME-Version: 1.0
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: quoted-printable
X-Mailer: Microsoft Outlook 16.0
Thread-Index: AQH5rA+CwpVeTdY2lU0+kS8AwUmZoQHtL+XRAeUUEP4B+wQcGwHAtBlyATjuoXKoqZ5BgA==
Content-Language: en-us
X-CMAE-Envelope: MS4wfDP3dW7wyFH7y5V3kkC8FtCt2w/lkpWGqBElxhIL/EZFMDBwNHYvWRTcwOe5DMNEn5S+1nepPEkdM/37aBzTqZqZdNtg5l+CWceqIJnzptOVlU7SNIgV auBXElDZf24eJp+1Tb6GIZei0KjiMfxPNjQRcuISRqS4iKZp4mj3x7J2hXQB8MBjYq5t+SkDCGTw98Bk9njgZ60dv3lEuQ5h5pL1C6Ka6wdNB8hHX/1qxgZz
Archived-At: <https://mailarchive.ietf.org/arch/msg/ietf-languages/NENbu0pU033qoWW1-vafQV8-F9w>
Subject: Re: [Ietf-languages] Suggestion to update Urdu Script Designation in the subtag registry
X-BeenThere: ietf-languages@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <ietf-languages.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ietf-languages/>
List-Post: <mailto:ietf-languages@ietf.org>
List-Help: <mailto:ietf-languages-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 13 Aug 2020 17:53:25 -0000

For some of the styling scenarios Peter describes, I could imagine creating a BCP 47 extension, so taggers could write something like "fr-s-plaincap" or "fr-s-accntcap".

For the OP’s original scenario of Urdu in Nastaliq, I don’t have any issue with specifying “ur-Aran” when the choice of a Nastaliq font is considered important, especially since 'Aran' already exists. It's just not what Suppress-Script is for.

--
Doug Ewell | Thornton, CO, US | ewellic.org


From: Ietf-languages <ietf-languages-bounces@ietf.org> On Behalf Of Peter Constable
Sent: Thursday, August 13, 2020 10:30
To: r12a <ishida@w3.org>; ietf-languages@ietf.org
Subject: Re: [Ietf-languages] Suggestion to update Urdu Script Designation in the subtag registry

If typographic styles are to be indicated in a tag that is primarily intended to declare the _language_ of content, then what are the limits to that? 

For some languages written with Latin script, certain styles of fonts are preferred. For example, the Puyallup nature just south of where I live use fonts with a distinctive style. Should Puyallup content be marked up with a tag that includes something indicating that distinct style? If another language doesn’t normally use the Puyallup style fonts but some documents do, should those documents be tagged to indicate that style?

Or within a single language, there may be different typographic preferences. I’m sure you’re familiar with issues regarding accented capitals in French: one convention allows for them, while another convention would never have accented capitals. Do we need variant subtags to indicate such distinctions? Or take this a step further: there are lots of Latin ligatures that _could_ be used in a document but aren’t required. Should language tags reflect levels of ligation that are used?

Certainly there are stylistic preferences associated with some languages, such as Nastaliq for Urdu. (Mind, I suspect Urdu speakers wouldn’t expect or want a Nastaliq font to be used for UI strings on a phone.) I could perhaps see some benefit in having a way to qualify fonts for such distinctions, and to capture that language/style association as a default in something like CLDR. But aside from characterizing font resources, it’s not clear to me where is it would be beneficial to use ‘Aran’, either in a language tag or some other context.


Peter