Re: [Ietf-languages] Suggestion to update Urdu Script Designation in the subtag registry

Mark Davis ☕️ <mark@macchiato.com> Thu, 13 August 2020 18:11 UTC

Return-Path: <mark.edward.davis@gmail.com>
X-Original-To: ietf-languages@ietfa.amsl.com
Delivered-To: ietf-languages@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id D634B3A0F53 for <ietf-languages@ietfa.amsl.com>; Thu, 13 Aug 2020 11:11:14 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.884
X-Spam-Level:
X-Spam-Status: No, score=-1.884 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FREEMAIL_FORGED_FROMDOMAIN=0.001, FREEMAIL_FROM=0.001, HEADER_FROM_DIFFERENT_DOMAINS=0.001, HTML_FONT_FACE_BAD=0.001, HTML_MESSAGE=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_KAM_HTML_FONT_INVALID=0.01, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=macchiato-com.20150623.gappssmtp.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Ijs4xD8GlfPi for <ietf-languages@ietfa.amsl.com>; Thu, 13 Aug 2020 11:11:12 -0700 (PDT)
Received: from mail-qt1-x836.google.com (mail-qt1-x836.google.com [IPv6:2607:f8b0:4864:20::836]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 98DBB3A0F41 for <ietf-languages@ietf.org>; Thu, 13 Aug 2020 11:11:12 -0700 (PDT)
Received: by mail-qt1-x836.google.com with SMTP id c12so5062274qtn.9 for <ietf-languages@ietf.org>; Thu, 13 Aug 2020 11:11:12 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=macchiato-com.20150623.gappssmtp.com; s=20150623; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=jIvNEMJ8yAaRF3W9Ews/oWsmxFLMUiYxL1tzWhFOd3U=; b=QonFZBAYXOdHamiBvKk+ajDm+wA1/dCMV8m7g4p3dNsOyO/cx0QRNO0klEwDKjTgM3 uJP5HWfGIGsmOryGQe2y/zOIkDmGtWy8WRO2HArH71t8N9S03ejiCr9lqPPeVVEWX2r2 dEdJ3SCKapYPNK433kNsW98L3s1DhxMRKfhf//6kmnQdfuEEhmeQq1EMPaSJNgHdXl4z wflZJqcm8mD+cmfkQeqOzjhO9URVnlaoqkd8jgbiO3Yn4o+yt3p+zqEKNM0S9jSfZZuZ PAGW78ED2ATKqigvwvr7hAv6bZVy0Pq3chd7kul3bY7DSYMMmwSUyt2TroJDZaocvhMp IA0A==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=jIvNEMJ8yAaRF3W9Ews/oWsmxFLMUiYxL1tzWhFOd3U=; b=Ht93hPqNXySiNgJddwsZdnB/jbCmbj4GPwp/FCEoSdWFDPwE2dYEB7lcq6hAVblMlH KhrXr4Js0E4/9yWGwYwCFEEemC/5Nd0OXMeu0+P210gdhMCnXEcO8IpyaZz9B3ZIXX7C NnuznoG9ZENRgZsDTSboxnhBw32lrhjIRzrIKoYgZNIFxDKzslE1bkItwDuM0YW7b721 anXSPqBAIuvMIZiyKWqFvtYdrG4v5ld2rNVX6IJrw25XXXMWxiL6V65n531frWMAWdwC 2EeXdKpXo+3Xe3bziQdyNce/Kmvx2/NJLxDUdQGcSg8XfFNnUaq2hFejo87r4kn1seHW IrXA==
X-Gm-Message-State: AOAM53236rCPJBQUfpygvbV/uOk7Gz80Rm/ptu6/Wj9EC7K8Gc3jQY8D uQAQ6lD00/MxpX5EQbXVhHTFTQQr8qe/Q1FKwQA=
X-Google-Smtp-Source: ABdhPJwdvuk+95rMncAWx/pRyru26HLfIaQLzzIT9/cu20UCgxcJzpfGCz4rI6Y1JLsuPM/ZFEHdJONL7S82VkUzu7o=
X-Received: by 2002:ac8:6901:: with SMTP id e1mr6622169qtr.352.1597342271359; Thu, 13 Aug 2020 11:11:11 -0700 (PDT)
MIME-Version: 1.0
References: <CY4PR0401MB36203305BEFEBF938B654E8FC6420@CY4PR0401MB3620.namprd04.prod.outlook.com> <000201d670e8$d25e7e60$771b7b20$@ewellic.org> <CY4PR0401MB362045E1E4D11D92E1F89443C6420@CY4PR0401MB3620.namprd04.prod.outlook.com> <001a01d670ed$9c868530$d5938f90$@ewellic.org> <f4fa9f5c-3bb6-6b27-f294-7df9e0afa3d4@w3.org> <MWHPR1301MB21120388068B8E68EB6C8DE586430@MWHPR1301MB2112.namprd13.prod.outlook.com> <000001d6719a$9c3c7b40$d4b571c0$@ewellic.org>
In-Reply-To: <000001d6719a$9c3c7b40$d4b571c0$@ewellic.org>
From: Mark Davis ☕️ <mark@macchiato.com>
Date: Thu, 13 Aug 2020 11:10:59 -0700
Message-ID: <CAJ2xs_G8R+AsTmfbwqvWt2856Asqjn=2WBgWER2PTrXYPWodYw@mail.gmail.com>
To: Doug Ewell <doug@ewellic.org>
Cc: Peter Constable <pgcon6@msn.com>, r12a <ishida@w3.org>, ietf-languages@ietf.org
Content-Type: multipart/alternative; boundary="000000000000ab511d05acc63a73"
Archived-At: <https://mailarchive.ietf.org/arch/msg/ietf-languages/yTu7_Vbm5jJBD7Q-X-VT-irgNYU>
Subject: Re: [Ietf-languages] Suggestion to update Urdu Script Designation in the subtag registry
X-BeenThere: ietf-languages@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <ietf-languages.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ietf-languages/>
List-Post: <mailto:ietf-languages@ietf.org>
List-Help: <mailto:ietf-languages-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 13 Aug 2020 18:11:15 -0000

Mark


On Thu, Aug 13, 2020 at 10:53 AM Doug Ewell <doug@ewellic.org> wrote:

> For some of the styling scenarios Peter describes, I could imagine
> creating a BCP 47 extension, so taggers could write something like
> "fr-s-plaincap" or "fr-s-accntcap".
>
> For the OP’s original scenario of Urdu in Nastaliq, I don’t have any issue
> with specifying “ur-Aran” when the choice of a Nastaliq font is considered
> important, especially since 'Aran' already exists.


It is certainly legal to specify ur-Aran. However, as I remarked before, I
don't think it will accomplish what Daniel would intend with it — and in
the worst case could cause very significant degradation in quality (such a
request for a localized data for ur-Aran actually returning English
instead, even though ur is available).

If people don't understand the ramifications of using variant script tags,
they definitely shouldn't be using them.


> It's just not what Suppress-Script is for.
>

Definitely agreed.


> --
> Doug Ewell | Thornton, CO, US | ewellic.org
>
>
> From: Ietf-languages <ietf-languages-bounces@ietf.org> On Behalf Of Peter
> Constable
> Sent: Thursday, August 13, 2020 10:30
> To: r12a <ishida@w3.org>; ietf-languages@ietf.org
> Subject: Re: [Ietf-languages] Suggestion to update Urdu Script Designation
> in the subtag registry
>
> If typographic styles are to be indicated in a tag that is primarily
> intended to declare the _language_ of content, then what are the limits to
> that?
>
> For some languages written with Latin script, certain styles of fonts are
> preferred. For example, the Puyallup nature just south of where I live use
> fonts with a distinctive style. Should Puyallup content be marked up with a
> tag that includes something indicating that distinct style? If another
> language doesn’t normally use the Puyallup style fonts but some documents
> do, should those documents be tagged to indicate that style?
>
> Or within a single language, there may be different typographic
> preferences. I’m sure you’re familiar with issues regarding accented
> capitals in French: one convention allows for them, while another
> convention would never have accented capitals. Do we need variant subtags
> to indicate such distinctions? Or take this a step further: there are lots
> of Latin ligatures that _could_ be used in a document but aren’t required.
> Should language tags reflect levels of ligation that are used?
>
> Certainly there are stylistic preferences associated with some languages,
> such as Nastaliq for Urdu. (Mind, I suspect Urdu speakers wouldn’t expect
> or want a Nastaliq font to be used for UI strings on a phone.) I could
> perhaps see some benefit in having a way to qualify fonts for such
> distinctions, and to capture that language/style association as a default
> in something like CLDR. But aside from characterizing font resources, it’s
> not clear to me where is it would be beneficial to use ‘Aran’, either in a
> language tag or some other context.
>
>
> Peter
>
> _______________________________________________
> Ietf-languages mailing list
> Ietf-languages@ietf.org
> https://www.ietf.org/mailman/listinfo/ietf-languages
>