Re: [Ietf-languages] Fwd: I-D Action: draft-msporny-d-langtag-ext-00.txt

Martin J. Dürst <duerst@it.aoyama.ac.jp> Wed, 29 May 2019 07:20 UTC

Return-Path: <duerst@it.aoyama.ac.jp>
X-Original-To: ietf-languages@ietfa.amsl.com
Delivered-To: ietf-languages@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 784A5120108 for <ietf-languages@ietfa.amsl.com>; Wed, 29 May 2019 00:20:20 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.555
X-Spam-Level:
X-Spam-Status: No, score=-2.555 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FROM_EXCESS_BASE64=0.979, RCVD_IN_DNSWL_MED=-2.3, SPF_HELO_NONE=0.001, SPF_SOFTFAIL=0.665] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=itaoyama.onmicrosoft.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id zolfjNbMlaC8 for <ietf-languages@ietfa.amsl.com>; Wed, 29 May 2019 00:20:17 -0700 (PDT)
Received: from mork.alvestrand.no (mork.alvestrand.no [158.38.152.117]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 55013120044 for <ietf-languages@ietf.org>; Wed, 29 May 2019 00:20:17 -0700 (PDT)
Received: by mork.alvestrand.no (Postfix) id 3C46C7C07D0; Wed, 29 May 2019 09:20:15 +0200 (CEST)
Delivered-To: ietf-languages@alvestrand.no
Received: from localhost (localhost [127.0.0.1]) by mork.alvestrand.no (Postfix) with ESMTP id 25E0F7C079D for <ietf-languages@alvestrand.no>; Wed, 29 May 2019 09:20:15 +0200 (CEST)
X-Virus-Scanned: Debian amavisd-new at alvestrand.no
Authentication-Results: mork.alvestrand.no (amavisd-new); dkim=pass (1024-bit key) header.d=itaoyama.onmicrosoft.com
Received: from mork.alvestrand.no ([127.0.0.1]) by localhost (mork.alvestrand.no [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id dYU4nMcveYXY for <ietf-languages@alvestrand.no>; Wed, 29 May 2019 09:20:10 +0200 (CEST)
X-Greylist: from auto-whitelisted by SQLgrey-1.8.0
X-Greylist: from auto-whitelisted by SQLgrey-1.8.0
X-Comment: SPF skipped for whitelisted relay - client-ip=192.0.46.74; helo=pechora8.dc.icann.org; envelope-from=duerst@it.aoyama.ac.jp; receiver=ietf-languages@alvestrand.no
Received: from pechora8.dc.icann.org (pechora8.icann.org [192.0.46.74]) by mork.alvestrand.no (Postfix) with ESMTPS id 99BEF7C078D for <ietf-languages@alvestrand.no>; Wed, 29 May 2019 09:20:10 +0200 (CEST)
Received: from JPN01-OS2-obe.outbound.protection.outlook.com (mail-eopbgr1410128.outbound.protection.outlook.com [40.107.141.128]) (using TLSv1.2 with cipher AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by pechora8.dc.icann.org (Postfix) with ESMTPS id 7910FC026E for <ietf-languages@iana.org>; Wed, 29 May 2019 07:20:06 +0000 (UTC)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=itaoyama.onmicrosoft.com; s=selector1-itaoyama-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=lkwRL2OEfyPJbCFPsEbr2xrazgC3zFajNI1qCC8Dgwk=; b=S/98z/VltqdyH0RbLVNvI99H5HWIH7I2tMfE6YRZUxYpVde7RXpeWQtkT/yjxICTzU/aNKHJd3bBGmjB8TmQrJ8Apu82pAbulb1I1xiII/2L5H6XIrza5dyGoDzNoRg/pXwRG36r/4apfA2d4FcOsf3muWtI3tHrs5zHgNRfxLg=
Received: from OSAPR01MB1683.jpnprd01.prod.outlook.com (52.134.232.142) by OSAPR01MB3284.jpnprd01.prod.outlook.com (52.134.248.142) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.1922.17; Wed, 29 May 2019 07:19:42 +0000
Received: from OSAPR01MB1683.jpnprd01.prod.outlook.com ([fe80::dc8b:100a:3e3f:b226]) by OSAPR01MB1683.jpnprd01.prod.outlook.com ([fe80::dc8b:100a:3e3f:b226%4]) with mapi id 15.20.1922.021; Wed, 29 May 2019 07:19:42 +0000
From: "Martin J. Dürst" <duerst@it.aoyama.ac.jp>
To: Manu Sporny <msporny@digitalbazaar.com>, Mark Davis ☕️ <mark@macchiato.com>, Doug Ewell <doug@ewellic.org>, "Phillips, Addison" <addison@lab126.com>
CC: IETF Languages Discussion <ietf-languages@iana.org>
Thread-Topic: [Ietf-languages] Fwd: I-D Action: draft-msporny-d-langtag-ext-00.txt
Thread-Index: AQHVEz6iwfqaq4Xs40KUuCBo/SSZZA==
Date: Wed, 29 May 2019 07:19:42 +0000
Message-ID: <123ae48a-2b3f-cd4e-774e-a465cc740e13@it.aoyama.ac.jp>
References: <155881874982.30992.4869767614014356043@ietfa.amsl.com> <49b6a1de-e016-514f-90e4-24703b5818d2@it.aoyama.ac.jp> <63b4f786-8b44-ecdf-ed33-ff0567ecc839@digitalbazaar.com> <000001d51425$a48ac140$eda043c0$@ewellic.org> <CAJ2xs_EwKg3Tu5etk-ELXXd0u2Go-6TZbGm3QsBxV1upKTa8_g@mail.gmail.com> <0819b68a-56a1-d11d-db36-7e5510a8e971@digitalbazaar.com>
In-Reply-To: <0819b68a-56a1-d11d-db36-7e5510a8e971@digitalbazaar.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-clientproxiedby: TY2PR01CA0025.jpnprd01.prod.outlook.com (2603:1096:404:ce::13) To OSAPR01MB1683.jpnprd01.prod.outlook.com (2603:1096:603:2b::14)
authentication-results: spf=none (sender IP is ) smtp.mailfrom=duerst@it.aoyama.ac.jp;
x-ms-exchange-messagesentrepresentingtype: 1
x-originating-ip: [133.2.210.64]
x-ms-publictraffictype: Email
x-ms-office365-filtering-correlation-id: 2f1a86e3-5cc9-476c-2f4f-08d6e4060493
x-microsoft-antispam: BCL:0; PCL:0; RULEID:(2390118)(7020095)(4652040)(7021145)(8989299)(4534185)(7022145)(4603075)(4627221)(201702281549075)(8990200)(7048125)(7024125)(7025125)(7027125)(7023125)(5600148)(711020)(4605104)(1401327)(2017052603328)(7193020); SRVR:OSAPR01MB3284;
x-ms-traffictypediagnostic: OSAPR01MB3284:
x-ms-exchange-purlcount: 3
x-microsoft-antispam-prvs: <OSAPR01MB32841D17416E96E55A779535CA1F0@OSAPR01MB3284.jpnprd01.prod.outlook.com>
x-ms-oob-tlc-oobclassifiers: OLM:9508;
x-forefront-prvs: 0052308DC6
x-forefront-antispam-report: SFV:NSPM; SFS:(10019020)(376002)(366004)(346002)(396003)(39840400004)(136003)(189003)(199004)(26005)(2906002)(4326008)(85202003)(66446008)(386003)(71200400001)(11346002)(2616005)(66946007)(64756008)(786003)(6512007)(52116002)(74482002)(66066001)(71190400001)(86362001)(446003)(256004)(14444005)(66476007)(68736007)(476003)(73956011)(561944003)(8936002)(14454004)(6486002)(6306002)(110136005)(5660300002)(66556008)(53936002)(486006)(102836004)(85182001)(508600001)(6506007)(81156014)(81166006)(53546011)(76176011)(31696002)(316002)(305945005)(186003)(31686004)(6436002)(6116002)(7736002)(25786009)(99286004)(3846002)(8676002)(229853002)(6246003)(966005); DIR:OUT; SFP:1102; SCL:1; SRVR:OSAPR01MB3284; H:OSAPR01MB1683.jpnprd01.prod.outlook.com; FPR:; SPF:None; LANG:en; PTR:InfoNoRecords; MX:1; A:0;
received-spf: None (protection.outlook.com: it.aoyama.ac.jp does not designate permitted sender hosts)
x-ms-exchange-senderadcheck: 1
x-microsoft-antispam-message-info: F8sgx4qTwc4P6CeWdjwv0Tx9VyymtphgdYK8flPq9oF/nPv8Bklc5Zrmkr65NSCSewQ6SElrOTL5EX2NKYgyglESb5VScG63qV42n9yq8W8P1TJ7Cex6YE4UFHv19okAtytoB/1mABw4EcODUTg5hqZsxa/9JVOrcAPZEYRkedfekWM82Ao6neCuwH9ly6VSky+mHG63cd7zLbe+Jp3hXOn10F2Ghd8Fd+7Gm9lhqP3Dh9YxQBGmaTybN4RVS2hvP7j2pHZbO4mMJuV7cf6qslM96vYR/UkYkHZHZ0fB/osTNG4w4dujKhjTlbm/MZ5DbMQYPaZIC2BIn1ojEKaXlxhtetvxqTXQgokELYTeNkVb9pNXy0H3vZOZwVBrOYH3XGSqyaDsNxcxLxpYGQhzbesNKrizaiVjGzs/3iClH8s=
Content-Type: text/plain; charset="utf-8"
Content-ID: <F861AEF6BED24B458A4052A2051C4206@jpnprd01.prod.outlook.com>
Content-Transfer-Encoding: base64
MIME-Version: 1.0
X-OriginatorOrg: it.aoyama.ac.jp
X-MS-Exchange-CrossTenant-Network-Message-Id: 2f1a86e3-5cc9-476c-2f4f-08d6e4060493
X-MS-Exchange-CrossTenant-originalarrivaltime: 29 May 2019 07:19:42.2322 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: e02030e7-4d45-463e-a968-0290e738c18e
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: duerst@it.aoyama.ac.jp
X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.2 (pechora8.dc.icann.org [192.0.46.74]); Wed, 29 May 2019 07:20:06 +0000 (UTC)
X-MS-Exchange-Transport-CrossTenantHeadersStamped: OSAPR01MB3284
Archived-At: <https://mailarchive.ietf.org/arch/msg/ietf-languages/FNDxsLMAPdb8I1Bw6iyGlWZvuGM>
Subject: Re: [Ietf-languages] Fwd: I-D Action: draft-msporny-d-langtag-ext-00.txt
X-BeenThere: ietf-languages@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <ietf-languages.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ietf-languages/>
List-Post: <mailto:ietf-languages@ietf.org>
List-Help: <mailto:ietf-languages-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 29 May 2019 07:20:20 -0000

Hello Manu, others,

On 2019/05/28 05:10, Manu Sporny wrote:
> On 5/27/19 9:52 AM, Mark Davis ☕️ wrote:
>> I think what they are trying to do is shoehorn in a parameter that
>> lets them set the paragraph embedding level
>> (https://unicode.org/reports/tr9/#BD4) for the Bidi Algorithm.
> 
> Hmm, no, I don't think that's it...

It's pretty much it. To be more precise, maybe change it to "(paragraph) 
embedding level", because it could be intended for sub-paragraph 
embeddings, too.


>> 1. So from the tag "ar-Arab", we get the script "Arab". Then use
>> https://github.com/unicode-org/cldr/blob/master/common/properties/scriptMetadata.txt,

> which has a mapping from script to direction (RTL=YES). (I'm pointing to
>> trunk, just so people can read the file easily; one would use the
>> latest release.)
> 
> What about for something like this, where BiDi doesn't work?
> 
> HTML و CSS: تصميم و إنشاء مواقع الويب

What does not work here?
[Please be aware of the fact that not all email software will render 
this the same.]

Similar to Mark's "rindfleisch..." example, it's probably intended to be 
an Arabic sentence that starts with a word in Latin script and therefore 
risks to be displayed with overall (paragraph) direction of LTR. 
Unfortunately, the only Arabic word I can read here is "و" (and), so I'm 
not totally sure I'm right. It would be good to have explanations for 
examples.

>> It isn't that Arabic would be displayed left to right, it is what
>> establishes the paragraph ordering. The problem arises when you have
>> mixed text. Look at the following example, using the convention that
>> lowercase = English and uppercase=Arabic. The majority of the text
>> and the first strong character are both English, but the sentence is
>>   meant to be used in an Arabic environment, so the default paragraph
>>   embedding level needs to be RTL.
> 
> Yep.

Why are you agreeing here when this is just an explanation of the term 
"paragraph embedding direction" that Mark used at the start of the mail, 
where you disagreed to this?

>> 3. I also agree with Martin that the definition "automatically
>> detected" for subtag 'auto' is not adequate. How does it differ from
>> leaving off the D extension altogether?

I think there might be an important difference. Leaving off the D 
extension may mean one of two things:
1) Determine the embedding direction by some kind of "auto" algorithm
    (ideally clearly specified, desirably by reference)
2) Use the intrinsic directionality derived from the language (and maybe
    script and region) subtags

These two are not the same. If anything, we seem to move into the 
direction of 2), which would mean that "auto" may still be desirable.

> Folks have argued against `auto`, happy to remove it if that's what
> folks in this group think we should do.

It's good to know that you're flexible. But this is just a small detail, 
it could go either way. What we have to make sure is that we get the
big issues right (or "as right as possible").

> It was meant to achieve the same thing this achieves:
> 
> https://www.w3.org/TR/string-meta/#dom-localizable-dir

I'll note that this seems to prefer alternative 1), and we should 
clearly align one way or another if this is going to get deployed.

>> While this is true, for the fast majority of cases, LTR and RTL are
>> the important issues. Most computer systems don't really handle
>> vertical natively; one needs to have more specialized text processing
>> systems, and that is not, I imagine, the target for this syntax.
> 
> We're happy to add other directionalities that folks in this group think
> we should add.

Again a small issue. My personal understanding is that directionality is 
needed only for mixed directional stuff, and that's as in mixing LTR and 
RTL. Vertical writing is at a higher level.

>> 5. Given #4, the lack of a registry for the proposed extension, or
>> even the mention of one, is a significant problem. The set of exactly
>> 3 values associated with this extension ('ltr', 'rtl', and 'auto')
>> would be fixed; adding to it would require updating the RFC, which is
>> much more work than updating a registry.

> Sure, we can add a registry, I can make that change in the next version
> once it becomes clear that the proposal has merit and won't be rejected
> by this or the W3C i18n community.

This is again a small issue. I think it's good to have a registry if 
additions can be easily envisioned, but I can't imagine the need for any 
additions soon. And please note that an updating RFC could essentially 
only contain the new tags. It doesn't have to repeat the text in the 
original RFC.


>> Without these issues being addressed in a satisfactory way, I would
>> lobby IETF not to approve this I-D.
>>
>> I don't see that there is any reason to approve it, given that it is,
>> as far as I can tell, completely unnecessary and would just
>> complicate implementer's lives to no good end.
> 
> Given the new information above (links to use cases, background
> discussion), are you still of the opinion that there is no need for the
> extension?

The extension might be okay if your community can convince us that this 
extension will be strictly limited to those formats (and instances) 
where it's really needed, and that it's stripped or transferred to other 
syntactic elements (e.g. dir attribute for HTML) when that's more 
appropriate.

Regards,   Martin.