Re: [Ietf-languages] Fwd: I-D Action: draft-msporny-d-langtag-ext-00.txt

Manu Sporny <msporny@digitalbazaar.com> Mon, 27 May 2019 20:10 UTC

Return-Path: <msporny@digitalbazaar.com>
X-Original-To: ietf-languages@ietfa.amsl.com
Delivered-To: ietf-languages@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 3158312008A for <ietf-languages@ietfa.amsl.com>; Mon, 27 May 2019 13:10:54 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.898
X-Spam-Level:
X-Spam-Status: No, score=-1.898 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, SPF_FAIL=0.001, SPF_HELO_NONE=0.001] autolearn=no autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id oeO3N0Jb9sDG for <ietf-languages@ietfa.amsl.com>; Mon, 27 May 2019 13:10:51 -0700 (PDT)
Received: from mork.alvestrand.no (mork.alvestrand.no [IPv6:2001:700:1:2::117]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 5FF41120019 for <ietf-languages@ietf.org>; Mon, 27 May 2019 13:10:51 -0700 (PDT)
Received: by mork.alvestrand.no (Postfix) id 05ED07C37CE; Mon, 27 May 2019 22:10:49 +0200 (CEST)
Delivered-To: ietf-languages@alvestrand.no
Received: from localhost (localhost [127.0.0.1]) by mork.alvestrand.no (Postfix) with ESMTP id E966A7C37C4 for <ietf-languages@alvestrand.no>; Mon, 27 May 2019 22:10:48 +0200 (CEST)
X-Virus-Scanned: Debian amavisd-new at alvestrand.no
Received: from mork.alvestrand.no ([127.0.0.1]) by localhost (mork.alvestrand.no [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id R2lq6UdRRU-e for <ietf-languages@alvestrand.no>; Mon, 27 May 2019 22:10:46 +0200 (CEST)
X-Greylist: from auto-whitelisted by SQLgrey-1.8.0
X-Greylist: from auto-whitelisted by SQLgrey-1.8.0
X-Comment: SPF skipped for whitelisted relay - client-ip=192.0.46.72; helo=pechora6.dc.icann.org; envelope-from=msporny@digitalbazaar.com; receiver=ietf-languages@alvestrand.no
Received: from pechora6.dc.icann.org (pechora6.icann.org [192.0.46.72]) by mork.alvestrand.no (Postfix) with ESMTPS id A35497C3646 for <ietf-languages@alvestrand.no>; Mon, 27 May 2019 22:10:46 +0200 (CEST)
Received: from mail.digitalbazaar.com (mail.digitalbazaar.com [96.89.14.193]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by pechora6.dc.icann.org (Postfix) with ESMTPS id 24C961E0862 for <ietf-languages@iana.org>; Mon, 27 May 2019 20:10:45 +0000 (UTC)
Received: from [192.168.0.149] by mail.digitalbazaar.com with esmtpsa (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.86) (envelope-from <msporny@digitalbazaar.com>) id 1hVLwt-0007BF-CE; Mon, 27 May 2019 16:10:23 -0400
To: Mark Davis ☕️ <mark@macchiato.com>, Doug Ewell <doug@ewellic.org>, "Phillips, Addison" <addison@lab126.com>
Cc: "Martin J. Dürst" <duerst@it.aoyama.ac.jp>, IETF Languages Discussion <ietf-languages@iana.org>
References: <155881874982.30992.4869767614014356043@ietfa.amsl.com> <49b6a1de-e016-514f-90e4-24703b5818d2@it.aoyama.ac.jp> <63b4f786-8b44-ecdf-ed33-ff0567ecc839@digitalbazaar.com> <000001d51425$a48ac140$eda043c0$@ewellic.org> <CAJ2xs_EwKg3Tu5etk-ELXXd0u2Go-6TZbGm3QsBxV1upKTa8_g@mail.gmail.com>
From: Manu Sporny <msporny@digitalbazaar.com>
Message-ID: <0819b68a-56a1-d11d-db36-7e5510a8e971@digitalbazaar.com>
Date: Mon, 27 May 2019 16:10:22 -0400
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.6.1
MIME-Version: 1.0
In-Reply-To: <CAJ2xs_EwKg3Tu5etk-ELXXd0u2Go-6TZbGm3QsBxV1upKTa8_g@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"
Content-Language: en-CA
Content-Transfer-Encoding: 8bit
X-SA-Exim-Connect-IP: 192.168.0.149
X-SA-Exim-Mail-From: msporny@digitalbazaar.com
X-SA-Exim-Scanned: No (on mail.digitalbazaar.com); SAEximRunCond expanded to false
Archived-At: <https://mailarchive.ietf.org/arch/msg/ietf-languages/Yxu_gy4rqiDiy4e-ZR1NtTdWcHs>
Subject: Re: [Ietf-languages] Fwd: I-D Action: draft-msporny-d-langtag-ext-00.txt
X-BeenThere: ietf-languages@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <ietf-languages.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ietf-languages/>
List-Post: <mailto:ietf-languages@ietf.org>
List-Help: <mailto:ietf-languages-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 27 May 2019 20:10:54 -0000

On 5/27/19 9:52 AM, Mark Davis ☕️ wrote:
> I think what they are trying to do is shoehorn in a parameter that 
> lets them set the paragraph embedding level 
> (https://unicode.org/reports/tr9/#BD4) for the Bidi Algorithm.

Hmm, no, I don't think that's it... Here's some background, Mark:

https://github.com/w3c/rdf-dir-literal/issues/3#issuecomment-496004819

... and why we're having this discussion:

https://github.com/w3c/rdf-dir-literal/issues/3#issuecomment-496006350

... and what the current proposed spec text in the Verifiable
Credentials Data Model specification regarding i18n states:

https://pr-preview.s3.amazonaws.com/w3c/vc-data-model/pull/641.html#internationalization-considerations

> 1. So from the tag "ar-Arab", we get the script "Arab". Then use 
> https://github.com/unicode-org/cldr/blob/master/common/properties/scriptMetadata.txt,
>
>
>
> 
which has a mapping from script to direction (RTL=YES). (I'm pointing to
> trunk, just so people can read the file easily; one would use the 
> latest release.)

What about for something like this, where BiDi doesn't work?

HTML و CSS: تصميم و إنشاء مواقع الويب

> It isn't that Arabic would be displayed left to right, it is what 
> establishes the paragraph ordering. The problem arises when you have 
> mixed text. Look at the following example, using the convention that 
> lowercase = English and uppercase=Arabic. The majority of the text 
> and the first strong character are both English, but the sentence is
>  meant to be used in an Arabic environment, so the default paragraph
>  embedding level needs to be RTL.

Yep.

> 3. I also agree with Martin that the definition "automatically 
> detected" for subtag 'auto' is not adequate. How does it differ from 
> leaving off the D extension altogether?
> 
> Agreed, not well specified. But -d- is not needed in the first place,
> so moot.

Folks have argued against `auto`, happy to remove it if that's what
folks in this group think we should do.

It was meant to achieve the same thing this achieves:

https://www.w3.org/TR/string-meta/#dom-localizable-dir

> While this is true, for the fast majority of cases, LTR and RTL are 
> the important issues. Most computer systems don't really handle 
> vertical natively; one needs to have more specialized text processing
> systems, and that is not, I imagine, the target for this syntax.

We're happy to add other directionalities that folks in this group think
we should add.

> 5. Given #4, the lack of a registry for the proposed extension, or 
> even the mention of one, is a significant problem. The set of exactly
> 3 values associated with this extension ('ltr', 'rtl', and 'auto')
> would be fixed; adding to it would require updating the RFC, which is
> much more work than updating a registry.
> 
> Agreed, that would be a major drawback.  But -d- is not needed in
> the first place, so moot.

Sure, we can add a registry, I can make that change in the next version
once it becomes clear that the proposal has merit and won't be rejected
by this or the W3C i18n community.

> Without these issues being addressed in a satisfactory way, I would 
> lobby IETF not to approve this I-D.
> 
> I don't see that there is any reason to approve it, given that it is,
> as far as I can tell, completely unnecessary and would just 
> complicate implementer's lives to no good end.

Given the new information above (links to use cases, background
discussion), are you still of the opinion that there is no need for the
extension?

-- manu

-- 
Manu Sporny (skype: msporny, twitter: manusporny)
Founder/CEO - Digital Bazaar, Inc.
blog: Veres One Decentralized Identifier Blockchain Launches
https://tinyurl.com/veres-one-launches