Re: [Ietf-languages] Fwd: I-D Action: draft-msporny-d-langtag-ext-00.txt

"Doug Ewell" <doug@ewellic.org> Mon, 27 May 2019 21:48 UTC

Return-Path: <doug@ewellic.org>
X-Original-To: ietf-languages@ietfa.amsl.com
Delivered-To: ietf-languages@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 706EF120189 for <ietf-languages@ietfa.amsl.com>; Mon, 27 May 2019 14:48:51 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.898
X-Spam-Level:
X-Spam-Status: No, score=-1.898 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, SPF_HELO_NONE=0.001, SPF_NONE=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id CD7fGvAad0jS for <ietf-languages@ietfa.amsl.com>; Mon, 27 May 2019 14:48:50 -0700 (PDT)
Received: from mork.alvestrand.no (mork.alvestrand.no [IPv6:2001:700:1:2::117]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id C5FAE120099 for <ietf-languages@ietf.org>; Mon, 27 May 2019 14:48:49 -0700 (PDT)
Received: by mork.alvestrand.no (Postfix) id 9508B7C0434; Mon, 27 May 2019 23:48:46 +0200 (CEST)
Delivered-To: ietf-languages@alvestrand.no
Received: from localhost (localhost [127.0.0.1]) by mork.alvestrand.no (Postfix) with ESMTP id 5ABD57C37CE for <ietf-languages@alvestrand.no>; Mon, 27 May 2019 23:48:46 +0200 (CEST)
X-Virus-Scanned: Debian amavisd-new at alvestrand.no
Received: from mork.alvestrand.no ([127.0.0.1]) by localhost (mork.alvestrand.no [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id E9PXvujEs8hv for <ietf-languages@alvestrand.no>; Mon, 27 May 2019 23:48:45 +0200 (CEST)
X-Greylist: from auto-whitelisted by SQLgrey-1.8.0
X-Greylist: from auto-whitelisted by SQLgrey-1.8.0
X-Comment: SPF skipped for whitelisted relay - client-ip=192.0.33.71; helo=pechora1.lax.icann.org; envelope-from=doug@ewellic.org; receiver=ietf-languages@alvestrand.no
Received: from pechora1.lax.icann.org (pechora1.icann.org [192.0.33.71]) by mork.alvestrand.no (Postfix) with ESMTPS id D36B27C0434 for <ietf-languages@alvestrand.no>; Mon, 27 May 2019 23:48:44 +0200 (CEST)
Received: from p3plsmtpa12-07.prod.phx3.secureserver.net (p3plsmtpa12-07.prod.phx3.secureserver.net [68.178.252.236]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by pechora1.lax.icann.org (Postfix) with ESMTPS id 4E7F81E027F for <ietf-languages@iana.org>; Mon, 27 May 2019 21:48:43 +0000 (UTC)
Received: from DESKTOPLPOB1E4 ([73.229.14.229]) by :SMTPAUTH: with ESMTPSA id VNTih7OohEUBsVNTihlMn7; Mon, 27 May 2019 14:48:23 -0700
From: Doug Ewell <doug@ewellic.org>
To: 'Manu Sporny' <msporny@digitalbazaar.com>, 'Mark Davis ☕️' <mark@macchiato.com>
Cc: "'Phillips, Addison'" <addison@lab126.com>, 'IETF Languages Discussion' <ietf-languages@iana.org>, "'Martin J. Dürst'" <duerst@it.aoyama.ac.jp>
References: <155881874982.30992.4869767614014356043@ietfa.amsl.com> <49b6a1de-e016-514f-90e4-24703b5818d2@it.aoyama.ac.jp> <63b4f786-8b44-ecdf-ed33-ff0567ecc839@digitalbazaar.com> <000001d51425$a48ac140$eda043c0$@ewellic.org> <CAJ2xs_EwKg3Tu5etk-ELXXd0u2Go-6TZbGm3QsBxV1upKTa8_g@mail.gmail.com> <000001d514ae$b15bdbf0$141393d0$@ewellic.org> <CAJ2xs_GiwkqHPxsoW91ZbA82o1oosXNb=Hm2XOuKuEkMMcNBhA@mail.gmail.com> <000d01d514ce$98e0de70$caa29b50$@ewellic.org> <f6c02ae8-84ce-3670-7b76-658fb761fc2f@digitalbazaar.com>
In-Reply-To: <f6c02ae8-84ce-3670-7b76-658fb761fc2f@digitalbazaar.com>
Date: Mon, 27 May 2019 15:48:22 -0600
Message-ID: <001001d514d5$e866abd0$b9340370$@ewellic.org>
MIME-Version: 1.0
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: quoted-printable
X-Mailer: Microsoft Outlook 16.0
Content-Language: en-us
Thread-Index: AQKLCtGP2z0dTCPwWpJCCMnlEYhybwL64n7IAmGOKmIBrGprnQHBCMpOAbXN25wCHv9nrAIONTERApM8wGakiZKtEA==
X-Greylist: Sender DNS name whitelisted, not delayed by milter-greylist-4.6.2 (pechora1.lax.icann.org [192.0.33.71]); Mon, 27 May 2019 21:48:43 +0000 (UTC)
X-CMAE-Envelope: MS4wfCsbC4BRAGvqGY4rjwJWzKdA/OPHy/P/OaImnhjyYVc6jq0MBFQfg+3A44Cl++l46290Uc2GjEAb8K8bCEKooRtHjkS7G3JkJTZdYvFqutqUG3y5tZ6e CZIHg5BnmrcwbcrrirL1eVnfoxtibv+t0TLnajndWcCkQOYAykSV5RkXnPyn/bC9ACNa84y+i5xf5c+kcNHWyhfyhDhNE5Ft6xKyUGFGQU2BFHfRUSOAWO/U 7yUFqDFxW2ZsM4o9UbdrHWK5lsGYngzdWQ50z+xkk4KP3VBVKhw/cKHgvNlnR70OADXrVtqL45xYNeamQgJgkA==
Archived-At: <https://mailarchive.ietf.org/arch/msg/ietf-languages/hMo0oAvPhEOMsmCTwxGrO40zzqo>
Subject: Re: [Ietf-languages] Fwd: I-D Action: draft-msporny-d-langtag-ext-00.txt
X-BeenThere: ietf-languages@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <ietf-languages.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ietf-languages/>
List-Post: <mailto:ietf-languages@ietf.org>
List-Help: <mailto:ietf-languages-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 27 May 2019 21:48:52 -0000

Manu Sporny wrote:

>> If so, I contend that a plain old language tag should still be a
>> sufficient indicator of that:
>
> Yes, but that was rejected as a solution a long time ago. Some of our
> implementers are running in constrained environments and do not want
> to require a full blown HTML processor to understand language and base
> direction. There are use cases where you can't escape that (such as
> Ruby markup), but for the simple ones... they want a simpler solution
> than "include this gigantic HTML processor in your solution".

Wait, now I'm very lost.

How does the simple language tag "ar" "require a full-blown HTML processor" in a way that the more complex language tag "ar-d-rtl" does not?

>> <span lang="ar">
>> rindfleischetikettierungsüberwachungsaufgabenübertragungsgesetz
>> IS A LONG WORD.</span>
>> (example is HTML, substitute your favorite text format that uses
> language tags)
>
> This is not a solution as it would require a full blown HTML processor
> to understand the variations in an interoperable way

Do you mean that the processor you envision would pick up a BCP 47 language tag, ignore the language part ("ar") altogether, and look only at the direction extension ("-d-rtl")?

If so, this is very much an abuse of the concept of language tagging.

If not, this is more complex than processing "ar", not less.

> (like, for example, what do you do when the tags mismatch, or there is
> a syntax error)?

I don't see how you can avoid solving problems like these, regardless of the approach you adopt.

--
Doug Ewell | Thornton, CO, US | ewellic.org