Re: [Ietf-languages] Recommendation not to register variant subtags of the form 0nnn

"Phillips, Addison" <addison@lab126.com> Wed, 29 July 2020 15:49 UTC

Return-Path: <prvs=472685a45=addison@lab126.com>
X-Original-To: ietf-languages@ietfa.amsl.com
Delivered-To: ietf-languages@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 9086B3A0BD8 for <ietf-languages@ietfa.amsl.com>; Wed, 29 Jul 2020 08:49:17 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.118
X-Spam-Level:
X-Spam-Status: No, score=-1.118 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HTML_MESSAGE=0.001, SPF_HELO_NONE=0.001, SPF_NEUTRAL=0.779, URIBL_BLOCKED=0.001] autolearn=no autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id t-jCQNoW3inF for <ietf-languages@ietfa.amsl.com>; Wed, 29 Jul 2020 08:49:15 -0700 (PDT)
Received: from mork.alvestrand.no (mork.alvestrand.no [158.38.152.117]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 7FAC43A0BD5 for <ietf-languages@ietf.org>; Wed, 29 Jul 2020 08:49:15 -0700 (PDT)
Received: by mork.alvestrand.no (Postfix) id DBEAC7C5B4F; Wed, 29 Jul 2020 17:49:13 +0200 (CEST)
Delivered-To: ietf-languages@alvestrand.no
X-Comment: SPF skipped for whitelisted relay - client-ip=192.0.33.71; helo=pechora1.lax.icann.org; envelope-from=prvs=472685a45=addison@lab126.com; receiver=ietf-languages@alvestrand.no
Received: from pechora1.lax.icann.org (pechora1.icann.org [192.0.33.71]) by mork.alvestrand.no (Postfix) with ESMTPS id 607FC7C5B2D for <ietf-languages@alvestrand.no>; Wed, 29 Jul 2020 17:49:13 +0200 (CEST)
Received: from smtp-fw-6002.amazon.com (smtp-fw-6002.amazon.com [52.95.49.90]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by pechora1.lax.icann.org (Postfix) with ESMTPS id D0ED27003167 for <ietf-languages@iana.org>; Wed, 29 Jul 2020 15:49:10 +0000 (UTC)
IronPort-SDR: x84wfhzx4ufQHLwhn388xne+zxCUypqGQvW3qJ2nx3IjGWa4CKi9x4cFBks4qM0u97Gcblb2T5 5zVfYnMpo1Vg==
X-IronPort-AV: E=Sophos; i="5.75,410,1589241600"; d="scan'208,217"; a="44851772"
Received: from iad12-co-svc-p1-lb1-vlan3.amazon.com (HELO email-inbound-relay-2a-e7be2041.us-west-2.amazon.com) ([10.43.8.6]) by smtp-border-fw-out-6002.iad6.amazon.com with ESMTP; 29 Jul 2020 15:48:42 +0000
Received: from EX13MTAUWB001.ant.amazon.com (pdx4-ws-svc-p6-lb7-vlan2.pdx.amazon.com [10.170.41.162]) by email-inbound-relay-2a-e7be2041.us-west-2.amazon.com (Postfix) with ESMTPS id 3ADC7A1F25; Wed, 29 Jul 2020 15:48:40 +0000 (UTC)
Received: from EX13D08UWB003.ant.amazon.com (10.43.161.186) by EX13MTAUWB001.ant.amazon.com (10.43.161.249) with Microsoft SMTP Server (TLS) id 15.0.1497.2; Wed, 29 Jul 2020 15:48:40 +0000
Received: from EX13D08UWB002.ant.amazon.com (10.43.161.168) by EX13D08UWB003.ant.amazon.com (10.43.161.186) with Microsoft SMTP Server (TLS) id 15.0.1497.2; Wed, 29 Jul 2020 15:48:40 +0000
Received: from EX13D08UWB002.ant.amazon.com ([10.43.161.168]) by EX13D08UWB002.ant.amazon.com ([10.43.161.168]) with mapi id 15.00.1497.006; Wed, 29 Jul 2020 15:48:40 +0000
From: "Phillips, Addison" <addison@lab126.com>
To: John Cowan <cowan@ccil.org>, "Martin J. Dürst" <duerst@it.aoyama.ac.jp>
CC: IETF Languages Discussion <ietf-languages@iana.org>
Thread-Topic: [Ietf-languages] Recommendation not to register variant subtags of the form 0nnn
Thread-Index: AdZlvkn+5OWk5ZOaTiOSsQE2b8zaug==
Date: Wed, 29 Jul 2020 15:48:40 +0000
Message-ID: <a66144d8bae24114a1b1e64144ca1088@EX13D08UWB002.ant.amazon.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-ms-exchange-transport-fromentityheader: Hosted
x-originating-ip: [10.43.160.26]
Content-Type: multipart/alternative; boundary="_000_a66144d8bae24114a1b1e64144ca1088EX13D08UWB002antamazonc_"
MIME-Version: 1.0
X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.2 (pechora1.lax.icann.org [0.0.0.0]); Wed, 29 Jul 2020 15:49:11 +0000 (UTC)
Archived-At: <https://mailarchive.ietf.org/arch/msg/ietf-languages/V5Cva1mHzIawY4iC3x4QnkKdTQQ>
Subject: Re: [Ietf-languages] Recommendation not to register variant subtags of the form 0nnn
X-BeenThere: ietf-languages@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <ietf-languages.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ietf-languages/>
List-Post: <mailto:ietf-languages@ietf.org>
List-Help: <mailto:ietf-languages-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 29 Jul 2020 15:49:18 -0000

That would not happen with a 4-digit variant starting with a digit: they would always come at the end of a language tag. I tend to agree that we should avoid confusables and thus should be careful not to register such subtags just generally. But never is a long time and I could see someone (attempting to) register e.g. Gregorian year based subtags for well-attested years pre-1000, e.g. “ar-0661” (or maybe “ar-0072” ;-))

As I said, I agree in general principle, but wouldn’t we discuss it in specific at the time?

Addison

Addison Phillips
Sr. Principal SDE – I18N (Amazon)
Chair (W3C I18N WG)

Internationalization is not a feature.
It is an architecture.



From: Ietf-languages [mailto:ietf-languages-bounces@ietf.org] On Behalf Of John Cowan
Sent: Wednesday, July 29, 2020 8:09 AM
To: Martin J. Dürst <duerst@it.aoyama.ac.jp>
Cc: IETF Languages Discussion <ietf-languages@iana.org>
Subject: RE: [EXTERNAL] [Ietf-languages] Recommendation not to register variant subtags of the form 0nnn





I don't think so.  But I want to avoid in such contexts as spreadsheets the "U.S. postal code problem", whereby the code 07104 becomes 7104 because Excel thinks it's a number rather than a string of digits.

On Wed, Jul 29, 2020 at 1:08 AM Martin J. Dürst <duerst@it.aoyama.ac.jp<mailto:duerst@it.aoyama.ac.jp>> wrote:


On 29/07/2020 11:43, John Cowan wrote:
> While BCP 47 does not forbid 4-digit variant subtags that begin with 0, I
> am hereby suggesting to the Tyrant and his Good Right Arm that such tags
> never be registered.  It would be very easy to confuse '0029' with '029'
> (Caribbean).
>
> What say you all?

Sounds good. Has such a case actually come up?

Regards,   Martin.

>
> John Cowan          http://vrici.lojban.org/~cowan        cowan@ccil.org<mailto:cowan@ccil.org>
> A poetical purist named Cowan                   [that's me]
> Once put the rest of us dowan.                  [on xml-dev]
> "Your verse would be sweeter / If it only had metre
> And rhymes that didn't force me to frowan."     [overpacked line!]
> --Michael Kay