Re: [I18nrp] draft-faltstrom-unicode11-04.txt

Nico Williams <nico@cryptonector.com> Thu, 11 October 2018 21:17 UTC

Return-Path: <nico@cryptonector.com>
X-Original-To: i18nrp@ietfa.amsl.com
Delivered-To: i18nrp@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 5ADD2130DEC for <i18nrp@ietfa.amsl.com>; Thu, 11 Oct 2018 14:17:44 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2
X-Spam-Level:
X-Spam-Status: No, score=-2 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=cryptonector.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 9h0YcQlefnPd for <i18nrp@ietfa.amsl.com>; Thu, 11 Oct 2018 14:17:42 -0700 (PDT)
Received: from pdx1-sub0-mail-a32.g.dreamhost.com (pop.dreamhost.com [64.90.62.162]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 0F70F12777C for <i18nrp@ietf.org>; Thu, 11 Oct 2018 14:17:42 -0700 (PDT)
Received: from pdx1-sub0-mail-a32.g.dreamhost.com (localhost [127.0.0.1]) by pdx1-sub0-mail-a32.g.dreamhost.com (Postfix) with ESMTP id A3B6D80013; Thu, 11 Oct 2018 14:17:41 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=cryptonector.com; h=date :from:to:cc:subject:message-id:references:mime-version :content-type:in-reply-to:content-transfer-encoding; s= cryptonector.com; bh=GLMkfzS+JHx0aLRoi7OvuoRaT7I=; b=uyZQ6DbyH8R 0j0uWsT1TP23gT3uApqELikHAHyTGcynGpvy7Gu1RY0IKDnx09D7Wd0yHO1vcVNV vScwhpupVjhShgCC4kLibMq4dZIIpQasbtBba2mFu857iI/PEAsXbAnG0kko6mDi p7K4Zzegg2+18pTcb/n5tyC0rirRKYDQ=
Received: from localhost (unknown [24.28.108.183]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) (Authenticated sender: nico@cryptonector.com) by pdx1-sub0-mail-a32.g.dreamhost.com (Postfix) with ESMTPSA id D8AA480012; Thu, 11 Oct 2018 14:17:39 -0700 (PDT)
Date: Thu, 11 Oct 2018 16:17:37 -0500
X-DH-BACKEND: pdx1-sub0-mail-a32
From: Nico Williams <nico@cryptonector.com>
To: Patrik =?iso-8859-1?B?RuRsdHN0cvZt?= <paf=40frobbit.se@dmarc.ietf.org>
Cc: i18nrp@ietf.org, iab@iab.org
Message-ID: <20181011211736.GA2486@localhost>
References: <A23EC543-5DEB-4044-96B7-C983A1BF1E23@frobbit.se>
MIME-Version: 1.0
Content-Type: text/plain; charset=iso-8859-1
Content-Disposition: inline
In-Reply-To: <A23EC543-5DEB-4044-96B7-C983A1BF1E23@frobbit.se>
User-Agent: Mutt/1.5.24 (2015-08-30)
X-VR-OUT-STATUS: OK
X-VR-OUT-SCORE: 0
X-VR-OUT-SPAMCAUSE: gggruggvucftvghtrhhoucdtuddrgedtkedrudekgdduheeiucetufdoteggodetrfdotffvucfrrhhofhhilhgvmecuggftfghnshhusghstghrihgsvgdpffftgfetoffjqffuvfenuceurghilhhouhhtmecufedttdenucenucfjughrpeffhffvuffkfhggtggugfgjfgesthekredttderudenucfhrhhomheppfhitghoucghihhllhhirghmshcuoehnihgtohestghrhihpthhonhgvtghtohhrrdgtohhmqeenucfkphepvdegrddvkedruddtkedrudekfeenucfrrghrrghmpehmohguvgepshhmthhppdhhvghloheplhhotggrlhhhohhsthdpihhnvghtpedvgedrvdekrddutdekrddukeefpdhrvghtuhhrnhdqphgrthhhpefpihgtohcuhghilhhlihgrmhhsuceonhhitghosegtrhihphhtohhnvggtthhorhdrtghomheqpdhmrghilhhfrhhomhepnhhitghosegtrhihphhtohhnvggtthhorhdrtghomhdpnhhrtghpthhtohepnhhitghosegtrhihphhtohhnvggtthhorhdrtghomhenucevlhhushhtvghrufhiiigvpedt
Content-Transfer-Encoding: quoted-printable
Archived-At: <https://mailarchive.ietf.org/arch/msg/i18nrp/zxrdfU6MnOUTkALUgxd1hpyG-fw>
Subject: Re: [I18nrp] draft-faltstrom-unicode11-04.txt
X-BeenThere: i18nrp@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Internationalization Review Procedures <i18nrp.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/i18nrp>, <mailto:i18nrp-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/i18nrp/>
List-Post: <mailto:i18nrp@ietf.org>
List-Help: <mailto:i18nrp-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/i18nrp>, <mailto:i18nrp-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 11 Oct 2018 21:17:44 -0000

On Sun, Oct 07, 2018 at 07:53:51AM +0200, Patrik Fältström wrote:
> Background: As we all know IDNA2008 is an algorithm that applied to
> Unicode gives a subset of code points which are permissible for in
> turn registries and others to create subsets from to be used in their
> respective policies.
> 
> Being the appointed expert for the IANA registry I have flagged for
> IAB changes made to Unicode that I feel must be discussed in IETF, and
> here is the document with my proposal (details are in the draft).
> 
> In short the draft proposes (just like RFC 6452) IETF continue to
> follow Unicode Standard without adding exceptions to the IDNA2008
> algorithm, EVEN IF changes are made that are incompatible to earlier
> versions of Unicode.

+1

We're talking specifically about future incompatible changes in
either case mappings or normalization.  Correct?

And to confirm the IDNA2008 / UTS#46 context:

 - IDNA2008 is hear-no-evil-see-no-evil, says to do no case mappings,
   relies on users entering domainnames in the correct case whatever
   that is

 - UTS#46 says to lower-case first, then apply the rest of IDNA2008

Correct?

> The choices for IETF when things like this happens are:
> 
> 1. Keep IDNA2008 with no exceptions
> 
> 2. Keep IDNA2008 with exceptions
> 
> 3. Stop referring (directly) to Unicode as it is not stable enough

(3) has got to be out as it amounts to hard-forking Unicode.  A fork of
Unicode would be momentous and extraordinary, and not to be undertaken
lightly!  It might be useful to think about how we'd go about forking
Unicode process-wise, but I think any analysis will convince us that we
should not actually do it.

(2) is a light fork of Unicode.

I understand that there have been a few backwards-incompatible changes
in Unicode, but that's just life, and it is their problem (sure, our
users' too, but forking Unicode is not going to be satisfying at the
very very least).

We could do a flavor of (2) that I describe below, but I'd rather not.

For me that leaves (1), which is what I'm for.

We should, however, use our liason to communicate to the UC our
displeasure as to backwards-incompatible changes, and that we'd like the
UC to at least consider adding a stability attribute to every codepoint.
If there were a stability attribute, then we could exclude codepoints
whose stability the UC will not guarantee.

If there were codepoint stability attributes, then filtering out
unstable codepoints would be (2') and likely desirable.

Future codepoint assignments could all begin life as unstable and gain
stability with experience.

We could, perhaps, implement codepoint stability attributes at the IETF
alone if the UC balks.  I would be OK with that provided it was simple
and simple to automate -- I don't think we want to be in the business of
inspecting every new codepoint assignment and guessing a stability level
for it, so nothing much more complicated than time-since-assignment
should be used.

> Probably more choices than these...
> 
> My proposal is [1], together with a more forceful push to strict
> IDNA2008 adoption. No IDNA2003, no UTS#46, no homebrew mixes.

I would much prefer that we adopt UTS#46.

> Including that registries really do a careful conservative selection
> of code points to be used in whatever context it is to be used.

Yes.

> The problem though is that this is *my* view. Sure, a few others have
> randomly and "by pure luck" told me "this is ok".
> 
> But is this a stable informed decision by IETF?

It should be after the standard IETF sturm und drang :)

> This very fast turn into a process issue where IAB have asked for
> progress, and IETF is to deliver.
> 
> And this is where I take a step back, and want IESG and IAB to make up
> their mind. This group was created for I18N issues, we have a charter,
> but then what? Is this where drafts like these should be discussed?

I believe we need IETF consensus on these matters, but the IAB must be
involved.  Any notion of forking Unicode should require buy-in from the
IAB.

Nico
--