Re: [I18nrp] Last Call: <draft-faltstrom-unicode11-05.txt> (IDNA2008 and Unicode 11.0.0) to Informational RFC

"Patrik Fältström " <paf@frobbit.se> Tue, 04 December 2018 07:07 UTC

Return-Path: <paf@frobbit.se>
X-Original-To: i18nrp@ietfa.amsl.com
Delivered-To: i18nrp@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 82655130DF7 for <i18nrp@ietfa.amsl.com>; Mon, 3 Dec 2018 23:07:59 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.722
X-Spam-Level:
X-Spam-Status: No, score=-1.722 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FROM_EXCESS_BASE64=0.979, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=frobbit.se
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id WEtFyy-poIMx for <i18nrp@ietfa.amsl.com>; Mon, 3 Dec 2018 23:07:56 -0800 (PST)
Received: from mail.frobbit.se (mail.frobbit.se [85.30.129.185]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 2DCB0129C6A for <i18nrp@ietf.org>; Mon, 3 Dec 2018 23:07:55 -0800 (PST)
Received: from [169.254.186.213] (unknown [IPv6:2a01:3f0:1:0:29da:870d:c8f1:83f4]) by mail.frobbit.se (Postfix) with ESMTPSA id 88D1127500; Tue, 4 Dec 2018 08:07:49 +0100 (CET)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=frobbit.se; s=mail; t=1543907269; bh=b7uDdChl8CHfo84xHyTVajGfIrsZv3Zh3VxBum0rjRE=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=ewpkGWQ4q7S6nmHP21JCVqFwdJWiml5UHV484waQHogQsCjZr2kJryVpHTWq+XuFR 8N0brvmdPWBc+gluTkaMUQw6WOFzjGg6qQP77qzbWY1jnNoosox4p30ZA+nSq1sssC Q1CTuI2kljRAft7exYVCN9SpXZyEXatEbpupE8I4=
From: Patrik Fältström <paf@frobbit.se>
To: Asmus Freytag <asmusf@ix.netcom.com>
Cc: i18nrp@ietf.org
Date: Tue, 04 Dec 2018 08:07:49 +0100
X-Mailer: MailMate (1.12.2r5568)
Message-ID: <C7F3A439-454D-48A3-965A-83678FF5CA84@frobbit.se>
In-Reply-To: <ef327a06-fb46-811a-5225-5ba8ec7a88fe@ix.netcom.com>
References: <154385119878.18333.5085298134102919486.idtracker@ietfa.amsl.com> <FF6F9EB9-C73B-4EC0-AC4F-3E3BFBABA0AB@vpnc.org> <8E20D432-01B0-4B52-80BB-3348C5FE73AF@vpnc.org> <ef327a06-fb46-811a-5225-5ba8ec7a88fe@ix.netcom.com>
MIME-Version: 1.0
Content-Type: multipart/signed; boundary="=_MailMate_8FAA01EB-2AB0-4C22-8563-769EBABD3A5C_="; micalg="pgp-sha1"; protocol="application/pgp-signature"
Archived-At: <https://mailarchive.ietf.org/arch/msg/i18nrp/241aIBJNbhzVCX058KA1qEELEtQ>
Subject: Re: [I18nrp] Last Call: <draft-faltstrom-unicode11-05.txt> (IDNA2008 and Unicode 11.0.0) to Informational RFC
X-BeenThere: i18nrp@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Internationalization Review Procedures <i18nrp.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/i18nrp>, <mailto:i18nrp-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/i18nrp/>
List-Post: <mailto:i18nrp@ietf.org>
List-Help: <mailto:i18nrp-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/i18nrp>, <mailto:i18nrp-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 04 Dec 2018 07:07:59 -0000

On 4 Dec 2018, at 2:33, Asmus Freytag wrote:

> What I see Patrik's document reflecting is the inherent limitation of the normalization algorithm: it folds alternate encodings of the same abstract entity, but does not purport to be a universal glyph folding.

This is why I continue to believe IETF must investigate _every_ individual code point where decisions could be made to include exceptions in 5892.

> Given that the set of PVALID code points has long included both code points and sequences that have identical glyphs, it was surprising that the process was stopped for something that was merely a set of close lookalikes.

I do not understand what you mean by "sequence" because a sequence can never be PVALID, only individual code points. If you imply "sequence of code points, each having derived property value PVALID" then I understand.

The reason is simple, pure coincidence. If I exaggerate. And I am sure we will see more things like these.

And for IETF it does not matter as we have to evaluate code points manually. Most of these I catch when I evaluate differences when caluclating derived property values from new versions of unicode. But sometimes the catch is when people just look at the newly added/changed code points.

Detail: no incompatible changes for Unicode 12.0.0 from Unicode 11.0.0, at least not at the current beta of Unicode 12.0.0.

> Perhaps it might be possible to edit the text of Section 4.1 to make sure that the "repertoire" mentioned is to be understood as the repertoire for "any particular zone" as opposed to something like the set of PVALID code points.

Please send text :-)

Yes, the goal of this document is to emphasize what have already been said many many times, that it is not recommended to allow all PVALID code points (in any combination) in the registration policy in _any_ zone. Instead a subset is to be chosen using the conservatism principle explained by IAB, by SSAC and in IDNA2008.

   Patrik