Re: [I18nrp] Last Call: <draft-faltstrom-unicode11-05.txt> (IDNA2008 and Unicode 11.0.0) to Informational RFC

"Paul Hoffman" <paul.hoffman@vpnc.org> Tue, 04 December 2018 00:02 UTC

Return-Path: <paul.hoffman@vpnc.org>
X-Original-To: i18nrp@ietfa.amsl.com
Delivered-To: i18nrp@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id F26EC130DD9 for <i18nrp@ietfa.amsl.com>; Mon, 3 Dec 2018 16:02:04 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.9
X-Spam-Level:
X-Spam-Status: No, score=-1.9 tagged_above=-999 required=5 tests=[BAYES_00=-1.9] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 0uSVnbBtngCR for <i18nrp@ietfa.amsl.com>; Mon, 3 Dec 2018 16:02:03 -0800 (PST)
Received: from mail.proper.com (Opus1.Proper.COM [207.182.41.91]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 3B80C130DCB for <i18nrp@ietf.org>; Mon, 3 Dec 2018 16:02:03 -0800 (PST)
Received: from [10.32.60.109] (50-1-51-141.dsl.dynamic.fusionbroadband.com [50.1.51.141]) (authenticated bits=0) by mail.proper.com (8.15.2/8.15.2) with ESMTPSA id wB4010Wn049975 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NO) for <i18nrp@ietf.org>; Mon, 3 Dec 2018 17:01:01 -0700 (MST) (envelope-from paul.hoffman@vpnc.org)
X-Authentication-Warning: mail.proper.com: Host 50-1-51-141.dsl.dynamic.fusionbroadband.com [50.1.51.141] claimed to be [10.32.60.109]
From: Paul Hoffman <paul.hoffman@vpnc.org>
To: i18nrp@ietf.org
Date: Mon, 03 Dec 2018 16:02:01 -0800
X-Mailer: MailMate (1.12.2r5568)
Message-ID: <8E20D432-01B0-4B52-80BB-3348C5FE73AF@vpnc.org>
In-Reply-To: <FF6F9EB9-C73B-4EC0-AC4F-3E3BFBABA0AB@vpnc.org>
References: <154385119878.18333.5085298134102919486.idtracker@ietfa.amsl.com> <FF6F9EB9-C73B-4EC0-AC4F-3E3BFBABA0AB@vpnc.org>
MIME-Version: 1.0
Content-Type: text/plain; format="flowed"
Archived-At: <https://mailarchive.ietf.org/arch/msg/i18nrp/Mnb69lpNqkmLQnMW8AEQ9_cW_yc>
Subject: Re: [I18nrp] Last Call: <draft-faltstrom-unicode11-05.txt> (IDNA2008 and Unicode 11.0.0) to Informational RFC
X-BeenThere: i18nrp@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Internationalization Review Procedures <i18nrp.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/i18nrp>, <mailto:i18nrp-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/i18nrp/>
List-Post: <mailto:i18nrp@ietf.org>
List-Help: <mailto:i18nrp-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/i18nrp>, <mailto:i18nrp-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 04 Dec 2018 00:02:05 -0000

Before I go to the ietf@ietf.org mailing list with my concerns about 
this draft, I hope it is OK to bounce them off people here in case I'm 
wildly off track.

=====

In Section 1:
    Specifically, the Internet Architecture Board did issue a statement
    [IAB] which requested IETF to resolve the issues related to the code
    point ARABIC LETTER BEH WITH HAMZA ABOVE (U+08A1), introduced in
    Unicode 7.0.0 [Unicode-7.0.0].  This document resolves this issue 
and
    suggests IDNA2008 standard is to follow the Unicode Standard and not
    update RFC 5892 [RFC5892] or any other IDNA2008 RFCs.

In Section 4.1:
    The discussion in the IETF concluded that although it is possible to
    create "the same" character in multiple ways, the issue with U+08A1
    is not unique.  In the case of U+08A1, it can be represented with 
the
    sequence ARABIC LETTER BEH (U+0628) and ARABIC HAMZA ABOVE (U+0654).
    Just like LATIN SMALL LETTER A WITH DIAERESIS (U+00E4) can be
    represented via the sequence LATIN SMALL LETTER A (U+0061), and
    COMBINING DIAERESIS (U+0308).  One difference between these 
sequences
    is how they are treated in the normalization forms specified by the
    Unicode Consortium.

This sounds like the IETF is saying that if the Unicode Consortium 
changes how a character appears in a normalization form other than for 
case folding (Section 2.2 of RFC 5892), that change does not affect the 
tables for IDNA2008. Is that correct?

=====

In Section 4.1:
    As U+08A1 is discussed in draft-freytag-troublesome-characters
    [I-D.freytag-troublesome-characters] and elsewhere.  Regardless of
    whether those discussions ends in recommending including the code
    point in the repertoire of characters permissable for registration 
or
    not, it is acceptable to allow the code point to have a derived
    property value of PVALID.

This sounds like it is saying that even though 
draft-freytag-troublesome-characters is meant for standards track, 
because it is not yet finished, this document (which is informational) 
can ignore the other document and make changes to the IANA registry. If 
that's correct, it concerns me because it could make the IANA registry 
unstable for characters that we know about and are actively discussing. 
If I'm not correct, I'd like to hear why so that maybe this document can 
be reworded.

--Paul Hoffman