[I18ndir] Review of Unicode-07: Finishing

Harald Alvestrand <harald@alvestrand.no> Thu, 21 March 2019 16:10 UTC

Return-Path: <harald@alvestrand.no>
X-Original-To: i18ndir@ietfa.amsl.com
Delivered-To: i18ndir@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 6ACC413138D for <i18ndir@ietfa.amsl.com>; Thu, 21 Mar 2019 09:10:36 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.889
X-Spam-Level:
X-Spam-Status: No, score=-1.889 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HTML_MESSAGE=0.001, T_KAM_HTML_FONT_INVALID=0.01] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id IycHbKlLgFvb for <i18ndir@ietfa.amsl.com>; Thu, 21 Mar 2019 09:10:32 -0700 (PDT)
Received: from mork.alvestrand.no (mork.alvestrand.no [IPv6:2001:700:1:2::117]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 7666D131387 for <i18ndir@ietf.org>; Thu, 21 Mar 2019 09:10:32 -0700 (PDT)
Received: from localhost (localhost [127.0.0.1]) by mork.alvestrand.no (Postfix) with ESMTP id 8111D7C40A8 for <i18ndir@ietf.org>; Thu, 21 Mar 2019 17:10:30 +0100 (CET)
X-Virus-Scanned: Debian amavisd-new at alvestrand.no
Received: from mork.alvestrand.no ([127.0.0.1]) by localhost (mork.alvestrand.no [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 8EEPWQ10L-j8 for <i18ndir@ietf.org>; Thu, 21 Mar 2019 17:10:28 +0100 (CET)
Received: from [192.168.3.141] (unknown [188.113.75.166]) by mork.alvestrand.no (Postfix) with ESMTPSA id 9F9277C3F9A for <i18ndir@ietf.org>; Thu, 21 Mar 2019 17:10:28 +0100 (CET)
From: Harald Alvestrand <harald@alvestrand.no>
Openpgp: preference=signencrypt
Autocrypt: addr=harald@alvestrand.no; prefer-encrypt=mutual; keydata= mQINBFRpbhYBEADXu8uE7LDQgrEB/zclYiwWRb50FnuJjIdK5Q7t68tSxx+LU8HTfxwOgHo9 vMyQvntoRBOHQZDJzvdAnZj/7vtl9RDfWvhUz+o9jSMyORzrt0kiW2QNICVkOkc0ZbI14Rn8 EjFRinK5m5+PXrng3PwZgK+sQJ1nzUxjE9oGTWClsAEqJw62z7JmzNqaEwAyHoHAZ1JAptSP ak91dUxjueJ2R+rFUBl6ParRZ2de7QKr3rN5Jbu/ikjHsAeTSo0R0BPKbzU23tXXxQ/dADvM V/PZp3hRFmXT7x05Q82O6k6hsGd5fJToBDRrlsC3jwWWhDhFhsWcdYKxFbYUsJVetPrWDtD4 6sjrbsQ+7kWRYgQWvL2EJ0s7QGpLxitopoISUEt0MlCcJhq7ZxiWhGnwM3GgADn+9W+aqwuk Y1tlUbdw0qdHyU0WM0k/yPd/eOghk3PLtlOizg4Q22VqfzNRXd3pwUmVjPYHQS0PwIjzuTEI em03qlVeJ8xn0X9W90E8PEnxZmREZBI90qCcUrxWOywEcLq21eLXurRzwnbY3oi6NxmSedcL xDWFdrVTHfPNNqh8zqXV/z9Ezz+7kSwgRygpG5+/sHfFq/YivoSHJdkL8xDzlNiqYCs8EL4A ipQWlKIuFH1F/pXLmXZlcDExw6aTlAP2rR+rw4Lc7kENZlMMMwARAQABtC9IYXJhbGQgQWx2 ZXN0cmFuZCAoMjAxNCkgPGhhcmFsZEBhbHZlc3RyYW5kLm5vPokCPgQTAQIAKAUCVO3uHAIb IwUJCWYBgAYLCQgHAwIGFQgCCQoLBBYCAwECHgECF4AACgkQawFW3omifDRKiA/+KtWpGwNa EaMMjxuVhdvMkQ6cS362iWydVbha03TBf/7HM380nO+2/t4S0kiSRtX89bY9lvrjS5oHd0tZ qS14vwBn8ZKbZl+k/NRiFlNNxhBx1PDRni1lfh/lU4xJraKI17h2h9mVJbMGk0kFuLqDUwMc 18mZZcfJEeUxSVUCndFMab4LQWSvRaqcwGrpDXuCxmWzMxtRjZzS2vkNX0oiBO7/NuEdQZL8 /CM3/GTqEd6kqY5Rkddvhr21KqhDyNT0NYRLgQ4yToTRDeXrHkjDD8cIQJhOHSNm6/3tuHB1 Bunxg1If3oEZxZirTGiuNZfBUAuXXJa//wEqhS+28/iQc6RE4bQXh2TyqtHs1mn3VDeKqbp7 lp31FfQ6GVGUaVfKfhg6UPSeczHTKWG3vX5UL7SOLXyaSniuYDkPIV/YR46GFPNhSsQ9YccU 5zAbn8ZhyONwO7524WjhIHgITiPVnCiSIHQKOw0S3+Ns0/5TIUgEc6+M97vsJTxTOqKfPthj xkHckF7VUFzu9ee6IMupJJp1wxVjpPQpJTjUG2aDnWk+E2OArulIjHER2dj0DEiOuqjjwTQH CKfrsWUMIs6TJ9jIKEfOSVOz5opGKLimQaOJ8Y1NYZKOy7fyJjofcC+dkAIpYBRzQTdDXm0A 4eryQBqLSpRldX4rvnU77i2/ryG5Ag0EVGluFgEQAK2r1cmzqfJzOIielYx4OGVWlh3TmGdI mPgYI8yx/W8Uyvwknto7Qm5HaBBy9/33usNiovygYLFr7X5U/+ynXClkpAHaPOzS+bMCybpd UsS9Yq/jPmyq0Tlqn6b1tjSjFwysTiUVRS6nHufRlHQEOyxlYAjmePfjJI85g9J3iOa3eY87 +YSlF/rzhPrlvW0yD1YBGBmtuDdRnd4qSof8pcVmiN91QylbnTO5+/VtQtZydk2couaBHkf+ h0eDlJLB7igJ6Ks0ae2UoUNOBv2F1roQ1jZC8yMPScXygmjsoBSuTUirHatyR7AUiCHNymB+ EdhK4Vl+ZVHdCY9l269g5ocw0y6BZofHpqhE9K3RGBWQjWKTXuOk1fVjLfAum3wQqztYEhlD uKZgfEn7reDuzBq4cqzUe7CI6lZwCU7DnA0Dz2vBaqBhrZb7eKfTqmXddNm/dXmPn1nB554N fxWoxb3L8fHXwLgJiBgxLM6OYhJM51PxwW1qoQM1ax6gu+H101uEE4ZZq+s7c301HqwFwGMi SMmn1oJ7/+OquMkYHjeVAhxRE6blcRH2cmqxFSrpHsHgpXMVyWgTZRZsMmQathzCTUWKf5hC EOzwb4rp/UvU1LUHo1uPqbBafW62VB+iUaFp/zOg69Wo8/Z6urM5m+ldiWTbx+ivxKlPQDEA 332dABEBAAGJAiUEGAECAA8FAlRpbhYCGwwFCQlmAYAACgkQawFW3omifDRKhg//eHcjvxcA ENNe66f5R3ULi5pMbrHGLMGirVX9pHTRf5+5OFaGr8bwXeYkCHpptpxr2Kk/PUzpUWOL2uvL lh7QhPw3+GoEWubXOAgHiQW5iIzkA9wYw/nctZ+5veHN7InVqJ7djhtTN7K9Luj4nDR1T7Vf 61zpCKLlEW6W5MAp4slRVzRiFfaMfMYkxLm6MBxC961j8Lrqx2XNMGugaYh1QzcFYTbFmGKX 5SY4EQsETiB0PeE3IBVtXfiabrk8YX2IuL9BrEgD6GngXTd78hUMnZeqjvnS772bjRgwLCz7 Hab6hQESrFCNXfxzb39y5DLHwXtB/HruYqVD48XvPnNV0UNsWcS+7rtPFMmkd3MTvoAOWjkV zeQHpvF71IlwWginXbkf9aR/QsAbMIQDZWhsd+ma67V6g6KH41r6mNXAgK2JlA1CqgblM7iB hl01vL0V5bkbInZq2sB505Hn1DSc4NoP2WHlwe8Bm8vVG5oyfyPw9ReS9WLVY9w7fK4EKOgk VnOsIQuE0WIPT0Ak+hJ0UigOduuCX7s7NIVaOgWQe1q4Xytgj1RHjg9qlA6eQiTUrAx7Mu7s eliWCFuWsQXoaktVEDjoWVbP9dgozanL5kwWh/sJNtHVQbgu3IG4w8D3QvvOE83+jAdzgOzv pqHJkrqlWu+R9ZqBucZLqjQvQZk=
To: i18ndir@ietf.org
Message-ID: <2cd9f904-d1b7-d715-289f-1535db754883@alvestrand.no>
Date: Thu, 21 Mar 2019 17:10:28 +0100
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.5.1
MIME-Version: 1.0
Content-Type: multipart/alternative; boundary="------------70B91B8DBFF12B15F782847F"
Content-Language: en-GB
Archived-At: <https://mailarchive.ietf.org/arch/msg/i18ndir/xlnabuxi_0jh1bsodQuRZSMETDI>
Subject: [I18ndir] Review of Unicode-07: Finishing
X-BeenThere: i18ndir@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Internationalization Directorate <i18ndir.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/i18ndir/>
List-Post: <mailto:i18ndir@ietf.org>
List-Help: <mailto:i18ndir-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 21 Mar 2019 16:10:36 -0000

With the petering out of discussion, I think it's time to file our
review of unicode-07 and call it "done". We were asked for a review, we
have written a review. The review should be part of the public record,
but I don't think we need to call attention to it by posting it all over
the place, given that we have new versions of the document to work from.

Current proposed text:



**

*Directorate review of draft-faltstrom-unicode11-07*

*

Overall conclusion: Not ready yet, needs some updates. New I-D recommended.


[Note: As part of the discussion that resulted in this text, a new I-D
has been issued.]


    Context issues


The discussion of draft-faltstrom-unicode11 in the directorate has shown
that the directorate members share a number of concerns about the
current state of IDNA, only some of which are directly relevant to this
memo.


IDNA2008 considered limits to what was reasonable to register and use in
the DNS at a number of levels:


  *

    A level of “don’t register stuff that causes confusion”. This
    requires human judgment, and reasonable people may disagree about
    what causes confusion.

  *

    A level of “don’t register stuff that is structurally invalid under
    the relevant writing system”. Aspects of this can be captured in
    rulesets (ICANN’s RZ-LGR efforts fall not this category), but
    requires deep expertise; this is captured in IDNA2008 as the “don’t
    register what you don’t understand” rule.

  *

    A level of “this is stuff that you should never register, and
    applications can reasonably choose to treat it as an error or an
    attack if it ever shows up”. This is the distinction that is
    captured in the classification of codepoints as DISALLOWED, and
    where IDNA2008 (with updates) gives precise rules.


The current document focuses on the last level only - the maintenance of
the distinction between PVALID and DISALLOWED. (It also considers
whether new CONTEXTO and CONTEXTJ rules are needed).


It is clear from directorate discussion that work needs to be done at
the other levels outlined above too, but it is not clear from the
discussion what form that work should take or what fora that work is
reasonably performed in; the work may or may not involve a revision of
the basic IDNA2008 specifications.


We suggest to insert a paragraph in the document describing the context
of the state of IDNA2008, and explain what issues this document does not
attempt to address. Specifically that the conclusion of the document is
what to do regarding Unicode versions up to and including 11, and that
this is not to be used as expectations of future versions of Unicode.


In addition, it’s become clear that IDNA2008 does not specify the
mechanisms and expectations of the review of new versions of Unicode in
enough detail; with the review of a number of versions of Unicode behind
us, we should be able to describe those procedures and expectations
better than IDNA2008 does. However, this may need to happen in another
document than this one.


    Content issues


Section 4.1 does not specify where to find the conclusion of the IETF
discussion on U+08A1.

It is not easy to see from the text whether the algorithms and
procedures will render U+0628 U+0654 an illegal sequence or a legal
sequence. No matter what the resolution is, the document should make it
obvious what the conclusion is (and why).


RFC 5892 states that SPHERICAL ANGLE OPENING UP is DISALLOWED not PVALID:

27D0..2B4C  ; DISALLOWED


Section 4.1 ought to include numbers for how many characters ended up in
DISALLOWED vs PVALID - ideally, for each Unicode version since IDNA2008
was issued. This may also be something that is recommended for the IANA
tables rather than this document.Given the time that has passed since
this work started, we should consider whether or not to include Unicode 12.


    Nits


These have been submitted separately to the author, and are not
enumerated here.


*


-- 
Surveillance is pervasive. Go Dark.