Re: [I18ndir] Review of new characters for Unicode 12.0.0

"Patrik Fältström " <paf@frobbit.se> Mon, 18 March 2019 11:24 UTC

Return-Path: <paf@frobbit.se>
X-Original-To: i18ndir@ietfa.amsl.com
Delivered-To: i18ndir@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id D0E811228B7 for <i18ndir@ietfa.amsl.com>; Mon, 18 Mar 2019 04:24:52 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.721
X-Spam-Level:
X-Spam-Status: No, score=-1.721 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FROM_EXCESS_BASE64=0.979, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001, UPPERCASE_75_100=0.001] autolearn=no autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=frobbit.se
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id wdeDtqiMLFdi for <i18ndir@ietfa.amsl.com>; Mon, 18 Mar 2019 04:24:27 -0700 (PDT)
Received: from mail.frobbit.se (mail.frobbit.se [IPv6:2a02:80:3ffe::176]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 4E4AD1279AA for <i18ndir@ietf.org>; Mon, 18 Mar 2019 04:23:38 -0700 (PDT)
Received: from [169.254.92.7] (dyn-fg235.sth.netnod.se [77.72.226.235]) by mail.frobbit.se (Postfix) with ESMTPSA id 01A5126CB0; Mon, 18 Mar 2019 12:23:32 +0100 (CET)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=frobbit.se; s=mail; t=1552908215; bh=3dZIEfChXYgCOHAG+OwzEdqKSFGGhGhFmFiPgW4s6Sc=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=UwyzsDzGUXFX59A5M4bQStyWvVwPW3EPL1HkgZLzwNzlg35YRbdLGniaLlne/euyX jzl/X/aUyxrCCgDDzQkUfU0hpy+2DrU7c+wI/9XpAVFvpzGf/CtrgAWnKiowE0/HIT Z+ZcZ37GkbT6wlFpdjWxcbiAH26XU0+KHuDuFDfQ=
From: Patrik Fältström <paf@frobbit.se>
To: "Martin J. Dürst" <duerst@it.aoyama.ac.jp>
Cc: i18ndir@ietf.org
Date: Mon, 18 Mar 2019 12:23:06 +0100
X-Mailer: MailMate (1.12.4r5597)
Message-ID: <61B050AE-7956-43E6-A96B-70DC9A326D0B@frobbit.se>
In-Reply-To: <12f6742d-081b-5ef0-097c-d571e7fe1e9f@it.aoyama.ac.jp>
References: <e0174987-056d-d74e-c3fa-5b457a72f8c3@it.aoyama.ac.jp> <12f6742d-081b-5ef0-097c-d571e7fe1e9f@it.aoyama.ac.jp>
MIME-Version: 1.0
Content-Type: multipart/signed; boundary="=_MailMate_CAEA7672-A3F2-4EF2-888F-A981E0687B55_="; micalg="pgp-sha1"; protocol="application/pgp-signature"
Archived-At: <https://mailarchive.ietf.org/arch/msg/i18ndir/1P6_sK59EnAEYV89FHeQ43791Ec>
Subject: Re: [I18ndir] Review of new characters for Unicode 12.0.0
X-BeenThere: i18ndir@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Internationalization Directorate <i18ndir.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/i18ndir/>
List-Post: <mailto:i18ndir@ietf.org>
List-Help: <mailto:i18ndir-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 18 Mar 2019 11:25:05 -0000

Martin, as I have implemented IDNA2008 from scratch, i.e. not using any libraries at all, let me suggest we sync on what output format we use so that we can do "diff" between your and my lists. My list for 12.0.0 is btw attached.

May I suggest what my program is doing (of course), which is as follows:

<Codepoint>;<Derived property value>;<Rule(s) that matched>;<Name>

:
:
0058;DISALLOWED;AB;LATIN CAPITAL LETTER X
0059;DISALLOWED;AB;LATIN CAPITAL LETTER Y
005A;DISALLOWED;AB;LATIN CAPITAL LETTER Z
005B;DISALLOWED;;LEFT SQUARE BRACKET
005C;DISALLOWED;;REVERSE SOLIDUS
005D;DISALLOWED;;RIGHT SQUARE BRACKET
005E;DISALLOWED;;CIRCUMFLEX ACCENT
005F;DISALLOWED;;LOW LINE
0060;DISALLOWED;;GRAVE ACCENT
0061;PVALID;AE;LATIN SMALL LETTER A
0062;PVALID;AE;LATIN SMALL LETTER B
0063;PVALID;AE;LATIN SMALL LETTER C
0064;PVALID;AE;LATIN SMALL LETTER D
0065;PVALID;AE;LATIN SMALL LETTER E
0066;PVALID;AE;LATIN SMALL LETTER F
0067;PVALID;AE;LATIN SMALL LETTER G
0068;PVALID;AE;LATIN SMALL LETTER H
0069;PVALID;AE;LATIN SMALL LETTER I
006A;PVALID;AE;LATIN SMALL LETTER J
:
:

   Patrik