Re: [precis] Category changes with Unicode 6.3
Patrik Fältström <paf@frobbit.se> Wed, 16 October 2013 05:40 UTC
Return-Path: <paf@frobbit.se>
X-Original-To: precis@ietfa.amsl.com
Delivered-To: precis@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id F399521F9A8D for <precis@ietfa.amsl.com>; Tue, 15 Oct 2013 22:40:39 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.3
X-Spam-Level:
X-Spam-Status: No, score=-4.3 tagged_above=-999 required=5 tests=[BAYES_00=-2.599, GB_I_LETTER=-2, MIME_8BIT_HEADER=0.3, NO_RELAYS=-0.001]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Yj0dqlBZAtmx for <precis@ietfa.amsl.com>; Tue, 15 Oct 2013 22:40:39 -0700 (PDT)
Received: from mail.frobbit.se (mail.frobbit.se [IPv6:2a02:80:3ffe::176]) by ietfa.amsl.com (Postfix) with ESMTP id AEB2421F9A49 for <precis@ietf.org>; Tue, 15 Oct 2013 22:40:37 -0700 (PDT)
Received: from [IPv6:2001:67c:64:42:d997:fadd:90f:58bd] (unknown [IPv6:2001:67c:64:42:d997:fadd:90f:58bd]) by mail.frobbit.se (Postfix) with ESMTPSA id 9D2E0219C6; Wed, 16 Oct 2013 07:40:36 +0200 (CEST)
Content-Type: multipart/signed; boundary="Apple-Mail=_07D02D0C-9749-4244-88CF-D4AB930B13A1"; protocol="application/pgp-signature"; micalg="pgp-sha1"
Mime-Version: 1.0 (Mac OS X Mail 6.6 \(1510\))
From: Patrik Fältström <paf@frobbit.se>
In-Reply-To: <525DFB1F.8040401@it.aoyama.ac.jp>
Date: Wed, 16 Oct 2013 08:40:35 +0300
Message-Id: <7693B3B4-7204-48F0-8C42-EBF5D701BAF4@frobbit.se>
References: <525DFB1F.8040401@it.aoyama.ac.jp>
To: "Martin J. Dürst" <duerst@it.aoyama.ac.jp>
X-Mailer: Apple Mail (2.1510)
Cc: "idna-update@alvestrand.no" <idna-update@alvestrand.no>, "precis@ietf.org" <precis@ietf.org>
Subject: Re: [precis] Category changes with Unicode 6.3
X-BeenThere: precis@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: Preparation and Comparison of Internationalized Strings <precis.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/precis>, <mailto:precis-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/precis>
List-Post: <mailto:precis@ietf.org>
List-Help: <mailto:precis-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/precis>, <mailto:precis-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 16 Oct 2013 05:40:40 -0000
Executive summary: Does not impact IDNA2008. Longer explanation: On 16 okt 2013, at 05:34, Martin J. Dürst <duerst@it.aoyama.ac.jp> wrote: > Excuse me if this has been checked and/or discussed already, but I just downloaded the Unicode 6.3 version (officially published a few days ago) of http://www.unicode.org/Public/UCD/latest/ucd/UnicodeData.txt and found several changes in character classification: > > OLD > 180E;MONGOLIAN VOWEL SEPARATOR;Zs;0;WS;;;;;N;;;;; > NEW > 180E;MONGOLIAN VOWEL SEPARATOR;Cf;0;BN;;;;;N;;;;; No change: $ grep '^180E;' ../6.[23].0/allcodepoints.txt ../6.2.0/allcodepoints.txt:180E;DISALLOWED;I;C;MONGOLIAN VOWEL SEPARATOR ../6.3.0/allcodepoints.txt:180E;DISALLOWED;I;C;MONGOLIAN VOWEL SEPARATOR > OLD > 1A1B;BUGINESE VOWEL SIGN AE;Mc;0;L;;;;;N;;;;; > NEW > 1A1B;BUGINESE VOWEL SIGN AE;Mn;0;NSM;;;;;N;;;;; No change: $ grep '^1A1B;' ../6.[23].0/allcodepoints.txt ../6.2.0/allcodepoints.txt:1A1B;PVALID;I;A;BUGINESE VOWEL SIGN AE ../6.3.0/allcodepoints.txt:1A1B;PVALID;I;A;BUGINESE VOWEL SIGN AE > OLD > 2308;LEFT CEILING;Sm;0;ON;;;;;Y;;;;; > 2309;RIGHT CEILING;Sm;0;ON;;;;;Y;;;;; > 230A;LEFT FLOOR;Sm;0;ON;;;;;Y;;;;; > 230B;RIGHT FLOOR;Sm;0;ON;;;;;Y;;;;; > NEW > 2308;LEFT CEILING;Ps;0;ON;;;;;Y;;;;; > 2309;RIGHT CEILING;Pe;0;ON;;;;;Y;;;;; > 230A;LEFT FLOOR;Ps;0;ON;;;;;Y;;;;; > 230B;RIGHT FLOOR;Pe;0;ON;;;;;Y;;;;; No change: $ egrep '^230[89AB];' ../6.[23].0/allcodepoints.txt ../6.2.0/allcodepoints.txt:2308;DISALLOWED;I;;LEFT CEILING ../6.2.0/allcodepoints.txt:2309;DISALLOWED;I;;RIGHT CEILING ../6.2.0/allcodepoints.txt:230A;DISALLOWED;I;;LEFT FLOOR ../6.2.0/allcodepoints.txt:230B;DISALLOWED;I;;RIGHT FLOOR ../6.3.0/allcodepoints.txt:2308;DISALLOWED;I;;LEFT CEILING ../6.3.0/allcodepoints.txt:2309;DISALLOWED;I;;RIGHT CEILING ../6.3.0/allcodepoints.txt:230A;DISALLOWED;I;;LEFT FLOOR ../6.3.0/allcodepoints.txt:230B;DISALLOWED;I;;RIGHT FLOOR > Can somebody check whether and how they affect IDNA 2008 and/or precis? > > Again, if that has already been done, sorry for the noise. > > Regards, Martin. > > > P.S.: > All the other changes in UnicodeData.txt: > > Change in numerical value only: > > OLD > 12456;CUNEIFORM NUMERIC SIGN NIGIDAMIN;Nl;0;L;;;;-1;N;;;;; > 12457;CUNEIFORM NUMERIC SIGN NIGIDAESH;Nl;0;L;;;;-1;N;;;;; > NEW > 12456;CUNEIFORM NUMERIC SIGN NIGIDAMIN;Nl;0;L;;;;2;N;;;;; > 12457;CUNEIFORM NUMERIC SIGN NIGIDAESH;Nl;0;L;;;;3;N;;;;; $ egrep '^1245[67];' ../6.[23].0/allcodepoints.txt ../6.2.0/allcodepoints.txt:12456;DISALLOWED;I;;CUNEIFORM NUMERIC SIGN NIGIDAMIN ../6.2.0/allcodepoints.txt:12457;DISALLOWED;I;;CUNEIFORM NUMERIC SIGN NIGIDAESH ../6.3.0/allcodepoints.txt:12456;DISALLOWED;I;;CUNEIFORM NUMERIC SIGN NIGIDAMIN ../6.3.0/allcodepoints.txt:12457;DISALLOWED;I;;CUNEIFORM NUMERIC SIGN NIGIDAESH > New characters (my understanding is that these are taken care of automatically): > > 061C;ARABIC LETTER MARK;Cf;0;AL;;;;;N;;;;; $ egrep '^061C;' ../6.[23].0/allcodepoints.txt ../6.2.0/allcodepoints.txt:061C;UNASSIGNED;I;J;<reserved> ../6.3.0/allcodepoints.txt:061C;DISALLOWED;I;C;ARABIC LETTER MARK > 2066;LEFT-TO-RIGHT ISOLATE;Cf;0;LRI;;;;;N;;;;; > 2067;RIGHT-TO-LEFT ISOLATE;Cf;0;RLI;;;;;N;;;;; > 2068;FIRST STRONG ISOLATE;Cf;0;FSI;;;;;N;;;;; > 2069;POP DIRECTIONAL ISOLATE;Cf;0;PDI;;;;;N;;;;; $ egrep '^206[6789];' ../6.[23].0/allcodepoints.txt ../6.2.0/allcodepoints.txt:2066;UNASSIGNED;I;CJ;<reserved> ../6.2.0/allcodepoints.txt:2067;UNASSIGNED;I;CJ;<reserved> ../6.2.0/allcodepoints.txt:2068;UNASSIGNED;I;CJ;<reserved> ../6.2.0/allcodepoints.txt:2069;UNASSIGNED;I;CJ;<reserved> ../6.3.0/allcodepoints.txt:2066;DISALLOWED;I;C;LEFT-TO-RIGHT ISOLATE ../6.3.0/allcodepoints.txt:2067;DISALLOWED;I;C;RIGHT-TO-LEFT ISOLATE ../6.3.0/allcodepoints.txt:2068;DISALLOWED;I;C;FIRST STRONG ISOLATE ../6.3.0/allcodepoints.txt:2069;DISALLOWED;I;C;POP DIRECTIONAL ISOLATE Patrik
- Re: [precis] Category changes with Unicode 6.3 Peter Saint-Andre
- [precis] Category changes with Unicode 6.3 Martin J. Dürst
- Re: [precis] Category changes with Unicode 6.3 Patrik Fältström
- Re: [precis] Category changes with Unicode 6.3 Patrik Fältström
- Re: [precis] Category changes with Unicode 6.3 Martin J. Dürst
- Re: [precis] Category changes with Unicode 6.3 Patrik Fältström
- Re: [precis] Category changes with Unicode 6.3 Mark Davis ☕
- Re: [precis] Category changes with Unicode 6.3 Patrik Fältström
- Re: [precis] Category changes with Unicode 6.3 John C Klensin
- Re: [precis] Category changes with Unicode 6.3 Martin J. Dürst
- Re: [precis] Category changes with Unicode 6.3 Patrik Fältström
- Re: [precis] Category changes with Unicode 6.3 Peter Saint-Andre
- Re: [precis] Category changes with Unicode 6.3 Peter Saint-Andre
- Re: [precis] Category changes with Unicode 6.3 Patrik Fältström
- Re: [precis] Category changes with Unicode 6.3 Peter Saint-Andre
- Re: [precis] Category changes with Unicode 6.3 John C Klensin