[precis] Category changes with Unicode 6.3

"Martin J. Dürst" <duerst@it.aoyama.ac.jp> Wed, 16 October 2013 02:34 UTC

Return-Path: <duerst@it.aoyama.ac.jp>
X-Original-To: precis@ietfa.amsl.com
Delivered-To: precis@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 5F18D11E81F4 for <precis@ietfa.amsl.com>; Tue, 15 Oct 2013 19:34:48 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -104.569
X-Spam-Level:
X-Spam-Status: No, score=-104.569 tagged_above=-999 required=5 tests=[AWL=1.221, BAYES_00=-2.599, GB_I_LETTER=-2, HELO_EQ_JP=1.244, HOST_EQ_JP=1.265, MIME_8BIT_HEADER=0.3, RCVD_IN_DNSWL_MED=-4, USER_IN_WHITELIST=-100]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id ijgUdVPP7Pa3 for <precis@ietfa.amsl.com>; Tue, 15 Oct 2013 19:34:43 -0700 (PDT)
Received: from scintmta02.scbb.aoyama.ac.jp (scintmta02.scbb.aoyama.ac.jp [133.2.253.34]) by ietfa.amsl.com (Postfix) with ESMTP id B5EC911E80E9 for <precis@ietf.org>; Tue, 15 Oct 2013 19:34:42 -0700 (PDT)
Received: from scmse02.scbb.aoyama.ac.jp ([133.2.253.231]) by scintmta02.scbb.aoyama.ac.jp (secret/secret) with SMTP id r9G2YTBM000607; Wed, 16 Oct 2013 11:34:30 +0900
Received: from (unknown [133.2.206.134]) by scmse02.scbb.aoyama.ac.jp with smtp id 2606_6102_7b6f932e_360b_11e3_93cd_001e6722eec2; Wed, 16 Oct 2013 11:34:29 +0900
Received: from [IPv6:::1] (unknown [133.2.210.1]) by itmail2.it.aoyama.ac.jp (Postfix) with ESMTP id F0EC1BF521; Wed, 16 Oct 2013 11:34:28 +0900 (JST)
Message-ID: <525DFB1F.8040401@it.aoyama.ac.jp>
Date: Wed, 16 Oct 2013 11:34:07 +0900
From: =?UTF-8?B?Ik1hcnRpbiBKLiBEw7xyc3Qi?= <duerst@it.aoyama.ac.jp>
Organization: Aoyama Gakuin University
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 6.1; en-US; rv:1.9.1.9) Gecko/20100722 Eudora/3.0.4
MIME-Version: 1.0
To: "precis@ietf.org" <precis@ietf.org>, "idna-update@alvestrand.no" <idna-update@alvestrand.no>
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Subject: [precis] Category changes with Unicode 6.3
X-BeenThere: precis@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: Preparation and Comparison of Internationalized Strings <precis.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/precis>, <mailto:precis-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/precis>
List-Post: <mailto:precis@ietf.org>
List-Help: <mailto:precis-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/precis>, <mailto:precis-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 16 Oct 2013 02:34:48 -0000

Excuse me if this has been checked and/or discussed already, but I just 
downloaded the Unicode 6.3 version (officially published a few days ago) 
of http://www.unicode.org/Public/UCD/latest/ucd/UnicodeData.txt and 
found several changes in character classification:

OLD
180E;MONGOLIAN VOWEL SEPARATOR;Zs;0;WS;;;;;N;;;;;
NEW
180E;MONGOLIAN VOWEL SEPARATOR;Cf;0;BN;;;;;N;;;;;

OLD
1A1B;BUGINESE VOWEL SIGN AE;Mc;0;L;;;;;N;;;;;
NEW
1A1B;BUGINESE VOWEL SIGN AE;Mn;0;NSM;;;;;N;;;;;

OLD
2308;LEFT CEILING;Sm;0;ON;;;;;Y;;;;;
2309;RIGHT CEILING;Sm;0;ON;;;;;Y;;;;;
230A;LEFT FLOOR;Sm;0;ON;;;;;Y;;;;;
230B;RIGHT FLOOR;Sm;0;ON;;;;;Y;;;;;
NEW
2308;LEFT CEILING;Ps;0;ON;;;;;Y;;;;;
2309;RIGHT CEILING;Pe;0;ON;;;;;Y;;;;;
230A;LEFT FLOOR;Ps;0;ON;;;;;Y;;;;;
230B;RIGHT FLOOR;Pe;0;ON;;;;;Y;;;;;

Can somebody check whether and how they affect IDNA 2008 and/or precis?

Again, if that has already been done, sorry for the noise.

Regards,   Martin.


P.S.:
All the other changes in UnicodeData.txt:

Change in numerical value only:

OLD
12456;CUNEIFORM NUMERIC SIGN NIGIDAMIN;Nl;0;L;;;;-1;N;;;;;
12457;CUNEIFORM NUMERIC SIGN NIGIDAESH;Nl;0;L;;;;-1;N;;;;;
NEW
12456;CUNEIFORM NUMERIC SIGN NIGIDAMIN;Nl;0;L;;;;2;N;;;;;
12457;CUNEIFORM NUMERIC SIGN NIGIDAESH;Nl;0;L;;;;3;N;;;;;


New characters (my understanding is that these are taken care of 
automatically):

061C;ARABIC LETTER MARK;Cf;0;AL;;;;;N;;;;;

2066;LEFT-TO-RIGHT ISOLATE;Cf;0;LRI;;;;;N;;;;;
2067;RIGHT-TO-LEFT ISOLATE;Cf;0;RLI;;;;;N;;;;;
2068;FIRST STRONG ISOLATE;Cf;0;FSI;;;;;N;;;;;
2069;POP DIRECTIONAL ISOLATE;Cf;0;PDI;;;;;N;;;;;