[Ltru] FYI: Unicode 5.1 Released
"Mark Davis" <mark.davis@icu-project.org> Sat, 05 April 2008 01:44 UTC
Return-Path: <ltru-bounces@ietf.org>
X-Original-To: ltru-archive@megatron.ietf.org
Delivered-To: ietfarch-ltru-archive@core3.amsl.com
Received: from core3.amsl.com (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 116C13A68EC; Fri, 4 Apr 2008 18:44:21 -0700 (PDT)
X-Original-To: ltru@core3.amsl.com
Delivered-To: ltru@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id B3C3C3A6B3F for <ltru@core3.amsl.com>; Fri, 4 Apr 2008 17:21:00 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -0.487
X-Spam-Level:
X-Spam-Status: No, score=-0.487 tagged_above=-999 required=5 tests=[BAYES_05=-1.11, FM_FORGED_GMAIL=0.622, HTML_MESSAGE=0.001]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Zp0iPtyUmKQD for <ltru@core3.amsl.com>; Fri, 4 Apr 2008 17:20:59 -0700 (PDT)
Received: from ag-out-0708.google.com (ag-out-0708.google.com [72.14.246.246]) by core3.amsl.com (Postfix) with ESMTP id 0F3C928C1CC for <ltru@ietf.org>; Fri, 4 Apr 2008 17:20:12 -0700 (PDT)
Received: by ag-out-0708.google.com with SMTP id 9so164169agd.12 for <ltru@ietf.org>; Fri, 04 Apr 2008 17:20:19 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:sender:subject:mime-version:content-type:x-google-sender-auth; bh=d9O50hptL2VGzuCKC4MK9RoMy7yxaNR4nGUCA4Klg3U=; b=WqjlVrTJHLQImE+KvFNm2vYhDX/nidEGMHDtBbvGp2vCU2dx4bsE3+avfoXMhrtld5RY/cJkj4QDhPg/HSIsGdkUWB0UBtNcvLvpjIGQ/sWGObuGVgzwC9YeWI3P5mnTk4C1sfkhX8m4ZFyt9gz0Ux7l2zeC1PoNZlJdfAbcK1Q=
DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:sender:subject:mime-version:content-type:x-google-sender-auth; b=R7Zg0vx02uN4NghtpbTQOCQN/QUnfz1o27Twxoi4h6bk7hinuXc8tLKhJWLwZE6+RomSz8kA7i1MhRscqK6VJOGs08VFSksVSIJ3SyJtnLu0KbtjGycqiGlAo6+wfYVu5HFEFcr6tWfhzLsyCzDgIhM15DTTnf2QgWAw6dh4HpQ=
Received: by 10.150.155.1 with SMTP id c1mr957359ybe.85.1207354818078; Fri, 04 Apr 2008 17:20:18 -0700 (PDT)
Received: by 10.150.229.9 with HTTP; Fri, 4 Apr 2008 17:20:18 -0700 (PDT)
Message-ID: <30b660a20804041720o690dd17j44cae49d960d73d3@mail.gmail.com>
Date: Fri, 04 Apr 2008 17:20:18 -0700
From: Mark Davis <mark.davis@icu-project.org>
MIME-Version: 1.0
X-Google-Sender-Auth: a0168393a737f5a8
To: undisclosed-recipients:;
X-Mailman-Approved-At: Fri, 04 Apr 2008 18:44:19 -0700
Subject: [Ltru] FYI: Unicode 5.1 Released
X-BeenThere: ltru@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: Language Tag Registry Update working group discussion list <ltru.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/pipermail/ltru>
List-Post: <mailto:ltru@ietf.org>
List-Help: <mailto:ltru-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=subscribe>
Content-Type: multipart/mixed; boundary="===============2081971211=="
Sender: ltru-bounces@ietf.org
Errors-To: ltru-bounces@ietf.org
---------- Forwarded message ---------- From: Rick McGowan <rick@unicode.org> Date: Fri, Apr 4, 2008 at 3:54 PM Subject: Unicode 5.1 Released To: unicode@unicode.org The Unicode Consortium is pleased to announce the release of Unicode 5.1. This release contains over 100,000 characters, and provides significant additions and improvements that extend text processing for software worldwide. Some of the key features are: increased security in data exchange, significant character additions for Indic and South East Asian scripts, expanded identifier specifications for Indic and Arabic scripts, improvements in the processing of Tamil and other Indic scripts, linebreaking conformance relaxation for HTML and other protocols, strengthened normalization stability, new case pair stability, plus others given below. The Version 5.1.0 data files and documentation are final and posted on the Unicode site. In addition to updated existing files, implementers will find new test data files (for example, for linebreaking) and new XML data files that encapsulate all of the Unicode character properties. For details, see the page for Unicode 5.1.0 at http://www.unicode.org/versions/Unicode5.1.0/. A major feature of Unicode 5.1.0 is the enabling of ideographic variation sequences. These sequences allow standardized representation of glyphic variants needed for Japanese, Chinese, and Korean text. The first registered collection, from Adobe Systems, is now available at http://www.unicode.org/ivd/. Unicode 5.1 contains significant changes to properties and behaviorial specifications. Several important property definitions were extended, improving linebreaking for Polish and Portuguese hyphenation. The Unicode Text Segmentation Algorithms, covering sentences, words, and characters, were greatly enhanced to improve the processing of Tamil and other Indic languages. The Unicode Normalization Algorithm now defines stabilized strings and provides guidelines for buffering. Standardized named sequences are added for Lithuanian, and provisional named sequences for Tamil. Unicode 5.1.0 adds 1,624 newly encoded characters. These additions include characters required for Malayalam and Myanmar and important individual characters such as Latin capital sharp s for German. Version 5.1 extends support for languages in Africa, India, Indonesia, Myanmar, and Vietnam, with the addition of the Cham, Lepcha, Ol Chiki, Rejang, Saurashtra, Sundanese, and Vai scripts. Scholarly support includes important editorial punctuation marks, as well as the Carian, Lycian, and Lydian scripts, and the Phaistos disc symbols. Other new symbol sets include dominoes, Mahjong, dictionary punctuation marks, and math additions. This latest version of the Unicode Standard has exactly the same character assignments as ISO/IEC 10646:2003 plus Amendments 1 through 4. The Unicode Collation Algorithm (UCA), the core standard for sorting all text, is also being updated at the same time (see http://www.unicode.org/reports/tr10/). The major changes in UCA include coverage of all Unicode 5.1 characters, tightened conformance for canonical equivalence, clearer definitions of internationalized search and matching, specifications of parameters for customizing collation, and definitions of collation folding. There are also important clarifications on the use of contractions (such as "ch" in Slovak) in collation. The next version of the Unicode locale project (CLDR) is also being prepared on the basis of Unicode 5.1, and is now open for public data submission (see http://www.unicode.org/cldr/). -- Mark -- Mark
_______________________________________________ Ltru mailing list Ltru@ietf.org https://www.ietf.org/mailman/listinfo/ltru
- [Ltru] FYI: Unicode 5.1 Released Mark Davis