Re: RFC 4790 code point collation identifier

Bjoern Hoehrmann <derhoermi@gmx.net> Fri, 13 March 2009 15:49 UTC

Return-Path: <derhoermi@gmx.net>
X-Original-To: collation@core3.amsl.com
Delivered-To: collation@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id F3A413A6B2D for <collation@core3.amsl.com>; Fri, 13 Mar 2009 08:49:12 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.999
X-Spam-Level:
X-Spam-Status: No, score=-1.999 tagged_above=-999 required=5 tests=[BAYES_00=-2.599, J_CHICKENPOX_15=0.6]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id zN0III0TmbhV for <collation@core3.amsl.com>; Fri, 13 Mar 2009 08:49:05 -0700 (PDT)
Received: from mail.gmx.net (mail.gmx.net [213.165.64.20]) by core3.amsl.com (Postfix) with SMTP id C3ACD3A687E for <collation@ietf.org>; Fri, 13 Mar 2009 08:49:01 -0700 (PDT)
Received: (qmail invoked by alias); 13 Mar 2009 15:49:39 -0000
Received: from dslb-092-078-028-181.pools.arcor-ip.net (EHLO hive) [92.78.28.181] by mail.gmx.net (mp069) with SMTP; 13 Mar 2009 16:49:39 +0100
X-Authenticated: #723575
X-Provags-ID: V01U2FsdGVkX18U5nZEzgMUBqQlAbqpPEjzYgOLOR2LPa5EJnQZdL 6ve1i86LL/Jsls
From: Bjoern Hoehrmann <derhoermi@gmx.net>
To: Arnt Gulbrandsen <arnt@oryx.com>
Subject: Re: RFC 4790 code point collation identifier
Date: Fri, 13 Mar 2009 16:49:43 +0100
Message-ID: <uivkr45ijt77m95rfgpnc9rigu2kjpuepv@hive.bjoern.hoehrmann.de>
References: <rb8jr4h5bes498mtbk1maclmkbdfrc3abh@hive.bjoern.hoehrmann.de> <IkeeTWscmdhSPrs/s0A93g.md5@lochnagar.oryx.com>
In-Reply-To: <IkeeTWscmdhSPrs/s0A93g.md5@lochnagar.oryx.com>
X-Mailer: Forte Agent 3.3/32.846
MIME-Version: 1.0
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: 8bit
X-Y-GMX-Trusted: 0
X-FuHaFi: 0.66
Cc: www-international@w3.org, collation@ietf.org
X-BeenThere: collation@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: "Collation registration and discussion \(see RFC 4790\)" <collation.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/collation>, <mailto:collation-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/collation>
List-Post: <mailto:collation@ietf.org>
List-Help: <mailto:collation-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/collation>, <mailto:collation-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 13 Mar 2009 15:49:13 -0000

* Arnt Gulbrandsen wrote:
>Perhaps RFC 5051. At a superficial glance, it and the the default 
>collation described in XQ 7.3.1 look equivalent.

As I understand it, i;unicode-casemap performs case conversion and some
normalization and then compares the result. What I am looking for, and
what I think the default collation is, is strict Unicode identity. Put
simply, represent both strings as UTF-8 sequences and apply i;octet.

>That document landed in my lap after a while. If you care I could finish 
>it. There isn't much left to do. (I didn't finish it until now for a 
>variety of reasons. Mostly blah health blah priority blah.)

I found that draft and think it would be nice if it was finished, but
there is no hurry, from my perspective, to finish it.
-- 
Björn Höhrmann · mailto:bjoern@hoehrmann.de · http://bjoern.hoehrmann.de
Am Badedeich 7 · Telefon: +49(0)160/4415681 · http://www.bjoernsworld.de
25899 Dagebüll · PGP Pub. KeyID: 0xA4357E78 · http://www.websitedev.de/