Re: "Difficult Characters" draft
Leslie Daigle <leslie@bunyip.com> Mon, 05 May 1997 17:23 UTC
Received: from cnri by ietf.org id aa23740; 5 May 97 13:23 EDT
Received: from services.Bunyip.Com by CNRI.Reston.VA.US id aa15169; 5 May 97 13:23 EDT
Received: (from daemon@localhost) by services.bunyip.com (8.8.5/8.8.5) id MAA03396 for uri-out; Mon, 5 May 1997 12:42:01 -0400 (EDT)
Received: from mocha.bunyip.com (mocha.Bunyip.Com [192.197.208.1]) by services.bunyip.com (8.8.5/8.8.5) with ESMTP id MAA03391 for <uri@services.bunyip.com>; Mon, 5 May 1997 12:41:58 -0400 (EDT)
Received: from beethoven.bunyip.com (beethoven.Bunyip.Com [192.197.208.5]) by mocha.bunyip.com (8.8.5/8.8.5) with ESMTP id MAA16329; Mon, 5 May 1997 12:40:25 -0400 (EDT)
Received: from localhost (leslie@localhost) by beethoven.bunyip.com (8.6.9/8.6.10) with SMTP id MAA03465; Mon, 5 May 1997 12:40:24 -0400
X-Authentication-Warning: beethoven.bunyip.com: leslie owned process doing -bs
Date: Mon, 05 May 1997 12:40:23 -0400
From: Leslie Daigle <leslie@bunyip.com>
To: Alain LaBont/e'/ <alb@sct.gouv.qc.ca>
cc: "Martin J. Duerst" <mduerst@ifi.unizh.ch>, Larry Masinter <masinter@parc.xerox.com>, URI mailing list <uri@bunyip.com>
Subject: Re: "Difficult Characters" draft
In-Reply-To: <3.0.1.16.19970421143621.3d17d37e@riq.qc.ca>
Message-ID: <Pine.SUN.3.95.970505123131.3239E-100000@beethoven.bunyip.com>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset="US-ASCII"
X-MIME-Autoconverted: from QUOTED-PRINTABLE to 8bit by services.bunyip.com id MAA03392
Sender: owner-uri@bunyip.com
Precedence: bulk
Content-Transfer-Encoding: quoted-printable
X-MIME-Autoconverted: from 8bit to quoted-printable by services.bunyip.com id MAA03396
On Mon, 21 Apr 1997, Alain LaBont/e'/ wrote: > A 17:58 97-05-02 +0200, Martin J. Duerst a écrit : > [Larry] : > >> Using UCS in identifiers that are normally "case insensitive" > >> in ASCII, and the issues, e.g., similar upper-case forms, > >> the role of accents and equivalence. > [snip] > However accents normally don't count much for alphabetic order, they are > considerwed only in case of quasi-homography (cote, côte, coté, côté, > pèche, pêche, péché). My apologies if this has already been addressed earlier in the thread, but this jumped out at me as being a potential point of confusion. Namely, while accents don't count for alphabetic order in French, there are other languages with characters which can wrongly be perceived as "accented characters" to people familiar with only a-z. For example, "o" and "ö" are unrelated characters in Swedish, so it would be erroneous to say that they are equivalent in an accent-insensitive search. Lexicographically, "ö" is the last character in the alphabet in Swedish. So, "accent-insensitive" matching is pretty well language-dependent. Leslie. ---------------------------------------------------------------------------- "_Be_ Leslie Daigle where you _are_." Bunyip Information Systems (514) 875-8611 -- ThinkingCat leslie@bunyip.com ----------------------------------------------------------------------------
- Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Dan Connolly
- Re: Using UTF-8 for non-ASCII Characters in URLs Michael Kung <MKUNG.US.ORACLE.COM>
- Re: Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Dan Oscarsson
- Re: Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Dan Oscarsson
- Re: Using UTF-8 for non-ASCII Characters in URLs Gary Adams - Sun Microsystems Labs BOS
- Re: Using UTF-8 for non-ASCII Characters in URLs Gary Adams - Sun Microsystems Labs BOS
- Re: Using UTF-8 for non-ASCII Characters in URLs Francois Yergeau
- Re: Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Michael Kung <MKUNG.US.ORACLE.COM>
- Re: Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Dan Oscarsson
- Re: "Difficult Characters" draft Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Edward Cherlin
- Re: Using UTF-8 for non-ASCII Characters in URLs Chris Newman
- Re: "Difficult Characters" draft Larry Masinter
- Re: "Difficult Characters" draft Alain LaBont/e'/
- Re: "Difficult Characters" draft Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Martin J. Duerst
- Re: "Difficult Characters" draft Leslie Daigle
- Re: "Difficult Characters" draft Alain LaBont/e'/
- Re: "Difficult Characters" draft Martin J. Duerst
- Re: "Difficult Characters" draft Patrik Faltstrom
- Re: Using UTF-8 for non-ASCII Characters in URLs Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Alain LaBont/e'/