Re: "Difficult Characters" draft
Alain LaBont/e'/ <alb@sct.gouv.qc.ca> Mon, 05 May 1997 18:51 UTC
Received: from cnri by ietf.org id aa26130; 5 May 97 14:51 EDT
Received: from services.Bunyip.Com by CNRI.Reston.VA.US id aa17141; 5 May 97 14:51 EDT
Received: (from daemon@localhost) by services.bunyip.com (8.8.5/8.8.5) id OAA08087 for uri-out; Mon, 5 May 1997 14:32:08 -0400 (EDT)
Received: from mocha.bunyip.com (mocha.Bunyip.Com [192.197.208.1]) by services.bunyip.com (8.8.5/8.8.5) with ESMTP id OAA08080 for <uri@services.bunyip.com>; Mon, 5 May 1997 14:32:05 -0400 (EDT)
Received: from socrate.riq.qc.ca (socrate.riq.qc.ca [199.84.128.1]) by mocha.bunyip.com (8.8.5/8.8.5) with SMTP id OAA17902; Mon, 5 May 1997 14:31:58 -0400 (EDT)
Received: from 506.riq.qc.ca (riq-44-239.riq.qc.ca) by socrate.riq.qc.ca (5.x/SMI-SVR4) id AA22679; Mon, 5 May 1997 14:34:55 -0400
Message-Id: <3.0.1.16.19970421134857.29b7b16a@riq.qc.ca>
X-Sender: alb@riq.qc.ca
X-Mailer: Windows Eudora Pro Version 3.0.1 beta 14 (16) [F]
Date: Mon, 21 Apr 1997 13:48:57 -0000
To: Leslie Daigle <leslie@bunyip.com>
From: Alain LaBont/e'/ <alb@sct.gouv.qc.ca>
Subject: Re: "Difficult Characters" draft
Cc: "Martin J. Duerst" <mduerst@ifi.unizh.ch>, Larry Masinter <masinter@parc.xerox.com>, URI mailing list <uri@bunyip.com>
In-Reply-To: <Pine.SUN.3.95.970505123131.3239E-100000@beethoven.bunyip.c om>
References: <3.0.1.16.19970421143621.3d17d37e@riq.qc.ca>
Mime-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-uri@bunyip.com
Precedence: bulk
Content-Transfer-Encoding: quoted-printable
X-MIME-Autoconverted: from 8bit to quoted-printable by services.bunyip.com id OAA08087
A 12:40 97-05-05 -0400, Leslie Daigle a écrit : > >On Mon, 21 Apr 1997, Alain LaBont/e'/ wrote: >> A 17:58 97-05-02 +0200, Martin J. Duerst a écrit : >> [Larry] : >> >> Using UCS in identifiers that are normally "case insensitive" >> >> in ASCII, and the issues, e.g., similar upper-case forms, >> >> the role of accents and equivalence. >> >[snip] [Alain] : >> However accents normally don't count much for alphabetic order, they are >> considerwed only in case of quasi-homography (cote, côte, coté, côté, >> pèche, pêche, péché). > [Leslie] : >My apologies if this has already been addressed earlier in the thread, but >this jumped out at me as being a potential point of confusion. > >Namely, while accents don't count for alphabetic order in French, there >are other languages with characters which can wrongly be perceived as "accented >characters" to people familiar with only a-z. > >For example, "o" and "ö" are unrelated characters in Swedish, so it >would be erroneous to say that they are equivalent in an accent-insensitive >search. Lexicographically, "ö" is the last character in the alphabet >in Swedish. > >So, "accent-insensitive" matching is pretty well language-dependent. [Alain] : Of course! Same for ñ which is simply an accented n in French cañon and a letter on its own in Spanish cañon... In other words, in Spanish, searching on "canon" shall never retrieve "cañon"; in French it could, for unprecise searches, as well as the word "canon"... Tack so myket! Alain LaBonté Québec
- Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Dan Connolly
- Re: Using UTF-8 for non-ASCII Characters in URLs Michael Kung <MKUNG.US.ORACLE.COM>
- Re: Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Dan Oscarsson
- Re: Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Dan Oscarsson
- Re: Using UTF-8 for non-ASCII Characters in URLs Gary Adams - Sun Microsystems Labs BOS
- Re: Using UTF-8 for non-ASCII Characters in URLs Gary Adams - Sun Microsystems Labs BOS
- Re: Using UTF-8 for non-ASCII Characters in URLs Francois Yergeau
- Re: Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Michael Kung <MKUNG.US.ORACLE.COM>
- Re: Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Dan Oscarsson
- Re: "Difficult Characters" draft Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Edward Cherlin
- Re: Using UTF-8 for non-ASCII Characters in URLs Chris Newman
- Re: "Difficult Characters" draft Larry Masinter
- Re: "Difficult Characters" draft Alain LaBont/e'/
- Re: "Difficult Characters" draft Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Martin J. Duerst
- Re: "Difficult Characters" draft Leslie Daigle
- Re: "Difficult Characters" draft Alain LaBont/e'/
- Re: "Difficult Characters" draft Martin J. Duerst
- Re: "Difficult Characters" draft Patrik Faltstrom
- Re: Using UTF-8 for non-ASCII Characters in URLs Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Alain LaBont/e'/