Re: "Difficult Characters" draft
Patrik Faltstrom <paf@swip.net> Tue, 06 May 1997 01:16 UTC
Received: from cnri by ietf.org id aa05461; 5 May 97 21:16 EDT
Received: from services.Bunyip.Com by CNRI.Reston.VA.US id aa25140; 5 May 97 21:16 EDT
Received: (from daemon@localhost) by services.bunyip.com (8.8.5/8.8.5) id VAA18816 for uri-out; Mon, 5 May 1997 21:03:30 -0400 (EDT)
Received: from mocha.bunyip.com (mocha.Bunyip.Com [192.197.208.1]) by services.bunyip.com (8.8.5/8.8.5) with ESMTP id VAA18810 for <uri@services.bunyip.com>; Mon, 5 May 1997 21:03:28 -0400 (EDT)
Received: from nix.swip.net (nix.swip.net [192.71.220.2]) by mocha.bunyip.com (8.8.5/8.8.5) with ESMTP id VAA21261; Mon, 5 May 1997 21:03:24 -0400 (EDT)
Received: from localhost (paf@localhost) by nix.swip.net (8.8.2/8.8.2) with ESMTP id DAA09828; Tue, 6 May 1997 03:03:19 +0200 (MET DST)
Date: Tue, 06 May 1997 11:03:18 +1000
From: Patrik Faltstrom <paf@swip.net>
To: Alain LaBont/e'/ <alb@sct.gouv.qc.ca>
cc: Leslie Daigle <leslie@bunyip.com>, "Martin J. Duerst" <mduerst@ifi.unizh.ch>, Larry Masinter <masinter@parc.xerox.com>, URI mailing list <uri@bunyip.com>
Subject: Re: "Difficult Characters" draft
In-Reply-To: <3.0.1.16.19970421134857.29b7b16a@riq.qc.ca>
Message-ID: <Pine.SUN.3.95.970506105903.7836O-100000@nix.swip.net>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset="ISO-8859-1"
Sender: owner-uri@bunyip.com
Precedence: bulk
Content-Transfer-Encoding: quoted-printable
X-MIME-Autoconverted: from 8bit to quoted-printable by services.bunyip.com id VAA18816
On Mon, 21 Apr 1997, Alain LaBont/e'/ wrote: > A 12:40 97-05-05 -0400, Leslie Daigle a écrit : > >For example, "o" and "ö" are unrelated characters in Swedish, so it > >would be erroneous to say that they are equivalent in an accent-insensitive > >search. Lexicographically, "ö" is the last character in the alphabet > >in Swedish. > > > >So, "accent-insensitive" matching is pretty well language-dependent. > > [Alain] : > Of course! Same for ñ which is simply an accented n in French cañon and a > letter on its own in Spanish cañon... In other words, in Spanish, searching > on "canon" shall never retrieve "cañon"; in French it could, for unprecise > searches, as well as the word "canon"... The o and the ö in Swedish is as equal as E and F in english, i.e. only "some" graphics differs :-) Is a E a F with an underline? > Tack so myket! Almost! The correct spelling is "Tack så mycket!" in Swedish :-) (and I will not write the same thing in french...) Alain do though point out one other thing, that å in swedish is closer to o than ö is to o. I.e. in Swedish, equality between o and å is ok, but not between ö and o. Yes, it is messy... Patrik
- Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Dan Connolly
- Re: Using UTF-8 for non-ASCII Characters in URLs Michael Kung <MKUNG.US.ORACLE.COM>
- Re: Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Dan Oscarsson
- Re: Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Dan Oscarsson
- Re: Using UTF-8 for non-ASCII Characters in URLs Gary Adams - Sun Microsystems Labs BOS
- Re: Using UTF-8 for non-ASCII Characters in URLs Gary Adams - Sun Microsystems Labs BOS
- Re: Using UTF-8 for non-ASCII Characters in URLs Francois Yergeau
- Re: Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Michael Kung <MKUNG.US.ORACLE.COM>
- Re: Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Larry Masinter
- Re: Using UTF-8 for non-ASCII Characters in URLs Dan Oscarsson
- Re: "Difficult Characters" draft Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Edward Cherlin
- Re: Using UTF-8 for non-ASCII Characters in URLs Chris Newman
- Re: "Difficult Characters" draft Larry Masinter
- Re: "Difficult Characters" draft Alain LaBont/e'/
- Re: "Difficult Characters" draft Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Martin J. Duerst
- Re: "Difficult Characters" draft Leslie Daigle
- Re: "Difficult Characters" draft Alain LaBont/e'/
- Re: "Difficult Characters" draft Martin J. Duerst
- Re: "Difficult Characters" draft Patrik Faltstrom
- Re: Using UTF-8 for non-ASCII Characters in URLs Martin J. Duerst
- Re: Using UTF-8 for non-ASCII Characters in URLs Alain LaBont/e'/