[idn] homographs in TrueType fonts
Erik van der Poel <erik@vanderpoel.org> Thu, 05 May 2005 21:43 UTC
Received: from psg.com (mailnull@psg.com [147.28.0.62]) by ietf.org (8.9.1a/8.9.1a) with ESMTP id RAA22919 for <idn-archive@lists.ietf.org>; Thu, 5 May 2005 17:43:25 -0400 (EDT)
Received: from majordom by psg.com with local (Exim 4.50 (FreeBSD)) id 1DTnza-000LKk-SW for idn-data@psg.com; Thu, 05 May 2005 21:34:46 +0000
Received: from [207.115.57.74] (helo=ylpvm43.prodigy.net) by psg.com with esmtp (Exim 4.50 (FreeBSD)) id 1DTnzX-000LKT-WF for idn@ops.ietf.org; Thu, 05 May 2005 21:34:44 +0000
Received: from pimout1-ext.prodigy.net (pimout1-int.prodigy.net [207.115.5.65]) by ylpvm43.prodigy.net (8.12.10 outbound/8.12.10) with ESMTP id j45LYlYJ026157 for <idn@ops.ietf.org>; Thu, 5 May 2005 17:34:47 -0400
X-ORBL: [64.174.147.206]
Received: from [10.1.1.2] (adsl-64-174-147-206.dsl.sntc01.pacbell.net [64.174.147.206]) by pimout1-ext.prodigy.net (8.12.10 milter /8.12.10) with ESMTP id j45LYfqT090648; Thu, 5 May 2005 17:34:42 -0400
Message-ID: <427A9171.2030409@vanderpoel.org>
Date: Thu, 05 May 2005 14:34:41 -0700
From: Erik van der Poel <erik@vanderpoel.org>
User-Agent: Mozilla Thunderbird 1.0.2 (X11/20050317)
X-Accept-Language: en-us, en
MIME-Version: 1.0
To: idn@ops.ietf.org
Subject: [idn] homographs in TrueType fonts
Content-Type: text/plain; charset="ISO-8859-1"; format="flowed"
Content-Transfer-Encoding: 7bit
X-Spam-Checker-Version: SpamAssassin 3.0.2 (2004-11-16) on psg.com
X-Spam-Status: No, score=-1.5 required=5.0 tests=AWL,BAYES_00, RCVD_IN_NJABL_DUL,RCVD_IN_SORBS_DUL autolearn=no version=3.0.2
Sender: owner-idn@ops.ietf.org
Precedence: bulk
Content-Transfer-Encoding: 7bit
I have written a small program that parses a number of TrueType font tables to determine which pairs of Unicode codepoints end up using the same glyphs. The ASCII part of the table is included below. Each line has a codepoint, its glyph, the other codepoint of the pair, and the number of fonts in which that pair is identical. U+2044 and U+2215 use the same glyph as the slash (U+002F) in a few East Asian fonts. Note also that the capital letters I and O have homographs, although some apps present domain names in lower case, so those homographs would stand out in those apps. For the complete table, see: http://nameprep.org/tt-hg.html Erik 0021(!);01C3;2 0022(");02BA;4 0022(");05F4;12 0027(');0060;1 0027(');02B9;4 0027(');05F3;12 0027(');2032;6 0028(();FD3E;3 0029());FD3F;3 002C(,);201A;9 002D(-);2010;12 002D(-);2012;1 002D(-);2013;2 002F(/);2044;3 002F(/);2215;4 003A(:);05C3;1 003C(<);2039;1 003E(>);203A;1 0049(I);04C0;4 004F(O);2D54;1 005C(\);00A5;2 005C(\);20A9;8 0060(`);0300;1 0061(a);03B1;4 0061(a);0430;52 0063(c);0441;51 0064(d);0501;1 0065(e);0435;55 0066(f);0192;1 0067(g);0261;2 0068(h);04BB;10 0069(i);0456;60 006A(j);03F3;3 006A(j);0458;57 006D(m);0442;15 006E(n);043F;13 006F(o);03BF;48 006F(o);043E;52 006F(o);0585;1 006F(o);1D0F;1 0070(p);0440;53 0073(s);0455;57 0075(u);0438;14 0076(v);03BD;27 0076(v);03C5;1 0076(v);0475;2 0078(x);03C7;2 0078(x);0445;46 0079(y);0443;48 007C(|);01C0;1
- [idn] homographs in TrueType fonts Erik van der Poel
- Re: [idn] homographs in TrueType fonts Mark Davis