Re: [EAI] punctuation and number (NFC/NFD)
Joseph Yee <jyee@ca.afilias.info> Mon, 27 July 2009 15:28 UTC
Return-Path: <jyee@ca.afilias.info>
X-Original-To: ima@core3.amsl.com
Delivered-To: ima@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id B6F6D28C26F for <ima@core3.amsl.com>; Mon, 27 Jul 2009 08:28:11 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -6.265
X-Spam-Level:
X-Spam-Status: No, score=-6.265 tagged_above=-999 required=5 tests=[BAYES_00=-2.599, IP_NOT_FRIENDLY=0.334, RCVD_IN_DNSWL_MED=-4]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id qqNOn44nhZjT for <ima@core3.amsl.com>; Mon, 27 Jul 2009 08:28:10 -0700 (PDT)
Received: from outbound.afilias.info (outbound.afilias.info [69.46.124.26]) by core3.amsl.com (Postfix) with ESMTP id C00F028C185 for <ima@ietf.org>; Mon, 27 Jul 2009 08:28:10 -0700 (PDT)
Received: from ms6.yyz2.afilias-ops.info ([10.50.129.112] helo=smtp.afilias.info) by outbound.afilias.info with esmtp (Exim 4.69) (envelope-from <jyee@ca.afilias.info>) id 1MVS7V-00066t-8H; Mon, 27 Jul 2009 15:28:09 +0000
Received: from [207.219.45.45] (helo=jyee-lt.tor.afilias-int.info) by smtp.afilias.info with esmtpsa (TLSv1:AES128-SHA:128) (Exim 4.69) (envelope-from <jyee@ca.afilias.info>) id 1MVS7V-0004Ar-7c; Mon, 27 Jul 2009 15:28:09 +0000
From: Joseph Yee <jyee@ca.afilias.info>
To: Harald Alvestrand <harald@alvestrand.no>
In-Reply-To: <4A6DB2E4.5010406@alvestrand.no>
References: <mailman.13830.1247508102.4936.ima@ietf.org> <CAD7705D4A93814F97D3EF00790AF0B315FA6650@tk5ex14mbxc105.redmond.corp.microsoft.com> <4A5BABF8.4080900@isode.com> <CAD7705D4A93814F97D3EF00790AF0B315FA6AAF@tk5ex14mbxc105.redmond.corp.microsoft.com> <4A60AA0B.4000106@alvestrand.no> <CAD7705D4A93814F97D3EF00790AF0B315FCA179@TK5EX14MBXC104.redmond.corp.microsoft.com> <EA9664FBEBEB7127550C3D30@[192.168.1.110]> <CAD7705D4A93814F97D3EF00790AF0B315FCB1CA@TK5EX14MBXC104.redmond.corp.microsoft.com> <F2BC6EC973C4D97B22FB5FE1@p3.int.jck.com> <CAD7705D4A93814F97D3EF00790AF0B315FDE04F@tk5ex14mbxc105.redmond.corp.microsoft.com> <448328271.18687@cnnic.cn> <448338030.08886@cnnic.cn> <021001ca0b70$4bf6ab20$236ff1da@whatisfuture> <8BC1BBA8349DC191FB83819F@JcK-eee9.example.com> <CAD7705D4A93814F97D3EF00790AF0B315FDF9E5@tk5ex14mbxc105.redmond.corp.microsoft.com> <4A696140.4090902@alvestrand.no> <52385C598A7B129EE974A987@JcK-eee9.example.com> <Pg19vTtgeaaCi1W4fWmFpw.md5@lochnagar.oryx.com> <4A6DB2E4.5010406@alvestrand.no>
Message-Id: <A524E303-7139-46AC-9419-B684BCC00579@ca.afilias.info>
Content-Type: text/plain; charset="UTF-8"; format="flowed"; delsp="yes"
Content-Transfer-Encoding: quoted-printable
Mime-Version: 1.0 (Apple Message framework v935.3)
Date: Mon, 27 Jul 2009 11:28:08 -0400
X-Mailer: Apple Mail (2.935.3)
Cc: Arnt Gulbrandsen <arnt@gulbrandsen.priv.no>, ima@ietf.org
Subject: Re: [EAI] punctuation and number (NFC/NFD)
X-BeenThere: ima@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: "EAI \(Email Address Internationalization\)" <ima.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/ima>, <mailto:ima-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ima>
List-Post: <mailto:ima@ietf.org>
List-Help: <mailto:ima-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ima>, <mailto:ima-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 27 Jul 2009 15:28:11 -0000
IMO this set MUST normalize first, punctuation and number: dot, underscore, plus, minus, number, and any other punctuation characters allowed in email now. Some normalization may be solved by internal mapping, alias, etc. I'm concerned that punctuation marks are not (especially characters with special meaning in email format). It sounds like I'm spelling NFKC/D, I'm not. I want to focus on punctuation mark normalization in regard to email address only. It can be special rule to apply first before other normalization, no punctuation mark allowed beyond ASCII set, NF-something for the whole email address (if it addressed the punctuation mark concern). Joseph On 27-Jul-09, at 10:00 AM, Harald Alvestrand wrote: > Arnt Gulbrandsen wrote: >> John C Klensin writes: >>> --On Friday, July 24, 2009 09:22 +0200 Harald Alvestrand >>> <harald@alvestrand.no> wrote: >>>> The more dangerous brokenness is the business card data entry >>>> situation, where people enter an address from a business card and >>>> get a "no such mailbox" response, without there being any visual >>>> difference between the two strings. >>>> >>>> Requiring normalization at the server (without placing a >>>> requirement on what one normalizes to) would at least remove >>>> this class of issue. >>> >>> Exactly. >> >> I wonder if that's a bit of a fallacy... anyone who expects humans >> to read and reenter 42-69 differently from 42‒69 has a problem. >> IMO that's a perfectly natural mistake, it involves a character >> often used in today's email addresses, and I've never heard anyone >> use this particular example before so I doubt it would be >> implemented quickly. > and in this case, NFC wouldn't help either.... >> >> Alternative proposal: Extend VRFY to optionally return a fuzzily >> correctified version of an email address. If VRFY is disabled or >> doesn't return an address, correction isn't available for that >> address. I think this is better because it permits the fuzzy >> fuzziness to be invoked when the address is entered into e.g. an >> address database, instead of when a message is transmitted. As a >> bonus it's 99% independent of EAI. > Fuzzy matching is a very interesting subject, and I'd argue that > it's 100% orthogonal to EAI - I still remember the mail I got as a > new postmaster after a new fuzzy-match algorithm had rerouted a > misspelling of one man's wife's mail to his mailbox - "how did the > system know we are married?". > > I'd like to punt this issue to the Single Global Directory, but > that's just reducing it to a previously unsolved problem..... > > Harald > > > > _______________________________________________ > IMA mailing list > IMA@ietf.org > https://www.ietf.org/mailman/listinfo/ima
- [EAI] Rechartering Shawn Steele
- Re: [EAI] Rechartering Alexey Melnikov
- Re: [EAI] Rechartering Shawn Steele
- [EAI] mailto: escaping Shawn Steele
- Re: [EAI] Rechartering Harald Alvestrand
- Re: [EAI] Rechartering Shawn Steele
- Re: [EAI] Rechartering Xiaodong Lee
- Re: [EAI] Rechartering John C Klensin
- Re: [EAI] Rechartering Shawn Steele
- Re: [EAI] Rechartering Charles Lindsey
- Re: [EAI] Rechartering John C Klensin
- Re: [EAI] Rechartering Shawn Steele
- [EAI] Test - driven schedule (Re: Rechartering) Harald Alvestrand
- Re: [EAI] Test - driven schedule (Re: Recharterin… Shawn Steele
- Re: [EAI] Rechartering YAO Jiankang
- Re: [EAI] Test - driven schedule (Re: Recharterin… YAO Jiankang
- Re: [EAI] Rechartering Charles Lindsey
- Re: [EAI] Test - driven schedule (Re: Recharterin… John C Klensin
- Re: [EAI] Rechartering John C Klensin
- Re: [EAI] Rechartering Shawn Steele
- Re: [EAI] Test - driven schedule (Re: Recharterin… Shawn Steele
- [EAI] NFC/NFD (Re: Test - driven schedule (Re: Re… Harald Alvestrand
- Re: [EAI] Rechartering YAO Jiankang
- Re: [EAI] NFC/NFD (Re: Test - driven schedule (Re… John C Klensin
- Re: [EAI] Rechartering John C Klensin
- Re: [EAI] Test - driven schedule (Re: Recharterin… Alexey Melnikov
- Re: [EAI] Rechartering Shawn Steele
- Re: [EAI] NFC/NFD (Re: Test - driven schedule (Re… Arnt Gulbrandsen
- Re: [EAI] NFC/NFD (Re: Test - driven schedule (Re… Harald Alvestrand
- Re: [EAI] NFC/NFD (Re: Test - driven schedule (Re… Arnt Gulbrandsen
- Re: [EAI] punctuation and number (NFC/NFD) Joseph Yee
- Re: [EAI] NFC/NFD (Re: Test - driven schedule (Re… Tony Finch
- Re: [EAI] NFC/NFD (Re: Test - driven schedule (Re… John C Klensin
- Re: [EAI] mailto: escaping Martin J. Dürst
- Re: [EAI] mailto: escaping Shawn Steele