Re: [EAI] UTF-8 in Message-IDs

"Charles Lindsey" <chl@clerew.man.ac.uk> Thu, 18 August 2011 15:33 UTC

Return-Path: <chl@clerew.man.ac.uk>
X-Original-To: ima@ietfa.amsl.com
Delivered-To: ima@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 1858821F8BD3 for <ima@ietfa.amsl.com>; Thu, 18 Aug 2011 08:33:27 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.304
X-Spam-Level:
X-Spam-Status: No, score=-4.304 tagged_above=-999 required=5 tests=[AWL=-0.857, BAYES_00=-2.599, RCVD_IN_DNSWL_LOW=-1, SARE_SUB_ENC_UTF8=0.152]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id u4+JeBfBCHRJ for <ima@ietfa.amsl.com>; Thu, 18 Aug 2011 08:33:26 -0700 (PDT)
Received: from outbound-queue-2.mail.thdo.gradwell.net (outbound-queue-2.mail.thdo.gradwell.net [212.11.70.35]) by ietfa.amsl.com (Postfix) with ESMTP id 5BDEE21F8B8C for <ima@ietf.org>; Thu, 18 Aug 2011 08:33:22 -0700 (PDT)
Received: from outbound-edge-2.mail.thdo.gradwell.net (bonnie.gradwell.net [212.11.70.2]) by outbound-queue-2.mail.thdo.gradwell.net (Postfix) with ESMTP id 9099C21E6F for <ima@ietf.org>; Thu, 18 Aug 2011 16:34:15 +0100 (BST)
Received: from port-89.xxx.th.newnet.co.uk (HELO clerew.man.ac.uk) (80.175.135.89) (smtp-auth username postmaster%pop3.clerew.man.ac.uk, mechanism cram-md5) by outbound-edge-2.mail.thdo.gradwell.net (qpsmtpd/0.83) with (DES-CBC3-SHA encrypted) ESMTPSA; Thu, 18 Aug 2011 16:34:15 +0100
Received: from clerew.man.ac.uk (localhost [127.0.0.1]) by clerew.man.ac.uk (8.13.7/8.13.7) with ESMTP id p7IFYCcf021183 for <ima@ietf.org>; Thu, 18 Aug 2011 16:34:14 +0100 (BST)
Date: Thu, 18 Aug 2011 16:34:12 +0100
To: IMA <ima@ietf.org>
From: Charles Lindsey <chl@clerew.man.ac.uk>
Content-Type: text/plain; format="flowed"; delsp="yes"; charset="iso-8859-1"
MIME-Version: 1.0
References: <CAHhFybo47--0YjCRcvSO4asoV_R89+ULDB3tyij+ba=O_6gKsQ@mail.gmail.com> <18B1642B54C3604C98866093@96B2F16665FF96BAE59E9B90>
Content-Transfer-Encoding: 8bit
Message-ID: <op.v0e1jadu6hl8nm@clerew.man.ac.uk>
In-Reply-To: <18B1642B54C3604C98866093@96B2F16665FF96BAE59E9B90>
User-Agent: Opera Mail/9.25 (SunOS)
X-Gradwell-MongoId: 4e4d30f7.afda-5070-2
X-Gradwell-Auth-Method: mailbox
X-Gradwell-Auth-Credentials: postmaster@pop3.clerew.man.ac.uk
Subject: Re: [EAI] UTF-8 in Message-IDs
X-BeenThere: ima@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "EAI \(Email Address Internationalization\)" <ima.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ima>, <mailto:ima-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ima>
List-Post: <mailto:ima@ietf.org>
List-Help: <mailto:ima-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ima>, <mailto:ima-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 18 Aug 2011 15:33:27 -0000

On Wed, 17 Aug 2011 23:36:04 +0100, Chris Newman <chris.newman@oracle.com>  
wrote:

> Thank you for raising this use case.
>
> I believe a downgraded EAI message constitutes a "subsequent revision"  
> as described in RFC 5322 section 3.6.4. Specifically, it is likely that  
> attempts to reply to the author, sender and recipients of a downgraded  
> EAI message will fail so it is a fundamentally different message. And  
> thus it is important that it gets a new message id.
>
> So perhaps we should change the advice in RFC5335bis to say:
>
>   The Message-ID SHOULD include at least one UTF-8 character.

Absolutely NO-WAY!

If a message is evidently the SAME message, even though different  
recipients may encounter it in slightly different forms (possibly  
dowgreaded, possibly garbled, but still with the majority of it readable)  
then it MUST have the same Message-ID.

So if, by some complex rerouting, encapsulating, forwarding, someone  
manages to acquire copies of it in both forms, it will be clear it is the  
same (e.g. it will not need to be replied to twice, and in any threaded  
list both will appear together with the same parents and same children).

An essential property of <msg-id>s, which we managed to achieve during the  
discussions that lead to RFC 5322, is that two <msg-id>s can be compared  
for equality by a simnple byte-for-byte comparison. Threading algorithms  
need to rely on this, and within Netnews the same property is essential  
for the transport mechanism to sork (even if the transport proceeds by  
gatewaying into email and then nack into Netnews).

Therefore, IF we are going to permit utf-8 in <msg-id>s, then it is  
essential to include sufficient mandatory requirements on Normalization,  
and to scrutinize the set of allowed characters for troublesome cases. If  
this WG is prepared to put in the work to do then, then well and good, and  
I will support it.

But otherwise, please count me as OPPOSED.

-- 
Charles H. Lindsey ---------At Home, doing my own thing------------------------
Tel: +44 161 436 6131                       
   Web: http://www.cs.man.ac.uk/~chl
Email: chl@clerew.man.ac.uk      Snail: 5 Clerewood Ave, CHEADLE, SK8 3JU, U.K.
PGP: 2C15F1A9      Fingerprint: 73 6D C2 51 93 A0 01 E7 65 E8 64 7E 14 A4 AB A5