Re: [EAI] UTF-8 in Message-IDs

"Charles Lindsey" <chl@clerew.man.ac.uk> Tue, 04 October 2011 20:38 UTC

Return-Path: <chl@clerew.man.ac.uk>
X-Original-To: ima@ietfa.amsl.com
Delivered-To: ima@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 32C2521F8C42 for <ima@ietfa.amsl.com>; Tue, 4 Oct 2011 13:38:13 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.017
X-Spam-Level:
X-Spam-Status: No, score=-4.017 tagged_above=-999 required=5 tests=[AWL=-0.870, BAYES_00=-2.599, MIME_8BIT_HEADER=0.3, RCVD_IN_DNSWL_LOW=-1, SARE_SUB_ENC_UTF8=0.152]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id A2kkFl3YN3tj for <ima@ietfa.amsl.com>; Tue, 4 Oct 2011 13:38:12 -0700 (PDT)
Received: from outbound-queue-1.mail.thdo.gradwell.net (outbound-queue-1.mail.thdo.gradwell.net [212.11.70.34]) by ietfa.amsl.com (Postfix) with ESMTP id D396021F8C68 for <ima@ietf.org>; Tue, 4 Oct 2011 13:38:11 -0700 (PDT)
Received: from outbound-edge-2.mail.thdo.gradwell.net (bonnie.gradwell.net [212.11.70.2]) by outbound-queue-1.mail.thdo.gradwell.net (Postfix) with ESMTP id 5FC672203F; Tue, 4 Oct 2011 21:41:15 +0100 (BST)
Received: from port-89.xxx.th.newnet.co.uk (HELO clerew.man.ac.uk) (80.175.135.89) (smtp-auth username postmaster%pop3.clerew.man.ac.uk, mechanism cram-md5) by outbound-edge-2.mail.thdo.gradwell.net (qpsmtpd/0.83) with (DES-CBC3-SHA encrypted) ESMTPSA; Tue, 04 Oct 2011 21:41:14 +0100
Received: from clerew.man.ac.uk (localhost [127.0.0.1]) by clerew.man.ac.uk (8.13.7/8.13.7) with ESMTP id p94KfChG022394; Tue, 4 Oct 2011 21:41:13 +0100 (BST)
Date: Tue, 04 Oct 2011 21:41:12 +0100
To: John C Klensin <klensin@jck.com>, Julien ÉLIE <julien@trigofacile.com>, ima@ietf.org
From: Charles Lindsey <chl@clerew.man.ac.uk>
Content-Type: text/plain; format="flowed"; delsp="yes"; charset="iso-8859-1"
MIME-Version: 1.0
References: <CAHhFybo47--0YjCRcvSO4asoV_R89+ULDB3tyij+ba=O_6gKsQ@mail.gmail.com> <01O4T11O8X4M00VHKR@mauve.mrochek.com> <op.vz8z3v0a6hl8nm@clerew.man.ac.uk> <01O4VFNKDGEE00VHKR@mauve.mrochek.com> <op.v0cswsg76hl8nm@clerew.man.ac.uk> <4E8A2E61.5060308@trigofacile.com> <725E50D595CDB0AD7E4687A0@PST.JCK.COM>
Content-Transfer-Encoding: 8bit
Message-ID: <op.v2ug2y0y6hl8nm@clerew.man.ac.uk>
In-Reply-To: <725E50D595CDB0AD7E4687A0@PST.JCK.COM>
User-Agent: Opera Mail/9.25 (SunOS)
X-Gradwell-MongoId: 4e8b6f6a.121b8-499f-2
X-Gradwell-Auth-Method: mailbox
X-Gradwell-Auth-Credentials: postmaster@pop3.clerew.man.ac.uk
Subject: Re: [EAI] UTF-8 in Message-IDs
X-BeenThere: ima@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "EAI \(Email Address Internationalization\)" <ima.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ima>, <mailto:ima-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ima>
List-Post: <mailto:ima@ietf.org>
List-Help: <mailto:ima-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ima>, <mailto:ima-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 04 Oct 2011 20:38:13 -0000

On Tue, 04 Oct 2011 03:34:35 +0100, John C Klensin <klensin@jck.com> wrote:

> --On Monday, October 03, 2011 23:51 +0200 Julien ÉLIE
> <julien@trigofacile.com> wrote:

>> For what is worth, I do not think there would be any major
>> issue with INN (a news server) because message-IDs are
>> internally "hashed" after having parsed them.  They can be
>> retrievable.
>>
>> The problem is the transition period.  INN currently rejects
>> any message whose Message-ID: header field does not comply
>> with RFC 5536. It also does not want to search for such
>> message-IDs.
>> ...
> Julien,
>
> The edge case of a message with a non-ASCII Message-ID but no
> other non-ASCII header fields aside, how would the news servers
> you and Charles are familiar with respond to non-ASCII addresses
> or header fields more generally?

They would propagate fine. User agents that could display them would do  
so. Making EAI an official extension to Netnews would be quite simple -  
mainly a matter of inventing a Newsgroups: header and doing something  
about normalization of Message-IDs if there were to contain UTF-8 (which  
is why I opposed allowing them in EAI without some normalization rules  
being provided at the same time).
>
> Note also that, to some extent, this is not a news reader
> problem at all but a gateway issue between Internet mail and
> news.  If articles originate in news and move to mail, there is
> no issue because they will be ASCII.  So the question is what
> the gateways in the "to news" direction do.  If they reject
> messages with non-ASCII headers (Message-IDs or otherwise), the
> readers won't see such messages... and the problem is really no
> different from an attempt to send an extended mail message to a
> server that is not UTF8SMTP-capable.

I suspect that even if EAI does not find its way 'officially' into  
Netnews, it will still just happen anyway (probably in selected newsgroups  
where its use will become common). The transport mechanism it already  
8-bit clean, so it should "just work".

-- 
Charles H. Lindsey ---------At Home, doing my own thing------------------------
Tel: +44 161 436 6131                       
   Web: http://www.cs.man.ac.uk/~chl
Email: chl@clerew.man.ac.uk      Snail: 5 Clerewood Ave, CHEADLE, SK8 3JU, U.K.
PGP: 2C15F1A9      Fingerprint: 73 6D C2 51 93 A0 01 E7 65 E8 64 7E 14 A4 AB A5