Re: [ietf-822] utf8 messages

Ned Freed <ned.freed@mrochek.com> Thu, 14 August 2014 01:06 UTC

Return-Path: <ned.freed@mrochek.com>
X-Original-To: ietf-822@ietfa.amsl.com
Delivered-To: ietf-822@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 0E48D1A0666 for <ietf-822@ietfa.amsl.com>; Wed, 13 Aug 2014 18:06:26 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.57
X-Spam-Level:
X-Spam-Status: No, score=-2.57 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RP_MATCHES_RCVD=-0.668, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001] autolearn=ham
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id eigQMN3PULth for <ietf-822@ietfa.amsl.com>; Wed, 13 Aug 2014 18:06:21 -0700 (PDT)
Received: from mauve.mrochek.com (mauve.mrochek.com [66.159.242.17]) by ietfa.amsl.com (Postfix) with ESMTP id B0C3C1A0665 for <ietf-822@ietf.org>; Wed, 13 Aug 2014 18:06:21 -0700 (PDT)
Received: from dkim-sign.mauve.mrochek.com by mauve.mrochek.com (PMDF V6.1-1 #35243) id <01PBCA9AOZGW0019A5@mauve.mrochek.com> for ietf-822@ietf.org; Wed, 13 Aug 2014 18:01:19 -0700 (PDT)
MIME-version: 1.0
Content-transfer-encoding: 7bit
Content-type: TEXT/PLAIN; CHARSET="us-ascii"
Received: from mauve.mrochek.com by mauve.mrochek.com (PMDF V6.1-1 #35243) id <01PBCA2453TC0000SM@mauve.mrochek.com>; Wed, 13 Aug 2014 18:01:16 -0700 (PDT)
Message-id: <01PBCA98IPI00000SM@mauve.mrochek.com>
Date: Wed, 13 Aug 2014 17:56:01 -0700
From: Ned Freed <ned.freed@mrochek.com>
In-reply-to: "Your message dated Wed, 13 Aug 2014 13:17:53 +0000" <D0111ECB.195FD%dvargha@mimecast.com>
References: <CABa8R6tWEhjjZSvq6NbM7EimokOms3suZufn0-6N1SB_fzGM8Q@mail.gmail.com> <01PB9FABWA4E0000SM@mauve.mrochek.com> <CABa8R6tns-idiZTj=+vb9fVNyH-nNYT+w9oNMb80XbCs5osvFw@mail.gmail.com> <01PBABOOL4QO0000SM@mauve.mrochek.com> <CABa8R6vBqS1ewmTtHh8tTOdzobsWpvSEokRxOqpj1Oq3hA+vsw@mail.gmail.com> <D0111ECB.195FD%dvargha@mimecast.com>
To: Daniel Vargha <dvargha@mimecast.com>
Archived-At: http://mailarchive.ietf.org/arch/msg/ietf-822/VCLaPMS0xV1DLno0aNw4mwBemyQ
Cc: Brandon Long <blong@google.com>, "ietf-822@ietf.org" <ietf-822@ietf.org>, Ned Freed <ned.freed@mrochek.com>
Subject: Re: [ietf-822] utf8 messages
X-BeenThere: ietf-822@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: "Discussion of issues related to Internet Message Format \[RFC 822, RFC 2822, RFC 5322\]" <ietf-822.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf-822>, <mailto:ietf-822-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ietf-822/>
List-Post: <mailto:ietf-822@ietf.org>
List-Help: <mailto:ietf-822-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf-822>, <mailto:ietf-822-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 14 Aug 2014 01:06:26 -0000

> I fully agree with Brandon, the standard SHOULD consider the use case when a
> message is transferred from one system to another as a blob (e.g. flat file) and
> the only available "metadata" is that the message is in MIME format. Having
> some sort of well defined UTF8 indicator in the header section of the message
> would make it much simpler to adopt the new standard as it would require
> substantially less development effort in most cases.

I'm skeptical of the claim, but if you absolutely have to have something, why
not add a Received: field containing a "with smtputf8" clause, assuming one
isn't there already?

> Regarding Ned's concern about inconsistent states I think it would be a workable
> solution to only honour the UTF8 indicator in the headers when the UTF8 flag
> is not available from metadata. In a well known UTF8 context where the SMTP
> protocol or the message store already "knows" that the message is UTF8 the
> indicator in the headers can be ignored.

That assumes people will read the standard. It's far more likely that,
given an obvious indicator, they will simply use it.

> I think it is generally desirable to reduce (or at least not increase) the amount
> of heuristics required to successfully parse a MIME message. We should try to
> learn from previous mistakes instead of repeating them.

That's the absolute worst example you could have picked, because the most
serious design error in MIME is the MIME-Version: field. You know, the field
that tells you whether or not a given message is a MIME message. Sound
familiar?

You might want to jot down the hall and ask Nathaniel Borenstein about it. He's
far more emphatic about how stupid that was than I am.

				Ned