Re: Last Call: 'The APPLICATION/MBOX Media-Type' to Proposed Standard

Bruce Lilly <blilly@erols.com> Fri, 13 August 2004 00:13 UTC

Received: from ietf-mx.ietf.org (ietf-mx.ietf.org [132.151.6.1]) by ietf.org (8.9.1a/8.9.1a) with ESMTP id UAA19114; Thu, 12 Aug 2004 20:13:33 -0400 (EDT)
Received: from megatron.ietf.org ([132.151.6.71]) by ietf-mx.ietf.org with esmtp (Exim 4.33) id 1BvPmT-0001dG-0H; Thu, 12 Aug 2004 20:18:49 -0400
Received: from localhost.localdomain ([127.0.0.1] helo=megatron.ietf.org) by megatron.ietf.org with esmtp (Exim 4.32) id 1BvPf7-0002Ao-Eg; Thu, 12 Aug 2004 20:11:13 -0400
Received: from odin.ietf.org ([132.151.1.176] helo=ietf.org) by megatron.ietf.org with esmtp (Exim 4.32) id 1BvPeQ-00021J-Hs; Thu, 12 Aug 2004 20:10:30 -0400
Received: from ietf-mx.ietf.org (ietf-mx.ietf.org [132.151.6.1]) by ietf.org (8.9.1a/8.9.1a) with ESMTP id UAA18991; Thu, 12 Aug 2004 20:10:29 -0400 (EDT)
Received: from smtp02.mrf.mail.rcn.net ([207.172.4.61]) by ietf-mx.ietf.org with esmtp (Exim 4.33) id 1BvPjT-0001aV-BF; Thu, 12 Aug 2004 20:15:44 -0400
X-Info: This message was accepted for relay by smtp02.mrf.mail.rcn.net as the sender used SMTP authentication
X-Trace: JkVYWWlaWB+ER4mwpsY3JSr71KNFRlxzPEKD9kJNrEU=
Received: from 65.105.220.34.ptr.us.xo.net ([65.105.220.34] helo=mail.blilly.com) by smtp02.mrf.mail.rcn.net with asmtp (Exim 3.35 #7) id 1BvPeF-0003TA-00; Thu, 12 Aug 2004 20:10:24 -0400
Received: from marty.blilly.com (localhost [127.0.0.1]) by mail.blilly.com with ESMTP id i7D0AFL2009645(8.12.10/8.12.10/mail.blilly.com sendmail.mc.mail 1.18 2004/05/15 07:23:45) ; Thu, 12 Aug 2004 20:10:16 -0400
Received: from [127.0.0.1] (localhost [127.0.0.1]) by marty.blilly.com with ESMTP id i7D0AEsm009643(8.12.10/8.12.10/blilly.com submit.mc 1.1 2003/08/26 22:21:33) ; Thu, 12 Aug 2004 20:10:14 -0400
Message-ID: <411C06E5.1030405@erols.com>
Date: Thu, 12 Aug 2004 20:10:13 -0400
From: Bruce Lilly <blilly@erols.com>
Organization: Bruce Lilly
User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.7) Gecko/20040616
X-Accept-Language: en-us
MIME-Version: 1.0
To: "Eric A. Hall" <ehall@ehsco.com>
References: <p06110437bd3d5d396de1@[10.20.30.249]> <411AB496.8080307@erols.com> <411BDE9B.7090706@att.com> <411BEFB1.1040808@ehsco.com>
In-Reply-To: <411BEFB1.1040808@ehsco.com>
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
X-Spam-Score: 0.0 (/)
X-Scan-Signature: 4b800b1eab964a31702fa68f1ff0e955
Content-Transfer-Encoding: 7bit
Cc: ietf-822@imc.org, ehall@ntrg.com, iesg@ietf.org, ietf@ietf.org
Subject: Re: Last Call: 'The APPLICATION/MBOX Media-Type' to Proposed Standard
X-BeenThere: ietf@ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
Reply-To: iesg@ietf.org, ietf@ietf.org, ietf-822@imc.org
List-Id: IETF-Discussion <ietf.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=unsubscribe>
List-Post: <mailto:ietf@ietf.org>
List-Help: <mailto:ietf-request@ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=subscribe>
Sender: ietf-bounces@ietf.org
Errors-To: ietf-bounces@ietf.org
X-Spam-Score: 0.0 (/)
X-Scan-Signature: d8ae4fd88fcaf47c1a71c804d04f413d
Content-Transfer-Encoding: 7bit

Eric A. Hall wrote:

> Do you have a specific URL to a specific man page that you think would be
> appropriate and authoritative?
> 
> I spent a while looking (see also http://makeashorterlink.com/?X61D16909
> for example), and couldn't find anything that seemed to be authoritative
> enough to reference.

The docs/formats.txt file in the UW IMAP distribution
ftp://ftp.cac.washington.edu/mail/imap.tar.Z
describes several variants, some of which are not addressed
by the Bernstein document.  It also discusses a number of
interoperability issues not covered in the captioned draft.

As noted by Mark Crispin in the messages at the URL which you
have provided, different software can have somewhat different
interpretations of separator lines.  The only hope of having
any semblance of interoperability in exchanging such mbox
files between systems is to include an adequate specification
of the *specific* separator format used in the *particular*
file being transported (and I have suggested a regular
expression as a possible means of communicating that specification
via a Content-Type field parameter).

And I agree with Kai Henningsen's statement about use of
application/octet-stream; if there is not provision for
sufficient information (e.g. via parameters) to account for
the interoperability issues, then there's no point in having
a separate application subtype for it -- just use octet-stream
and let the communicating parties thrash out the details
out-of-band.  Or better yet (as briefly noted in the draft)
just use multipart/digest with local-system translations at
each end.

There are some additional issues that should be considered
(and either resolved or documented as known technical
omissions):
1. It is possible that processing by some transport (especially
   gateway) software may alter content which is not protected
   by transport encoding (and quoted-printable encoding might
   be insufficient).  That includes adding or removing trailing
   whitespace, modification of lines beginning with "From ", etc.
2. It is at best unclear how various software which uses one of
   the mbox format variants handles some 8bit and binary content,
   and indeed even 7bit content in MIME messages.  In particular,
   any attempt to convert line endings to/from the on-the-wire
   RFC [2]822 CRLF ending is likely to result in problems. For
   an example of binary content, see
   http://users.erols.com/blilly/mparse/tm35
   Note that the issue is not limited to binary content; "line
   ending" only has meaning in text media (not application, audio,
   image, video, model) and attempts at transforming non-text
   content (not merely text line endings, but e.g. content that
   happens to have <CR><LF>From<SP> in application, audio, etc.
   media) may irreversibly corrupt the content.

My opinion is that none of the mbox variants can handle the above
issues adequately.  Multipart/digest avoids the message separator
problem by using separators which are required not to be present
within the body of the message text in question.  Rigid formats
such as used by mbox variants are guaranteed to conflict with
some content at some point.  Some of the mbox variants appear
to have resulted from attempts to deal with various aspects of
the above and similar issues; the local storage methods that
appear to have the best chance of avoiding all of the problems
are those that use message-per-file storage in RFC [2]822
format (i.e. no attempt to convert line endings or otherwise
fold, spindle or mutilate message content) -- of course those
aren't "mbox" formats.

_______________________________________________
Ietf mailing list
Ietf@ietf.org
https://www1.ietf.org/mailman/listinfo/ietf