Re: Last Call: 'The APPLICATION/MBOX Media-Type' to Proposed Standard

Valdis.Kletnieks@vt.edu Sat, 14 August 2004 00:45 UTC

Received: from ietf-mx.ietf.org (ietf-mx.ietf.org [132.151.6.1]) by ietf.org (8.9.1a/8.9.1a) with ESMTP id UAA28591; Fri, 13 Aug 2004 20:45:35 -0400 (EDT)
Received: from megatron.ietf.org ([132.151.6.71]) by ietf-mx.ietf.org with esmtp (Exim 4.33) id 1BvmlD-0002ci-UP; Fri, 13 Aug 2004 20:51:05 -0400
Received: from localhost.localdomain ([127.0.0.1] helo=megatron.ietf.org) by megatron.ietf.org with esmtp (Exim 4.32) id 1BvmcT-00007m-7b; Fri, 13 Aug 2004 20:42:01 -0400
Received: from odin.ietf.org ([132.151.1.176] helo=ietf.org) by megatron.ietf.org with esmtp (Exim 4.32) id 1Bvmah-00082l-As; Fri, 13 Aug 2004 20:40:13 -0400
Received: from ietf-mx.ietf.org (ietf-mx.ietf.org [132.151.6.1]) by ietf.org (8.9.1a/8.9.1a) with ESMTP id UAA28046; Fri, 13 Aug 2004 20:40:09 -0400 (EDT)
From: Valdis.Kletnieks@vt.edu
Received: from turing-police.cc.vt.edu ([128.173.14.107] ident=root) by ietf-mx.ietf.org with esmtp (Exim 4.33) id 1Bvmfx-0002X6-FE; Fri, 13 Aug 2004 20:45:38 -0400
Received: from turing-police.cc.vt.edu (IDENT:valdis@turing-police.cc.vt.edu [127.0.0.1]) by turing-police.cc.vt.edu (8.13.1/8.13.1) with ESMTP id i7E0e5Pk023772; Fri, 13 Aug 2004 20:40:05 -0400
Message-Id: <200408140040.i7E0e5Pk023772@turing-police.cc.vt.edu>
X-Mailer: exmh version 2.7.1 07/26/2004 with nmh-1.1-RC3
To: Tony Hansen <tony@att.com>
In-Reply-To: Your message of "Thu, 12 Aug 2004 17:18:19 EDT." <411BDE9B.7090706@att.com>
References: <p06110437bd3d5d396de1@[10.20.30.249]> <411AB496.8080307@erols.com> <411BDE9B.7090706@att.com>
Mime-Version: 1.0
Date: Fri, 13 Aug 2004 20:40:05 -0400
X-Spam-Score: 0.3 (/)
X-Scan-Signature: 32b73d73e8047ed17386f9799119ce43
Cc: ietf-822@imc.org, ehall@ntrg.com, iesg@ietf.org, ietf@ietf.org
Subject: Re: Last Call: 'The APPLICATION/MBOX Media-Type' to Proposed Standard
X-BeenThere: ietf@ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: IETF-Discussion <ietf.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=unsubscribe>
List-Post: <mailto:ietf@ietf.org>
List-Help: <mailto:ietf-request@ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=subscribe>
Content-Type: multipart/mixed; boundary="===============0956735048=="
Sender: ietf-bounces@ietf.org
Errors-To: ietf-bounces@ietf.org
X-Spam-Score: 0.3 (/)
X-Scan-Signature: 7fa173a723009a6ca8ce575a65a5d813

On Thu, 12 Aug 2004 17:18:19 EDT, Tony Hansen said:
> The information about the mbox format being anecdotally defined is 
> incorrect. The mbox format has traditionally been documented in the 
> binmail(1) or mail.local(8) man pages (BSD UNIX derivatives) or mail(1) 
> man page (UNIX System 3/5/III/V derivatives). There have been several 
> variants of the mbox format in use by those different systems. The most 
> complete description of an mbox format can be seen in the man page from 
> any UNIX System Vr4 derived system, such as Solaris.

Umm.. Tony?  I hate to say it, but if there have been several variants used in
the wild, and the man pages for said variants document different formats,
that's awfully close to "anecdotally defined" when you're doing a standard.

For example, a Solaris 8 box across the hall says in 'man mail.local':

     Each delivered mail message in the mailbox is preceded by  a
     "Unix From line" with the following format:

          From sender_address time_stamp

     The sender_address  is  extracted  from  the  SMTP  envelope
     address  (the  envelope  address  is  specified  with the -f
     option).

     A trailing blank line is also added to the end of each  mes-
     sage.

Hmm. Nothing about whether the sender_address is, or should be, <bracketed>.
Nothing about the format of the time_stamp. Nothing about '>From ' stuffing
(and yes, I've seen systems that don't do it at all, and systems that only
>-stuff if the From line matched a regexp for what *they* think the entire 'From '
line looks like(*)). The Sendmail 8.13.1 mail.local does say >-stuffing
happens for lines that "which could be mistaken for a ``From '' delimiter
line", and the code actually checks for exactly 5 chars...

Any doubts that this whole mess is at best anecdotally defined can be dispelled by
mentioning "Content-Length:" (interestingly enough, not even mentioned in the
Solaris or Sendmail man pages, although the Sendmail source tree does mention
that building on Solaris 2.3 or later will turn it on.  Of interest mostly because
the Content-Length: is so easily broken by later >-stuffing/unstuffing or other
similar conversion...

(*) time_stamp. Argh.  Fought with this during a data/machine migration.
Write code that will accept a 26 byte ctime format: 'Fri Sep 13 00:00:00 1986\n\0'.
Works fine once you realize that some systems just used 'From envelop_address'
without a timestamp.

Then I get handed this: 'Fri Aug 13 20:21:32 EDT 2004'.  Fix that, and find some
joker running in a French locale: 'vendredi, 13 août 2004, 20:22:01 EDT'.
And yes, his b0rked software only >-stuffed 'From ' lines that regexp-matched
the *French* variant. Took me *quite* some time to twig into THAT one...

_______________________________________________
Ietf mailing list
Ietf@ietf.org
https://www1.ietf.org/mailman/listinfo/ietf