Re: Last Call: 'The APPLICATION/MBOX Media-Type' to Proposed Standard

Valdis.Kletnieks@vt.edu Tue, 17 August 2004 17:20 UTC

Received: from ietf-mx.ietf.org (ietf-mx.ietf.org [132.151.6.1]) by ietf.org (8.9.1a/8.9.1a) with ESMTP id NAA01061; Tue, 17 Aug 2004 13:20:50 -0400 (EDT)
Received: from megatron.ietf.org ([132.151.6.71]) by ietf-mx.ietf.org with esmtp (Exim 4.33) id 1Bx7jp-00008d-78; Tue, 17 Aug 2004 13:27:09 -0400
Received: from localhost.localdomain ([127.0.0.1] helo=megatron.ietf.org) by megatron.ietf.org with esmtp (Exim 4.32) id 1Bx7Uj-0001Kk-1K; Tue, 17 Aug 2004 13:11:33 -0400
Received: from odin.ietf.org ([132.151.1.176] helo=ietf.org) by megatron.ietf.org with esmtp (Exim 4.32) id 1Bx7PA-0000G8-Fd; Tue, 17 Aug 2004 13:05:48 -0400
Received: from ietf-mx.ietf.org (ietf-mx.ietf.org [132.151.6.1]) by ietf.org (8.9.1a/8.9.1a) with ESMTP id NAA29873; Tue, 17 Aug 2004 13:05:45 -0400 (EDT)
From: Valdis.Kletnieks@vt.edu
Received: from natted.sendmail.com ([63.211.143.38] helo=turing-police.cc.vt.edu) by ietf-mx.ietf.org with esmtp (Exim 4.33) id 1Bx7VD-0008FL-7X; Tue, 17 Aug 2004 13:12:03 -0400
Received: from turing-police.cc.vt.edu (IDENT:valdis@turing-police.cc.vt.edu [127.0.0.1]) by turing-police.cc.vt.edu (8.13.1/8.13.1) with ESMTP id i7HH5Rx8019339; Tue, 17 Aug 2004 13:05:27 -0400
Message-Id: <200408171705.i7HH5Rx8019339@turing-police.cc.vt.edu>
X-Mailer: exmh version 2.7.1 07/26/2004 with nmh-1.1-RC3
To: Tony Hansen <tony@att.com>
In-Reply-To: Your message of "Mon, 16 Aug 2004 22:47:52 EDT." <412171D8.9010201@att.com>
References: <p06110437bd3d5d396de1@[10.20.30.249]> <411AB496.8080307@erols.com> <411BDE9B.7090706@att.com> <200408140040.i7E0e5Pk023772@turing-police.cc.vt.edu> <412171D8.9010201@att.com>
Mime-Version: 1.0
Date: Tue, 17 Aug 2004 13:05:27 -0400
X-Spam-Score: 0.3 (/)
X-Scan-Signature: 10ba05e7e8a9aa6adb025f426bef3a30
Cc: ietf-822@imc.org, iesg@ietf.org, ietf@ietf.org
Subject: Re: Last Call: 'The APPLICATION/MBOX Media-Type' to Proposed Standard
X-BeenThere: ietf@ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: IETF-Discussion <ietf.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=unsubscribe>
List-Post: <mailto:ietf@ietf.org>
List-Help: <mailto:ietf-request@ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=subscribe>
Content-Type: multipart/mixed; boundary="===============1850976222=="
Sender: ietf-bounces@ietf.org
Errors-To: ietf-bounces@ietf.org
X-Spam-Score: 0.3 (/)
X-Scan-Signature: 6d95a152022472c7d6cdf886a0424dc6

On Mon, 16 Aug 2004 22:47:52 EDT, Tony Hansen said:

> The claim in Appendix A is that there were no authoritative sources of 
> documentation for the mbox formats and otherwise it's "only documented 
> in anecdotal form". I'm sorry, but the the definitions ARE there, and 
> ARE almost always authoritative for those systems.

Somehow, I can't get thrilled by the concept of saying a format is documented
because we have (for example) 3 systems, and each has an authoritative
definition of the version it uses, and the definitions are incompatible (and
yes, the Solaris 'content-length:' scheme and '>from ' escaping are basically
incompatible - there exist messages that can't be converted from one to the
other without information loss).

> Because Solaris 8 is System Vr4-derived, you should look at 'man mail' 
> for the definitive definition. You'll find Content-Length: documented there.

It says:

     A letter is composed of some  header  lines  followed  by  a
     blank line followed by the message content. The header lines
     section of the letter consists of one  or  more  UNIX  post-
     marks:

           From     sender     date_and_time     [remote     from
          remote_system_name]

     followed by one or more standardized message header lines of
     the form:

           keyword-name: [printable text]

     where keyword-name  is  comprised  of  any  printable,  non-
     whitespace  characters  other  than  colon (`:'). A Content-
     Length: header line, indicating the number of bytes  in  the
     message  content  will  always  be present unless the letter
     consists of only header lines with  no  message  content.

For bonus points - is the 'crlf-crlf' between the header and the body included
in the Content-Length:?  There's other issues as well - what if the
Content-Length: is computed across a non-canonified message - how do
you send it across the wire?

'man mail' doesn't mention escaping a 'From ' inside a message,
except for this:

     The default mode for printing messages is  to  display  only
     those header lines of immediate interest. These include, but
     are not limited to,  the  UNIX  From  and  >From  postmarks,
     From:,  Date:,  Subject:,  and Content-Length: header lines,
     and any recipient header lines such as To:, Cc:,  Bcc:,  and
     so  forth.  After the header lines have been displayed, mail

Of course, that's because Solaris doesn't use '>From ' escaping
because it has Content-Length instead.

Should other systems trust the value of a Content-Length:?

Should other systems be required to include a Content-Length?

Should other systems escape a 'From ' iff there's no Content-Length?

What if an mbox file has a Content-Length on some items but not others?

How do you recover from a corrupted Content-Length?

So - where is the *one true canonical* definition of an mbox that actually
answers all these basic questions that an implementer *needs* to know the
answer to?



_______________________________________________
Ietf mailing list
Ietf@ietf.org
https://www1.ietf.org/mailman/listinfo/ietf