Re: What's left before a new RFC?

Marcel Waldvogel <marcel@news.m.wanda.ch> Thu, 18 April 2002 14:58 UTC

Received: from above.proper.com (above.proper.com [208.184.76.39]) by ietf.org (8.9.1a/8.9.1a) with ESMTP id KAA09273 for <openpgp-archive@odin.ietf.org>; Thu, 18 Apr 2002 10:58:40 -0400 (EDT)
Received: by above.proper.com (8.11.6/8.11.3) id g3IEkVE15632 for ietf-openpgp-bks; Thu, 18 Apr 2002 07:46:31 -0700 (PDT)
Received: from wanda.ch (adsl-73-116-basel1.tiscalinet.ch [212.254.73.116]) by above.proper.com (8.11.6/8.11.3) with ESMTP id g3IEkTm15628 for <ietf-openpgp@imc.org>; Thu, 18 Apr 2002 07:46:30 -0700 (PDT)
Received: from news.m.wanda.ch (pat.zurich.ibm.com [195.212.119.253]) by wanda.ch (8.11.6/8.11.3/SuSE Linux 8.11.1-0.5) with ESMTP id g3IEkMw14021; Thu, 18 Apr 2002 16:46:22 +0200
Message-ID: <3CBEDC3D.10809@news.m.wanda.ch>
Date: Thu, 18 Apr 2002 16:46:21 +0200
From: Marcel Waldvogel <marcel@news.m.wanda.ch>
User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:0.9.8) Gecko/20020204
X-Accept-Language: de, en
MIME-Version: 1.0
To: Marc Mutz <mutz@kde.org>
CC: Jon Callas <jon@callas.org>, ietf-openpgp@imc.org
Subject: Re: What's left before a new RFC?
References: <p05101585b8e3abe7a80e@[192.168.1.97]> <3CBE80FF.6040301@news.m.wanda.ch> <200204181248.57879@sendmail.mutz.com>
Content-Type: text/plain; charset="us-ascii"; format="flowed"
Content-Transfer-Encoding: 7bit
Sender: owner-ietf-openpgp@mail.imc.org
Precedence: bulk
List-Archive: <http://www.imc.org/ietf-openpgp/mail-archive/>
List-Unsubscribe: <mailto:ietf-openpgp-request@imc.org?body=unsubscribe>
List-ID: <ietf-openpgp.imc.org>
Content-Transfer-Encoding: 7bit

Marc Mutz wrote:

>-----BEGIN PGP SIGNED MESSAGE-----
>Hash: SHA1
>
>On Thursday 18 April 2002 10:17, Marcel Waldvogel wrote:
><snip>
>
>>For the Version and Comment headers, I propose to state that they are
>>UTF-8, but for interoperability, implementations SHOULD restrict
>>themselves to generate ASCII characters.
>>
><snip>
>
>I don't see the how having UTF-8 inside _ASCII_ Armor can be justified. The 
>problem is that the ascii armor is going to be used in non-8but-clean 
>environments. Else, you'd use the binary format, no?
>

One one side, we can consider cleartext signatures, which already can 
contain characters with the MSB set, and which are not necessarily 
armored for 8-bit cleanliness. Even there, the signature block is (among 
other things) ASCII-armored to not confuse the user (or his terminal) 
with weird chars.

There are multiple reasons for armoring non-7bit data:
a) The transport does not allow arbitrary 8bit sequences
b) The transport does not allow arbitrary "line" lengths
c) The transport does charset recoding, probably forth and back.

The headers that use UTF-8 are mostly for human consumption (I presume), 
and most modern non-8bit transports are of the form (b) or (c), which 
(1) should not harm typical headers nor (2) cause any damage, if they 
are slightly mangled (IMHO).


BTW: Does the "Version" header contain the product and version used 
("User-Agent"), or the version of the standard ("MIME-Version")? bis04 
seems to indicate the latter, but usage seems to be the former.

-Marcel