Re: Character encodings in headers [i74][was: Straw-man charter forhttp-bis]

Keith Moore <moore@cs.utk.edu> Mon, 20 August 2007 13:04 UTC

Return-path: <discuss-bounces@apps.ietf.org>
Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1IN6vV-0001BP-BU; Mon, 20 Aug 2007 09:04:13 -0400
Received: from discuss by megatron.ietf.org with local (Exim 4.43) id 1IN6vU-0001BK-MQ for discuss-confirm+ok@megatron.ietf.org; Mon, 20 Aug 2007 09:04:12 -0400
Received: from [10.91.34.44] (helo=ietf-mx.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1IN6vU-0001BC-D0 for discuss@apps.ietf.org; Mon, 20 Aug 2007 09:04:12 -0400
Received: from shu.cs.utk.edu ([160.36.56.39]) by ietf-mx.ietf.org with esmtp (Exim 4.43) id 1IN6vT-0004aP-6Q for discuss@apps.ietf.org; Mon, 20 Aug 2007 09:04:12 -0400
Received: from localhost (localhost [127.0.0.1]) by shu.cs.utk.edu (Postfix) with ESMTP id DBDF31EE308; Mon, 20 Aug 2007 09:04:07 -0400 (EDT)
X-Virus-Scanned: by amavisd-new with ClamAV and SpamAssasin at cs.utk.edu
Received: from shu.cs.utk.edu ([127.0.0.1]) by localhost (bes.cs.utk.edu [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id ZAZBAeeORLDu; Mon, 20 Aug 2007 09:04:05 -0400 (EDT)
Received: from lust.indecency.org (user-119b1dm.biz.mindspring.com [66.149.133.182]) by shu.cs.utk.edu (Postfix) with ESMTP id 8F9971EE30E; Mon, 20 Aug 2007 09:03:59 -0400 (EDT)
Message-ID: <46C99137.80007@cs.utk.edu>
Date: Mon, 20 Aug 2007 09:03:51 -0400
From: Keith Moore <moore@cs.utk.edu>
User-Agent: Thunderbird 2.0.0.6 (Macintosh/20070728)
MIME-Version: 1.0
To: Martin Duerst <duerst@it.aoyama.ac.jp>
Subject: Re: Character encodings in headers [i74][was: Straw-man charter forhttp-bis]
References: <BA772834-227A-4C1B-9534-070C50DF05B3@mnot.net> <392C98BA-E7B8-44ED-964B-82FC48162924@mnot.net> <p06240843c2833f4d7f2f@[10.20.30.108]> <465D9142.9050506@gmx.de> <6.0.0.20.2.20070610165356.0a69cec0@localhost> <088FB13E-F12F-4BE7-94FB-78B21C51512E@mnot.net> <46C93B36.7070503@cs.utk.edu> <6.0.0.20.2.20070820170314.07449b20@localhost>
In-Reply-To: <6.0.0.20.2.20070820170314.07449b20@localhost>
X-Enigmail-Version: 0.95.2
OpenPGP: id=E1473978
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: 7bit
X-Spam-Score: 0.0 (/)
X-Scan-Signature: 79899194edc4f33a41f49410777972f8
Cc: Felix Sasaki <fsasaki@w3.org>, Richard Ishida <ishida@w3.org>, Apps Discuss <discuss@apps.ietf.org>, Mark Nottingham <mnot@mnot.net>, "ietf-http-wg@w3.org Group" <ietf-http-wg@w3.org>, Paul Hoffman <phoffman@imc.org>
X-BeenThere: discuss@apps.ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: general discussion of application-layer protocols <discuss.apps.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/discuss>, <mailto:discuss-request@apps.ietf.org?subject=unsubscribe>
List-Post: <mailto:discuss@apps.ietf.org>
List-Help: <mailto:discuss-request@apps.ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/discuss>, <mailto:discuss-request@apps.ietf.org?subject=subscribe>
Errors-To: discuss-bounces@apps.ietf.org

>> also, I'll note that supporting utf-8 in a way that is backward
>> compatible with existing implementations is almost certainly more
>> complex (and thus more costly, error-prone, etc) than supporting rfc 2047.
>>     
>
> Well, if "backwards compatible" means also supporting RFC 2047,
>   
of course it does mean that, as you're not going to get rid of the need
to interoperate with the installed base of clients and servers anytime soon.
>  If the choice is between UTF-8 and RFC 2047,
> however, then I'd take UTF-8 any time, because RFC 2047 includes
> UTF-8 as well as many other encodings.
if we had the luxury of starting from scratch, I'd agree with you.

though my (fuzzy) memory seems to say that at the time HTTP was
standardized lots of people insisted on using 8859-1 rather than either
ASCII or any form of Unicode - again for backward compatibility reasons.

but how much of a problem is this really?