Re: Volunteer needed to serve as IANA charset reviewer

Tim Bray <tbray@textuality.com> Wed, 06 September 2006 23:27 UTC

Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1GL6ng-0001V2-LX; Wed, 06 Sep 2006 19:27:20 -0400
Received: from [10.91.34.44] (helo=ietf-mx.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1GL6nf-0001Ux-KC for discuss@apps.ietf.org; Wed, 06 Sep 2006 19:27:19 -0400
Received: from brmea-mail-2.sun.com ([192.18.98.43]) by ietf-mx.ietf.org with esmtp (Exim 4.43) id 1GL6nd-0003XN-9T for discuss@apps.ietf.org; Wed, 06 Sep 2006 19:27:19 -0400
Received: from fe-amer-06.sun.com ([192.18.108.180]) by brmea-mail-2.sun.com (8.13.6+Sun/8.12.9) with ESMTP id k86NR9No024196 for <discuss@apps.ietf.org>; Wed, 6 Sep 2006 17:27:09 -0600 (MDT)
Received: from conversion-daemon.mail-amer.sun.com by mail-amer.sun.com (Sun Java System Messaging Server 6.2-4.02 (built Sep 9 2005)) id <0J5700I012DG4600@mail-amer.sun.com> (original mail from tbray@textuality.com) for discuss@apps.ietf.org; Wed, 06 Sep 2006 17:27:09 -0600 (MDT)
Received: from [192.168.1.15] ([216.113.204.64]) by mail-amer.sun.com (Sun Java System Messaging Server 6.2-4.02 (built Sep 9 2005)) with ESMTPSA id <0J57007IJ2H8AOX0@mail-amer.sun.com>; Wed, 06 Sep 2006 17:27:09 -0600 (MDT)
Date: Wed, 06 Sep 2006 16:27:46 -0700
From: Tim Bray <tbray@textuality.com>
Subject: Re: Volunteer needed to serve as IANA charset reviewer
In-reply-to: <20060906174558.ff156501.moore@cs.utk.edu>
To: Keith Moore <moore@cs.utk.edu>
Message-id: <F43531E0-F945-4497-AA56-53AA37E759EB@textuality.com>
MIME-version: 1.0
X-Mailer: Apple Mail (2.752.2)
Content-type: text/plain; format="flowed"; delsp="yes"; charset="US-ASCII"
Content-transfer-encoding: 7bit
References: <p06240600c124bdb12d16@[10.0.1.2]> <BDA09F0B9086491428F8F2FC@p3.JCK.COM> <01M6VLE70BJ60008CX@mauve.mrochek.com> <20060906174558.ff156501.moore@cs.utk.edu>
X-Spam-Score: 0.0 (/)
X-Scan-Signature: ffa9dfbbe7cc58b3fa6b8ae3e57b0aa3
Cc: hardie@qualcomm.com, discuss@apps.ietf.org, Ned Freed <ned.freed@mrochek.com>, ietf-charsets@iana.org
X-BeenThere: discuss@apps.ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: general discussion of application-layer protocols <discuss.apps.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/discuss>, <mailto:discuss-request@apps.ietf.org?subject=unsubscribe>
List-Post: <mailto:discuss@apps.ietf.org>
List-Help: <mailto:discuss-request@apps.ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/discuss>, <mailto:discuss-request@apps.ietf.org?subject=subscribe>
Errors-To: discuss-bounces@apps.ietf.org

On Sep 6, 2006, at 2:45 PM, Keith Moore wrote:

> As for utf-8 vs. Unicode, this is a bit tricky.  I agree that merely
> specifying Unicode isn't sufficient given the potential for
> incompatible CESs.  And yet I'm sympathetic to the notion that UTF-8
> pessimizes storage and transmission of text written in certain
> languages.  IMHO it's unreasonable to exclude the potential for a
> Unicode based CES that has more-or-less equivalent information
> density across a wide variety of languages.  But I do think that  
> use of
> multiple CESs in a new protocol should require substantial
> justification, and that UTF-8 should be presumed to be the CES of
> choice for any new protocol that requires ASCII compatibility for its
> character representation.

Agreed on all counts.  Section 5.1 of RFC3470 (aka BCP70) says smart  
things about this, referencing 2277.  Basically, if you're going to  
use XML, there's probably no point trying to legislate against UTF-16  
since any conformant reader is required to accept it, and in practice  
all known XML software can handle 8859 and Shift-JIS and EUC.   But  
if you're not doing XML, compulsory UTF-8 removes a lot of failure  
points without costing much.

   -Tim