Re: Volunteer needed to serve as IANA charset reviewer
Bruce Lilly <blilly@erols.com> Thu, 07 September 2006 17:38 UTC
Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1GLNpj-0003h6-Se; Thu, 07 Sep 2006 13:38:35 -0400
Received: from [10.91.34.44] (helo=ietf-mx.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1GLHD6-0002UG-6p for discuss@apps.ietf.org; Thu, 07 Sep 2006 06:34:16 -0400
Received: from smtp02.lnh.mail.rcn.net ([207.172.157.102]) by ietf-mx.ietf.org with esmtp (Exim 4.43) id 1GLHD3-000092-QP for discuss@apps.ietf.org; Thu, 07 Sep 2006 06:34:16 -0400
Received: from mr02.lnh.mail.rcn.net ([207.172.157.22]) by smtp02.lnh.mail.rcn.net with ESMTP; 07 Sep 2006 06:34:13 -0400
Received: from smtp01.lnh.mail.rcn.net (smtp01.lnh.mail.rcn.net [207.172.4.11]) by mr02.lnh.mail.rcn.net (MOS 3.7.5a-GA) with ESMTP id MFA75622; Thu, 7 Sep 2006 06:34:10 -0400 (EDT)
Received: from 68-187-229-110.dhcp.oxfr.ma.charter.com (HELO mail.blilly.com) ([68.187.229.110]) by smtp01.lnh.mail.rcn.net with ESMTP; 07 Sep 2006 06:34:06 -0400
X-IronPort-AV: i="4.08,223,1154923200"; d="scan'208"; a="272470077:sNHT27785664"
Received: from marty.blilly.com (marty.blilly.com [192.168.99.98]) by mail.blilly.com with ESMTP id k87AXxPx006736(8.13.6/8.13.6/mail.blilly.com /etc/sendmail.mc.mail 1.28 2006/06/11 04:26:23) (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=FAIL) ; Thu, 7 Sep 2006 06:33:59 -0400
Received: from marty.blilly.com (localhost [127.0.0.1]) (authenticated (0 bits)) by marty.blilly.com with ESMTP id k87AXwtB006735(8.13.6/8.13.6/blilly.com submit.mc 1.3 2005/04/08 12:29:31) (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO) ; Thu, 7 Sep 2006 06:33:59 -0400
From: Bruce Lilly <blilly@erols.com>
Organization: Bruce Lilly
To: ietf-charsets@iana.org
Subject: Re: Volunteer needed to serve as IANA charset reviewer
Date: Thu, 07 Sep 2006 06:33:48 -0400
User-Agent: KMail/1.9.4
References: <p06240600c124bdb12d16@[10.0.1.2]> <20060906174558.ff156501.moore@cs.utk.edu> <01M6VTGKMA620008CX@mauve.mrochek.com>
In-Reply-To: <01M6VTGKMA620008CX@mauve.mrochek.com>
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Content-Disposition: inline
Message-Id: <200609070633.54738@mail.blilly.com>
X-Junkmail-Status: score=10/50, host=mr02.lnh.mail.rcn.net
X-Junkmail-SD-Raw: score=unknown, refid=str=0001.0A090205.44FFF41E.0089,ss=1,fgs=0, ip=207.172.4.11, so=2006-05-09 23:27:51, dmn=5.2.113/2006-07-26
X-Spam-Score: 0.1 (/)
X-Scan-Signature: b280b4db656c3ca28dd62e5e0b03daa8
X-Mailman-Approved-At: Thu, 07 Sep 2006 13:38:34 -0400
Cc: Keith Moore <moore@cs.utk.edu>, hardie@qualcomm.com, Ned Freed <ned.freed@mrochek.com>, discuss@apps.ietf.org
X-BeenThere: discuss@apps.ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
Reply-To: ietf-charsets@iana.org
List-Id: general discussion of application-layer protocols <discuss.apps.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/discuss>, <mailto:discuss-request@apps.ietf.org?subject=unsubscribe>
List-Post: <mailto:discuss@apps.ietf.org>
List-Help: <mailto:discuss-request@apps.ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/discuss>, <mailto:discuss-request@apps.ietf.org?subject=subscribe>
Errors-To: discuss-bounces@apps.ietf.org
On Wed September 6 2006 18:58, Ned Freed wrote: > > I concur with the need to maintain the current charset registry to > > support legacy apps that use it. > > > And I think Ned would be an excellent choice for reviewer, though it > > wouldn' t bother me if he could have the assistance of people with > > specialized expertise in Asian writing schemes. > > Any such assistance would be hugely welcome. As an aside, it would also be nice > if more people would post comments to the list... OK. I concur with most of what has already been said by others, specifically that if a charset (i.e. something meeting the definition of charset) is in use, it ought to be registered; using the registry as a way to force some agenda is a very bad idea. Also that Ned would be an excellent choice for reviewer, and I would add that I fully support his stated plan to overhaul the existing registry, which has long been in need of such an overhaul (e.g. the registration procedure has long said that "ASCII" is disallowed, yet it is in fact registered as an alias). A few differences of opinion: Keith Moore wrote: > > But I do think that use of > > multiple CESs in a new protocol should require substantial > > justification, and that UTF-8 should be presumed to be the CES of > > choice for any new protocol that requires ASCII compatibility for its > > character representation. There may well be areas of application for new protocols which cannot fully support Unicode which underlies use of utf-8, due to character set size, huge tables needed for normalization, etc. (see sections 3.1 (paying particular attention to "memory-starved microprocessors") and 3.4 of RFC 1958). Not all protocols need to fully support utf-8 directly; the highly successful mail system, for example, supports only a subset of ANSI X3.4 in message header fields, yet it allows pass-through of utf-8 and other charsets via RFC 2047 mechanisms as amended by RFC 2231 and errata. Ted Hardie wrote: > > This question is motivated, not by a strong love for Unicode, > > but by the observation that RFC 2277 requires it and that the > > IETF is shifting toward it in a number of areas. To be precise, RFC 2277 says: " Protocols MUST be able to use the UTF-8 charset, which consists of the ISO 10646 coded character set combined with the UTF-8 character encoding scheme, as defined in [10646] Annex R (published in Amendment 2), for all text. Protocols MAY specify, in addition, how to use other charsets or other character encoding schemes for ISO 10646, such as UTF-16, but lack of an ability to use UTF-8 is a violation of this policy; such a violation would need a variance procedure ([BCP9] section 9) with clear and solid justification in the protocol specification document before being entered into or advanced upon the standards track. For existing protocols or protocols that move data from existing datastores, support of other charsets, or even using a default other than UTF-8, may be a requirement. This is acceptable, but UTF-8 support MUST be possible. When using other charsets than UTF-8, these MUST be registered in the IANA charset registry, if necessary by registering them when the protocol is published. " Several points: 1. "MUST be able to use" is a bit different from "requires" (see the above example of the mail system, which is able to use utf-8 by the mechanisms noted, but which does not require and in fact cannot directly accommodate raw utf-8). 2. The explicitly stated policy of allowing alternative charsets is important. 3. Most important, note that 2277 explicitly requires registration. > I have no problem with UTF-16 or > UTF-32 if there is a compelling reason to allow them, Well neither (as well as their "BE" and "LE" variants) is suitable for use with MIME text types, which precludes their use in a number of important applications. And one thing the charset registry sorely needs is a more explicit indication of which charsets are/are not suitable for such use (heck, some registrations have lacked the required statement of [un]suitability, so even groping through all of the registrations is of no use (and don't get me started on RFC 1345 issues)).
- Re: Volunteer needed to serve as IANA charset rev… John C Klensin
- Volunteer needed to serve as IANA charset reviewer Ted Hardie
- Re: Volunteer needed to serve as IANA charset rev… John C Klensin
- Re: Volunteer needed to serve as IANA charset rev… Tim Bray
- Re: Volunteer needed to serve as IANA charset rev… Ned Freed
- Re: Volunteer needed to serve as IANA charset rev… Keith Moore
- Re: Volunteer needed to serve as IANA charset rev… Tim Bray
- Re: Volunteer needed to serve as IANA charset rev… Ned Freed
- Re: Volunteer needed to serve as IANA charset rev… Bruce Lilly
- Re: Volunteer needed to serve as IANA charset rev… Keld Jørn Simonsen
- Re: Volunteer needed to serve as IANA charset rev… Terje Bless
- Re: Volunteer needed to serve as IANA charset rev… Mark Davis
- Re: Volunteer needed to serve as IANA charset rev… Martin Duerst
- Problems (and non-problems) in charset registry (… Martin Duerst
- Re: Problems (and non-problems) in charset regist… Ned Freed