Re: FWD: I-D ACTION:draft-klensin-unicode-escapes-00.txt

Keith Moore <moore@cs.utk.edu> Fri, 19 January 2007 16:43 UTC

Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1H7wq7-0002Rs-KP; Fri, 19 Jan 2007 11:43:43 -0500
Received: from [10.91.34.44] (helo=ietf-mx.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1H7wq6-0002Rn-PL for discuss@apps.ietf.org; Fri, 19 Jan 2007 11:43:42 -0500
Received: from shu.cs.utk.edu ([160.36.56.39]) by ietf-mx.ietf.org with esmtp (Exim 4.43) id 1H7wq2-0001Fx-Hm for discuss@apps.ietf.org; Fri, 19 Jan 2007 11:43:42 -0500
Received: from localhost (localhost [127.0.0.1]) by shu.cs.utk.edu (Postfix) with ESMTP id 2AF8A56478; Fri, 19 Jan 2007 11:43:38 -0500 (EST)
X-Virus-Scanned: by amavisd-new with ClamAV and SpamAssasin at cs.utk.edu
Received: from shu.cs.utk.edu ([127.0.0.1]) by localhost (shu.cs.utk.edu [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id ChTd2BMkAfnZ; Fri, 19 Jan 2007 11:43:32 -0500 (EST)
Received: from [192.168.0.4] (user-119b1dm.biz.mindspring.com [66.149.133.182]) by shu.cs.utk.edu (Postfix) with ESMTP id E5674563F1; Fri, 19 Jan 2007 11:43:31 -0500 (EST)
Message-ID: <45B0F536.2040208@cs.utk.edu>
Date: Fri, 19 Jan 2007 11:43:34 -0500
From: Keith Moore <moore@cs.utk.edu>
User-Agent: Thunderbird 1.5.0.9 (Macintosh/20061207)
MIME-Version: 1.0
To: John C Klensin <klensin@jck.com>
Subject: Re: FWD: I-D ACTION:draft-klensin-unicode-escapes-00.txt
References: <891E235E7A867F0DB506C90A@p3.JCK.COM>
In-Reply-To: <891E235E7A867F0DB506C90A@p3.JCK.COM>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit
X-Spam-Score: 0.0 (/)
X-Scan-Signature: de4f315c9369b71d7dd5909b42224370
Cc: discuss@apps.ietf.org
X-BeenThere: discuss@apps.ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: general discussion of application-layer protocols <discuss.apps.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/discuss>, <mailto:discuss-request@apps.ietf.org?subject=unsubscribe>
List-Post: <mailto:discuss@apps.ietf.org>
List-Help: <mailto:discuss-request@apps.ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/discuss>, <mailto:discuss-request@apps.ietf.org?subject=subscribe>
Errors-To: discuss-bounces@apps.ietf.org

one more caveat: protocol specifications need to specify this notation 
explicitly (either directly or by reference to the published RFC) if 
they are going to use it. conversely, this notation SHOULD NOT (maybe 
MUST NOT) be used unless it is part of the protocol specification.

one problem I see with introducing any new notation for characters is 
that it does create normalization issues by introducing additional ways 
to say the same thing.  e.g. after introducing this notation, "A" could 
also be expressed as \u0041 or \U00000041.  and it then becomes 
necessary to manage this conversion when copying fields from one 
protocol that supports the new notation (or in which it is benign) to 
another protocol that does not support the notation.

as an example of a potential source of problems, I'd hate to see this 
notation end up in X.509 certs.