Re: Typographical error in draft-klensin-unicode-escapes-00

John C Klensin <john-ietf@jck.com> Thu, 25 January 2007 14:48 UTC

Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1HA5tQ-0005OW-Mj; Thu, 25 Jan 2007 09:48:00 -0500
Received: from [10.91.34.44] (helo=ietf-mx.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1HA5tP-0005Nb-GA for discuss@apps.ietf.org; Thu, 25 Jan 2007 09:47:59 -0500
Received: from ns.jck.com ([209.187.148.211] helo=bs.jck.com) by ietf-mx.ietf.org with esmtp (Exim 4.43) id 1HA5tO-0005qE-5q for discuss@apps.ietf.org; Thu, 25 Jan 2007 09:47:59 -0500
Received: from [127.0.0.1] (helo=p3.JCK.COM) by bs.jck.com with esmtp (Exim 4.34) id 1HA5tI-000BLF-AD; Thu, 25 Jan 2007 09:47:52 -0500
Date: Thu, 25 Jan 2007 09:47:51 -0500
From: John C Klensin <john-ietf@jck.com>
To: "Clive D.W. Feather" <clive@demon.net>
Subject: Re: Typographical error in draft-klensin-unicode-escapes-00
Message-ID: <638CDDD4BD2EE58B558CB643@p3.JCK.COM>
In-Reply-To: <20070122092237.GL60599@finch-staff-1.thus.net>
References: <86EE78FED516BB38D35A7729@p3.JCK.COM> <20070122064002.GA60599@finch-staff-1.thus.net> <B94B3EA504C74B8386061138@p3.JCK.COM> <20070122092237.GL60599@finch-staff-1.thus.net>
X-Mailer: Mulberry/4.0.7 (Win32)
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Content-Disposition: inline
X-Spam-Score: 1.1 (+)
X-Scan-Signature: 9ed51c9d1356100bce94f1ae4ec616a9
Cc: discuss@apps.ietf.org
X-BeenThere: discuss@apps.ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: general discussion of application-layer protocols <discuss.apps.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/discuss>, <mailto:discuss-request@apps.ietf.org?subject=unsubscribe>
List-Post: <mailto:discuss@apps.ietf.org>
List-Help: <mailto:discuss-request@apps.ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/discuss>, <mailto:discuss-request@apps.ietf.org?subject=subscribe>
Errors-To: discuss-bounces@apps.ietf.org


--On Monday, 22 January, 2007 09:22 +0000 "Clive D.W. Feather"
<clive@demon.net> wrote:

> We may have to agree to disagree. When I see "4*4XXX", it
> makes me wonder if the author actually meant "4*" or "*4" and
> misunderstood the notation.

I am skipping ABNF quibbles for -01, leaving it for -02 (I
hope).  It would take very little more of this particular
discussion for me to shift back to pure BNF and write a
justification for it rather than trying to use ABNF at all.  The
tool is, IMO, complicating clear expression here, rather than
aiding it.   However, see my note from yesterday.   However...
 
> Incidentally, there's no significance to the groupings of 4 in
> the \u notation, so I see little point in introducing
> "Hex-quad".

That terminology was appropriated from the C documents.

> The term "BMP" is, as I understand it, deprecated
> these days.

I thought so too, until I discovered that it is extensively used
in the Unicode 5.0 book.    Take it up with the UTC :-(

> So I would just write:
> 
>     EmbeddedUnicodeChar = %x5C.75 4HexDigit / %x5C.55 8HexDigit
> 
> or if you really want:
> 
>     EmbeddedUnicodeChar = UnicodeShortForm / UnicodeLongForm
>     UnicodeShortForm    = %x5C.75 4HexDigit
>     UnicodeLongForm     = %x5C.55 8HexDigit
> 
> [Note that you omitted the / from your definition.]

Caught and fixed.  Thanks.
    john