Re: New draft (Was: I-D ACTION:draft-klensin-unicode-escapes-00.txt

"Clive D.W. Feather" <clive@demon.net> Fri, 02 February 2007 12:21 UTC

Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1HCxQ5-0000JK-H7; Fri, 02 Feb 2007 07:21:33 -0500
Received: from [10.91.34.44] (helo=ietf-mx.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1HCxQ4-0000I3-89 for discuss@apps.ietf.org; Fri, 02 Feb 2007 07:21:32 -0500
Received: from anchor-internal-1.mail.demon.net ([195.173.56.100]) by ietf-mx.ietf.org with esmtp (Exim 4.43) id 1HCxQ2-0000gV-Rp for discuss@apps.ietf.org; Fri, 02 Feb 2007 07:21:32 -0500
Received: from finch-staff-1.server.demon.net (finch-staff-1.server.demon.net [193.195.224.1]) by anchor-internal-1.mail.demon.net with ESMTP� id l12Bl83R024325Fri, 2 Feb 2007 11:47:08 GMT
Received: from clive by finch-staff-1.server.demon.net with local (Exim 3.36 #1) id 1HCwsc-0009c1-00; Fri, 02 Feb 2007 11:46:58 +0000
Date: Fri, 02 Feb 2007 11:46:58 +0000
From: "Clive D.W. Feather" <clive@demon.net>
To: John C Klensin <john-ietf@jck.com>
Subject: Re: New draft (Was: I-D ACTION:draft-klensin-unicode-escapes-00.txt
Message-ID: <20070202114658.GX7742@finch-staff-1.thus.net>
References: <875A124D75A8B481E176CF06@p3.JCK.COM> <uppsr2hs59srbd7eufbcul5a1ekl7i09nl@hive.bjoern.hoehrmann.de> <EF59DA6FD89C4F19750C68C3@p3.JCK.COM>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Disposition: inline
In-Reply-To: <EF59DA6FD89C4F19750C68C3@p3.JCK.COM>
User-Agent: Mutt/1.5.3i
X-Spam-Score: 0.0 (/)
X-Scan-Signature: 7655788c23eb79e336f5f8ba8bce7906
Cc: discuss@apps.ietf.org
X-BeenThere: discuss@apps.ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: general discussion of application-layer protocols <discuss.apps.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/discuss>, <mailto:discuss-request@apps.ietf.org?subject=unsubscribe>
List-Post: <mailto:discuss@apps.ietf.org>
List-Help: <mailto:discuss-request@apps.ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/discuss>, <mailto:discuss-request@apps.ietf.org?subject=subscribe>
Errors-To: discuss-bounces@apps.ietf.org

>> In section 5.2 it is said HTML uses the &#xNNNN; form and that
>> this form has a clear terminator. This is not really false but
>> HTML allows to omit the terminator if it is not needed, for
>> example <p>Bj&#xf6rn</p> is also valid. I would suggest to
>> mention only XML or note that HTML's mechanism is similar to
>> that of XML.

If you check the HTML specification (section 5.3), it says that SGML
allows the semicolon to be omitted in certain contexts, but "strongly
suggest" not to do that.

In particular, the example
    <p>Bj&#xf6rn</p>
is not valid because it lies in the middle of a word. A permitted case
would be:
    <p>Bj&#xf6</p>
where the tag begin symbol < ends the entity.

-- 
Clive D.W. Feather  | Work:  <clive@demon.net>   | Tel:    +44 20 8495 6138
Internet Expert     | Home:  <clive@davros.org>  | Fax:    +44 870 051 9937
Demon Internet      | WWW: http://www.davros.org | Mobile: +44 7973 377646
THUS plc            |                            |