Re: [Json] JSON: remove gap between Ecma-404 and IETF draft

ht@inf.ed.ac.uk (Henry S. Thompson) Thu, 14 November 2013 09:44 UTC

Return-Path: <ht@inf.ed.ac.uk>
X-Original-To: json@ietfa.amsl.com
Delivered-To: json@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 2401421E81DB; Thu, 14 Nov 2013 01:44:44 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -7.004
X-Spam-Level:
X-Spam-Status: No, score=-7.004 tagged_above=-999 required=5 tests=[AWL=-0.405, BAYES_00=-2.599, RCVD_IN_DNSWL_MED=-4]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Yi19c38zd6zH; Thu, 14 Nov 2013 01:44:39 -0800 (PST)
Received: from nougat.ucs.ed.ac.uk (nougat.ucs.ed.ac.uk [129.215.13.205]) by ietfa.amsl.com (Postfix) with ESMTP id D895521E8174; Thu, 14 Nov 2013 01:44:38 -0800 (PST)
Received: from crunchie.inf.ed.ac.uk (crunchie.inf.ed.ac.uk [129.215.33.180]) by nougat.ucs.ed.ac.uk (8.13.8/8.13.4) with ESMTP id rAE9i3tL009868; Thu, 14 Nov 2013 09:44:03 GMT
Received: from troutbeck.inf.ed.ac.uk (troutbeck.inf.ed.ac.uk [129.215.25.32]) by crunchie.inf.ed.ac.uk (8.14.4/8.14.4) with ESMTP id rAE9i1td016584; Thu, 14 Nov 2013 09:44:01 GMT
Received: from troutbeck.inf.ed.ac.uk (localhost [127.0.0.1]) by troutbeck.inf.ed.ac.uk (8.14.4/8.14.4) with ESMTP id rAE9i2FI031737; Thu, 14 Nov 2013 09:44:02 GMT
Received: (from ht@localhost) by troutbeck.inf.ed.ac.uk (8.14.4/8.14.4/Submit) id rAE9i0V8031733; Thu, 14 Nov 2013 09:44:00 GMT
X-Authentication-Warning: troutbeck.inf.ed.ac.uk: ht set sender to ht@inf.ed.ac.uk using -f
To: John Cowan <cowan@mercury.ccil.org>
References: <AA45B3C6-1DC5-4B1E-8045-C9FE76022584@vpnc.org> <CEA92854.2CC53%jhildebr@cisco.com> <20131113224737.GI31823@mercury.ccil.org>
From: ht@inf.ed.ac.uk
Date: Thu, 14 Nov 2013 09:44:00 +0000
In-Reply-To: <20131113224737.GI31823@mercury.ccil.org> (John Cowan's message of "Wed\, 13 Nov 2013 17\:47\:38 -0500")
Message-ID: <f5bob5n71y7.fsf@troutbeck.inf.ed.ac.uk>
User-Agent: Gnus/5.101 (Gnus v5.10.10) XEmacs/21.5-b33 (linux)
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
X-Edinburgh-Scanned: at nougat.ucs.ed.ac.uk with MIMEDefang 2.60, Sophie, Sophos Anti-Virus, Clam AntiVirus
X-Scanned-By: MIMEDefang 2.60 on 129.215.13.205
X-Mailman-Approved-At: Thu, 14 Nov 2013 07:37:06 -0800
Cc: IETF Discussion <ietf@ietf.org>, Paul Hoffman <paul.hoffman@vpnc.org>, JSON WG <json@ietf.org>, "Joe Hildebrand (jhildebr)" <jhildebr@cisco.com>, Anne van Kesteren <annevk@annevk.nl>, "www-tag@w3.org" <www-tag@w3.org>, es-discuss <es-discuss@mozilla.org>
Subject: Re: [Json] JSON: remove gap between Ecma-404 and IETF draft
X-BeenThere: json@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "JavaScript Object Notation \(JSON\) WG mailing list" <json.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/json>, <mailto:json-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/json>
List-Post: <mailto:json@ietf.org>
List-Help: <mailto:json-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/json>, <mailto:json-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 14 Nov 2013 09:44:44 -0000

John Cowan writes:

> Joe Hildebrand (jhildebr) scripsit:
>
>> If 404 doesn't allow [a BOM], I don't see a strong need to add it.
>> Parsers can always be more forgiving of what they will parse than what
>> the spec says, particularly since section 9 says "A JSON parser MAY
>> accept non-JSON forms or extensions".
>
> It's not clear that 404 disallows it, since 404 is defined in terms of
> characters, and a BOM is not a character but an out-of-band signal.

I think this is a crucial observation.  I note that XML approaches
this problem in what might be a useful way.  The XML ABNF makes no
mention of BOM, it's not part of any XML document as such.  But it
_is_ allowed.  The relevant wording [1] is:

  Entities ... may begin with the Byte Order Mark described by Annex H
  of [ISO/IEC 10646:2000], section 16.8 of [Unicode] (the ZERO WIDTH
  NO-BREAK SPACE character, #xFEFF). _This is an encoding signature,_
  _not part of either the markup or the character data of the XML_
  _document._ XML processors must be able to use this character to
  differentiate between UTF-8 and UTF-16 encoded documents. [emphasis
  added]

ht

[1] http://www.w3.org/TR/REC-xml/#charencoding
-- 
       Henry S. Thompson, School of Informatics, University of Edinburgh
      10 Crichton Street, Edinburgh EH8 9AB, SCOTLAND -- (44) 131 650-4440
                Fax: (44) 131 650-4587, e-mail: ht@inf.ed.ac.uk
                       URL: http://www.ltg.ed.ac.uk/~ht/
 [mail from me _always_ has a .sig like this -- mail without it is forged spam]