Re: [Json] Proposal: the minimal edit

Carsten Bormann <cabo@tzi.org> Tue, 25 June 2013 12:57 UTC

Return-Path: <cabo@tzi.org>
X-Original-To: json@ietfa.amsl.com
Delivered-To: json@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 56EC421F8D90 for <json@ietfa.amsl.com>; Tue, 25 Jun 2013 05:57:58 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -106.249
X-Spam-Level:
X-Spam-Status: No, score=-106.249 tagged_above=-999 required=5 tests=[AWL=-0.000, BAYES_00=-2.599, HELO_EQ_DE=0.35, RCVD_IN_DNSWL_MED=-4, USER_IN_WHITELIST=-100]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id EPSl0zKWCo8I for <json@ietfa.amsl.com>; Tue, 25 Jun 2013 05:57:52 -0700 (PDT)
Received: from informatik.uni-bremen.de (mailhost.informatik.uni-bremen.de [IPv6:2001:638:708:30c9::12]) by ietfa.amsl.com (Postfix) with ESMTP id ECB5A21F9C10 for <json@ietf.org>; Tue, 25 Jun 2013 05:57:20 -0700 (PDT)
X-Virus-Scanned: amavisd-new at informatik.uni-bremen.de
Received: from smtp-fb3.informatik.uni-bremen.de (smtp-fb3.informatik.uni-bremen.de [134.102.224.120]) by informatik.uni-bremen.de (8.14.4/8.14.4) with ESMTP id r5PCvHxn024902; Tue, 25 Jun 2013 14:57:17 +0200 (CEST)
Received: from eduroam-pool7-0058.wlan.uni-bremen.de (eduroam-pool7-0058.wlan.uni-bremen.de [134.102.112.58]) (using TLSv1 with cipher AES128-SHA (128/128 bits)) (No client certificate requested) by smtp-fb3.informatik.uni-bremen.de (Postfix) with ESMTPSA id 9CC663796; Tue, 25 Jun 2013 14:57:17 +0200 (CEST)
Mime-Version: 1.0 (Mac OS X Mail 6.5 \(1508\))
Content-Type: text/plain; charset="iso-8859-1"
From: Carsten Bormann <cabo@tzi.org>
In-Reply-To: <20130625125111.GC26875@mercury.ccil.org>
Date: Tue, 25 Jun 2013 14:57:17 +0200
Content-Transfer-Encoding: quoted-printable
Message-Id: <83630ACB-939E-4461-94E9-62C41C9E7908@tzi.org>
References: <20130625032014.GF14060@mercury.ccil.org> <ECAD5568-05F1-44C0-A8E1-6A38DAFAF6D4@vpnc.org> <C75D5BC5-6D5F-4CC0-8ACA-9717E7607DE4@tzi.org> <CAHBU6itPYpcof20YOAhgT82VnMAfgtpH2TaSDX+OHpu__WYjGg@mail.gmail.com> <CFD38719-3EB9-448C-A712-9B0E0B592D30@tzi.org> <20130625125111.GC26875@mercury.ccil.org>
To: John Cowan <cowan@mercury.ccil.org>
X-Mailer: Apple Mail (2.1508)
Cc: "json@ietf.org WG" <json@ietf.org>
Subject: Re: [Json] Proposal: the minimal edit
X-BeenThere: json@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "JavaScript Object Notation \(JSON\) WG mailing list" <json.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/json>, <mailto:json-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/json>
List-Post: <mailto:json@ietf.org>
List-Help: <mailto:json-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/json>, <mailto:json-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 25 Jun 2013 12:57:58 -0000

On Jun 25, 2013, at 14:51, John Cowan <cowan@mercury.ccil.org> wrote:

> Carsten Bormann scripsit:
> 
>> Try writing the ABNF for surrogate pair escapes and you'll see what I mean.
>> (Sure, it *can* be done.  Would you want to?)
> 
> Sure, why not?  You just change
> 
>    unescaped = %x20-21 / %x23-5B / %x5D-10FFFF
> 
> to
> 
>     unescaped = %x20-21 / %x23-5B / %x5D-D7FF / %xE000-10FFFF

You also have to fix the "\u" 4HEXDIG part (if you don't, it seems that ill-formed Unicode is allowed in escape sequences and not in native coding).
This gets ugly quickly.
Even uglier if you want to exclude the non-character code points as well.

Remember that this was written at a time when JSON was still vying for its space.
The appearance of complexity needed to be avoided.
So why encode things in the ABNF that already are handled by Unicode?

(Again, this is all about spec exegesis, not about reality.)

Grüße, Carsten