Re: [Json] Proposal for strings/Unicode text

John Cowan <cowan@mercury.ccil.org> Mon, 17 June 2013 14:32 UTC

Return-Path: <cowan@ccil.org>
X-Original-To: json@ietfa.amsl.com
Delivered-To: json@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 38C6F21F9C88 for <json@ietfa.amsl.com>; Mon, 17 Jun 2013 07:32:38 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -3.447
X-Spam-Level:
X-Spam-Status: No, score=-3.447 tagged_above=-999 required=5 tests=[AWL=0.152, BAYES_00=-2.599, RCVD_IN_DNSWL_LOW=-1]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id dp62cZNGjKQe for <json@ietfa.amsl.com>; Mon, 17 Jun 2013 07:32:33 -0700 (PDT)
Received: from earth.ccil.org (earth.ccil.org [192.190.237.11]) by ietfa.amsl.com (Postfix) with ESMTP id D16D421F9C87 for <json@ietf.org>; Mon, 17 Jun 2013 07:32:33 -0700 (PDT)
Received: from cowan by earth.ccil.org with local (Exim 4.72) (envelope-from <cowan@ccil.org>) id 1UoaTw-00051U-Q5; Mon, 17 Jun 2013 10:32:32 -0400
Date: Mon, 17 Jun 2013 10:32:32 -0400
From: John Cowan <cowan@mercury.ccil.org>
To: Paul Hoffman <paul.hoffman@vpnc.org>
Message-ID: <20130617143232.GC23594@mercury.ccil.org>
References: <20130613121620.GB11739@mercury.ccil.org> <A723FC6ECC552A4D8C8249D9E07425A70FC47B42@xmb-rcd-x10.cisco.com> <255B9BB34FB7D647A506DC292726F6E1151B931064@WSMSG3153V.srv.dir.telstra.com> <9DC91218-B255-4CA2-8337-41F7135A90BD@vpnc.org> <20130617141409.GB23594@mercury.ccil.org> <7CF56357-6A86-4BAA-BFD8-225019A11980@vpnc.org>
MIME-Version: 1.0
Content-Type: text/plain; charset=iso-8859-1
Content-Disposition: inline
Content-Transfer-Encoding: 8bit
In-Reply-To: <7CF56357-6A86-4BAA-BFD8-225019A11980@vpnc.org>
User-Agent: Mutt/1.5.20 (2009-06-14)
Sender: John Cowan <cowan@ccil.org>
Cc: json@ietf.org
Subject: Re: [Json] Proposal for strings/Unicode text
X-BeenThere: json@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "JavaScript Object Notation \(JSON\) WG mailing list" <json.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/json>, <mailto:json-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/json>
List-Post: <mailto:json@ietf.org>
List-Help: <mailto:json-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/json>, <mailto:json-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 17 Jun 2013 14:32:38 -0000

Paul Hoffman scripsit:

> > Where's the contradiction?  The prose says exactly what the ABNF says.
> > %22 (quotation mark), %5C (backslash), and %D800 through %DFFF (surrogate
> > pairs) are left out of the full Unicode range.
> 
> U+0000-0019 is not the entire list of Unicode control characters? U+0022
> is not the only Unicode quotation mark.

I assume by "19" you mean "1F".  If you want to say "C0 control
characters", fine.  But there is only one Unicode character named
"QUOTATION MARK", and it is the only one relevant to JSON.

> Did we not learn anything from the IDN WG work 13 years ago?

I don't see what that has to do with the price of pídàn in China.
Other quotation marks and control characters need not be escaped in JSON.

-- 
John Cowan  cowan@ccil.org   http://ccil.org/~cowan
"The exception proves the rule."  Dimbulbs think: "Your counterexample proves
my theory."  Latin students think "'Probat' means 'tests': the exception puts
the rule to the proof."  But legal historians know it means "Evidence for an
exception is evidence of the existence of a rule in cases not excepted from."