Re: [Json] Proposal for strings/Unicode text

Paul Hoffman <paul.hoffman@vpnc.org> Mon, 17 June 2013 14:43 UTC

Return-Path: <paul.hoffman@vpnc.org>
X-Original-To: json@ietfa.amsl.com
Delivered-To: json@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 0544621F9CB2 for <json@ietfa.amsl.com>; Mon, 17 Jun 2013 07:43:33 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -102.492
X-Spam-Level:
X-Spam-Status: No, score=-102.492 tagged_above=-999 required=5 tests=[AWL=0.107, BAYES_00=-2.599, USER_IN_WHITELIST=-100]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id g8r0RtmEPW60 for <json@ietfa.amsl.com>; Mon, 17 Jun 2013 07:43:32 -0700 (PDT)
Received: from hoffman.proper.com (IPv6.Hoffman.Proper.COM [IPv6:2605:8e00:100:41::81]) by ietfa.amsl.com (Postfix) with ESMTP id E63C021F9C97 for <json@ietf.org>; Mon, 17 Jun 2013 07:43:27 -0700 (PDT)
Received: from [10.20.30.90] (50-0-66-165.dsl.dynamic.sonic.net [50.0.66.165]) (authenticated bits=0) by hoffman.proper.com (8.14.5/8.14.5) with ESMTP id r5HEhQio034341 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=NO); Mon, 17 Jun 2013 07:43:27 -0700 (MST) (envelope-from paul.hoffman@vpnc.org)
Content-Type: text/plain; charset=iso-8859-1
Mime-Version: 1.0 (Mac OS X Mail 6.5 \(1508\))
From: Paul Hoffman <paul.hoffman@vpnc.org>
In-Reply-To: <20130617143232.GC23594@mercury.ccil.org>
Date: Mon, 17 Jun 2013 07:43:26 -0700
Content-Transfer-Encoding: quoted-printable
Message-Id: <0CF14E73-C52E-4595-922E-296C13D5A3DD@vpnc.org>
References: <20130613121620.GB11739@mercury.ccil.org> <A723FC6ECC552A4D8C8249D9E07425A70FC47B42@xmb-rcd-x10.cisco.com> <255B9BB34FB7D647A506DC292726F6E1151B931064@WSMSG3153V.srv.dir.telstra.com> <9DC91218-B255-4CA2-8337-41F7135A90BD@vpnc.org> <20130617141409.GB23594@mercury.ccil.org> <7CF56357-6A86-4BAA-BFD8-225019A11980@vpnc.org> <20130617143232.GC23594@mercury.ccil.org>
To: John Cowan <cowan@mercury.ccil.org>
X-Mailer: Apple Mail (2.1508)
Cc: json@ietf.org
Subject: Re: [Json] Proposal for strings/Unicode text
X-BeenThere: json@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "JavaScript Object Notation \(JSON\) WG mailing list" <json.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/json>, <mailto:json-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/json>
List-Post: <mailto:json@ietf.org>
List-Help: <mailto:json-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/json>, <mailto:json-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 17 Jun 2013 14:43:33 -0000

On Jun 17, 2013, at 7:32 AM, John Cowan <cowan@mercury.ccil.org> wrote:

> Paul Hoffman scripsit:
> 
>>> Where's the contradiction?  The prose says exactly what the ABNF says.
>>> %22 (quotation mark), %5C (backslash), and %D800 through %DFFF (surrogate
>>> pairs) are left out of the full Unicode range.
>> 
>> U+0000-0019 is not the entire list of Unicode control characters? U+0022
>> is not the only Unicode quotation mark.
> 
> I assume by "19" you mean "1F".  

Yes. :-)

> If you want to say "C0 control
> characters", fine.  

That would work for me. But I suspect that some people will say "no, all control characters".

> But there is only one Unicode character named
> "QUOTATION MARK", and it is the only one relevant to JSON.

Then we should say that. Or, better, not have a comment that is open to misinterpretation.

>> Did we not learn anything from the IDN WG work 13 years ago?
> 
> I don't see what that has to do with the price of pídàn in China.

An excellent accidental analogy! I was referring to the fact that "everyone" understood that the "dot" between DNS labels was U+002E... except the Japanese. And we had to put a complicated exception in for them, late in the process.

> Other quotation marks and control characters need not be escaped in JSON.

If everyone agrees with that, then we do not need the comment at all. But I suspect that "everyone" does not.

--Paul Hoffman