Re: [Json] Proposal: the minimal edit

Paul Hoffman <paul.hoffman@vpnc.org> Tue, 25 June 2013 02:11 UTC

Return-Path: <paul.hoffman@vpnc.org>
X-Original-To: json@ietfa.amsl.com
Delivered-To: json@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 74EA221F9A3C for <json@ietfa.amsl.com>; Mon, 24 Jun 2013 19:11:54 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -102.515
X-Spam-Level:
X-Spam-Status: No, score=-102.515 tagged_above=-999 required=5 tests=[AWL=0.084, BAYES_00=-2.599, USER_IN_WHITELIST=-100]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id o60gt7d2v1Nj for <json@ietfa.amsl.com>; Mon, 24 Jun 2013 19:11:54 -0700 (PDT)
Received: from hoffman.proper.com (IPv6.Hoffman.Proper.COM [IPv6:2605:8e00:100:41::81]) by ietfa.amsl.com (Postfix) with ESMTP id E4F5921F9A16 for <json@ietf.org>; Mon, 24 Jun 2013 19:11:53 -0700 (PDT)
Received: from [10.20.30.90] (50-0-66-165.dsl.dynamic.sonic.net [50.0.66.165]) (authenticated bits=0) by hoffman.proper.com (8.14.5/8.14.5) with ESMTP id r5P2Bqhd096436 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=NO); Mon, 24 Jun 2013 19:11:53 -0700 (MST) (envelope-from paul.hoffman@vpnc.org)
Content-Type: text/plain; charset="us-ascii"
Mime-Version: 1.0 (Mac OS X Mail 6.5 \(1508\))
From: Paul Hoffman <paul.hoffman@vpnc.org>
In-Reply-To: <20130625011904.GJ19899@mercury.ccil.org>
Date: Mon, 24 Jun 2013 19:11:52 -0700
Content-Transfer-Encoding: quoted-printable
Message-Id: <5FF84B9F-F637-48AC-921F-173F3370EF17@vpnc.org>
References: <CAChr6SyN4Z3Hh8OFGHkK+AJN0+S09wMfjeobZ51GjKNL+GhPsw@mail.gmail.com> <CAHBU6ivhoUM9cfUnc1YfnyDdQnWQ=Mj10cSoYn0qouMQ0F94XA@mail.gmail.com> <CAChr6SyQDjik_BTojXdw3G7_B=W5iZXksuM15VYwGJqr8WHdhw@mail.gmail.com> <CAChr6SwbFfR5UQuU2ceJhDeGAhv5Zy0dKA3szzO_KGfjA7fx5Q@mail.gmail.com> <B4858680-D319-4603-A1C3-D6A84195B300@vpnc.org> <CAHBU6ivUL6YvtMiajQwfftNgrRJk4dqiFs1yowoxfLi5wh2_hQ@mail.gmail.com> <20130624233521.GF19899@mercury.ccil.org> <3D442E5F-F9ED-47DB-9E1D-29ACA8588717@vpnc.org> <20130625011904.GJ19899@mercury.ccil.org>
To: John Cowan <cowan@mercury.ccil.org>
X-Mailer: Apple Mail (2.1508)
Cc: "json@ietf.org WG" <json@ietf.org>
Subject: Re: [Json] Proposal: the minimal edit
X-BeenThere: json@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "JavaScript Object Notation \(JSON\) WG mailing list" <json.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/json>, <mailto:json-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/json>
List-Post: <mailto:json@ietf.org>
List-Help: <mailto:json-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/json>, <mailto:json-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 25 Jun 2013 02:11:54 -0000

<no hat>

On Jun 24, 2013, at 6:19 PM, John Cowan <cowan@mercury.ccil.org> wrote:

> Paul Hoffman scripsit:
> 
>> I'm not sure what you mean by "exclude". The sentence Tim was referring
>> to is about inclusion:
>> 
>>  - ECMAScript implementations can generate and consume code points
>>    in JSON strings that are not Unicode characters.
> 
> There is no point in putting this item into a list of differences
> unless it is not true of RFC-JSON.

Agree.

>  If this language is to be added, it
> follows that RFC-JSON excludes certain code points that are not Unicode
> characters.  

There are many people in the WG who believe that is the case.

> But exactly which code points?  

Why is that question important? As long as there is a difference, why does there have to be a specific list for this purpose?

> As worded, presumably
> all of them: the surrogate code points, the non-character code points
> (a term of art in Unicode; it does not mean "code points that are not
> characters", but refers to a specific list of 66 code points), and the
> unassigned (as of some particular Unicode version) code points.

Errr, why make that presumption? The sentence stands on its own, without presumptions. Rob's suggestion to make this all simple is the basis for this thread.

> Tim's proposed revision replaced the last five words with "that can never
> be Unicode characters", thus excluding from RFC-JSON only the surrogate
> code points and the non-character code points, but not the unassigned
> code points.  As it turns out, this is indeed what he meant to say.

The sentence above is more general. If it is still true, I thought it would probably be supported by more people who wanted simplicity in the -bis document.

> My preference would be to exclude only the surrogates, not the
> non-characters, which is yet a third position.  My suggested wording
> would be simply "that are surrogates".

Why are you wanting to exclude anything? This section is a list of differences. Do you believe you know exactly which non-characters are allowed by ECMAScript? I ask because there were differing opinions earlier.

--Paul Hoffman