Re: [Json] Call for Consensus: Proposed Text for "8.1 Character Encoding"

"Matthew A. Miller" <linuxwolf+ietf@outer-planes.net> Wed, 10 May 2017 17:13 UTC

Sender: Matthew Miller <linuxwolf@outer-planes.net>
To: Pete Cordell <petejson@codalogic.com>, Julian Reschke <julian.reschke@gmx.de>, "json@ietf.org" <json@ietf.org>
References: <e69d7c21-85cb-45f4-c0c2-34c624e63049@outer-planes.net> <1e94516c-9c82-8b0e-0d2d-7dbaa83b21bd@outer-planes.net> <40e3207f-e047-c898-1f0c-4422de1d597a@it.aoyama.ac.jp> <1b3ec14a-927a-8d46-e3d3-9807a9588437@outer-planes.net> <CAHBU6ivsq8+Z=MMkUH+=Q0uwc5NCtaJLYw5cp0Qg8eX2hQQ6sA@mail.gmail.com> <b74cb31b-8e04-17d0-548a-fc164ce07c05@outer-planes.net> <20170417175627.GK23461@localhost> <10B651F1-7FE0-484D-BD2E-FD146BC5FB04@tzi.org> <eabbccb0-8d15-d595-7cd0-37acc0621c57@it.aoyama.ac.jp> <6eb23f90-6623-7888-bc1c-6640a9dababc@codalogic.com> <61bfad2b-850d-a11f-e80b-d5ed9ccb4dc9@codalogic.com> <08a88696-65ef-da05-0d77-1a07d04ebfc8@outer-planes.net> <bb9fead6-23e7-8c1d-bc80-b60c81c4b89a@codalogic.com> <6f047d01-ad72-59ab-9d34-20a8177ab3af@outer-planes.net> <be4d9f12-a4be-3723-e52a-56a60722a75f@gmx.de> <a3805f67-620b-67f0-9c06-c865b71029e7@codalogic.com> <bb1ef6a8-506c-344b-b903-980ed50ad2d3@gmx.de> <44b4523a-5e4b-ccad-af96-931d8b9ad1c2@codalogic.com>
From: "Matthew A. Miller" <linuxwolf+ietf@outer-planes.net>
Message-ID: <ac1d1b68-67e7-c19f-a556-280df73f465b@outer-planes.net>
Date: Wed, 10 May 2017 11:13:06 -0600
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.12; rv:53.0) Gecko/20100101 Thunderbird/53.0
MIME-Version: 1.0
In-Reply-To: <44b4523a-5e4b-ccad-af96-931d8b9ad1c2@codalogic.com>
Content-Type: multipart/signed; micalg="pgp-sha512"; protocol="application/pgp-signature"; boundary="QqjApumpEovQe0FCdGwlDUDuAj7OcM16T"
Archived-At: <https://mailarchive.ietf.org/arch/msg/json/Bu10Fq9l_kZJY8ecBnacX4EK0OU>
Subject: Re: [Json] Call for Consensus: Proposed Text for "8.1 Character Encoding"
Precedence: list

On 5/10/17 8:04 AM, Pete Cordell wrote:
> On 10/05/2017 14:08, Julian Reschke wrote:
>> I believe we should have separate names for JSON represented as a
>> sequence of characters (such as in a string variable in a programming
>> language) and for a JSON-shaped octet sequence inside a
>> "application/json"-typed (HTTP) message. For the latter, enforcing UTF-8
>> IMHO is attractive.
>>
>> I think it's ok for the spec to talk about both, but it really needs to
>> be clear what we are talking about in each section.
> 
> It's an interesting thought on JSON in programs.  It would be strange to
> be able to say it was not valid JSON if it was encoded in a string
> inside a Shift-JIS encoded Ruby program for example.
> 
> My ISO layers are rusty, but it looks like we can talk about JSON
> character sequences somewhere above the transport layer, and JSON
> encoded messages somewhere below the transport layer.  The latter
> possibly being transport specific.
> 
> To me it would seem discussion of "JSON" (without any further
> refinement) ought to be independent of the lower layer transport
> encoding aspects.
> 
> "application/json" would be one transport specific encoding (for which I
> think most are happy with only UTF-8).  JSON inside a Shift-JIS encoded
> Ruby program is in effect another form of transport for a JSON message.
> 
> Cheers,
> 
> Pete.
> 

I believe in essence it is within our purview to set expectations of
what transits a wire protocol.

That phrasing is broad and likely vague.  To get a more specific, I
believe it is within scope to cover instances where a media type is
specified and that media type is "application/json" (e.g. HTTP bodies),
but I think it is also within scope for this document to essentially say
"where a protocol says 'use JSON here' then it is encoded as".  I don't
believe it is within scope to dictate JSON be encoded in any particular
manner when placed into a storage medium, or when embedded within other
content (e.g., the Shift-JIS encoded Ruby program even if said program
were transmitted over a network protocol).

Assuming the Working Group finds that scope acceptable and finds UTF-8
only acceptable, here is a starting proposal for text:

"""
8.1.  Character Encoding

When transmitting over a network protocol, JSON text MUST be
encoded in UTF-8 (Section 3 of [UNICODE]).

Previous specifications of JSON have not required the use of UTF-8
when transmitting JSON text. However, the vast majority of
JSON-based software implementations have chosen to use the UTF-8
encoding, to the extent that it is the only encoding that achieves
interoperability.

Implementations MUST NOT add a byte order mark (U+FEFF) to the
beginning of a JSON text.  In the interests of interoperability,
implementations that parse JSON texts MAY ignore the presence of a
byte order mark rather than treating it as an error.
"""

If you find this acceptable, please indicate that.  Otherwise, please
provide suggested changes.

- m&m

Matthew A. Miller

Attachment: signature.asc

Re: [Json] Call for Consensus: Proposed Text for … HANSEN, TONY L
Re: [Json] Call for Consensus: Proposed Text for … Nico Williams
Re: [Json] Call for Consensus: Proposed Text for … Matthew A. Miller
Re: [Json] Call for Consensus: Proposed Text for … Carsten Bormann
Re: [Json] Call for Consensus: Proposed Text for … Martin J. Dürst
Re: [Json] Call for Consensus: Proposed Text for … Tim Bray
Re: [Json] Call for Consensus: Proposed Text for … John Cowan
Re: [Json] Call for Consensus: Proposed Text for … HANSEN, TONY L
Re: [Json] Call for Consensus: Proposed Text for … Pete Cordell
Re: [Json] Call for Consensus: Proposed Text for … Carsten Bormann
Re: [Json] Call for Consensus: Proposed Text for … Pete Cordell
Re: [Json] Call for Consensus: Proposed Text for … Martin J. Dürst
Re: [Json] Call for Consensus: Proposed Text for … Pete Cordell
Re: [Json] Call for Consensus: Proposed Text for … Matthew A. Miller
[Json] Call for Consensus: Proposed Text for "8.1… Matthew Miller
Re: [Json] Call for Consensus: Proposed Text for … Carsten Bormann
Re: [Json] Call for Consensus: Proposed Text for … Julian Reschke
Re: [Json] Call for Consensus: Proposed Text for … Nico Williams
Re: [Json] Call for Consensus: Proposed Text for … Nico Williams
Re: [Json] Call for Consensus: Proposed Text for … Carsten Bormann
Re: [Json] Call for Consensus: Proposed Text for … Julian Reschke
Re: [Json] Call for Consensus: Proposed Text for … Carsten Bormann
Re: [Json] Call for Consensus: Proposed Text for … Julian Reschke
Re: [Json] Call for Consensus: Proposed Text for … Peter Cordell
Re: [Json] Call for Consensus: Proposed Text for … Carsten Bormann
Re: [Json] Call for Consensus: Proposed Text for … Julian Reschke
Re: [Json] Call for Consensus: Proposed Text for … Tim Bray
Re: [Json] Call for Consensus: Proposed Text for … Julian Reschke
Re: [Json] Call for Consensus: Proposed Text for … Carsten Bormann
Re: [Json] Call for Consensus: Proposed Text for … Peter Cordell
Re: [Json] Call for Consensus: Proposed Text for … Carsten Bormann
Re: [Json] Call for Consensus: Proposed Text for … Martin J. Dürst
Re: [Json] Call for Consensus: Proposed Text for … Peter Cordell
Re: [Json] Call for Consensus: Proposed Text for … Paul Hoffman
Re: [Json] Call for Consensus: Proposed Text for … Julian Reschke
Re: [Json] Call for Consensus: Proposed Text for … Matthew A. Miller
Re: [Json] Call for Consensus: Proposed Text for … Julian Reschke
Re: [Json] Call for Consensus: Proposed Text for … Julian Reschke
Re: [Json] Call for Consensus: Proposed Text for … Carsten Bormann
Re: [Json] Call for Consensus: Proposed Text for … Julian Reschke
Re: [Json] Call for Consensus: Proposed Text for … Matthew A. Miller
Re: [Json] Call for Consensus: Proposed Text for … John Cowan
Re: [Json] Call for Consensus: Proposed Text for … Julian Reschke
Re: [Json] Call for Consensus: Proposed Text for … Peter Cordell
Re: [Json] Call for Consensus: Proposed Text for … Julian Reschke
Re: [Json] Call for Consensus: Proposed Text for … HANSEN, TONY L
Re: [Json] Call for Consensus: Proposed Text for … Pete Cordell
Re: [Json] Call for Consensus: Proposed Text for … Matthew A. Miller
Re: [Json] Call for Consensus: Proposed Text for … HANSEN, TONY L
Re: [Json] Call for Consensus: Proposed Text for … Julian Reschke
Re: [Json] Call for Consensus: Proposed Text for … HANSEN, TONY L
Re: [Json] Call for Consensus: Proposed Text for … John Cowan
Re: [Json] Call for Consensus: Proposed Text for … Matthew A. Miller
Re: [Json] Call for Consensus: Proposed Text for … Pete Cordell
Re: [Json] Call for Consensus: Proposed Text for … Nico Williams
Re: [Json] Call for Consensus: Proposed Text for … John Cowan
[Json] FW: Call for Consensus: Proposed Text for … Manger, James
Re: [Json] FW: Call for Consensus: Proposed Text … John Cowan
Re: [Json] FW: Call for Consensus: Proposed Text … Manger, James
Re: [Json] Call for Consensus: Proposed Text for … Martin J. Dürst
Re: [Json] Call for Consensus: Proposed Text for … Martin J. Dürst
Re: [Json] Call for Consensus: Proposed Text for … Matthew A. Miller
Re: [Json] Call for Consensus: Proposed Text for … Martin J. Dürst
Re: [Json] Call for Consensus: Proposed Text for … Tim Bray
Re: [Json] Call for Consensus: Proposed Text for … Julian Reschke
Re: [Json] Call for Consensus: Proposed Text for … Pete Cordell
Re: [Json] Call for Consensus: Proposed Text for … Matthew A. Miller
Re: [Json] Call for Consensus: Proposed Text for … Julian Reschke
Re: [Json] Call for Consensus: Proposed Text for … Pete Cordell
Re: [Json] Call for Consensus: Proposed Text for … Julian Reschke
Re: [Json] Call for Consensus: Proposed Text for … Carsten Bormann
Re: [Json] Call for Consensus: Proposed Text for … Pete Cordell
Re: [Json] Call for Consensus: Proposed Text for … Matthew A. Miller
Re: [Json] Call for Consensus: Proposed Text for … Joe Hildebrand
Re: [Json] Call for Consensus: Proposed Text for … John Cowan
Re: [Json] Call for Consensus: Proposed Text for … Pete Cordell
Re: [Json] Call for Consensus: Proposed Text for … Tim Bray
Re: [Json] Call for Consensus: Proposed Text for … Matthew A. Miller
Re: [Json] Call for Consensus: Proposed Text for … Martin J. Dürst
Re: [Json] Call for Consensus: Proposed Text for … Julian Reschke
Re: [Json] Call for Consensus: Proposed Text for … Alexey Melnikov
Re: [Json] Call for Consensus: Proposed Text for … Pete Cordell
Re: [Json] Call for Consensus: Proposed Text for … Allen Wirfs-Brock
Re: [Json] Call for Consensus: Proposed Text for … Tim Bray
Re: [Json] Call for Consensus: Proposed Text for … Julian Reschke
Re: [Json] Call for Consensus: Proposed Text for … Alexey Melnikov
Re: [Json] Call for Consensus: Proposed Text for … Pete Cordell

Re: [Json] Call for Consensus: Proposed Text for "8.1 Character Encoding"

Attachment: signature.asc