Re: [Json] Call for Consensus: Proposed Text for "8.1 Character Encoding"

Julian Reschke <julian.reschke@gmx.de> Mon, 20 March 2017 16:46 UTC

Return-Path: <julian.reschke@gmx.de>
X-Original-To: json@ietfa.amsl.com
Delivered-To: json@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id DA4D11294F0; Mon, 20 Mar 2017 09:46:38 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.896
X-Spam-Level:
X-Spam-Status: No, score=-4.896 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H2=-2.796, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id tWHSe65v3SK3; Mon, 20 Mar 2017 09:46:37 -0700 (PDT)
Received: from mout.gmx.net (mout.gmx.net [212.227.17.20]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id B70871314B7; Mon, 20 Mar 2017 09:46:36 -0700 (PDT)
Received: from [192.168.1.38] ([5.10.171.186]) by mail.gmx.com (mrgmx103 [212.227.17.168]) with ESMTPSA (Nemesis) id 0M6ioC-1bw8580bPo-00wWPH; Mon, 20 Mar 2017 17:46:34 +0100
To: "Matthew A. Miller" <linuxwolf+ietf@outer-planes.net>, "json@ietf.org" <json@ietf.org>
References: <1fb5849e-8dbf-835d-65b7-2403686248f9@outer-planes.net> <0E32A94D-CE12-4F52-9ED6-8743C49751B4@vpnc.org> <4d2f0fb3-a729-0c17-2394-bc1e005dd612@gmx.de> <d09f9a59-2411-45a0-470c-ea95072fe4fd@outer-planes.net> <dad91b19-e774-e239-36d2-9d086cca8e0d@gmx.de> <ac432615-ee84-3cdf-6b37-480626bd18c1@gmx.de> <804f9930-26a5-a565-0607-452b386cfeb5@outer-planes.net>
Cc: The IESG <iesg@ietf.org>
From: Julian Reschke <julian.reschke@gmx.de>
Message-ID: <b7cc7f5d-e614-3c90-8ea2-62441fa3d9e6@gmx.de>
Date: Mon, 20 Mar 2017 17:46:35 +0100
User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:45.0) Gecko/20100101 Thunderbird/45.8.0
MIME-Version: 1.0
In-Reply-To: <804f9930-26a5-a565-0607-452b386cfeb5@outer-planes.net>
Content-Type: text/plain; charset="windows-1252"; format="flowed"
Content-Transfer-Encoding: 7bit
X-Provags-ID: V03:K0:avt/yioG2s13qvTgCwuSJzeaHoYrm7M0G27QJc6xuAZrrcS53hI /C2KXO7UJsNkyn0k1ve9n9+sqOcU3hrHn0vYA6xu5oUi7Kiv0pD7TfllwCaY2wbebmeRmmV vRCwHqA3RMb9MZW6sNqAbS1ZWHp/X9sn35bCO2eslP8XGwjjEyw5/c3gahzkXFqIHIsTdlc h5CjAGJp/d/qjctFlfUHw==
X-UI-Out-Filterresults: notjunk:1;V01:K0:EvLhK2fMETU=:M+IWF3dwbDgeb+Y0YBCUvC 53KQrLrJh848R4XhDMprSyFoQMWXHHxZfUEoqvqLFsZh6zkMMR5W+AIkFRFEdEjDiEVrCi7EF KEE1NGTI2bkH99J88YHu3qTpxPhurUbmJgjC7YOP5YaV3UrOqgPFwJ3LsFwJwSX651nZ2+70C SpGQbipry+kh7HJSJ8dwhHpjDRazRfLgZrBIO4DsHKw4sjTLplGKzqu2UM7cxQR5ulUISWzc/ XtYctjSthHA3OclIBJQZM9wm4jjG8cFGNeBsRPTwbI0sYpsvmhMHXXAW9QwSPYIGJPKcZ8x0d IDgdVcpvVKcLqgo1yrrUH2dL38HBIPKuktv3pqLv3LaE4gXpoeJMT8b0M5dPi9L2U/wAVWDp3 s4PJTCCJ05l3Dqhmr98yzFY125ZrPpEyA37fvw9e0a2YyRYYEB6KZOH/X1c2a0zUSS4rp9Tr1 1kp53Kljbcbl0VMQRa7jDnL3cp/sU/YKJZdZ/1cdnVMcsGCeC6MWYMgkRoJ2QcO7mt0QMwEbd AzpvIMg2HY6QAdiQEBR4X19IFZQPxT2ob3jlo8Wi4Lv9vQUJ00jZtAzVJ+lWZZvLPjpMo0p4i mdT/0bvnvyJYnZaW62+xeXagzrmbirBTlF/30TEk0MSFwA+LIpDvoxxVWUBgZJHe13WsSOn/c uY6xExSBH6fMlCgMWcF4FrOoOiOMQqI2qJPVIc172JYeTXqpn3pzYiU614W7U8VYJ3uCEohm1 X2a1cLMnZ+GMZep46VMahUc6YR6QPjTcZAFqrIKT8/BryOCQ3cvnyeGsDgpbdCNInYzb9MIgd AHoY51r
Archived-At: <https://mailarchive.ietf.org/arch/msg/json/-5LN_g_mkvDEpt4fvtVaS05Prek>
Subject: Re: [Json] Call for Consensus: Proposed Text for "8.1 Character Encoding"
X-BeenThere: json@ietf.org
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: "JavaScript Object Notation \(JSON\) WG mailing list" <json.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/json>, <mailto:json-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/json/>
List-Post: <mailto:json@ietf.org>
List-Help: <mailto:json-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/json>, <mailto:json-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 20 Mar 2017 16:46:39 -0000

On 2017-03-20 17:26, Matthew A. Miller wrote:
> Thank you for the suggested changes, Julian.  To consolidate the
> changes, I believe the following is your suggested text for all of
> Section 8.1:
>
> """
> JSON text MUST be encoded in UTF-8, UTF-16, or UTF-32 Section 3 of
> [UNICODE].  The default encoding is UTF-8, and JSON texts that are

Put "Section 3 of Unicode" into parentheses.

> encoded in UTF-8 are interoperable in the sense that they will be
> read successfully by the maximum number of implementations; there are
> many implementations that cannot successfully read texts in other
> encodings (such as UTF-16 and UTF-32).  Text encoded in character

Replace "in other encodings (such as UTF-16 and UTF-32)" by just "in 
other encodings". The proposed text implies that there encodings other 
than UTF-16 or UTF-32 which could be allowed.

> encodings other than UTF-8, UTF-16, or UTF-32 cannot be used with
> the media tye "application/json".
>
> Implementations MUST NOT add a byte order mark (U+FEFF) to the
> beginning of a JSON text.  In the interests of interoperability,
> implementations that parse JSON texts MAY ignore the presence of a
> byte order mark rather than treating it as an error.
>
> Recipients that wish to support Unicode encodings other than UTF-8
> can do this using a detection mechanism that is based on the fact
> that the first character will always have a Unicode code point less
> or equal than 127, thus the UTF-16/32 variants can be detected by
> inspecting the first octets for nulls.
> """
>
> Does the working group object to this change?

+1, with the comments above.

Best regards, Julian