Re: [Json] Call for Consensus: Proposed Text for "8.1 Character Encoding"

Carsten Bormann <cabo@tzi.org> Tue, 14 March 2017 06:50 UTC

Return-Path: <cabo@tzi.org>
X-Original-To: json@ietfa.amsl.com
Delivered-To: json@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 7BCCB1294EA; Mon, 13 Mar 2017 23:50:20 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.2
X-Spam-Level:
X-Spam-Status: No, score=-4.2 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_MED=-2.3] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id udeaKa1peAEr; Mon, 13 Mar 2017 23:50:16 -0700 (PDT)
Received: from mailhost.informatik.uni-bremen.de (mailhost.informatik.uni-bremen.de [IPv6:2001:638:708:30c9::12]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id BD0691294FB; Mon, 13 Mar 2017 23:50:15 -0700 (PDT)
X-Virus-Scanned: amavisd-new at informatik.uni-bremen.de
Received: from submithost.informatik.uni-bremen.de (submithost.informatik.uni-bremen.de [134.102.201.11]) by mailhost.informatik.uni-bremen.de (8.14.5/8.14.5) with ESMTP id v2E6oBWI022597; Tue, 14 Mar 2017 07:50:11 +0100 (CET)
Received: from [192.168.217.124] (p5DCCCDC2.dip0.t-ipconnect.de [93.204.205.194]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by submithost.informatik.uni-bremen.de (Postfix) with ESMTPSA id 3vj52q3rzdzDH4B; Tue, 14 Mar 2017 07:50:11 +0100 (CET)
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (Mac OS X Mail 10.2 \(3259\))
From: Carsten Bormann <cabo@tzi.org>
In-Reply-To: <9c0dee1b-341c-e783-a30b-1afeb841e693@gmx.de>
Date: Tue, 14 Mar 2017 07:50:10 +0100
X-Mao-Original-Outgoing-Id: 511167010.586277-f186d51cbdb4927a4faaa1dcd24e135b
Content-Transfer-Encoding: quoted-printable
Message-Id: <BE8E943D-85DC-414E-A44F-E154286D4169@tzi.org>
References: <1fb5849e-8dbf-835d-65b7-2403686248f9@outer-planes.net> <b3cb2651-2d9f-d68d-2191-814e8dd5f5e2@gmx.de> <4B0A7371-9D85-4BEF-BC3C-14175E563178@tzi.org> <98ba10a0-6e44-9ff0-5993-f7ec9c66d74b@gmx.de> <E30CE52F-CE3E-4888-99D8-58899D3652EB@tzi.org> <CAHBU6ivb1meRgGZ8QPcicQY7awq1FSVCUNB2zkXGq2WJ6bsspQ@mail.gmail.com> <9c0dee1b-341c-e783-a30b-1afeb841e693@gmx.de>
To: Julian Reschke <julian.reschke@gmx.de>
X-Mailer: Apple Mail (2.3259)
Archived-At: <https://mailarchive.ietf.org/arch/msg/json/moGH12VibMciKzlMSmpMyo1EnZ0>
Cc: "Matthew A. Miller" <linuxwolf+ietf@outer-planes.net>, draft-ietf-jsonbis-rfc7159bis.all@ietf.org, Tim Bray <tbray@textuality.com>, "json@ietf.org" <json@ietf.org>
Subject: Re: [Json] Call for Consensus: Proposed Text for "8.1 Character Encoding"
X-BeenThere: json@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: "JavaScript Object Notation \(JSON\) WG mailing list" <json.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/json>, <mailto:json-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/json/>
List-Post: <mailto:json@ietf.org>
List-Help: <mailto:json-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/json>, <mailto:json-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 14 Mar 2017 06:50:20 -0000

On 14 Mar 2017, at 07:44, Julian Reschke <julian.reschke@gmx.de> wrote:
> 
> "Character encoding detection can be done based on the fact that the first character is always US-ASCII, so the UTF-16/32 variants can be detected by inspecting the first octets for zeros."

Prefix this with “If UTF-16 and UTF-32 forms need to be supported,”, and I’m happy.

Grüße, Carsten