Re: [Json] Naked surrogates already banned?

Stephen Dolan <stephen.dolan@cl.cam.ac.uk> Fri, 18 October 2013 02:00 UTC

Return-Path: <stedolan@stedolan.net>
X-Original-To: json@ietfa.amsl.com
Delivered-To: json@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 1F33011E81BF for <json@ietfa.amsl.com>; Thu, 17 Oct 2013 19:00:03 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.377
X-Spam-Level:
X-Spam-Status: No, score=-2.377 tagged_above=-999 required=5 tests=[BAYES_00=-2.599, FM_FORGED_GMAIL=0.622, J_CHICKENPOX_14=0.6, RCVD_IN_DNSWL_LOW=-1]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id TqxaHK+DqNIp for <json@ietfa.amsl.com>; Thu, 17 Oct 2013 18:59:56 -0700 (PDT)
Received: from mail-ie0-f172.google.com (mail-ie0-f172.google.com [209.85.223.172]) by ietfa.amsl.com (Postfix) with ESMTP id 419FF11E81A6 for <json@ietf.org>; Thu, 17 Oct 2013 18:59:55 -0700 (PDT)
Received: by mail-ie0-f172.google.com with SMTP id tp5so5440763ieb.31 for <json@ietf.org>; Thu, 17 Oct 2013 18:59:54 -0700 (PDT)
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:sender:in-reply-to:references:date :message-id:subject:from:to:cc:content-type :content-transfer-encoding; bh=8z7bNstEhRME8qKZYMfjtaIezSoIiJiD3Plr/KJq304=; b=RdziEkxbBXlOUpBbgf9TLM/U35FTDFYyeKsX8Qqv/HXstHK7YqiHCAQ/JcP56pNAme AAtjCXefljhz+o+H06vHxUZslCkN7W0DG0KI+HB6FYXVTWCZ/zQkwLS4I2ZfLJmyzdeZ CiU8ybYSvyui1VvL61nz4HEXGR0VG7Hvd3a86ktSf4AehRN6SaTdxgpn24Wn+sUgLc1x mNTdvaLHfi4kD5+qwbLRMAYkQqsMy83PZAv7eaZYGmfXGrYJfdPXtBNSsA2EzoAEyxHs cjR18yngNprHEc1kK9W9J2WhYJyKKypelvfLrHny5dQP30sMquNQMgz2pqLiLpnuH16I 04TA==
X-Gm-Message-State: ALoCoQnABvMgiDPn7PfCAzcEAoBPtQRNrWNqAXUdl3BIytMytX6kCzq3txSqtJCaOxWc0tbOvq3J
MIME-Version: 1.0
X-Received: by 10.50.79.164 with SMTP id k4mr919713igx.10.1382061594594; Thu, 17 Oct 2013 18:59:54 -0700 (PDT)
Sender: stedolan@stedolan.net
Received: by 10.43.132.5 with HTTP; Thu, 17 Oct 2013 18:59:54 -0700 (PDT)
X-Originating-IP: [131.111.184.8]
In-Reply-To: <CAHBU6itESY9bzSZ-0VnLq-VNcPg_LbR_q-kuaVzTyLQL9tTKcw@mail.gmail.com>
References: <CAHBU6itESY9bzSZ-0VnLq-VNcPg_LbR_q-kuaVzTyLQL9tTKcw@mail.gmail.com>
Date: Fri, 18 Oct 2013 02:59:54 +0100
X-Google-Sender-Auth: lmTMYcSvzi4F5RFkO3Vh_r8UXBE
Message-ID: <CA+mHimNgVXVSwFxrLZoQZ1N+GPvhxT0NLPPCxJiDQEXamqQvJQ@mail.gmail.com>
From: Stephen Dolan <stephen.dolan@cl.cam.ac.uk>
To: Tim Bray <tbray@textuality.com>
Content-Type: text/plain; charset=windows-1252
Content-Transfer-Encoding: quoted-printable
Cc: "json@ietf.org" <json@ietf.org>
Subject: Re: [Json] Naked surrogates already banned?
X-BeenThere: json@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "JavaScript Object Notation \(JSON\) WG mailing list" <json.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/json>, <mailto:json-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/json>
List-Post: <mailto:json@ietf.org>
List-Help: <mailto:json-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/json>, <mailto:json-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 18 Oct 2013 02:00:03 -0000

On Fri, Oct 18, 2013 at 2:36 AM, Tim Bray <tbray@textuality.com> wrote:
> But U+DEAD is not a character.  So a narrow reading would make that illegal.
> Not that any implementation I’ve encountered enforces that. -T

Several do, including libjansson, Python and Perl. In fact, I suspect
that most implementations on platforms that haven't baked in UTF16 as
the One True Unicode (javascript, JVM) reject your example.

Stephen