Re: [Json] On characters and code points

Tim Bray <tbray@textuality.com> Fri, 07 June 2013 16:19 UTC

Return-Path: <tbray@textuality.com>
X-Original-To: json@ietfa.amsl.com
Delivered-To: json@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 7B00E21F95DD for <json@ietfa.amsl.com>; Fri, 7 Jun 2013 09:19:48 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.157
X-Spam-Level:
X-Spam-Status: No, score=-1.157 tagged_above=-999 required=5 tests=[AWL=1.819, BAYES_00=-2.599, FM_FORGED_GMAIL=0.622, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_LOW=-1]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id w6yQvoT1vDUN for <json@ietfa.amsl.com>; Fri, 7 Jun 2013 09:19:42 -0700 (PDT)
Received: from mail-vc0-f182.google.com (mail-vc0-f182.google.com [209.85.220.182]) by ietfa.amsl.com (Postfix) with ESMTP id 825DA21F9302 for <json@ietf.org>; Fri, 7 Jun 2013 09:19:42 -0700 (PDT)
Received: by mail-vc0-f182.google.com with SMTP id gf12so2925564vcb.13 for <json@ietf.org>; Fri, 07 Jun 2013 09:19:41 -0700 (PDT)
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:x-originating-ip:in-reply-to:references:date :message-id:subject:from:to:cc:content-type:x-gm-message-state; bh=/OO3t5nEI9BvHDUl9ESGAmOYDFEi5592QMfCRY0V/qQ=; b=dOq1/1VanzcX3KcPWtgYxI5jhrYmVEKfV9x6DaeKVQ6We4FCrooNtrjSqBElvgxgtW eZiRgntBg8RRUvnuS3DUFHlBUpT1MH+0aTzb7hUKjQqfPz4U6Ng7TmaZsIRhHNbog4QD BxLtWm9KnkzfZu2yovpnZpeKl/UvhH5FwhjFZ2qa2FPo0aS8UkAiB/N1KTSH8rOaLzYb 2DyHyFU29WwZVwvfsYxQQQLa6pAAqFW9SBCxhbKnNgLbzqJYcxORo7wAm9ccNQUHvehF towo+ySihcORbXyTa069nl/al65YXSH72ydRqstBS2+MS4hTemswk8rgUqcjOFGm+/h8 ejUA==
MIME-Version: 1.0
X-Received: by 10.58.88.4 with SMTP id bc4mr1186105veb.48.1370621981869; Fri, 07 Jun 2013 09:19:41 -0700 (PDT)
Received: by 10.220.48.14 with HTTP; Fri, 7 Jun 2013 09:19:41 -0700 (PDT)
X-Originating-IP: [24.84.235.32]
In-Reply-To: <51B20731.3040300@drees.name>
References: <A723FC6ECC552A4D8C8249D9E07425A70FC2E7E1@xmb-rcd-x10.cisco.com> <51B06F38.8050707@crockford.com> <CAHBU6iuFBuW-RfgBLQF5q4BnUOzs088QXW3uOQG1OjBFjZttkw@mail.gmail.com> <51B1B4E7.8090101@it.aoyama.ac.jp> <9ld3r8pc0tufif18dohb2fmi0ijna1vs4n@hive.bjoern.hoehrmann.de> <56A163E9-E7CD-46B3-9984-8F009EBFF500@vpnc.org> <CAHBU6ivG=ONc8roT7W=LdpKYNMqRH_d5BobZ=pHnk=mVaKZKaA@mail.gmail.com> <51B20731.3040300@drees.name>
Date: Fri, 07 Jun 2013 09:19:41 -0700
Message-ID: <CAHBU6iufTsLoBoeFxT4pHSGAUi8H-wUFQYj1VcVQu1K_QCdhww@mail.gmail.com>
From: Tim Bray <tbray@textuality.com>
To: stefan@drees.name
Content-Type: multipart/alternative; boundary="047d7b33920759e2bd04de92cca2"
X-Gm-Message-State: ALoCoQkzzxN0pnWzZSU6xKxPcNhPmHEpCWb7GKO9Nn02M87SxfKqpugUTwh7C0dkpixLgmzNBrcp
Cc: Paul Hoffman <paul.hoffman@vpnc.org>, "json@ietf.org" <json@ietf.org>
Subject: Re: [Json] On characters and code points
X-BeenThere: json@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "JavaScript Object Notation \(JSON\) WG mailing list" <json.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/json>, <mailto:json-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/json>
List-Post: <mailto:json@ietf.org>
List-Help: <mailto:json-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/json>, <mailto:json-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 07 Jun 2013 16:19:48 -0000

On Fri, Jun 7, 2013 at 9:15 AM, Stefan Drees <stefan@drees.name> wrote:

> and what about { "Decorate my slash": "\/" } and "general-purpose string
> processing software". Isn't this also a case, where you need a
> "pre-conditioner" that replaces the JSON specific escape sequence "\" with
> "/" before feeding it into "general-purpose string processing software" :-?)


Red herring.  JSON, just like XML, has ways to encode characters that are
hard to type.  What we're arguing about is the actual content of the
payload after the parser/pre-conditioner.  -T


>
>
>
>  Also, consider the lovely consequences when unpaired surrogates start
>> showing  up in key fields and are fed to hash functions in every
>> programming language in the world, which expect to receive Unicode
>> characters.
>>   -T
>>
>>
> For today I better not imagine all these laguages and implementations
> blindly stuffing some json text transformed into their own memory
> structures ... maybe later during the weekend
>
>
>      ...
>>
>
> Stefan.
>
>
>
>