Re: [Json] Unpaired surrogates in JSON strings

John Cowan <cowan@mercury.ccil.org> Thu, 06 June 2013 04:29 UTC

Return-Path: <cowan@ccil.org>
X-Original-To: json@ietfa.amsl.com
Delivered-To: json@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 2B18C21F8CF4 for <json@ietfa.amsl.com>; Wed, 5 Jun 2013 21:29:28 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.949
X-Spam-Level:
X-Spam-Status: No, score=-2.949 tagged_above=-999 required=5 tests=[AWL=0.650, BAYES_00=-2.599, RCVD_IN_DNSWL_LOW=-1]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id M-irxcco9jy1 for <json@ietfa.amsl.com>; Wed, 5 Jun 2013 21:29:23 -0700 (PDT)
Received: from earth.ccil.org (earth.ccil.org [192.190.237.11]) by ietfa.amsl.com (Postfix) with ESMTP id CF52521F8613 for <json@ietf.org>; Wed, 5 Jun 2013 21:29:23 -0700 (PDT)
Received: from cowan by earth.ccil.org with local (Exim 4.72) (envelope-from <cowan@ccil.org>) id 1UkRpB-0003cS-QZ; Thu, 06 Jun 2013 00:29:22 -0400
Date: Thu, 06 Jun 2013 00:29:21 -0400
From: John Cowan <cowan@mercury.ccil.org>
To: Carsten Bormann <cabo@tzi.org>
Message-ID: <20130606042921.GC1362@mercury.ccil.org>
References: <A723FC6ECC552A4D8C8249D9E07425A70FC2C12D@xmb-rcd-x10.cisco.com> <83728898-9A2D-4758-9C06-1157E2954CCB@vpnc.org> <1A39ACCF-4B4D-4F69-9D97-ECEF1BB41D47@tzi.org>
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Disposition: inline
In-Reply-To: <1A39ACCF-4B4D-4F69-9D97-ECEF1BB41D47@tzi.org>
User-Agent: Mutt/1.5.20 (2009-06-14)
Sender: John Cowan <cowan@ccil.org>
Cc: Paul Hoffman <paul.hoffman@vpnc.org>, "json@ietf.org" <json@ietf.org>
Subject: Re: [Json] Unpaired surrogates in JSON strings
X-BeenThere: json@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "JavaScript Object Notation \(JSON\) WG mailing list" <json.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/json>, <mailto:json-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/json>
List-Post: <mailto:json@ietf.org>
List-Help: <mailto:json-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/json>, <mailto:json-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 06 Jun 2013 04:29:28 -0000

Carsten Bormann scripsit:

> Code points can refer to those of the characters or those of the code
> units (byte for UTF-8, etc.).

Code points are (mathematical) integers corresponding to Unicode
characters, though not all of them are assigned to characters.

Code *units* are 8-bit, 16-bit, or 32-bit values corresponding to the
components of a particular encoding.  A single code point is represented
by one or more code units.

-- 
Go, and never darken my towels again!           John Cowan
        --Rufus T. Firefly                      http://ccil.org/~cowan