Re: [Json] A possible summary of the discussion so far on code points and characters

Stephen Dolan <stephen.dolan@cl.cam.ac.uk> Mon, 10 June 2013 17:02 UTC

Return-Path: <stedolan@stedolan.net>
X-Original-To: json@ietfa.amsl.com
Delivered-To: json@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 9285821F9964 for <json@ietfa.amsl.com>; Mon, 10 Jun 2013 10:02:37 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.449
X-Spam-Level:
X-Spam-Status: No, score=-1.449 tagged_above=-999 required=5 tests=[AWL=1.528, BAYES_00=-2.599, FM_FORGED_GMAIL=0.622, RCVD_IN_DNSWL_LOW=-1]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id NDQ6iXYfpU+I for <json@ietfa.amsl.com>; Mon, 10 Jun 2013 10:02:32 -0700 (PDT)
Received: from mail-lb0-f176.google.com (mail-lb0-f176.google.com [209.85.217.176]) by ietfa.amsl.com (Postfix) with ESMTP id 0DE5421F995F for <json@ietf.org>; Mon, 10 Jun 2013 10:02:31 -0700 (PDT)
Received: by mail-lb0-f176.google.com with SMTP id z5so6560131lbh.35 for <json@ietf.org>; Mon, 10 Jun 2013 10:02:31 -0700 (PDT)
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:sender:x-originating-ip:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:cc:content-type :content-transfer-encoding:x-gm-message-state; bh=EiZ2eFAokufupA1l2nBB7qDdmnSZARSSYEWhLLyKA0A=; b=X0v596B8hKp70R8TR8XhtdmkADuDwebVYbWAKA7uzhJC8GNmjhHboQge1AH3Le1jtR +R1jEEq74dGACWHXgoxhosFN/rUrwHuTASLQY27URxzu7TPrNmoghoibpG2aOcyYsGvc iK3i+04dRounNOlttlja72cLQ2HB/EjZVM8n/rQXbkAUtotMrU0/rRrBXH+UPi5XXOPN XBcO2LZM4D8ccb2gNR2KvpyqXmtaPjrUGQ8qp7cDiEpcFPrfUaKbQHxJ9sZdc6AQUmou qSnbkWeU+uKp2E287P+kYbSiqGy0Aamuc/fSdC3TKt4Nv2dRZOD0l+TMGmP7SLu/xsth +iqw==
MIME-Version: 1.0
X-Received: by 10.112.61.199 with SMTP id s7mr1123418lbr.53.1370883750948; Mon, 10 Jun 2013 10:02:30 -0700 (PDT)
Sender: stedolan@stedolan.net
Received: by 10.114.176.231 with HTTP; Mon, 10 Jun 2013 10:02:30 -0700 (PDT)
X-Originating-IP: [131.111.184.8]
In-Reply-To: <CAChr6SyM0ERZ6bqEbG4ULDZx-MsKo8sx-9WB5sVLFyONm++kbQ@mail.gmail.com>
References: <AF793CAF-B30B-44A7-B864-82CEF79EA34D@vpnc.org> <CAChr6SwLDCUk0DC9pGTKqUu_V5vJHvs7Sgv4EneTJMryn1iKSA@mail.gmail.com> <CA+mHimPdoN0vf8c3AzYrZ8HXgPbUJPkvViwU4iWrcZBBKJRmNg@mail.gmail.com> <CAChr6SyM0ERZ6bqEbG4ULDZx-MsKo8sx-9WB5sVLFyONm++kbQ@mail.gmail.com>
Date: Mon, 10 Jun 2013 18:02:30 +0100
X-Google-Sender-Auth: uPuYRrSDkozlztx35KqcxdPwQZw
Message-ID: <CA+mHimMmDQkPK9rSe+ny_1A9jx7skhpPvxf14UV_Q9EwSJp-7Q@mail.gmail.com>
From: Stephen Dolan <stephen.dolan@cl.cam.ac.uk>
To: R S <sayrer@gmail.com>
Content-Type: text/plain; charset=windows-1252
Content-Transfer-Encoding: quoted-printable
X-Gm-Message-State: ALoCoQkFoCUqqTN6LR+5S2crUm34EYH2VIZwthmyYadK85XPZr/5yGKcK4rv/Fw2Zm9fTtAU1146
Cc: Paul Hoffman <paul.hoffman@vpnc.org>, "json@ietf.org" <json@ietf.org>
Subject: Re: [Json] A possible summary of the discussion so far on code points and characters
X-BeenThere: json@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "JavaScript Object Notation \(JSON\) WG mailing list" <json.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/json>, <mailto:json-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/json>
List-Post: <mailto:json@ietf.org>
List-Help: <mailto:json-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/json>, <mailto:json-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 10 Jun 2013 17:02:37 -0000

On Sun, Jun 9, 2013 at 1:23 AM, R S <sayrer@gmail.com> wrote:
> It's a requirement. Here are some additional references:
>
> <http://wiki.ecmascript.org/doku.php?id=strawman:support_full_unicode_in_strings&rev=1305822947>
> <https://mail.mozilla.org/pipermail/es-discuss/2011-May/014337.html>
>
> The paragraph following the one I cited:
>
> 'Throughout the rest of this document, the phrase “code unit” and the word
> “character” will be used to refer to a 16-bit unsigned value used to
> represent a single 16-bit unit of text. The phrase “Unicode character” will
> be used to refer to the abstract linguistic or typographical unit
> represented by a single Unicode scalar value (which may be longer than 16
> bits and thus may be represented by more than one code unit). The phrase
> “code point” refers to such a Unicode scalar value. “Unicode character” only
> refers to entities represented by single Unicode scalar values: the
> components of a combining character sequence are still individual “Unicode
> characters,” even though a user might think of the whole sequence as a
> single character.' <http://es5.github.io/x6.html>
>
> - Rob

Sorry, I misread.

Stephen