Re: [I18ndir] [art] Fwd: New Version Notification for draft-bray-unichars-06.txt

"Manger, James" <James.H.Manger@team.telstra.com> Mon, 02 October 2023 08:08 UTC

Return-Path: <James.H.Manger@team.telstra.com>
X-Original-To: i18ndir@ietfa.amsl.com
Delivered-To: i18ndir@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 0E7E5C1519A7; Mon, 2 Oct 2023 01:08:34 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.007
X-Spam-Level:
X-Spam-Status: No, score=-2.007 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_BLOCKED=0.001, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=team.telstra.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id DkT-yj2lqJUN; Mon, 2 Oct 2023 01:08:29 -0700 (PDT)
Received: from AUS01-SY4-obe.outbound.protection.outlook.com (mail-sy4aus01on2128.outbound.protection.outlook.com [40.107.107.128]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id E4050C14CEFF; Mon, 2 Oct 2023 01:08:27 -0700 (PDT)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=NZxEdio2m8cPexC2jO8lKPmT5rNfBKf6pNhvc5/PWA1qAcOk3jXyMN1L9rl3y3CyEKq2h6law68UuxWjDK/S61eGqORYLqR4Gc9IGU1DEbdCbWK8/f8za5H221RpArVS0AD0d0EKiMj3CBvHaryZh/NVHpdXZDI+ExIf3rT48g1eS9xxLpQa4qZnPp6wVEGra4qjnOvTQgwJfyu3tvnIMQeDm7IhuEOVREb61Tk0gRs06RNGvMMsSopFukGOZel8SesFQCOjtoU5QJvuc5vHxEUaXP4kvvXZjK9J8eeMCMK1bqnfGhF5heDgor40OenpBfN+7wQEz1ZdxmQI8sK3Gw==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=8yJ4Crpfgrlh4vFs+fGkQZhesBtKcALYmX/kMsfEj00=; b=k9JgqR7mEyGLj0cSQeMKANx22kL8r8lV75D8Wgze22eOU/TEZlrSze2As1XolHFOoeTgi5wzsdGkBn1zj1BwdLHABZoAtmQgRTQKva2P3lx7CE7P/RDFW3Okvhhc+geBtuPr7jXtJ9jX6K2SpoJdu7y2O1f92DjOrpm3Zaukwqy1ZwidHblK7dvU7ITphR9TB7xcHWVd0/O8TrT1w4Rx5NhdPEq6e8E1XePeCkHvACkbSzJW8raIqeW8bBVlttCf8u/k4kRQFlmQ++HtBFo6uR3pFJD3qT/CMRC2/ehummsM7iLETkWehvBpRI40KKIfHbPnWnghxthfDWTWQMuvDA==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=team.telstra.com; dmarc=pass action=none header.from=team.telstra.com; dkim=pass header.d=team.telstra.com; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=team.telstra.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=8yJ4Crpfgrlh4vFs+fGkQZhesBtKcALYmX/kMsfEj00=; b=hY7ODxey7YBc9IRmjMK4ZQCCDG4hRUgnCQ8PnXMiFQuQFcNPfJNeBjEUe44sutt4HNvzSVqi5bkoj8BFBkZV4ZsAE3zKKrD7rseB/7V+4mxZGBLaCuLFz9eJGIuEbSsZZlXAb13s5Xv1j0Cz76yqzCK56eExhKXohMfVUA5WydI=
Received: from SY4PR01MB5980.ausprd01.prod.outlook.com (2603:10c6:10:f7::9) by MEYPR01MB7380.ausprd01.prod.outlook.com (2603:10c6:220:159::15) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6838.28; Mon, 2 Oct 2023 08:08:24 +0000
Received: from SY4PR01MB5980.ausprd01.prod.outlook.com ([fe80::9cc9:656:a953:176b]) by SY4PR01MB5980.ausprd01.prod.outlook.com ([fe80::9cc9:656:a953:176b%3]) with mapi id 15.20.6838.029; Mon, 2 Oct 2023 08:08:24 +0000
From: "Manger, James" <James.H.Manger@team.telstra.com>
To: Tim Bray <tbray@textuality.com>
CC: "i18ndir@ietf.org" <i18ndir@ietf.org>, ART Area <art@ietf.org>
Thread-Topic: [art] Fwd: New Version Notification for draft-bray-unichars-06.txt
Thread-Index: AQHZ79CDx7kIJf8SrkaIgd4ICEsz7rAyr4kzgAE/8ACAAjaJ8w==
Date: Mon, 02 Oct 2023 08:08:24 +0000
Message-ID: <SY4PR01MB5980D009F1623E3694B871B7E5C5A@SY4PR01MB5980.ausprd01.prod.outlook.com>
References: <169566019635.41806.9804796677919971070@ietfa.amsl.com> <CAHBU6is-wU2NLXNWL56nSJ4=nKvDzGv_Aw4qJN6N2O8CuM4-yw@mail.gmail.com> <SYBPR01MB59814B3448F5754AAEDA1740E5C7A@SYBPR01MB5981.ausprd01.prod.outlook.com> <CAHBU6iueqtd5T1T-ciYUMWvmo8XqBQqO5LkWbdRaoXQzPYSQOQ@mail.gmail.com>
In-Reply-To: <CAHBU6iueqtd5T1T-ciYUMWvmo8XqBQqO5LkWbdRaoXQzPYSQOQ@mail.gmail.com>
Accept-Language: en-AU, en-US
Content-Language: en-AU
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
msip_labels: MSIP_Label_f4ab56b7-6ec4-4073-8d92-ac7cc2e7a5df_Enabled=True; MSIP_Label_f4ab56b7-6ec4-4073-8d92-ac7cc2e7a5df_SiteId=49dfc6a3-5fb7-49f4-adea-c54e725bb854; MSIP_Label_f4ab56b7-6ec4-4073-8d92-ac7cc2e7a5df_SetDate=2023-10-02T07:34:03.8170747Z; MSIP_Label_f4ab56b7-6ec4-4073-8d92-ac7cc2e7a5df_ContentBits=0; MSIP_Label_f4ab56b7-6ec4-4073-8d92-ac7cc2e7a5df_Method=Standard
authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=team.telstra.com;
x-ms-publictraffictype: Email
x-ms-traffictypediagnostic: SY4PR01MB5980:EE_|MEYPR01MB7380:EE_
x-ms-office365-filtering-correlation-id: b7655ba5-c40d-4b83-7351-08dbc31ec06b
x-ms-exchange-senderadcheck: 1
x-ms-exchange-antispam-relay: 0
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: IaCf3z/lW3m6a4xMwdpBomBP6mJhNayCE5Oz94XJd6KLwm2eEn4m3cr59ddE3dzjJAEySmLEe81vE60MxRRJArpGvPvqRiLOajknvEBy2xdUHjXFSgoalX/NDBGvcJP5JZPyhmFb/V1dldZA0w0Lluto7KNGPACKgXTScwZ2nfSlnmbD/1fG18IEtUkyVfQ14pOpJaOjZ7w93ZQYAzJ6gfbo8fMXCyD4cWtlkGOyGGe00cqIzdGjemalKC53OZPUDM0Y/CjS6zy/hkAHOSwEGNaC8ThyrWeC7bAwhoQw6ZqnxQSLXOtJw9UuhDcvM6xVQMt8sFTcVmNkRtTJbJwYRv1lQ88qZfWNqP75stRlloUy2T1nJFWWxERcWIWr15ZerSg9Hs2KMJNQechVqGpXBPMJ5l7pB3sOHAAfqXxnfIGDwejlZa7g8CcBkrHxE5MNklnewraiyrdaStXAUhZLuzJyiHMzyqwQJewRvVTz0zxsWOK70Mt4s/0GP2hKGuCbvqI2zhKz2ZiyEmIFBVC5nlMND6uBG0WMtwN1Fnwur3BYUU1E6xnBBdaFlT7NKf39SI6Lcb1yIADaXXKJb4rfPBBOUyWybSddqP7q+ki11cmhxGQzON2nPHvaBx3PUX0Z
x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:SY4PR01MB5980.ausprd01.prod.outlook.com; PTR:; CAT:NONE; SFS:(13230031)(39860400002)(376002)(396003)(366004)(346002)(136003)(230922051799003)(451199024)(64100799003)(186009)(1800799009)(478600001)(7696005)(6506007)(9686003)(71200400001)(26005)(316002)(6916009)(5660300002)(54906003)(64756008)(4326008)(8936002)(8676002)(76116006)(66476007)(52536014)(66946007)(33656002)(66446008)(2906002)(66556008)(41300700001)(38070700005)(38100700002)(82960400001)(86362001)(122000001)(83380400001)(66899024)(55016003); DIR:OUT; SFP:1102;
x-ms-exchange-antispam-messagedata-chunkcount: 1
x-ms-exchange-antispam-messagedata-0: 2G6G5Bp+GStGtZCHZYit/Dxkgft3yCKDwkUhBtWeTmzJvj5Aj87lpBEiuLHGlO6JycIk9jos77euncWXGQ5Zg8Ff709PNtjjTfRSGOXLCL4maIzKYci/HH3+oiC0NDZYIFRCIQNjrefa+bG+zxAYT9lwahsZXcz8q+BJ5KhCXXGIGdrIWkIhEG4pJIGQK3JCwORcRgb3zmGY6KP9/sV5xZX3qaZGAye9etRvG3hwXpmmYl1E49ux6JF6RLMJL59lRW0DlZCwlyzmNlZ6h+dVSi8n57gVDcbKweWFfqtZWwxo961l4OkImQR9H8g5Bd2Us5qecNg4Vhs3xDWdh7ZuBlMOgm0NRrPzym+As2ISvQcpcicPgHeZqryOj8hqvNpFCXvJ2uqt10ho+rEwqNlmRty4IU4h8qlWaz3nUccs1MY783OMLFKmqdKIjZgPO7DdHEYWvb8flXlQZw7eFHoxtnjFxDoSVuvT8p99PTEynUhmNs6P3qzTbSJAE+ckoecQ6QexCoRfd1pdNVPLBpVlmXeJ2KpNi1/p4N/Ok4h6qzFdcdDuNIJtU0aON+jgKTXlT4DSzhYeBaHL+wb5qeSqPlxykKyilT674HXKBPid/Hvs7h3OWc922N+/n55WbPMoq3MOf1lAswHXx0Enfucxp5a6eshs0POMXiW4ZR9m9Kh0DyP1SXG3yw5nlujjj7PdM7rNlioxvQf+vRK25salDSrlDieRTZJH7RtgxMRwvOSLi6EiI1kmJUScn/G6LnfQ5SJydlVNplL1m+PdiLXyuP8aC7JC/U9uWdMuA050Zi1wDaveU+Cj/46VNpjIdaAlI4FScv6Q/Q5n+uY7daAbXD/oaZA0YKVtl2hcUpXpEB6+IqLbB7/ORzhWhAkt3VbO5GAHt8m97O9A3qBeMJWMHz7vPqYU0Boe7koFDblLsGuJGujLVoSRr0AHKcUK0sKsMDHHHQCtZ0BN4Pz7NHixXBcQVKiGhGCQOMryMFCrbwmYTYRjGeZREqPfcA6smFNlNzHKuyA5fVtAF1CxlDoQFNQogquUQo0zTZC4LqVTvKPp9hJkrZ7bHQjaUZjfui4Rd2XRg/BCG3tAcIQp9BSK4Fx7OQWnNs8Xc7uySWB/xu7AdSbhgaHSat0UbZkgUmX9CAXu4Gnty/FEAFz8f71BFH5BBOtctzViwVUntmq6mrfpQ4XRvHgW13fEUyPTZr/9lZd2rGE9R7sbhBFk1XjTqQHdFdfaT86jqPfcSYzXO7jgXlwHp6HDpFI/hXagisZ0mJwMfo89/dYExob29V0Awj/InOH249DjKdYQaG7+mf8MHAqxr4jTcf1Z03+TBOC1RXxSf1nkoNF+vMRTTkEkjmM24Q4Gz4BkSDc6bFFcvYWGaaBUaZLOfbwQmcZV6OmxckIGTkwhtJ/Yn2yTG8IrkmCLU50RpbINSsZKkKxqvGbY7fQ1qEsZaImZGYfkYhk/KyDxy3WJzoE+Ozb8eQSn1qmyMu74nEF7VGt8mBF8QQnV3wkjN9KOi2QbPUuePI00zpgZfo+tQ+tEavCRG5W9IhtQBRvHxWjAbWYrAUYKpC66kEbNepXgab3fzcYS/7PvWuvV4fkCNUo2B2kqDp93iXCVZdrABTY4HjvGmgFqHtc=
Content-Type: multipart/alternative; boundary="_000_SY4PR01MB5980D009F1623E3694B871B7E5C5ASY4PR01MB5980ausp_"
MIME-Version: 1.0
X-OriginatorOrg: team.telstra.com
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-AuthSource: SY4PR01MB5980.ausprd01.prod.outlook.com
X-MS-Exchange-CrossTenant-Network-Message-Id: b7655ba5-c40d-4b83-7351-08dbc31ec06b
X-MS-Exchange-CrossTenant-originalarrivaltime: 02 Oct 2023 08:08:24.7571 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 49dfc6a3-5fb7-49f4-adea-c54e725bb854
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: 8XJQnhKWT0gewUw8v95F/pzydj8WDcoDAzw80tcP56weInwvlXXPLV+m/851R0I7EWLigRXdBF82bFjK4Hf6flO/StvFMpOX8q/UHbDysIw=
X-MS-Exchange-Transport-CrossTenantHeadersStamped: MEYPR01MB7380
Archived-At: <https://mailarchive.ietf.org/arch/msg/i18ndir/DkbesKr16eDS6-Rg-g145c6cIvg>
Subject: Re: [I18ndir] [art] Fwd: New Version Notification for draft-bray-unichars-06.txt
X-BeenThere: i18ndir@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Internationalization Directorate <i18ndir.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/i18ndir/>
List-Post: <mailto:i18ndir@ietf.org>
List-Help: <mailto:i18ndir-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 02 Oct 2023 08:08:34 -0000

Thought experiment: Unicode is invented is almost the same way, except UTF-16 is slightly rearranged. D800-DFFF are still high/low surrogates. The code-point to/from code-unit calculation differs.
  1 code unit X in [0000, D7FF] => code point = X
  1 code unit X in [E000, FFFF] => code point = X – 2048
  2 codes units X Y a surrogate pair => code point = (X – 0xD800) * 1024 + (Y – 0xDC00) + 0xF800
  Unpaired surrogate => ill-formed

The result is exactly the same number of scalars, but they are now contiguous %x0-10F800. So scalars = code points.

JSON is identical, including escapes using UTF-16 code units. You can still get "\uDEAD" as a JSON text. You still have to deal with that (and explain it if desired).

But you can no longer use “code point” as a label for unicode-plus-unpaired-surrogates. You have to explain its actual nature.

P.S. Of course I am NOT suggesting any such change. But I think it hints at how “code points” is misused to straddle the scalar/code-unit divide is a way that causes confusion.

--
James Manger


General