Re: [I18ndir] [art] Just uploaded draft-bray-unichars-03

Tim Bray <tbray@textuality.com> Sat, 09 September 2023 23:09 UTC

Return-Path: <tbray@textuality.com>
X-Original-To: i18ndir@ietfa.amsl.com
Delivered-To: i18ndir@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id D6177C151095 for <i18ndir@ietfa.amsl.com>; Sat, 9 Sep 2023 16:09:07 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.095
X-Spam-Level:
X-Spam-Status: No, score=-2.095 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_KAM_HTML_FONT_INVALID=0.01, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=textuality.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Y7Aiv8rWhVFr for <i18ndir@ietfa.amsl.com>; Sat, 9 Sep 2023 16:09:03 -0700 (PDT)
Received: from mail-lf1-x134.google.com (mail-lf1-x134.google.com [IPv6:2a00:1450:4864:20::134]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id D68A5C14CE42 for <i18ndir@ietf.org>; Sat, 9 Sep 2023 16:09:03 -0700 (PDT)
Received: by mail-lf1-x134.google.com with SMTP id 2adb3069b0e04-50078eba7afso5589197e87.0 for <i18ndir@ietf.org>; Sat, 09 Sep 2023 16:09:03 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=textuality.com; s=google; t=1694300942; x=1694905742; darn=ietf.org; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=1a2ft9tlgiGnCdmdyU2cYf2yhgw5m+f173Q5T1Us/LQ=; b=Y4jl5dtkWD+M2XAj555FG0lvrxX5RoCexb9asfJd0YPGitgdzSzn4BV/FAOFw5Kcw0 b5xLJ1mIWGTqGodUQbNm8SKUTJkElcNrp5R8eP73afUqqen0fY3ypC+5Hk9tx9y0nUIe rzFLabZGJrDQQKfPDVR0pTpOCHFqR+JG4SGCg=
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1694300942; x=1694905742; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=1a2ft9tlgiGnCdmdyU2cYf2yhgw5m+f173Q5T1Us/LQ=; b=SIfVzwoXuH7Psxeks995i5qPwdjm/uRUJAT6EiGGeblTvdMTHhw4lHqHotD/iroUjZ ljFZqdFJwG/+Yv/ESD2kv59Ka634sJouzmCly29PvhodqsqhemNsyNRbLKz3vpnY/30H EJovfL5aWgU/IANBvMmaSn6PyclsPR9wjfOfQLoMiZxry1S7LwMzW6ZCN36p/azZ0vLr SUmOQq4HozPxFuI39La/FJX8pn5G5Wyh266555whXsgwelFHiNMHBiW9hj6lLK5u/GJO 5LnNtXvd96umoDQsBW3m/inX08WqIl6GTEGDTiJpSisnSTwuf8p4LHLnxr7xKmAii5fu ThKw==
X-Gm-Message-State: AOJu0Yymy9sIHnXiX99aRwh6wrFYixDcqdWMgB8yLQEFOAhOnbh4dOul fxaL4DqS4GKJ1VBzWuLAPw6tZiRUtI1MMXB+Bs2jgw==
X-Google-Smtp-Source: AGHT+IEd+aPOOR0U6M7xMur1ZyQKqQMTUVQiFnngXk+dHupcnbfND+j1NEa04zo/m1vx0ki9SxIM/GYxm1e9fPp7OaM=
X-Received: by 2002:a19:5f59:0:b0:500:9969:60bf with SMTP id a25-20020a195f59000000b00500996960bfmr4351582lfj.68.1694300941995; Sat, 09 Sep 2023 16:09:01 -0700 (PDT)
Received: from 1064022179695 named unknown by gmailapi.google.com with HTTPREST; Sat, 9 Sep 2023 16:09:01 -0700
Received: from 1064022179695 named unknown by gmailapi.google.com with HTTPREST; Sat, 9 Sep 2023 16:08:58 -0700
Mime-Version: 1.0 (Mimestream 1.1.1)
References: <CAHBU6is50TkpDsqXTp6WxdVSgE66j3gGHZ60ey2jFYbefaHFJw@mail.gmail.com> <CAChr6SwDNujzq6+T6CXPko3jju9EiL6kmQCgNs4Ly7QAALujqg@mail.gmail.com> <CAChr6Sz3vcST+yyRVOrj6U59ZoSt7w3h9=-W6dWDbyNXc_0DTA@mail.gmail.com>
In-Reply-To: <CAChr6Sz3vcST+yyRVOrj6U59ZoSt7w3h9=-W6dWDbyNXc_0DTA@mail.gmail.com>
From: Tim Bray <tbray@textuality.com>
Date: Sat, 09 Sep 2023 16:09:01 -0700
Message-ID: <CAHBU6itLfpGSZngJ7z3CZbOg5o-Je4O4y4vD1eiY7QCK+QKfiw@mail.gmail.com>
To: Rob Sayre <sayrer@gmail.com>
Cc: i18ndir@ietf.org, ART Area <art@ietf.org>
Content-Type: multipart/alternative; boundary="000000000000ca6a350604f52d3a"
Archived-At: <https://mailarchive.ietf.org/arch/msg/i18ndir/H-ikSZ5yurGuRGDm301YzD6DzBs>
Subject: Re: [I18ndir] [art] Just uploaded draft-bray-unichars-03
X-BeenThere: i18ndir@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Internationalization Directorate <i18ndir.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/i18ndir/>
List-Post: <mailto:i18ndir@ietf.org>
List-Help: <mailto:i18ndir-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sat, 09 Sep 2023 23:09:07 -0000

On Sep 8, 2023 at 1:55:10 PM, Rob Sayre <sayrer@gmail.com> wrote:

>
> I do think you missed discussion of escape sequences, though. For example,
> you can create an absolutely infuriating sequence of surrogate characters
> in JSON while the document is correct UTF-8. This dual layering needs to be
> discussed for anyone that is trying to figure this stuff out. We're not
> writing for the people on this mailing list. It's for the people that show
> up later.
>

On consideration, I think you’re right.  The example “\uDEAD” might give
the impression that this is a legal way to sneak a surrogate into a text
field even when the specified character repertoire (using for example this
document) says you shouldn’t do that, so yeah, need to make that explicit.
Thanks.



>