Re: [I18ndir] [art] Fwd: New Version Notification for draft-bray-unichars-05.txt

Rob Sayre <sayrer@gmail.com> Tue, 19 September 2023 23:04 UTC

Return-Path: <sayrer@gmail.com>
X-Original-To: i18ndir@ietfa.amsl.com
Delivered-To: i18ndir@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 2EF52C15106B; Tue, 19 Sep 2023 16:04:19 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.104
X-Spam-Level:
X-Spam-Status: No, score=-2.104 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id ANZmMc_U56B8; Tue, 19 Sep 2023 16:04:15 -0700 (PDT)
Received: from mail-ed1-x533.google.com (mail-ed1-x533.google.com [IPv6:2a00:1450:4864:20::533]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 75F15C14CE36; Tue, 19 Sep 2023 16:04:15 -0700 (PDT)
Received: by mail-ed1-x533.google.com with SMTP id 4fb4d7f45d1cf-532aaba8747so2145606a12.3; Tue, 19 Sep 2023 16:04:15 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1695164653; x=1695769453; darn=ietf.org; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=iPxpWVxa7HmxXm1FK5dKbWx51n88M7cQr6qnBYYD/m4=; b=XASHY4zuT3/LnVZdZJXIrGCxaWOmOzVi2IPv1A4ZNneVGfHLsDDTyhlzyepR1qybHC MAMDYYICrppDM1bqv4rzR1CvYDz4KoagIAWKlCM3SEg2aKmTC58LtJp5nI5+HM9lQ1sf 3KWpbEEweKlDkSJqV9jI/kkpivi8MQfgys9s+KP7xYLEOtZ89jAwzdG3pzPbxQRCYZWu Jl+TN1EmN4xflF0XbyB8/cy/qNlrKgYJR0VhVdQb11ArKWCXp2wPDFovKh3sa1HF+mvf vSqGx7410nL/Q8oKuYJYGQZaXgKpMVfvvnpcEAz5w8UR9/wII/tzjLH7ntwkm+hSzbo5 Il4g==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1695164653; x=1695769453; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=iPxpWVxa7HmxXm1FK5dKbWx51n88M7cQr6qnBYYD/m4=; b=GNkl7U48WkTALib5nOVv9/ACp0gkQw/1dbVoauyxeQE0VAo4IEc7A8YrbIEkjqpuOK Y6hseXmSRYAqyMAqYF3TAVXyLMvM3PcCZrfoVouR4tkxRL2ECo4JyzxMNwN42NmpirF3 27XPCGdVQHOhx7XuRuG6cCGqpBqH/C699Fu88S4AAieFOPSuT0wFpnKGn1rjD2ox1zFU PNXVpFVpWA/j6Px/BruxfznGeq7a4oVF1Y0wOvCN44/y1gobnmsua4IHV6RzkitbrfpO GdV1vq3fO1V3b3DgkK/dkExzvnt6RFlsx6VsqmElU63gcjm5zuuUHYOci++oCBtAKh6B Auug==
X-Gm-Message-State: AOJu0YyfBYmLL1ZxrpSYxbKUfYBgabr82Jq8pJP2SkPqqCUPaxBwmIrl lM3PZIL1m+Yxxy/cmlGSdDPW1GOCSMNt6ponIBo=
X-Google-Smtp-Source: AGHT+IFsjcR7xBuMpkjcCVr/ZOa2hKe6qgvB1ZEvZCndvjdH3Dw6+QrTl28swz+fUzne+iO8CTxeeUT+NmFPg8f6+zY=
X-Received: by 2002:a05:6402:1357:b0:52f:9d4c:1d28 with SMTP id y23-20020a056402135700b0052f9d4c1d28mr645984edw.12.1695164653180; Tue, 19 Sep 2023 16:04:13 -0700 (PDT)
MIME-Version: 1.0
References: <169514412895.12827.17924518978945582691@ietfa.amsl.com> <CAHBU6iuUsa4H_9BNvf3XEuOg3ZB5qB31vQuodQhacQUMxFiUMg@mail.gmail.com>
In-Reply-To: <CAHBU6iuUsa4H_9BNvf3XEuOg3ZB5qB31vQuodQhacQUMxFiUMg@mail.gmail.com>
From: Rob Sayre <sayrer@gmail.com>
Date: Tue, 19 Sep 2023 16:04:02 -0700
Message-ID: <CAChr6SxMswjKACUr3cpZymjEOqnrxTQV2hX9mpsZO1=H2TwEZg@mail.gmail.com>
To: Tim Bray <tbray@textuality.com>
Cc: i18ndir@ietf.org, ART Area <art@ietf.org>
Content-Type: multipart/alternative; boundary="000000000000fd1bec0605be4621"
Archived-At: <https://mailarchive.ietf.org/arch/msg/i18ndir/6iVgSayuJdCtDh_kWhLbuLmPmsU>
Subject: Re: [I18ndir] [art] Fwd: New Version Notification for draft-bray-unichars-05.txt
X-BeenThere: i18ndir@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Internationalization Directorate <i18ndir.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/i18ndir/>
List-Post: <mailto:i18ndir@ietf.org>
List-Help: <mailto:i18ndir-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 19 Sep 2023 23:04:19 -0000

On Tue, Sep 19, 2023 at 10:29 AM Tim Bray <tbray@textuality.com> wrote:

> Two big differences here:
>
>
>    1. The %0-10FFFF production, and discussion of All The Code Points as
>    a subset, is removed. Which ended up re-organizing the draft quite a bit
>    (diffs vs -04 may not be very helpful) but we think improved the flow quite
>    a bit; thanks to James and Carsten for arguing this point so fiercely.
>    2. The discussion around RFC9413 is changed quite a bit based on
>    Asmus’ input.
>
>
> Unless substantial new issues are raised, we plan to consult our ADs about
> advancing the document.
>

There are still some bugs here. Generally, I think "Abstract Character
Repertoire" as used here is good:
https://unicode.org/reports/tr17/#Repertoire

Making it clear that the various encoding and escaping routines happen
before or after this idea. I don't think you need to add "Abstract" as a
qualifier. Just explain it.

> The Unicode Standard's definition of "Unicode character" is conceptual.
> However, each Unicode character is assigned a code point, used to
represent
> the characters in computer memory and storage systems and, in
specifications,
> to specify the allowed repertoires of Unicode characters.

I think you want to add: "Not all code points represent characters."

> In ABNF, the hexadecimal values for characters are preceded by "%x"
rather than "U+"."

But these are code points in the ABNF, right? For example:

https://www.ietf.org/archive/id/draft-bray-unichars-05.html#section-4.1

"; exclude surrogates"

These are in the problematic code point types. They are not characters. So,
it's probably best to go through and clean that up.

I think the "Restricting Character Repertoires" section should be run
through a grammar checker (MS Word or something). It doesn't say anything
incorrect, but I often thought "hmm, there should be a comma there" and
little things like that. Thank you for taking the "conforming JSON text"
suggestion, but the capitalization differs between the two uses: "JSON
text" vs "JSON Text".

thanks,
Rob