Re: [I18ndir] [art] Fwd: New Version Notification for draft-bray-unichars-04.txt

Steffen Nurpmeso <steffen@sdaoden.eu> Fri, 15 September 2023 20:44 UTC

Return-Path: <steffen@sdaoden.eu>
X-Original-To: i18ndir@ietfa.amsl.com
Delivered-To: i18ndir@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 20137C151077; Fri, 15 Sep 2023 13:44:02 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.91
X-Spam-Level:
X-Spam-Status: No, score=-1.91 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 4O1p8YmxC9a3; Fri, 15 Sep 2023 13:44:00 -0700 (PDT)
Received: from sdaoden.eu (sdaoden.eu [217.144.132.164]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 194FAC151060; Fri, 15 Sep 2023 13:43:56 -0700 (PDT)
Date: Fri, 15 Sep 2023 22:43:53 +0200
Author: Steffen Nurpmeso <steffen@sdaoden.eu>
From: Steffen Nurpmeso <steffen@sdaoden.eu>
To: Tim Bray <tbray@textuality.com>
Cc: ART Area <art@ietf.org>, i18ndir@ietf.org, Steffen Nurpmeso <steffen@sdaoden.eu>
Message-ID: <20230915204353.Yu_dp%steffen@sdaoden.eu>
In-Reply-To: <CAHBU6ivzUV947N+n7AoYkCFT3ZfaLobCQ4fBXw3dvkqTT=LBAw@mail.gmail.com>
References: <169479938668.18742.9199862891950651366@ietfa.amsl.com> <CAHBU6ivzUV947N+n7AoYkCFT3ZfaLobCQ4fBXw3dvkqTT=LBAw@mail.gmail.com>
Mail-Followup-To: Tim Bray <tbray@textuality.com>, ART Area <art@ietf.org>, i18ndir@ietf.org, Steffen Nurpmeso <steffen@sdaoden.eu>
User-Agent: s-nail v14.9.24-510-g8b7a513518
OpenPGP: id=EE19E1C1F2F7054F8D3954D8308964B51883A0DD; url=https://ftp.sdaoden.eu/steffen.asc; preference=signencrypt
BlahBlahBlah: Any stupid boy can crush a beetle. But all the professors in the world can make no bugs.
MIME-Version: 1.0
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: quoted-printable
Archived-At: <https://mailarchive.ietf.org/arch/msg/i18ndir/qGxYNdbsamHt7qAf_VgKd0fejzM>
Subject: Re: [I18ndir] [art] Fwd: New Version Notification for draft-bray-unichars-04.txt
X-BeenThere: i18ndir@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Internationalization Directorate <i18ndir.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/i18ndir/>
List-Post: <mailto:i18ndir@ietf.org>
List-Help: <mailto:i18ndir-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 15 Sep 2023 20:44:02 -0000

Hello.

Tim Bray wrote in
 <CAHBU6ivzUV947N+n7AoYkCFT3ZfaLobCQ4fBXw3dvkqTT=LBAw@mail.gmail.com>:
 | Hello again.  This draft corrects errors pointed out on these mailing
 |lists (thanks!) and goes to some length to address the arguments that
 |mentioning the All-Code-Points option is toxic, and that the definition of
 |UTF-8 excludes the possibility of surrogates. It relies more strongly on
 |the discussions in RFC 9413 (which BTW I hadn’t read except in an early
 |draft - I recommend to everyone, it’s really good).
 |
 |Comments are specifically requested about the desirability of advancing

That not, but one, and only as a remark for the interested:

  5.  Dealing With Problematic Code Points

     Noncharacters and legacy controls are unlikely to cause software
     failures, but they cannot usefully be displayed to humans, and can be
     used in attacks based on misleading human readers of text that
     display them.  [TR36]

For the purpose of "usefully displaying to humans" Unicode has the
dedicated code points U+2400 "SYMBOL FOR NULL" - U+2421 "SYMBOL
FOR DELETE", plus U+2424 - U+2426.

  6.  Restricting Character Repertoires
  ...
   It is unlikely that anyone specifying a new data format would choose
   to allow the Unicode Code Points character repertoire.

As a non-native english speaker i would think of "full" or
"complete" repertoire's?

Thank you very much!  I think the document is a good and helpful
read, especially with TR36 and the RFC 9413 that i did not have
yet!  (My personal fun is banging against that legacy C0 etc.)

A nice weekend i wish, and sorry for possibly to rude speech.

--steffen
|
|Der Kragenbaer,                The moon bear,
|der holt sich munter           he cheerfully and one by one
|einen nach dem anderen runter  wa.ks himself off
|(By Robert Gernhardt)