Re: [I18ndir] [art] New Version Notification for draft-bray-unichars-06.txt

Tim Bray <tbray@textuality.com> Sun, 01 October 2023 22:37 UTC

Return-Path: <tbray@textuality.com>
X-Original-To: i18ndir@ietfa.amsl.com
Delivered-To: i18ndir@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id BD895C15154E for <i18ndir@ietfa.amsl.com>; Sun, 1 Oct 2023 15:37:59 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.105
X-Spam-Level:
X-Spam-Status: No, score=-2.105 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=textuality.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id psaJUSdpUQlo for <i18ndir@ietfa.amsl.com>; Sun, 1 Oct 2023 15:37:55 -0700 (PDT)
Received: from mail-ed1-x52d.google.com (mail-ed1-x52d.google.com [IPv6:2a00:1450:4864:20::52d]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id F0DFFC151091 for <i18ndir@ietf.org>; Sun, 1 Oct 2023 15:37:54 -0700 (PDT)
Received: by mail-ed1-x52d.google.com with SMTP id 4fb4d7f45d1cf-536071e79deso5522828a12.1 for <i18ndir@ietf.org>; Sun, 01 Oct 2023 15:37:54 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=textuality.com; s=google; t=1696199873; x=1696804673; darn=ietf.org; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=NAV9eYC8M9IecX7SyTMfyTnm1dPZoDVm0tMjFsumsLI=; b=V4JU8w41vHT7NU/cOl+YVgIv6Gc4sMkXWXbk4+a/yu3FTxL0d5x2dSEVnxxG6MFZQ1 frY532Wp/pcdC7wJvXNzqIo4Vd6kn9x8jXPr8BbpAtwglp+o76pcnRQhS0wiq8f096Vg 6aUje4T/rIQTspgiLlw/Wqblnv8CaOnK5F1iQ=
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1696199873; x=1696804673; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=NAV9eYC8M9IecX7SyTMfyTnm1dPZoDVm0tMjFsumsLI=; b=w8QhvEBr2pDoCV3ZbFfF+OlhAyqgk7a1AE5jlB+SoEox6oGto5cBC0YgEXlx9H5efC EX20tCVyicHLSx1WXRLbghBoHHEyNfcYF/L6XoI7amiLuPfH9nHCEyVK2fKp9pmW7f90 HRh9yyclm3+ceK80w02wvi1qigOJNfBeOlAFDn/FQBoXmfpVwguKx/XV9o97jvbtfrOv zjf9KTEabew5sEc7GLhaZ5rOBgmOthvsbvHLaHA44Zl+Sjy3IooYUMjVzeldIrX5maUE MpNmfloUYuInlvY+vMNcwubKFnjcqapljEoBN/HA3oW7KWFmyAT2nNzk1RfR8ha8NVVs cnMg==
X-Gm-Message-State: AOJu0Yzd3Ha4yx4pSa7l2HcSnvAfzpyY/IGMydQFDax+++bwbSVyPvsj zVvzrCgSoPXeanmNbNm2kxch+5Gwq6GK7fFgp0g1dA==
X-Google-Smtp-Source: AGHT+IEU3MEluYhNr4PDh7ZnMu8jn0NPeZGG6mQXbSXWie0a39BOv45Faun19uCyBRDUH9tKOPzu6JOTRPa2W1ifAD4=
X-Received: by 2002:a05:6402:40cf:b0:531:14c4:ae30 with SMTP id z15-20020a05640240cf00b0053114c4ae30mr8677707edb.0.1696199873055; Sun, 01 Oct 2023 15:37:53 -0700 (PDT)
Received: from 1064022179695 named unknown by gmailapi.google.com with HTTPREST; Sun, 1 Oct 2023 22:37:52 +0000
Received: from 1064022179695 named unknown by gmailapi.google.com with HTTPREST; Sun, 1 Oct 2023 22:37:48 +0000
Mime-Version: 1.0 (Mimestream 1.1.2)
References: <169566019635.41806.9804796677919971070@ietfa.amsl.com> <CAHBU6is-wU2NLXNWL56nSJ4=nKvDzGv_Aw4qJN6N2O8CuM4-yw@mail.gmail.com> <SYBPR01MB59814B3448F5754AAEDA1740E5C7A@SYBPR01MB5981.ausprd01.prod.outlook.com> <CAHBU6iueqtd5T1T-ciYUMWvmo8XqBQqO5LkWbdRaoXQzPYSQOQ@mail.gmail.com> <SYBPR01MB59819A9F0BDD785F74EB2855E5C7A@SYBPR01MB5981.ausprd01.prod.outlook.com> <CAChr6SwLcEX3Oox-CMCui+p8LQQFJBf+kG8p9WNpD8HzgXsm9Q@mail.gmail.com> <219F675E-0184-4FD9-BD48-6B62AD8A4353@tzi.org> <CAChr6SyDMHKB6p-gdS1OdJRG+X=6ozZDtT8j1hehyTvCKaskSA@mail.gmail.com> <90BF7B53-584A-44D9-824E-F2AA8BD70A94@tzi.org> <CAHBU6iuS2J0KEOYWOscVAoNthjU4G1BR-Bo+jYetBBGXjrt-Ew@mail.gmail.com> <F9C8BA7D-54E0-489E-88BD-E089D2EA297B@tzi.org>
In-Reply-To: <F9C8BA7D-54E0-489E-88BD-E089D2EA297B@tzi.org>
From: Tim Bray <tbray@textuality.com>
Date: Sun, 01 Oct 2023 22:37:52 +0000
Message-ID: <CAHBU6ivTTrKCBJ0=yAQb7+NriDyN5udZ9pQOhSLjJAMHM8_D7Q@mail.gmail.com>
To: Carsten Bormann <cabo@tzi.org>
Cc: "Manger, James" <James.H.Manger=40team.telstra.com@dmarc.ietf.org>, "i18ndir@ietf.org" <i18ndir@ietf.org>, ART Area <art@ietf.org>, Rob Sayre <sayrer@gmail.com>
Content-Type: multipart/alternative; boundary="000000000000e6e5dc0606af4e73"
Archived-At: <https://mailarchive.ietf.org/arch/msg/i18ndir/XdF_T78DnMEw5eTEaBcz2IrAGXQ>
Subject: Re: [I18ndir] [art] New Version Notification for draft-bray-unichars-06.txt
X-BeenThere: i18ndir@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Internationalization Directorate <i18ndir.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/i18ndir/>
List-Post: <mailto:i18ndir@ietf.org>
List-Help: <mailto:i18ndir-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 01 Oct 2023 22:37:59 -0000

The unichars draft is immensely less ambitious than modern-network-unicode,
which is covering a lot of complex subjects. Unichars covers only the issue
of code points, what the issues are with them, and subsetting them.  It
would be useful as a reference point in many currently active Internet
drafts, which tend to ignore this issue.

Turning modern-network-unicode into an IETF consensus document will be a
long complicated task, with a lot of difficult issues. For example:


   1. At the moment, I don’t believe in the distinction between “1d" and
   "2d”.  Because I don’t think that controls like \n and \r shouldn be given
   semantic significance in modern protocols, we have things like JSON and
   CBOR for packaging.  So this issue needs a lot more work.
   2. I think it should be a best practice that protocol designs SHOULD NOT
   go anywhere near normalization because so many text fields are used as
   unique identifiers and thus hash keys.


Neither of those positions are very strongly held, but it is clear that a
general best-Unicode-practices spec is a big task.

In the meantime, unichars explains one poorly-understood problem and
provides handy references for sane solutions to it, and has a chance of
being completed quickly.

I just don’t think a discussion of unichars vs modern-network-unicode makes
sense. They’re not trying to do the same things.

On Oct 1, 2023 at 10:56:34 AM, Carsten Bormann <cabo@tzi.org> wrote:

> On 2023-10-01, at 19:44, Tim Bray <tbray@textuality.com> wrote:
>
>
> I repeat: To inform protocol designers that it’s insufficient to say
> “Unicode characters” for their text fields. And, having explained why,
> offer reasonable alternative character repertoires.
>
>
> That is approximately the reason why I wrote modern-network-unicode.
> Please explain how draft-bray-unichars is more useful for this purpose.
> It doesn’t even distinguish 1d from 2d applications.
> That is not the level of advice I saw that people needed when I decided to
> write modern-network-unicode.
>
> Grüße, Carsten
>
>