Re: [I18ndir] [art] Fwd: New Version Notification for draft-bray-unichars-06.txt

Rob Sayre <sayrer@gmail.com> Wed, 27 September 2023 17:24 UTC

Return-Path: <sayrer@gmail.com>
X-Original-To: i18ndir@ietfa.amsl.com
Delivered-To: i18ndir@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 7C366C15155E; Wed, 27 Sep 2023 10:24:41 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.106
X-Spam-Level:
X-Spam-Status: No, score=-2.106 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_BLOCKED=0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Eust0fZbas5U; Wed, 27 Sep 2023 10:24:39 -0700 (PDT)
Received: from mail-ed1-x530.google.com (mail-ed1-x530.google.com [IPv6:2a00:1450:4864:20::530]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 56EB5C151074; Wed, 27 Sep 2023 10:24:39 -0700 (PDT)
Received: by mail-ed1-x530.google.com with SMTP id 4fb4d7f45d1cf-533c4d20b33so18341375a12.0; Wed, 27 Sep 2023 10:24:39 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1695835477; x=1696440277; darn=ietf.org; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=dgvZsGKtrLGTL5VmJKZIM0tIiAh1YUm7uQ1CY4cOypg=; b=ImeJAgsQZFt7YAYXHKVet/97EC9MrUL1+E5coaaiPa2LzvT78dyG2Pa6y4U8KRo8Rf fPnhQJQ/jTCiXES+c/EeEYeXqOgulAkbJ001qES48UcWTPJ4M13OVRMunZFXEfcCwpyl A0gUT+BPFSjkTa9T8+lA607L20QrTDiKAoyRzPCywv7XZDP94DwSDHCePNNoD8auCflH TWdmSwdxawsVOLSdeoTxlzA8ZFrkn+KDPMd7UWVdt43aHN9ENJWv2rfeYd+HF/ePlfW+ sd/gyEGYIOm6jASR23WwL3jUsiqY6IfHrgkotUfJrltq7g54W6/PQWQKARD+e44lwstr BD8A==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1695835477; x=1696440277; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=dgvZsGKtrLGTL5VmJKZIM0tIiAh1YUm7uQ1CY4cOypg=; b=dbD80ZpDfsTdIH9yM28upAIztEI/NOyBju427rJg5s5goxyLSHqFQu3Zl1DdgTwqNm +IGq+ltGcc8je8wnMojXMG0J83vCwnrQ/ZGeYOlc4YOK0zwe+q7UjniJ4M5BHDsztjPk OSgCYWD9acSkBDt7Q3Y2AkW23vHNR0dbVWl8bZO4jkqV2oiYsWB87WaPnpKUWobgrHKD +y0X5Zcf+Olqn9BpDnDGQnTHBM4eA52fyvfuJPv3TBvCp3d37uH3l2c6tqBBvbIw17O6 r+XQCCU7DLQylxnV/giVdgfy7+f+klWzUCuAyFNArj8DYi905RJANVMZxwuKIPplGF/b 7pTQ==
X-Gm-Message-State: AOJu0Yw7DQHAbR25wSeeZsBRQ03dFw80wrcK833FyFP6Z1MTQV83RBtH OspZNXirriUfdcCyN7+sjvsoSUtIIPP/ui353tY=
X-Google-Smtp-Source: AGHT+IEU1J35bJLkdYzG9NVgrc9cTNbQYO+3LWx/PKoW60GPEceH4tnMGeBUB7NmjZITSjDpjkpvkFGWTEOc70+v8Jg=
X-Received: by 2002:a05:6402:430e:b0:52e:3ce8:e333 with SMTP id m14-20020a056402430e00b0052e3ce8e333mr9930019edc.18.1695835477394; Wed, 27 Sep 2023 10:24:37 -0700 (PDT)
MIME-Version: 1.0
References: <169566019635.41806.9804796677919971070@ietfa.amsl.com> <CAHBU6is-wU2NLXNWL56nSJ4=nKvDzGv_Aw4qJN6N2O8CuM4-yw@mail.gmail.com> <CAChr6SwM9re+0X8V9YkFLxkuxhSnu0chW9ecKq1JuNuo4fAEWw@mail.gmail.com> <CAHBU6ivSkEv0AcT52BWrYadmutdYNFx0D0MYR3Sv62a2LXckJw@mail.gmail.com> <CAChr6SyuLc6-fLsThQJie2G_K4-vZtPK_emnFyA7NWoakBowiA@mail.gmail.com> <CAHBU6is1c8onz4CWsh=7vTNP_s4=0sCC2Q4jFrrb50JVOqxRTA@mail.gmail.com> <8c520016-391f-62ad-49c1-09f942e195db@ix.netcom.com> <CAHBU6is=_PCqe0zPuHuipTi96jDZt4+40BNaiH3KQsfqOcmjfQ@mail.gmail.com>
In-Reply-To: <CAHBU6is=_PCqe0zPuHuipTi96jDZt4+40BNaiH3KQsfqOcmjfQ@mail.gmail.com>
From: Rob Sayre <sayrer@gmail.com>
Date: Wed, 27 Sep 2023 10:24:25 -0700
Message-ID: <CAChr6SwNuXRn8SLvwLju3HPs9_PXjnOFbxEVGfD32KhNOfswvQ@mail.gmail.com>
To: Tim Bray <tbray@textuality.com>
Cc: Asmus Freytag <asmusf@ix.netcom.com>, i18ndir@ietf.org, ART Area <art@ietf.org>
Content-Type: multipart/alternative; boundary="0000000000003a4b8d06065a77f5"
Archived-At: <https://mailarchive.ietf.org/arch/msg/i18ndir/Gy-IngcyURxBriH-MNR49rgYRSw>
Subject: Re: [I18ndir] [art] Fwd: New Version Notification for draft-bray-unichars-06.txt
X-BeenThere: i18ndir@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Internationalization Directorate <i18ndir.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/i18ndir/>
List-Post: <mailto:i18ndir@ietf.org>
List-Help: <mailto:i18ndir-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/i18ndir>, <mailto:i18ndir-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 27 Sep 2023 17:24:41 -0000

On Wed, Sep 27, 2023 at 9:21 AM Tim Bray <tbray@textuality.com> wrote:

> On Sep 25, 2023 at 10:53:44 PM, Asmus Freytag <asmusf@ix.netcom.com>
> wrote:
>
>>
>> I would argue that at the end of 1. Introduction would be a good place to
>> mention what this document does *not* address.
>>
>> "This document does not address protocols where every single Unicode code
>> point must be supported for specific reasons, or protocols and data formats
>> that need to further subset the valid code points based on criteria
>> different from the ones presented here."
>>
>
> Having thought about this, I can see no value in mentioning
> situations/components that handle all-the-code-points garbage.  This
> document exists to provide options for people specifying new protocols &
> data formats and explains (clearly, I hope) why all-the-code-points is not
> provided as one of those options.
>
> On the other half, it is certainly the case that some text fields are
> going to have much more restricted repertoires than any of the ones listed
> in this document. I’m trying to imagine a situation in which stating this
> fact would be helpful to a reader, and what that reader would need to hear.
> So far I’m coming up empty.
>

At first, I thought it was a nice way to deal with this problem:
"In a way, just adding that explicitly also prevents abuse of the document
by people making false claims that it is intended to prevent you from
dealing with the full set or restricted sets,"

But the sentence is not actually true. The document does address the JSON
issue in a few places. Maybe

"Some protocols and data formats allow every single Unicode code point, and
some further subset the valid code points based on criteria different from
the ones presented here."

The question for the authors is whether they think the claim Asmus
describes would be a bogus one. I think it would be, and I believe what I
wrote is true. It's less direct than saying one way to deal with
problematic code points is to accept them (also true), but it gets the job
done.

Side note: Section 3 says "Surrogate characters have been observed to cause
software failures." but I think you want to say "Surrogate code points"
here.

thanks,
Rob