Re: [art] draft-bray-unichars

Carsten Bormann <cabo@tzi.org> Thu, 31 August 2023 10:49 UTC

Return-Path: <cabo@tzi.org>
X-Original-To: art@ietfa.amsl.com
Delivered-To: art@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 57B6DC1522AF for <art@ietfa.amsl.com>; Thu, 31 Aug 2023 03:49:34 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -6.895
X-Spam-Level:
X-Spam-Status: No, score=-6.895 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_HI=-5, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, T_SCC_BODY_TEXT_LINE=-0.01, T_SPF_TEMPERROR=0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id APEWXvvjQpxe for <art@ietfa.amsl.com>; Thu, 31 Aug 2023 03:49:28 -0700 (PDT)
Received: from smtp.zfn.uni-bremen.de (smtp.zfn.uni-bremen.de [IPv6:2001:638:708:32::21]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id E55FEC14CE44 for <art@ietf.org>; Thu, 31 Aug 2023 03:49:27 -0700 (PDT)
Received: from smtpclient.apple (p548dc15c.dip0.t-ipconnect.de [84.141.193.92]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.zfn.uni-bremen.de (Postfix) with ESMTPSA id 4Rbyb72yQfzDCfM; Thu, 31 Aug 2023 12:49:23 +0200 (CEST)
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3731.700.6\))
From: Carsten Bormann <cabo@tzi.org>
In-Reply-To: <GVXPR07MB9728D092A7611EE40C251877A0E5A@GVXPR07MB9728.eurprd07.prod.outlook.com>
Date: Thu, 31 Aug 2023 12:49:12 +0200
Cc: Tim Bray <tbray@textuality.com>, "art@ietf.org" <art@ietf.org>
Content-Transfer-Encoding: quoted-printable
Message-Id: <183BF6FE-8DE6-4B86-9F65-622927E394D5@tzi.org>
References: <CAHBU6iuDwquhacp1r7qREfaA1CGLR5LjqdasMdOQUQim6NeJsw@mail.gmail.com> <GVXPR07MB9728D092A7611EE40C251877A0E5A@GVXPR07MB9728.eurprd07.prod.outlook.com>
To: tom petch <ietfc@btconnect.com>
X-Mailer: Apple Mail (2.3731.700.6)
Archived-At: <https://mailarchive.ietf.org/arch/msg/art/-CpAIH8MEw_h0msf-YxNThq8cPk>
Subject: Re: [art] draft-bray-unichars
X-BeenThere: art@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Applications and Real-Time Area Discussion <art.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/art>, <mailto:art-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/art/>
List-Post: <mailto:art@ietf.org>
List-Help: <mailto:art-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/art>, <mailto:art-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 31 Aug 2023 10:49:34 -0000

On 31. Aug 2023, at 12:17, tom petch <ietfc@btconnect.com> wrote:
> 
> And AFAICT none of them cover the case I know best, that of YANG RFC7950 p.209.

At least you excluded the surrogates code points...

> This is a topic that surfaces regularly with YANG modules where there are many display strings and the question is what to allow and what not to allow.  It was an issue with SMI previously but there there was a type intended for that use but YANG does not have that.  I cannot recall a YANG module specifying a Unicode subset to use.

Just a random observation: If you want to *display* these strings outside the English-language-only engineering environment, you’d probably want to provide for a language tag.
(I think YANG 1.1 did a good thing keeping this open.)

I wrote modern-network-unicode [1] for a similar case, where the authors of a draft naturally weren’t experts in the relevant questions.
There then is a strong danger of the restatement anti-pattern (*), as well as some incompleteness in the actual design.

So I agree we want to crystallize some advice, as well as providing a document that specifications can point to.
I just seem to have a slightly different perception than draft-bray-unichars on what the information in there should be…

Grüße, Carsten



[1]: https://datatracker.ietf.org/doc/draft-bormann-dispatch-modern-network-unicode/

(*) E.g., should every piece of ABNF respecify what are the Unicode non-characters today?  I don’t think so.