Re: [xml2rfc] [Rfc-markdown] [Tools-discuss] New xml2rfc release: v3.16.0

"Martin J. Dürst" <duerst@it.aoyama.ac.jp> Fri, 20 January 2023 06:10 UTC

Return-Path: <duerst@it.aoyama.ac.jp>
X-Original-To: xml2rfc@ietfa.amsl.com
Delivered-To: xml2rfc@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 0D8EEC14CE2D; Thu, 19 Jan 2023 22:10:13 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -6.902
X-Spam-Level:
X-Spam-Status: No, score=-6.902 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, NICE_REPLY_A=-0.001, RCVD_IN_DNSWL_HI=-5, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=itaoyama.onmicrosoft.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id k2j9-rOqPTNc; Thu, 19 Jan 2023 22:10:12 -0800 (PST)
Received: from JPN01-TYC-obe.outbound.protection.outlook.com (mail-tycjpn01on2136.outbound.protection.outlook.com [40.107.114.136]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 97913C14E514; Thu, 19 Jan 2023 22:10:11 -0800 (PST)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=HtfopPC646kZgFMiUerBBGsuaWkLU+R9e2AUKfY+b2TXg9ZVOg0cIlNoaTjBalZ9jI+/863x9q7y56EcaRjGfvCryOP24LXqOxPwZZdsQKzTx1qqsjce2PZh5/gPkhQneWfar+CZLWD6NGC2Il/p+HMnQT9zsToXqR2Tn7AQUhunE57SbpJ9UdYom5BmqIYJk1U0tUYeH53J6Gb0JB80/Azjcj3vrMCZiv5DttA6nkSHGui5OfcOLQHkzCTijD3bnZPLwU591VEejMVdqUviDKo/9wvdB83Z8TXpXoTt+FG8nca2joqpCTbsSxedmufmxLxeMIfSUI1HecgHFjhpfg==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=OjBam8ZqNax3ReYLEGjfkVp1n1kqRo8DdYvgYH8GiDE=; b=NMkvrisN8IxLuHwX2mWkNpUPEPvwybBQ3nzcrsrYDOnSPEPyYh10Fs7z9dLA99GgbeqghB83v6xoZagrjcq6mwJuGdnQSqYkSUtIpn6ba/fV8DeSglREel/hwpMhv3uDtZv5kyDSsiO1Fvp/Tv9vQCSBCtpj31n18cgh5+wHwWw2xsXW7O5/FcHni6lY1EQHhVeW803cQzmGu78Kk/aUYUST3Qv1YdT+R1TXdXioP0nqyDT8hwVekxVXl6gOX58qllwUFcVlz+ga31TJ0z2ifC0NqrlAdvF54oLd1jeQ0DIRletSRM+jygEFEQ3SrHz76xcJkH2jr/ud/ZSmUTUzfg==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=it.aoyama.ac.jp; dmarc=pass action=none header.from=it.aoyama.ac.jp; dkim=pass header.d=it.aoyama.ac.jp; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=itaoyama.onmicrosoft.com; s=selector2-itaoyama-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=OjBam8ZqNax3ReYLEGjfkVp1n1kqRo8DdYvgYH8GiDE=; b=NCLleqytW1yZJbgEyAUHt4udSyjJ56wtOjYu6/ihf6qzYzbPVErEcZay5nVWWNdnliuIOmgeJzEoO2NQJDZ0p9C0fh6hmtF6TBP2aZpYmVJlG7o8/GOdkUDPrNfvFHs7TRMFJcHzOxfahswXy95nWf9PRRIqRyiMvyhEVGSJ0QQ=
Authentication-Results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=it.aoyama.ac.jp;
Received: from TYAPR01MB5689.jpnprd01.prod.outlook.com (2603:1096:404:8053::7) by TYYPR01MB8183.jpnprd01.prod.outlook.com (2603:1096:400:fe::14) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6002.27; Fri, 20 Jan 2023 06:10:08 +0000
Received: from TYAPR01MB5689.jpnprd01.prod.outlook.com ([fe80::c31f:e92e:6745:20a8]) by TYAPR01MB5689.jpnprd01.prod.outlook.com ([fe80::c31f:e92e:6745:20a8%6]) with mapi id 15.20.6002.026; Fri, 20 Jan 2023 06:10:08 +0000
Message-ID: <c3b3064f-e505-f504-f258-06f0d824ed4b@it.aoyama.ac.jp>
Date: Fri, 20 Jan 2023 15:10:07 +0900
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.6.1
Content-Language: en-US
To: Marc Petit-Huguenin <marc@petit-huguenin.org>, Jay Daley <exec-director@ietf.org>
Cc: xml2rfc@ietf.org, tools-discuss <tools-discuss@ietf.org>
References: <CAD2=Z87EMetcpv66YY_b2+X1-yFy4cTpKMjPoJL=cH99c7P_Uw@mail.gmail.com> <9d719176-a4eb-7cce-e706-10325700531c@petit-huguenin.org> <F1A5624B-16D0-4463-AC5F-B0A03F3B94B6@ietf.org> <8f5a497e-4135-7c0c-46cb-c3fe4791e9f3@petit-huguenin.org>
From: "Martin J. Dürst" <duerst@it.aoyama.ac.jp>
Organization: Aoyama Gakuin University
In-Reply-To: <8f5a497e-4135-7c0c-46cb-c3fe4791e9f3@petit-huguenin.org>
Content-Type: text/plain; charset="UTF-8"; format="flowed"
Content-Transfer-Encoding: 7bit
X-ClientProxiedBy: TYCP286CA0155.JPNP286.PROD.OUTLOOK.COM (2603:1096:400:383::13) To TYAPR01MB5689.jpnprd01.prod.outlook.com (2603:1096:404:8053::7)
MIME-Version: 1.0
X-MS-PublicTrafficType: Email
X-MS-TrafficTypeDiagnostic: TYAPR01MB5689:EE_|TYYPR01MB8183:EE_
X-MS-Office365-Filtering-Correlation-Id: 27a4ecf4-9cc1-4e37-f9f9-08dafaacfb54
X-MS-Exchange-SenderADCheck: 1
X-MS-Exchange-AntiSpam-Relay: 0
X-Microsoft-Antispam: BCL:0;
X-Microsoft-Antispam-Message-Info: zxrv8L8SyNf2Y9iYSmVGYOdbZmLGUY2WXMRt0dAioTeUVFg5b73mXA0LR9XAsKCRSKVcN6umaHibH5QRBmU18gMwSekpEo69hgjz1PsvmYttkoIdgfbSibRzgGKE96zWPTu0fXjbp6Vk9YPN3tDSIt4vFddYTqqfz0xySy+X7tL7PU602CABcWxTJB48iMKIHeJFToxK/kWmblDuBHdM/b/6RYb0kQuaBV976X3oA9usWiWwvtNCzSZu220rYFF6ga1WCoY/BmqMVOc68UNZh/YxHaDOp1vebQNwFi87Hli1vpAucxEDHKb/uRhD6oEEWFMH9sbQwSD4I11qwnLVRpw8U/yEFZE0aY7DtFnLy8JutdZkrIAg5qugm6N4a94xZZtEEZ09/jRr2gspnfthqrg6zPVyIjRdceizstGAQXbgqV5HT5CZkS7CItBvNauJDxTLiiDu1sHHy2+NZluHS0MfzZ1Zj6ImtRzgO3s0R18Mryk70HgnRps4ShP0xRlgerCJJ9feZYw8RMXcCGdzVYxpQsdM86jaTvcUc0aBQhCMVLwG1aHBBL52CtxBuRNq33R3yWsw27XnP18SGSzstAaUWaPN01WU0JUvV1eVoHrdScr6uS6/xlYzcV3AlqJ3wCwf1FSHqud/cRWfbbOwDST3IJRuqkN2VWCBcOzqdPqg1UcHV108TxltQ7k5Erg9kxkoHMn1eXbEyjsnX/FLJIfgBh/z7qaeEQHiDje/bmOR1HDh5/EfVqc4gn6cLkmxYhOKclCEDDyuPMOd11Xrsg==
X-Forefront-Antispam-Report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:TYAPR01MB5689.jpnprd01.prod.outlook.com; PTR:; CAT:NONE; SFS:(13230022)(396003)(346002)(136003)(366004)(39850400004)(376002)(451199015)(2616005)(41300700001)(5660300002)(8936002)(316002)(786003)(4326008)(8676002)(66476007)(66556008)(66946007)(26005)(186003)(6512007)(38100700002)(41320700001)(38350700002)(31696002)(86362001)(83380400001)(2906002)(478600001)(6506007)(110136005)(36916002)(31686004)(6486002)(52116002)(53546011)(45980500001)(43740500002); DIR:OUT; SFP:1102;
X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1
X-MS-Exchange-AntiSpam-MessageData-0: G+DojvfmKjX9+FyIxCSd7D0OL1p4cxCl1vPJtJRq2Jh7fxlNJrRNxKx++VX7K9a5rOGx3ME09BPbnfbJ2b6UH0BJ3dgn295XqiXXgkCSl6n//1ROd29GuN38doPXnm7TxVmtrf1vBlZ11i1b5N79AgWL+FGL2JBHUx8S3lLb065GY7qWfBDcDnzwQ+V9NPMWoQOZvKZ7+mok5bogoct6FUyZT1UY+jSn6xWv5mv+Z+UzO1Ko6B7fkA0pxNFR4Pybe9xs9agqDVQqwNGW0HKqqxmd+adWe4WNYN+mOlZy45oHVh84fgMIpalqJSSnTZjgf1ffGbwfnhzb41q0UAkWW25KoagDkdrUEgqzLX9kvr6EcvEpLGHgSodbStg3iDHKqytqpR3Qd9wz+QAgpGLoRuXvdaeinB7vHf+aoze/P4279gxqpBxuJQBf4r3hYkPFP1SKRlqczEKGPcSSPQcvjT8eX3r0ltVwwoa/fzopqGlTc9F+P6IjUfMmRwoMX4/X2F3mG/N/zEkVee+/fABzuVaZVwKn38wEnaNEXZip941w6TVVfySywwRwYXhGZh8pnpAXqvZeriyzvSlbXPv3/nuozJKk4BkneTD97K7VPCxUckYJS0GayiSz+Lb4E/XujEfRRSsyqPV8hXc+0a3M4waYqvKr/zukywU5qGWg/JkD9qo4WqTfK29J2rm4XiddOlg/7aHZoKO6xGZqHKZ6R1XnNwQ2Dn4jHwZz21lgDIDT8F76ViUY14eiHhsLu6epzJQHDwmlaXAbmnvmH+zO/A3LZBRkm6rR9mXTt9n7UWWN0NHeYGkVNRf2uyaqdiyVzPQRhtp4jVT0dB6fY4mq6PpWADv5feS8de8P8DthuW2Z3iljJO8lWhD5NfUFu85mOoDG4GO8nICxgizpIpCtG6KbdcXZQSoISQhUaEFl67tjN7IQoDJ9NgZpi5DrQOH2XsPQDGAOd9rNBSvdl4Tju0+X/ot+51eAItqlzskx8mcE+5QVQBO1IcfAO5N0fAI7hY8vy1cLFYjPozzDsFCve177M6vfYeN0usrL2EsY+uShO2zZmDv6JoRXkFK+CB6XJcdwfH6DsCzhGPuQcZBnP9maG2QHiD0aF25HclmbsOrlavQ1SLGb060ejlzSkAKM9xo4VzEnC+sOXTS25f6ktmKvjsLojert8D1SYmaR/v5g5N564dI5jteBKdJYpqzP6kiTTsr5BFa9fS0moQhoWUSwjDd3Pj0uux5/WmiNAt2AkW64lgEiVYqgVcWDMo/T/7UO34dOPXqnViL2jOBDFjURtVvFy7MVs0Tl7M50DePcKT2BFfGVoRsGq+Z4Vf5mrOYM7ALDmzPVPwsVE3NI473v7SQCCOISVQiR4qLK8uwJfqoumOoNP7qnNGFpFp11YSxJyjuQY5eRvkYhCkDDRuAJ8FQOGYwtvV4UmppA11/z6GrbGToEP2NXYxBX0GoMRiiTUp6NzXtVuBjSE9ibmvz3e9T1LNoHkATtc1Dr1XR/R4fh2qIs3oDkt93Vf1HnzgM93WdXsZfYXPg64PGR4OMXc1ayYaOE7H/qeW3VhLf6VjUiLogk8nCCgTCCdA/82/iXYRA5if8qHbvylVCxfQ==
X-OriginatorOrg: it.aoyama.ac.jp
X-MS-Exchange-CrossTenant-Network-Message-Id: 27a4ecf4-9cc1-4e37-f9f9-08dafaacfb54
X-MS-Exchange-CrossTenant-AuthSource: TYAPR01MB5689.jpnprd01.prod.outlook.com
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-OriginalArrivalTime: 20 Jan 2023 06:10:08.7752 (UTC)
X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted
X-MS-Exchange-CrossTenant-Id: e02030e7-4d45-463e-a968-0290e738c18e
X-MS-Exchange-CrossTenant-MailboxType: HOSTED
X-MS-Exchange-CrossTenant-UserPrincipalName: ULKaq9QywstgujFNgwVY6wqQx5K4NnJfdi5hG8fBbCXYQ8o/X76vc7wGdmhPr5BfpjYr2XPQgC7Ns6iBG86odA==
X-MS-Exchange-Transport-CrossTenantHeadersStamped: TYYPR01MB8183
Archived-At: <https://mailarchive.ietf.org/arch/msg/xml2rfc/W1oZpyyrqLsGz2M54lIvgnilMyk>
Subject: Re: [xml2rfc] [Rfc-markdown] [Tools-discuss] New xml2rfc release: v3.16.0
X-BeenThere: xml2rfc@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: XML2RFC discussion list <xml2rfc.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/xml2rfc>, <mailto:xml2rfc-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/xml2rfc/>
List-Post: <mailto:xml2rfc@ietf.org>
List-Help: <mailto:xml2rfc-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/xml2rfc>, <mailto:xml2rfc-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 20 Jan 2023 06:10:13 -0000

On 2023-01-20 02:20, Marc Petit-Huguenin wrote:

> RFC 7997 clearly says that Unicode CANNOT be used unless for a finite list of cases:
> 
> 1. Purely part of an example (3.1)
> 2. for English words imported from foreign languages, with the strict constraints that they are defined in the Merriam-Webster dictionary (3.1).
> 3. person or Organization name (3.2, 3.3)
> 4. when the Unicode character is described, instead of being used (3.4)
> 5. in a table
> 6. in code
> 7. in a bibliographic item
> 8. in address information
> 
> The modification above is clearly not restricted to these cases.
> 
> I notice that the xml2rfc language already contains some elements that are can be used into enforcing these cases.  When missing, new elements could be added:
> 
> (1) An <artwork> element can contain Unicode
> (2) a new element (as <t> content) can mark word that can contain Unicode.  Xml2rfc can then extract them and check that they are valid English words
> (3) <contact> can contain Unicode
> (4) <u>, used to describe a Unicode character, can contains Unicode
> (5) a <tr> element can contain Unicode
> (6) a <sourcecode> element can contain Unicode
> (7) a <reference> can contain Unicode (not just the organization/address)
> (8) The <address> and <organization> elements can contain Unicode.

This doesn't include point 1) in your first list (Purely part of an 
example (3.1)). I guess they could go under (2), but the part "check 
that they are valid English words" would have to move from xml2rfc to 
people. Or there could be a second new element such as <example>. The 
content of this would again have to be checked by people.

But then, we don't actually have to go that far. The announcement to 
which your reacted contains the following:

```
* New flag --warn-bare-unicode when set, xml2rfc warns about bare
Unicode in the <t> elements. By default, this is set to False.
```

Because it is very easy for a program to detect (non-ASCII) Unicode, 
there isn't even a need for any new element.

> Doing that and documenting it in the next revision of RFC 7991 seems the sensible thing to do.
> 
> But unconditionally letting everyone adding Unicode characters willy-nilly looks to me as a way to, at some point in the future, being able to say that we have no other choices than officially authorizing Unicode everywhere because there is already too many legacy RFCs doing that (a well known tactic to work around standards).

The addition of the --warn-bare-unicode flag should be enough to show 
that there is no intention to let everyone add Unicode characters 
willy-nilly. I'm sure the RPC knows how to use that flag.

Regards,   Martin.