[Tools-discuss] [Cellar] non-ascii characters (fwd) Dave Rice: [Cellar] non-ascii characters

Michael Richardson <mcr+ietf@sandelman.ca> Tue, 27 August 2019 18:29 UTC

Return-Path: <mcr+ietf@sandelman.ca>
X-Original-To: tools-discuss@ietfa.amsl.com
Delivered-To: tools-discuss@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 880C7120825; Tue, 27 Aug 2019 11:29:16 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.199
X-Spam-Level:
X-Spam-Status: No, score=-4.199 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_MED=-2.3, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id TdEjVKXArDmT; Tue, 27 Aug 2019 11:29:14 -0700 (PDT)
Received: from tuna.sandelman.ca (tuna.sandelman.ca [IPv6:2607:f0b0:f:3:216:3eff:fe7c:d1f3]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 3E7B01201B7; Tue, 27 Aug 2019 11:29:13 -0700 (PDT)
Received: from sandelman.ca (obiwan.sandelman.ca [IPv6:2607:f0b0:f:2::247]) by tuna.sandelman.ca (Postfix) with ESMTP id 1BA243818D; Tue, 27 Aug 2019 14:28:05 -0400 (EDT)
Received: from localhost (localhost [IPv6:::1]) by sandelman.ca (Postfix) with ESMTP id CF44889; Tue, 27 Aug 2019 14:29:12 -0400 (EDT)
From: Michael Richardson <mcr+ietf@sandelman.ca>
To: tools-discuss@ietf.org, cellar@ietf.org
X-Mailer: MH-E 8.6; nmh 1.7+dev; GNU Emacs 24.5.1
X-Face: $\n1pF)h^`}$H>Hk{L"x@)JS7<%Az}5RyS@k9X%29-lHB$Ti.V>2bi.~ehC0; <'$9xN5Ub# z!G,p`nR&p7Fz@^UXIn156S8.~^@MJ*mMsD7=QFeq%AL4m<nPbLgmtKK-5dC@#:k
MIME-Version: 1.0
Content-Type: multipart/signed; boundary="=-=-="; micalg="pgp-sha256"; protocol="application/pgp-signature"
Date: Tue, 27 Aug 2019 14:29:12 -0400
Message-ID: <31591.1566930552@localhost>
Archived-At: <https://mailarchive.ietf.org/arch/msg/tools-discuss/z6d-6Skw3f-BTHnRbuJoigAQztE>
Subject: [Tools-discuss] [Cellar] non-ascii characters (fwd) Dave Rice: [Cellar] non-ascii characters
X-BeenThere: tools-discuss@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: IETF Tools Discussion <tools-discuss.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tools-discuss>, <mailto:tools-discuss-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tools-discuss/>
List-Post: <mailto:tools-discuss@ietf.org>
List-Help: <mailto:tools-discuss-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tools-discuss>, <mailto:tools-discuss-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 27 Aug 2019 18:29:17 -0000

reposting to tools-discuss for comments.
I suspect that the idnits tool may be in error in this case.

We are also using xml-v3 for ffv1, since it is full of math which gets
rendered to SVG.  

From: Dave Rice <dave@dericed.com>
Date: Tue, 27 Aug 2019 10:24:41 -0400
To: Codec Encoding for LossLess Archiving and Realtime transmission
 <cellar@ietf.org>

Hi cellar,

I’m doing an idnits review on the latest ffv1 draft and am confused about the
IETF rules for non-ascii characters in txt rfc drafts. The update at
https://tools.ietf.org/html/rfc7997 <https://tools.ietf.org/html/rfc7997>
seems to allow the RFC Editor to use some discretion in using non-ascii
characters, but idnits <https://tools.ietf.org/tools/idnits/> still complains
about that. One instance is using a Unicode minus sign to be more
semantically clear than an ascii hyphen. Also since the Unicode minus sign
uses more bytes than the ascii hyphen, idnits will say the line is too long,
even though in appearance the line displays within the correct width. 

In the RFC for ffv1, we’ve used unicode symbols for floor and ceiling
functions, such as ⌊a⌋ and ⌈a⌉, but rfc2xml transforms these into their
hexadecimal expressions such as &#8970;a&#8971; and &#8968;a&#8969;. Any
advice, should we listen to idnits and avoid non-ascii characters or listen
to rfc7997 and use them when it makes the content more clear? 

Kind Regards,
Dave Rice

-- 
Michael Richardson <mcr+IETF@sandelman.ca>, Sandelman Software Works
 -= IPv6 IoT consulting =-