Re: [Rfc-markdown] [rfc-i] [Tools-discuss] Tool to convert TXT to RFCXML

Martin Thomson <mt@lowentropy.net> Mon, 17 July 2023 11:11 UTC

Return-Path: <mt@lowentropy.net>
X-Original-To: rfc-markdown@ietfa.amsl.com
Delivered-To: rfc-markdown@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 8B640C151533; Mon, 17 Jul 2023 04:11:20 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -7.096
X-Spam-Level:
X-Spam-Status: No, score=-7.096 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_HI=-5, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_PASS=-0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=lowentropy.net header.b="KRmPXeFz"; dkim=pass (2048-bit key) header.d=messagingengine.com header.b="yvkveCvg"
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id YtPkL2wCPxFF; Mon, 17 Jul 2023 04:11:15 -0700 (PDT)
Received: from out1-smtp.messagingengine.com (out1-smtp.messagingengine.com [66.111.4.25]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id E6A23C1516E9; Mon, 17 Jul 2023 04:11:14 -0700 (PDT)
Received: from compute6.internal (compute6.nyi.internal [10.202.2.47]) by mailout.nyi.internal (Postfix) with ESMTP id 8BCBE5C0087; Mon, 17 Jul 2023 07:11:13 -0400 (EDT)
Received: from imap41 ([10.202.2.91]) by compute6.internal (MEProxy); Mon, 17 Jul 2023 07:11:13 -0400
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=lowentropy.net; h=cc:cc:content-type:content-type:date:date:from:from :in-reply-to:in-reply-to:message-id:mime-version:references :reply-to:sender:subject:subject:to:to; s=fm1; t=1689592273; x= 1689678673; bh=g9hMgktxIk/qA5piBvqI896WpT+g2jbLm7e6oFAH8Qs=; b=K RmPXeFzLOdm53pxChtwROxrzCG2uJ4A+GuYmzro2W/dgjwvkc2Q3QY1mQrGCAvQ6 NUot2n/NzPpHXzgUr4Vc12tgBJwG8tEO0Hx22f6wDkzSLe8iXzpzl1Ut42waYPfD LFme20J5YDGlCkXmDVONaA19xWlUBxwf6LtRRxHoU7RnKlVH+hx/k5PJVKpOsKSF BpscAwFGBB3KSbKRQcC49dMxOLb8r/WyjAvoncjnv8w/pkC2RWMBJyecwbYSDYGl huV/D5+jwxfhAIVB/LbafpvBk7hfoMUhbAR2OLsc5cO2zoMUiRubfHVDuPfBZFO+ xT9aME26GG7dQbZfaj6BQ==
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=cc:cc:content-type:content-type:date:date :feedback-id:feedback-id:from:from:in-reply-to:in-reply-to :message-id:mime-version:references:reply-to:sender:subject :subject:to:to:x-me-proxy:x-me-proxy:x-me-sender:x-me-sender :x-sasl-enc; s=fm3; t=1689592273; x=1689678673; bh=g9hMgktxIk/qA 5piBvqI896WpT+g2jbLm7e6oFAH8Qs=; b=yvkveCvg1GLbuoLcoHcV2fK0o3WDi fKtBvwfgJQsz78MFLw7RfMjnz5lX9fQJBNLy6mRAgYz8eo9rl2xMe37NyXmkyz6B sc9co633wFZDfg+6+FaPMuRk6sJC5pXMgGJFGM93ypTB4ZDIgiFzMmsEWXgcljjV fYBfzgh+4z9D/9cruXr227dRdgEPjtqbiKNVjo8QGptbT9sgPTQ77JYi0kgv8Djt 2wtBirkeCRWA86t0AvUSsMbU+5HywR0BGQQput/Kl0sn4xh7qmyrEtTp+x/tzW34 WZNta2WhdinGg7T87RNeowtE4f2zBTRwJ6ocH8zxGb447AIku7ct08Otw==
X-ME-Sender: <xms:0SG1ZGdzL_ohdmSYMz3R94jZbQRx33hW-1VWMfmOYs7al71HEQBcsg> <xme:0SG1ZAPJzCTWVK_8o7oKU08tCbOofKHyvxK6RyX_ZiEeNVLM2ddkXARo5jhrAhneq aEiF3o43vEmLKPNJAA>
X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgedviedrgedvgdefjecutefuodetggdotefrodftvf curfhrohhfihhlvgemucfhrghsthforghilhdpqfgfvfdpuffrtefokffrpgfnqfghnecu uegrihhlohhuthemuceftddtnecusecvtfgvtghiphhivghnthhsucdlqddutddtmdenuc fjughrpefofgggkfgjfhffhffvvefutgesthdtredtreertdenucfhrhhomhepfdforghr thhinhcuvfhhohhmshhonhdfuceomhhtsehlohifvghnthhrohhphidrnhgvtheqnecugg ftrfgrthhtvghrnhepudelueeftdfhgeeiieeikeekjedvjefgveduffegfedvffelveef keduieeikeelnecuvehluhhsthgvrhfuihiivgeptdenucfrrghrrghmpehmrghilhhfrh homhepmhhtsehlohifvghnthhrohhphidrnhgvth
X-ME-Proxy: <xmx:0SG1ZHig7ocPNKj7vOA1_Cn8Rbp32RBeZJ6IXJphc6MTRzcRTA5eRg> <xmx:0SG1ZD-PSYgthbr2DSC9oRJAL0mutpn26FzrJTYXbj18Vq_sP5SG8g> <xmx:0SG1ZCtcXAr_MCKi-bZVgJZN7Pxo2Y-vzzWyuGHsH3Pd6G6ttk968g> <xmx:0SG1ZL6jKHe7zAUvwVOdOJA8SVajqsRYcF4kDVIN79k7Ia4dLso9jw>
Feedback-ID: ic129442d:Fastmail
Received: by mailuser.nyi.internal (Postfix, from userid 501) id 47E37234007B; Mon, 17 Jul 2023 07:11:13 -0400 (EDT)
X-Mailer: MessagingEngine.com Webmail Interface
User-Agent: Cyrus-JMAP/3.9.0-alpha0-531-gfdfa13a06d-fm-20230703.001-gfdfa13a0
Mime-Version: 1.0
Message-Id: <22b0c566-30b1-467b-ae87-5ce1ab3f5a44@app.fastmail.com>
In-Reply-To: <0f1ee0fa-e65c-097f-73e0-288f279c7c49@gmx.de>
References: <CABXxEz97ZDeHhtMeX6CwX842d=s9CXfUtG5DFWpxNKbtBcoW6Q@mail.gmail.com> <CAD2=Z86=DyfT0Fp23DqdCyz32Od4uhAhA48K=pst64eZ+CS8NA@mail.gmail.com> <CABXxEz-KC5Nayv=KMce9mDi_ngu9J8PY1-pAbMUBypxEX2Pw8Q@mail.gmail.com> <430B62BB-45BB-4ECF-812F-39E84926CFA7@tzi.org> <CABXxEz-_6MV0P9sc=GBJw+eReFpLZC2vDFFvo9katc1ECwpyRw@mail.gmail.com> <9f95474b-c049-2e5a-1cfd-f3e9ebf48e2d@gmx.de> <fe96fb1a-1c56-4416-b937-b0b66c38ae71@betaapp.fastmail.com> <66ef1749-f4b1-51af-8e08-06eb58a9005a@gmx.de> <869922F5-DA96-40DB-A749-32E801A1A632@tzi.org> <d76f7ddf-46ae-4046-a1e0-9499da86e700@betaapp.fastmail.com> <E960579F-0750-471B-8729-8A59E89735D5@ietf.org> <0f1ee0fa-e65c-097f-73e0-288f279c7c49@gmx.de>
Date: Mon, 17 Jul 2023 04:10:53 -0700
From: Martin Thomson <mt@lowentropy.net>
To: "Julian F. Reschke" <julian.reschke@gmx.de>, IETF Executive Director <exec-director@ietf.org>
Cc: Carsten Bormann <cabo@tzi.org>, rfc-markdown@ietf.org, RFC Interest <rfc-interest@rfc-editor.org>
Content-Type: text/plain
Archived-At: <https://mailarchive.ietf.org/arch/msg/rfc-markdown/NmNaz5xKwA85QZ-RrRWiubcOBlc>
Subject: Re: [Rfc-markdown] [rfc-i] [Tools-discuss] Tool to convert TXT to RFCXML
X-BeenThere: rfc-markdown@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: "rfc-markdown is a discussion list for people writing I-Ds and RFCs in Markdown and the authors of the tools used for that." <rfc-markdown.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/rfc-markdown>, <mailto:rfc-markdown-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/rfc-markdown/>
List-Post: <mailto:rfc-markdown@ietf.org>
List-Help: <mailto:rfc-markdown-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/rfc-markdown>, <mailto:rfc-markdown-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 17 Jul 2023 11:11:20 -0000

On Mon, Jul 17, 2023, at 02:45, Julian Reschke wrote:
> I agree partly.

Same.

> The trouble with that is that even a rule like "do not insert LF into
> terms" may not be always desirable. For instance, jn HTTP we have
> several longish header field names, and I would not want them to be not
> broken across lines.

Yep, and there are lots of those examples.  Sometimes you want line breaking and other times not.  The "don't break me" ultimately ends up needing some amount of authorial control, which is sometimes orthogonal to any semantic distinction we might invent.  For instance, a blanket rule for HTTP fields seems wise... up until the point when you are introduced to all the terrible CORS names, which are so long that avoiding wrapping creates a worse outcome than just letting them flop around all over the place.

I have a real soft spot for purity as far as it relates to the split between semantics and presentation.  CSS takes a pretty good swing at this.  But in practice, this line never holds firm.  For instance, when authoring HTML you always have a bunch of <span or <div elements that exist only so that you can hang styling rules on them.  I've seen a lot of attempts at a clean split and the discipline just never lasts.

So I tend to think that we need to identify those things that we know we need to make things work properly.  Acknowledge that this isn't a pure stance, but a pragmatic one.  We won't get it right always, but we should probably stop the ad hoc use of Unicode and edge closer to giving authors hinting tools.

I'd start with line breaking.  Disabling line breaking is an extremely common request[1].  

The rest of the list is pretty short: Soft breaks would also be a nice feature (so that you can break X_Y_Z at underscores to keep wide tables tight).  Emphasis/monospace is also something I'd want to fix (Carsten's <tt bugbear).  Widow/orphan control would be nice, but I have less of a good handle on that (and the need is diminished given our current output format status) so it can probably sit unattended for a good long time.

--Martin

[1] Almost 100%, even: the BCP 14 boilerplate includes a non-breaking space between the "BCP" and the "14" in most RFCs, which I don't like.  But "BCP 14" would otherwise wrap in the text output and that is not cool.