Re: [rfc-i] getting SVG of RFC diagrams

Carsten Bormann <cabo@tzi.org> Sun, 26 November 2023 19:46 UTC

Return-Path: <cabo@tzi.org>
X-Original-To: rfc-interest@ietfa.amsl.com
Delivered-To: rfc-interest@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id A7DA2C14CF1E for <rfc-interest@ietfa.amsl.com>; Sun, 26 Nov 2023 11:46:01 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.909
X-Spam-Level:
X-Spam-Status: No, score=-1.909 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 9976JAeVBagw for <rfc-interest@ietfa.amsl.com>; Sun, 26 Nov 2023 11:45:57 -0800 (PST)
Received: from smtp.zfn.uni-bremen.de (smtp.zfn.uni-bremen.de [IPv6:2001:638:708:32::21]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 956EFC14CEE3 for <rfc-interest@rfc-editor.org>; Sun, 26 Nov 2023 11:45:56 -0800 (PST)
Received: from smtpclient.apple (p548dcbf2.dip0.t-ipconnect.de [84.141.203.242]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.zfn.uni-bremen.de (Postfix) with ESMTPSA id 4SdfN16RNSzDCbr; Sun, 26 Nov 2023 20:45:53 +0100 (CET)
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3774.200.91.1.1\))
From: Carsten Bormann <cabo@tzi.org>
In-Reply-To: <d9b2052b-8f23-4cbf-99b8-9d2b85892fb4@alum.mit.edu>
Date: Sun, 26 Nov 2023 20:45:43 +0100
Cc: rfc-interest@rfc-editor.org
Content-Transfer-Encoding: quoted-printable
Message-Id: <75157E6C-99B3-4E17-8FFD-57A1D55E0CD4@tzi.org>
References: <310371.1700735021@dyas> <7C81D316-7D04-4941-BEE5-6AE8C96F0DE1@tzi.org> <D1F8F86E-80D7-4FC7-B7B5-32648ACB5D65@tzi.org> <MN2PR11MB3757AF8CCDF388BA5CECFE1AB9B8A@MN2PR11MB3757.namprd11.prod.outlook.com> <153bde12-5773-42ce-bfdf-392e4492c06b@gmx.de> <9D66A3A2-5516-4924-8B1E-420871D862C5@tzi.org> <8373b6b5-4a0a-4abb-8880-d618154b2f81@alum.mit.edu> <1F2DBD33-1553-4BBE-8ADB-115632C8373D@tzi.org> <d9b2052b-8f23-4cbf-99b8-9d2b85892fb4@alum.mit.edu>
To: Paul Kyzivat <pkyzivat@alum.mit.edu>
X-Mailer: Apple Mail (2.3774.200.91.1.1)
Archived-At: <https://mailarchive.ietf.org/arch/msg/rfc-interest/jH4-Bq9NsW-XUQC6v6x70XgHl-c>
Subject: Re: [rfc-i] getting SVG of RFC diagrams
X-BeenThere: rfc-interest@rfc-editor.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: "A list for discussion of the RFC series and RFC Editor functions." <rfc-interest.rfc-editor.org>
List-Unsubscribe: <https://mailman.rfc-editor.org/mailman/options/rfc-interest>, <mailto:rfc-interest-request@rfc-editor.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/rfc-interest/>
List-Post: <mailto:rfc-interest@rfc-editor.org>
List-Help: <mailto:rfc-interest-request@rfc-editor.org?subject=help>
List-Subscribe: <https://mailman.rfc-editor.org/mailman/listinfo/rfc-interest>, <mailto:rfc-interest-request@rfc-editor.org?subject=subscribe>
X-List-Received-Date: Sun, 26 Nov 2023 19:46:01 -0000

Hi Paul,

>> I put in a proof of concept at
> 
> This is a great proof of concept. It is very much like I was imagining.

Thank you.

> I'm not sure exactly how you are constructing the names of the parts. But the details of that can be bike shedded as needed.

File names should primarily come from the name= attributes of artwork/sourcecode.
However, few authors fill these in (and some get it wrong, e.g., in RFC 8688).
So, as a fallback, the code uses the slugifiedName= attribute created by the preptool (and, if that hasn’t be performed, it simulates the behavior of the preptool).

Multiple artwork/sourcecode with the same name= are aggregated (concatenated) under that name.
That would be mostly wrong with the guessed filenames from slugifiedName=, so here the extractor appends -b, -c… in case of a collision.
(Note that the slugifiedName= creation appends -2, -3…, so you might find a -2-b or so…)

> Question: are you actually extracting and storing all the pieces as a preprocessing step? Or is this virtual, where the pieces are extracted on demand?

In the PoC, this is done as preprocessing, essentially simply calling (an updated version of) kramdown-rfc-extract-sourcecode.

> If you are using a preprocessing step to extract the pieces, this could be made to work for docs prior to 8650 by using ad hoc techniques to extract the pieces from the .txt file. That could be done gradually on a demand basis. That would however require some sort of review process to ensure that is is done correctly.

I think that would be useful.
I’m not sure about how a review process would work, but people already are using aex and similar tools to extract ABNF, so by focusing on these higher-value targets we could cover 80 % of the needs with 20 % of the work.

(If the review process would be too arduous, we might need to fall back to a community effort.)

Grüße, Carsten