Re: [art] New RFCs text formatting

John C Klensin <john-ietf@jck.com> Thu, 28 November 2019 16:33 UTC

Return-Path: <john-ietf@jck.com>
X-Original-To: ietf@ietfa.amsl.com
Delivered-To: ietf@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 028AA12089D; Thu, 28 Nov 2019 08:33:12 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.897
X-Spam-Level:
X-Spam-Status: No, score=-1.897 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, SPF_HELO_NONE=0.001, SPF_NONE=0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 4ClKjM3LKWu5; Thu, 28 Nov 2019 08:33:10 -0800 (PST)
Received: from bsa2.jck.com (bsa2.jck.com [70.88.254.51]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 05D79120859; Thu, 28 Nov 2019 08:33:10 -0800 (PST)
Received: from [198.252.137.10] (helo=PSB) by bsa2.jck.com with esmtp (Exim 4.82 (FreeBSD)) (envelope-from <john-ietf@jck.com>) id 1iaMj5-000DLQ-9b; Thu, 28 Nov 2019 11:33:07 -0500
Date: Thu, 28 Nov 2019 11:33:01 -0500
From: John C Klensin <john-ietf@jck.com>
To: Steffen Nurpmeso <steffen@sdaoden.eu>, ietf@ietf.org
cc: art@ietf.org, rfc-editor@rfc-editor.org
Subject: Re: [art] New RFCs text formatting
Message-ID: <08EE9B7B7C15D8F8B5DE6AF5@PSB>
In-Reply-To: <20191128005000.yMa3P%steffen@sdaoden.eu>
References: <20191127233129.9829F%steffen@sdaoden.eu> <db882e3c-d3fb-4742-8456-7b400225ecce@www.fastmail.com> <20191128005000.yMa3P%steffen@sdaoden.eu>
X-Mailer: Mulberry/4.0.8 (Win32)
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Content-Disposition: inline
X-SA-Exim-Connect-IP: 198.252.137.10
X-SA-Exim-Mail-From: john-ietf@jck.com
X-SA-Exim-Scanned: No (on bsa2.jck.com); SAEximRunCond expanded to false
Archived-At: <https://mailarchive.ietf.org/arch/msg/ietf/OTmG0_5rgEAahdqRb0qb2-PDBug>
X-BeenThere: ietf@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: IETF-Discussion <ietf.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf>, <mailto:ietf-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ietf/>
List-Post: <mailto:ietf@ietf.org>
List-Help: <mailto:ietf-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 28 Nov 2019 16:33:12 -0000

(copying the IETF list -- this is not an ART matter alone)

TL;DL warning: while not terribly long, this is a rant.

--On Thursday, November 28, 2019 01:50 +0100 Steffen Nurpmeso
<steffen@sdaoden.eu> wrote:

>...
> Yah, hm, hm.  It surely has merits, both HTML as well as PDF.
> I have a local RFC pool since, pooh, maybe 2003 or so.  The
> german computer magazine c't once put all (?) of them on one
> of their CDs by then, and i am only throwing onto that stock
> ever since. I personally am almost perfectly served with plain
> grep(1) on plain text.  Actually i do not even really care
> about missing form feeds, i was only surprised.

Hi.  I realize it is very late, probably too late, to be
complaining about this, but my clear recollection was that, when
the v3 project was started, the community was promised that,
while the authoritative/ canonical/ archival format would shift
to the XML and fully-functional HTML and PDF format (not just
HTML and PDF produced from plain text forms) would become
available, the plain text format would be preserved sufficiently
that tools [1] that were designed for, and used with, RFCs for
the last 40-odd years would continue to work and work equally
well [2].  If page breaks, running headers and footers, and TOCs
and indices with page references have disappeared, some of those
tools will cease working (even if grep still does) and that
violates that particular commitment.  If there were an xml2rfc
v3 option that was guaranteed to be supported and that would
generate the older format, even if that format were never stored
on IETF servers, I think that would be sufficient to let us (and
our tools) keep working, but that does not seem to be the case
at present (nor does it address any TOC or index issues).[3]

This is especially important because at least some of us who
prefer the text form when working with RFCs (even if some of us
might prefer something else for reading them), after losing the
argument about whether XML (or any other generic or formatting
markup arrangement) was appropriate as the canonical/archival
form essentially tuned out of the discussions of xml2rfc v3
details and decided to devote our available IETF time to
technical standards and other work that we felt had impact on
the Internet rather than just on the IETF and how it operated.
The tuning out was reinforced as it became clear that xml2rfc v3
was evolving into a odd hybrid between generic and formatting
markup, one that was as or more likely to share the disadvantage
of each rather than the advantages and that, in terms we use
when talking about networking, raises all of the issues of layer
violations.  So we didn't read the reports and documents
carefully; we just assumed that, as long as the plain text
format was stable (pagination and all), we would continue to be
able to work without having to spend time converting habits and
tools.

Perhaps what all of this means is that I (and a few others) are
just old and stuck in our ways and that the IETF would be better
off if we just retired or otherwise moved on.   The "old" part
is certainly true for some of us but I'm not convinced about the
"better off" part.  However, by requiring us to develop new
tools or learn new tricks, this change may have the effect of
raising the bar for effective participation in the IETF enough
to push us out.   That is especially likely when we need to
review and revise older documents.  Personally, I think that
pushing anyone out who is willing to do work and might have
useful contributions to make is a bad idea.

So, even if it is too late to reconsider these decisions, I hope
we can at least be aware of possible negative side effects.

best,
   john


[1] The concern is about user-developed tools, used by one
person or a cluster of people who have shared them, not ones
developed or maintained under IETF auspices.

[2] Reading RFC 7994 only after these changes were identified, I
find that Sections 4.1ff remove headers and footers and the page
numbers from the TOC.  But the introductory paragraph of Section
4 does say 

	"One plain-text output will be created during the
	publication process with basic pagination that includes
	a form feed instruction every 58 lines at most,
	including blank lines."

So perhaps that omission is a bug.


[3] The last argument for removing pagination in Section 2.1.3
of RFC 6949 is almost completely bogus.   The problems with
unbroken sections that span several pages, including
difficulties with references to material in those sections, was
noticed not long after RFC 1543 was published with its specific
comments about cross-references to sections and not pages, IIR
much earlier.  There was discussion even before then about
insisting on numbered paragraphs, an idea that was abandoned in
large measure because of difficulties with document preparation
and tooling.  For some years, the RFC Editor pushed back
aggressively or (or simply modified) such very long sections,
especially when there were cross-references to them (by either
section number or page number).  Later, for reasons unrelated to
this rant, the pushback became much less frequent.  The problem
is over-long sections; the solution is to manage RFC Style
decisions (during final editing if not earlier) so that they
don't happen, not to pretend that eliminating page numbers will
eliminate those sections by precluding the possibility of using
page number for more granular references.