Re: [rfc-i] Poll: RFCs with page numbers (pretty please) ?

Henrik Levkowetz <henrik@levkowetz.com> Tue, 27 October 2020 19:27 UTC

Return-Path: <henrik@levkowetz.com>
X-Original-To: wgchairs@ietfa.amsl.com
Delivered-To: wgchairs@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 14CDC3A154F for <wgchairs@ietfa.amsl.com>; Tue, 27 Oct 2020 12:27:41 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.146
X-Spam-Level:
X-Spam-Status: No, score=-2.146 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, NICE_REPLY_A=-0.247, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Rd24To79LDRo for <wgchairs@ietfa.amsl.com>; Tue, 27 Oct 2020 12:27:38 -0700 (PDT)
Received: from zinfandel.tools.ietf.org (zinfandel.tools.ietf.org [64.170.98.42]) (using TLSv1.2 with cipher DHE-RSA-AES128-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 883403A1582 for <wgchairs@ietf.org>; Tue, 27 Oct 2020 12:27:38 -0700 (PDT)
Received: from h-202-242.a357.priv.bahnhof.se ([158.174.202.242]:51814 helo=tannat.localdomain) by zinfandel.tools.ietf.org with esmtpsa (TLS1.2:DHE_RSA_AES_128_CBC_SHA1:128) (Exim 4.80) (envelope-from <henrik@levkowetz.com>) id 1kXUd1-0004dV-1m; Tue, 27 Oct 2020 12:27:35 -0700
To: "HANSEN, TONY L" <tony@att.com>, Paul Kyzivat <pkyzivat@alum.mit.edu>, "wgchairs@ietf.org" <wgchairs@ietf.org>
References: <20201026020433.GA19475@faui48f.informatik.uni-erlangen.de> <20201026162814.GP39170@kduck.mit.edu> <20201026164036.GO48111@faui48f.informatik.uni-erlangen.de> <1a56dc3b-56ef-3ffb-a12b-44d5e0d0f835@levkowetz.com> <20201026171931.GP48111@faui48f.informatik.uni-erlangen.de> <b733240-fc78-5a71-8920-ff84fbf64287@iecc.com> <20201026180105.GQ48111@faui48f.informatik.uni-erlangen.de> <03976f9f-7f49-7bf7-ce29-ee989232a44d@gmail.com> <7FA8EF59-5CDE-42B9-A487-520531EEA1F0@juniper.net> <65374aef-e018-7bc8-ce50-d5c0a3982bf7@gmail.com> <DE3C9D6AE8EF94D87936DAE7@PSB> <75918E93-96A2-4C9A-9D60-570E7A0E1B22@ribose.com> <C393B7270B2043C75B6CA7B8@PSB> <EB282B9A-8562-43B5-AC65-31FD2CF64C5D@ribose.com> <3F09EF2A87B102C8666C72D0@PSB> <baa56cf3-bf4b-fbb7-321c-80d3af92373f@ntlworld.com> <9ab56d82-854a-9b33-a3e9-acd537bc1e70@alum.mit.edu> <3125A9F3-B2F5-43FE-B7B6-B4D58FD48308@att.com> <99862761-9654-946e-4d36-8fe6ad73a2c0@levkowetz.com> <083860E2-29EE-4799-96CE-1BD0F201EDE6@att.com>
From: Henrik Levkowetz <henrik@levkowetz.com>
Message-ID: <4b151b62-c40b-6100-f4ab-ecc427f017a5@levkowetz.com>
Date: Tue, 27 Oct 2020 20:27:23 +0100
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.11; rv:45.0) Gecko/20100101 Thunderbird/45.8.0
MIME-Version: 1.0
In-Reply-To: <083860E2-29EE-4799-96CE-1BD0F201EDE6@att.com>
Content-Type: multipart/signed; micalg="pgp-sha256"; protocol="application/pgp-signature"; boundary="F4fiM7t3nSGMFjbBDkvw6xK46mldE7WeO"
X-SA-Exim-Connect-IP: 158.174.202.242
X-SA-Exim-Rcpt-To: wgchairs@ietf.org, pkyzivat@alum.mit.edu, tony@att.com
X-SA-Exim-Mail-From: henrik@levkowetz.com
Subject: Re: [rfc-i] Poll: RFCs with page numbers (pretty please) ?
X-SA-Exim-Version: 4.2.1 (built Mon, 26 Dec 2011 16:24:06 +0000)
X-SA-Exim-Scanned: Yes (on zinfandel.tools.ietf.org)
X-Clacks-Overhead: GNU Terry Pratchett
Archived-At: <https://mailarchive.ietf.org/arch/msg/wgchairs/lEp3sBH2YZgnUZLV_DMe7nr-xUc>
X-BeenThere: wgchairs@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Working Group Chairs <wgchairs.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/wgchairs>, <mailto:wgchairs-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/wgchairs/>
List-Post: <mailto:wgchairs@ietf.org>
List-Help: <mailto:wgchairs-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/wgchairs>, <mailto:wgchairs-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 27 Oct 2020 19:27:48 -0000

Hi Tony,

On 2020-10-27 19:53, HANSEN, TONY L wrote:
> I did some quick poking at the HTML&CSS in a recent HTML-formatted RFC. 
> 
> The paragraph pilcrows are represented as an anchor
> 
> <a href="#section-1.2-1.2" class="pilcrow">¶</a>
> 
> This means that the text of the section number itself is not present
> within the output.

Yes, this was the reason I thought a bit more work would be needed :-)

> You would need to have something this instead to
> get the data:
> 
> <a href="#section-1.2-1.2" class="pilcrow"><span
> class="pilcrow-text">1.2-1.2</span>¶</a>
> 
> and then the CSS can be adjusted to display or not display that text
> as appropriate.

Right.

Best,
	Henrik

> 
> Just some thoughts.
> 
> 	Tony
> 
> On 10/27/20, 2:32 PM, "Henrik Levkowetz" <henrik@levkowetz.com> wrote:
> 
>     Hi Tony,
> 
>     On 2020-10-27 19:25, HANSEN, TONY L wrote:
>     > Typical placement for paragraph numbers is within the right margin.
>     > 
>     > RFC 7995 (RFC PDF format) has this recommendation:
>     > 
>     >    Recommendation:  When the XML "editing=yes" option has been chosen,
>     >       show paragraph numbers in the right margin, typeset in a way so as
>     >       to be unobtrusive.  (The right margin instead of the left margin
>     >       prevents the paragraph numbers from being confused with the
>     >       section numbers.)  If possible, the paragraph numbers should be
>     >       coded in such a way that they do not interfere with screen
>     >       readers.
>     > 
>     > RFC 7994 (RFC text format) has this to say about paragraph numbering:
>     > 
>     >    Where practical, the original guidance for the structure of a
>     >    plain-text RFC has been kept (e.g., with line lengths, with lines
>     >    per page [INS2AUTH]).  Other publication formats, such as HTML and
>     >    PDF, will include additional features that will not be present in the
>     >    plain text (e.g., paragraph numbering, typographical emphasis).
>     > 
>     >    The details described in this document are expected to change based
>     >    on experience gained in implementing the new publication toolsets.
>     > 
>     > The HTML format generates paragraph numbering with pilcrows. However,
>     > you have to hover over the pilcrows to see those numbers.
>     > 
>     > Personally, I would support extending support for paragraph numbers
>     > to the text format as well, possibly even enabled using the same
>     > "editing=yes" option.
>     > 
>     > Note: I haven't tried this option to see if it actually works with
>     > the PDF format.
> 
>     It doesn't.  I'm afraid the premise of the xml2rfc SoW, that the PDF version
>     should be generated from the HTML version, resulted in me loosing track of
>     that recommendation in 7995.
> 
>     It might be possible to use a different CSS style for the PDF which turned the
>     pilcrows into right-hand paragraph numbers, but I suspect that a bit more would
>     be required than a CSS tweak.
> 
> 
>     Best regards,
> 
>     	Henrik
> 
>     > 	Tony Hansen
>     > 
>     > On 10/27/20, 11:23 AM, "WGChairs on behalf of Paul Kyzivat" <wgchairs-bounces@ietf.org on behalf of pkyzivat@alum.mit.edu> wrote:
>     > 
>     >     On 10/27/20 7:33 AM, Keith Drage wrote:
>     >     > I do agree that referencing by section numbering is the appropriate method.
>     >     > 
>     >     > However, as you say, it gets difficult with overly long sections, and 
>     >     > this is something on which no guidance or checking is currently given. 
>     >     > Perhaps it should be?
>     >     > 
>     >     > Further there are difficulties with section references if you allow text 
>     >     > between x.y.z and x.y.z.1. Does a reference to x.y.z mean the entirety 
>     >     > of the text including x.y.z.1, or merely the text before x.y.z. Other 
>     >     > organisations restrict such usage in order to make section references 
>     >     > unambiguous.
>     > 
>     >     +1
>     > 
>     >     Sections that contain subsections are commonly long. And not only can 
>     >     x.y.z contain text prior to x.y.z.1, it can also contain text after 
>     >     subsections that is not part of a subsection.
>     > 
>     >     Perhaps the solution is paragraph numbers. But I don't know how you 
>     >     would include them in the txt format.
>     > 
>     >     	Thanks,
>     >     	Paul
>     > 
>     >     > I also note people are referring to the XML version as the definitive 
>     >     > version, but surely the section numbers do not appear until the 
>     >     > presentation version is created?
>     >     > 
>     >     > Keith
>     >     > 
>     >     > On 27/10/2020 01:53, John C Klensin wrote:
>     >     >>
>     >     >> --On Tuesday, October 27, 2020 01:30 +0000 Ronald Tse
>     >     >> <tse@ribose.com> wrote:
>     >     >>
>     >     >>> Thanks John for the clarification. There is some confusion to
>     >     >>> me whether the intention is just about the TXT output having
>     >     >>> page numbers, or for the PDF to also have the same page
>     >     >>> numbers, and whether to use page numbers inside cross
>     >     >>> references. There was a also discussion about a ToC and page
>     >     >>> numbers, but perhaps that was a diversion.
>     >     >>>
>     >     >>> If the discussion is only about the ASCII output having page
>     >     >>> numbers, I have no objection because it is (nearly) purely
>     >     >>> cosmetic (in publication and in usage of the text, being done
>     >     >>> by xml2rfc).
>     >     >>>
>     >     >>> If having page numbers will require the PDF output to also
>     >     >>> have page numbers, this inevitably leads to some shared spec
>     >     >>> between the TXT and PDF outputs on the topic of pagination,
>     >     >>> which is less ideal, but since I assume that is the work of
>     >     >>> xml2rfc, it's not a concern to us as tool maintainers.
>     >     >> Actually, I think you have it a bit backward.  The PDF has page
>     >     >> numbers today.  More to the point, PDF is (almost) inherently a
>     >     >> page image format so there is no way to escape pagination.  One
>     >     >> could decide to not number those pages in the footers, or one
>     >     >> could eliminate the headers and footers entirely, but the page
>     >     >> boundaries are going to be there regardless.
>     >     >>
>     >     >> However, as long as a strict discipline is maintained that
>     >     >> references (within an RFC, between RFCs, and whatever we can
>     >     >> do/encourage about external references to RFCs use are to
>     >     >> section numbers and not pages (and there Brian and I agree)
>     >     >> _and_ as long as we don't allow sections to become so long that
>     >     >> people seek other mechanisms, then whether the page numbers in a
>     >     >> text format agree with those in the PDF format or not is largely
>     >     >> irrelevant.  That issue is centuries old: if I reference a
>     >     >> chapter by number in a book, that reference is typically stable
>     >     >> for different printings and formats (and often but not always
>     >     >> between editions).  But the page numbers often are not, so,
>     >     >> unless the people doing the layout are _really_ careful, using
>     >     >> the index from, e.g., a hardbound copy and trying to apply its
>     >     >> numbers to a paperback that uses different size type and page
>     >     >> layouts is, to use a technical term, just plain dumb.  And
>     >     >> external references that use page numbers need to be very
>     >     >> careful to specify exactly what form is being referred to.
>     >     >>
>     >     >>> Adding page numbers to cross references can make reading
>     >     >>> confusing — since the cross references between the paginated
>     >     >>> and flowed versions will render these references differently.
>     >     >>> It's doable, but again this requirement ties the paginated
>     >     >>> versions (TXT and PDF) together for consistency.
>     >     >> And, again, this is why there has been a long-term prohibition
>     >     >> in the RFC Series against using page numbers in cross references.
>     >     >>
>     >     >>> Of course, if the PDF output is simply an enhanced PDF-ized
>     >     >>> TXT version, these aren't really issues.
>     >     >> But two of several advantages of the contemporary (xml2rfc v3)
>     >     >> PDF version is that it can contain and render pictures easily
>     >     >> and that, if characters are used outside the ASCII repertoire,
>     >     >> they are rendered correctly as well.
>     >     >>
>     >     >> best,
>     >     >>      john
>     >     >>
>     >     > 
>     > 
>     > 
> 
>