Re: [Rswg] A citation as a substantive [was draft-irse-xml2rfc-changes...]

Julian Reschke <julian.reschke@gmx.de> Wed, 28 September 2022 19:03 UTC

Return-Path: <julian.reschke@gmx.de>
X-Original-To: rswg@ietfa.amsl.com
Delivered-To: rswg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 4C6EFC159826 for <rswg@ietfa.amsl.com>; Wed, 28 Sep 2022 12:03:00 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -6.906
X-Spam-Level:
X-Spam-Status: No, score=-6.906 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FREEMAIL_FROM=0.001, NICE_REPLY_A=-0.001, RCVD_IN_DNSWL_HI=-5, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=gmx.net
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id A192Yft1IZwJ for <rswg@ietfa.amsl.com>; Wed, 28 Sep 2022 12:02:56 -0700 (PDT)
Received: from mout.gmx.net (mout.gmx.net [212.227.17.22]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 75282C159A30 for <rswg@rfc-editor.org>; Wed, 28 Sep 2022 12:01:58 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.net; s=badeba3b8450; t=1664391708; bh=XtrjXgT7zrO385pgS9Tc89t3HIuFCI7nzIZeJpLV6oI=; h=X-UI-Sender-Class:Date:Subject:To:References:From:In-Reply-To; b=DiUfyXdRU0sUAnOzdDsUZ8CWhYnhO0AqHDwyAZD9c2HdNAy9G3ANuK78xELC7+4P1 FcPxJoVWWfXjnI4HZoWuekgCmJF18WpZyhRWXsk+Rpibpum9itf8Q6PkruteavZgIb gqaRrMILsH9AkMPYZqIjrig6jBG0vmzyOh94PNBY=
X-UI-Sender-Class: 01bb95c1-4bf8-414a-932a-4f6e2808ef9c
Received: from [192.168.178.20] ([91.61.55.162]) by mail.gmx.net (mrgmx104 [212.227.17.168]) with ESMTPSA (Nemesis) id 1N3bX1-1pMP9D1Zp5-010eGJ; Wed, 28 Sep 2022 21:01:48 +0200
Message-ID: <415f4ca0-723b-e73e-9321-30348f64955e@gmx.de>
Date: Wed, 28 Sep 2022 21:01:45 +0200
MIME-Version: 1.0
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101 Thunderbird/91.13.1
To: John C Klensin <john-ietf@jck.com>, rswg@rfc-editor.org
References: <interim-rfc-series-project-manager/draft-irse-xml2rfc-changes/issues/1@github.com> <interim-rfc-series-project-manager/draft-irse-xml2rfc-changes/is sues/1/1259679973@github.com> <YzNE5EIA7HI0WsEk@faui48e.informatik.uni-erlangen.de> <0276E311EA9F8008B13ABEBE@PSB> <4ee10c7a-506c-3799-a726-6f9147e4b58c@gmail.com> <035f585b-d96e-ac89-7964-2ae0f2cb56cf@cs.tcd.ie> <91e44938-8549-4acc-9be8-85c74603552a@amsl.com> <1AFA9E0F699E174E121DF124@PSB> <021ea908-85e6-a3eb-a539-4afb603ace14@gmx.de> <B0350AE53001A41BEF7C0114@PSB>
From: Julian Reschke <julian.reschke@gmx.de>
In-Reply-To: <B0350AE53001A41BEF7C0114@PSB>
Content-Type: text/plain; charset="UTF-8"; format="flowed"
Content-Transfer-Encoding: quoted-printable
X-Provags-ID: V03:K1:+srFhAy0rRnG9A23u7HGVgg5BIHbeogXaIYPCyDUcumKi2c89H+ l/QGi1yRCJZzeKEFSC+zQfJ6JQBv62I6ieC4PMGlUavhV4Yu3Qvlx4lzPXZMpoD+uqwFclW VhrQG+/rSqGwaOUqBCnoMwdTxCN8r7vB52IKoBuSh4wtLBmbN/FzLaH7lQhnvfwlki8GrE2 aPGc6BFTAOtNRWoJr4Prw==
X-UI-Out-Filterresults: notjunk:1;V03:K0:+56239If6vQ=:C4qfirL9vO4UAdu6catn6y YCL1m64yAY2Kfnxc375QtDWmSoJN2cmyv8rQKxOdUHQ5Sqttr9Y0NG3bATctKySBSsFlAm/RY S/7p2LEsGcOH5EdFQK8NM0lO9lKkLgJlq4iu3B8OADY/31JkDtOb2GPFNU2D4/Vu4bjumV6fn yaD2D1oqzExUpyceXdS3E4z7KV+N0OO/LXAgVUcHMZtCASqA4GtymT2QsDi8AZauOZjduprwJ ib3Qg52XiHsPZK8bxYrcnJp5Q8+xRh23KrOIHim4FTaw6sSmxjmwcLBpNLRH7KJ95dLjyMGxj ia1JBCjvX2Uo6DZluCk4y44xEUc3bErCCquWx0bxkvtZGGrmWG0LmTLuBJKj/BFDpBxB7goMz Y8H0G2zOkMThMsdLH/2l4pacUrwvEj5WLrrgNm6paeuNmjlqnuPeZtCtsI4FIY/o3nKk2issr u8NkbddNGirnD+zEMRMCIKEAVAePXikJYeP1QrdLY5Qwvs9HubUEBwNr3Hx9f6ngoxWSNAuti M9NoFXOiVb++quthlY6TjQAAP6ugg8IheXetSuVWu0arDmxMg76+h78SBuPuCC/AGO6Ajs6bE YupE/igwm6RANZSYLtofG6zfVTDnd+qsnHvP9irb/XhcqQARSdQTLUBChWXdbkOKKiEDPQc89 6Q5iBB4vH5bpUe+zu1rTt/WEogTifIHraT7J0BYFZAfoX43MCokOh4jfNtVY6o2cibHgo07mo pHqhZ0Y847ZrrAhjIPOO5YF1fwPGKtW0cUlbb0R087cliipMytLRatKMZGFwYV4G8rtB6CDlr yhT2PZXtvY56xk/SeJwv+Vz76umUEKMrIEn1aLAfhOyIu83N5QQ0gS78lAjSXU4PlSviNmk4J 1qA+3ClYCLe67mME2OiOzO79T1HuDWwtyy8oHtEVPuNtMKfeFZijPNIa5przkBV5Pctn+whI5 VpyIFhZhD6WxzCeu49jHIhboCJPfW+BoJIIELghyisZ8vjW0JIjVqgKMkYShNmuuVZGTC31nw JgMuBFn/Us19aDKGrYu2kVxLasmf3Th/1IAjnoyVair9XC25ZHKZ3ocigU8ul7XBNcLxFHQug /XOIaGhyjxKO1/KdGoGrIs+00tz82kUkeOoM4/vryCGEWkj+7t5jq7UFkNGlKK2j/drj5Qd+O l4fzuN5c2N8QxJ3x8grccfw3GS
Archived-At: <https://mailarchive.ietf.org/arch/msg/rswg/LMhUMhi3HwOgOPWPUc-qMWHkT7E>
Subject: Re: [Rswg] A citation as a substantive [was draft-irse-xml2rfc-changes...]
X-BeenThere: rswg@rfc-editor.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: "RFC Series Working Group \(RSWG\)" <rswg.rfc-editor.org>
List-Unsubscribe: <https://mailman.rfc-editor.org/mailman/options/rswg>, <mailto:rswg-request@rfc-editor.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/rswg/>
List-Post: <mailto:rswg@rfc-editor.org>
List-Help: <mailto:rswg-request@rfc-editor.org?subject=help>
List-Subscribe: <https://mailman.rfc-editor.org/mailman/listinfo/rswg>, <mailto:rswg-request@rfc-editor.org?subject=subscribe>
X-List-Received-Date: Wed, 28 Sep 2022 19:03:00 -0000

Am 28.09.2022 um 19:40 schrieb John C Klensin:
> Response to Julian, in probably more detail than he wished for,
> inline but:
>
> (a) Question for the co-chairs: How deeply into this swamp do
> you consider it worthwhile to descend at this point?  Should we
> pose the issues and wait for a recommendation for the RPC and/or
> RSCE as suggested/explained below or continue to have at it?
>
> (b) Note to John Levine: Discussed in more detail inline but, in
> looking at draft-irse-draft-irse-xml2rfcv3-implemented-03 (and
> its RFC Editor Function predecessor), it appears that a least
> some of the new/added attributes ("section" for <xref> in this
> case) lack definitions of what is permitted in their values.
> Clearly some definition has been implemented so, as your
> document continues to evolve, it would probably be worth
> tracking those attributes down and completing the definitions.
>
> Now for the more substantive issues...
>
> --On Wednesday, September 28, 2022 17:52 +0200 Julian Reschke
> <julian.reschke@gmx.de> wrote:
>
>> Am 28.09.2022 um 17:21 schrieb John C Klensin:
>>>
>>>
>>> --On Wednesday, September 28, 2022 08:35 -0500 Jean Mahoney
>>> <jmahoney@amsl.com> wrote:
>>>
>>>>> I don't object to guidance as to better phrasing at all,
>>>>> but imposing such as a requirement seems a bit silly when
>>>>> the meaning is clear.
>>>>
>>>> [JM]  The current guidance for in-text citations can be
>>>> found in the RECOMMENDED section of the Web Portion of the
>>>> Style Guide. Both styles are okay:
>>>> https://www.rfc-editor.org/styleguide/part2/#citation_usage
>>>
>>> Jean,
>>>
>>> My apologies to you, and everyone else, for mentioning that
>>> issue in my note because it seems to have distracted from the
>>> more important question, one that I do not believe is
>>> addressed in the Style Guide.  That is whether, if one needs
>>> to cite, e.g. (and simplifying a bit from Toerless's
>>> example), two different sections of the same document,
>>> whether you have guidance about whether to use some variation
>>> on
>>>
>>> 	The background for the current Nomcom selection criteria
>>> 	[RFC8989] Section 1,...
>>>
>>> and
>>>
>>> 	The specific, experimental Nomcom criteria [RFC8989]
>>> 	Section 4...
>>>
>>> or whether to embed the section number in the references so
>>> one would have simply "[RFC8989a]" and "[RFC8989b]" in the
>>> running text.  That second form is probably more friendly to
>>> both Toerless's concerns about very complex within-document
>>> citations and to HTML formats, where an author could actually
>>> specify a link of
>>> <https://www.rfc-editor.org/info/rfc8989#Section1> if that
>>> anchor were supported in the text (it is not for RFCs, but
>>> could easily be available for other types of documents).
>>>
>>>
>>> As far as I can tell, the current RFCXML vocabulary is not
>>> particularly friendly to either.  For the first form and itsy
>>> 	"[RFC8989 Section 1]" to "[RFC8989] Section 1"
>>> because the former would make it more clear that "Section 1"
>>> is part of the citation and not running text.   However, I
>>> cannot write
>>>      <xref target="RFC8959" citationDetail="Section 1"/>
>>> or the equivalent, which I presume would be needed to produce
>>> it.
>>
>> You could write
>>
>>     <xref target="RFC8959" section="1"/>
>>
>> This will produce
>>
>>     Section 1 of [RFC8959]
>>
>> with both parts nicely hyperlinked in formats that support
>> linking.
>>
>>   > ...
>
> Noting that a "section" attribute does not appear in RFC 7991, I
> obviously have not been tracking the vocabulary closely enough,
> instead prioritizing actually getting documents written and,
> apparently, reading this list, for which I guess I should
> apologize.  However:

RFC 7991 has "<relref>". Nobody liked that, and thus we have an
extension on <xref> instead.

> While the above works for "an RFC or Internet-Draft in the v3
> format", it is an error for some other sort of document unless
> "relative" is also specified (Section 2.66.4 of
> [id.draft-iab-rfc7991bis-04] and Section 2.66.3 of
> [id.draft-iab-rfc7991bis-04])
>
> The above illustrates several of the points I (and I think
> Toerless) were trying to ask Jean about.
>
> (1) For readability, the brackets should really be outside the
> entire citation, because "Section 1" is actually part of it.

Sounds you would like "[Section x of RFCYYYY]"? That's not common
practice in RFCs and also not the preferred format as stated by the RFC
Editor (I believe).

Yes, we can discuss this. xml2rfc happens to support was has been widely
used before.

> For the more complicated case I show above, the parentheses are
> needed to prevent things from becoming hopelessly confused.
> Maybe the two citations should be parenthesized separately, to
> be consistent with having references to two separate RFCs (e.g.,
> "... [RFC7991][id.draft-iab-rfc7991bis-04]" or "[3][4]".
>
> (2) Even Section 3.66.5 of the "As Implemented" spec
> [id.draft-irse-draft-irse-xml2rfcv3-implemented-03] does not
> specify the potential values of the "section" attribute, so I
> can't figure out, without either experimenting or appeal to
> higher authority, whether I could write (for a different
> reference)
>    <xref target="foo" section="Appendix D, part 2"/>
> your example with section="1"' hints that might not work.

It's a section number (where number is alphanumeric so that appendices
can be referenced).

> (3) Item (2) above also illustrates the citation bracketing
> problem.  While it would be perfectly sensible to write either
> 	The "As implemented" spec [Section 3.66.5 of
> 	id.draft-irse-draft-irse-xml2rfcv3-implemented-03]
> or
>    The "As implemented" spec
> [id.draft-irse-draft-irse-xml2rfcv3-implemented-03a]
> the sentence in (2) strikes me as awkward.  YMMD and that may be
> partially a plain text problem.  But having it as
>
> 	Even the "As Implemented" spec Section 3.66.5
> 	[id.draft-irse-draft-irse-xml2rfcv3-implemented-03]
>
> which is presumably what would be generated using the section
> attribute, is, IMO, even more awkward and, I believe that, in
> the absence of a comma after "spec" also grammatically
> incorrect.  So the comma belongs there in plain text but not in

I would simply say "Section 3.66.5 of the "As Implemented spec [...]". I
*believe* you can get that with the right combination of sectionFormat
attributes and text content of <xref>.

At the end of the day, the goal of the xml2rfc vocabulary is to make
simple things simple, and also get authors to use a consistent style. If
your style preferences are different, you might have to type a lot more.

> HTML where the whole business is a citation anchor and link?  I
> am not optimistic about getting the right in the output
> generation process.

At this point it's pretty clear to me that you haven't tried the format.
How about doing that instead of elaborating what could in theory be bad
about it?

> (4) I also have no idea what the sentence in (2) would look like
> in an HTMLized process.  We would either end up with the
> "Section 3.66.5" in plaintext (not a link) and a link into the
> references for the citation anchor or, with some clever
> heuristics and some lookahead, a link into the actual document
> (https://www.ietf.org/archive/id/draft-irse-draft-irse-xml2rfcv3-implemented-03.html#section-3.66.1)
> with the actual citation anchor still pointing to the
> references.

I recommend just trying it, or looking at the hundreds of RFCs that have
been published in the last few years.

Example: <https://www.rfc-editor.org/rfc/rfc9110.html#section-2.1-4>.

> Or maybe I'm still missing something but these seem to be more
> questions for Jean and her colleagues about their preferences
> and what the Style Manual should say.  Once they have expressed
> opinions, the WG can decide whether to disagree or nit-pick.
> After there is agreement, I'm confident that we can
> reverse-engineer the RFCXML vocabulary to allow accommodating
> them (including any author flexibility they think appropriate).

I happen to have tests for this. See
<https://greenbytes.de/tech/webdav/xref-tests-hlext.xml2rfcv3.html>.

> However, dropping a "section" attribute into the <xref> element
> without examining the issue that Toerless first raised (I'm just
> digging deeper) in this discussion, most of which have been
> raised on and off with the RFC Editor Function for decades, is
> exactly the sort of thing that causes some of us to feel that
> things are in rather a mess.
>
> best,
>     john

Best regards, Julian