Re: [Xml-sg-cmt] Please consider adding a way to speak about references as a token to the grammar

Robert Sparks <rjsparks@nostrum.com> Mon, 07 March 2022 15:22 UTC

Return-Path: <rjsparks@nostrum.com>
X-Original-To: xml-sg-cmt@ietfa.amsl.com
Delivered-To: xml-sg-cmt@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 2911A3A0B01 for <xml-sg-cmt@ietfa.amsl.com>; Mon, 7 Mar 2022 07:22:54 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.089
X-Spam-Level:
X-Spam-Status: No, score=-2.089 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, NICE_REPLY_A=-0.001, RCVD_IN_DNSWL_BLOCKED=0.001, T_SCC_BODY_TEXT_LINE=-0.01, T_SPF_HELO_PERMERROR=0.01, T_SPF_PERMERROR=0.01, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=nostrum.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id j_k3sKzvu4re for <xml-sg-cmt@ietfa.amsl.com>; Mon, 7 Mar 2022 07:22:47 -0800 (PST)
Received: from nostrum.com (raven-v6.nostrum.com [IPv6:2001:470:d:1130::1]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 8317B3A0AC0 for <xml-sg-cmt@ietf.org>; Mon, 7 Mar 2022 07:22:46 -0800 (PST)
Received: from [192.168.1.114] ([47.186.48.51]) (authenticated bits=0) by nostrum.com (8.17.1/8.16.1) with ESMTPSA id 227FMhAd049973 (version=TLSv1.3 cipher=TLS_AES_256_GCM_SHA384 bits=256 verify=NO); Mon, 7 Mar 2022 09:22:44 -0600 (CST) (envelope-from rjsparks@nostrum.com)
DKIM-Signature: v=1; a=rsa-sha256; c=simple/simple; d=nostrum.com; s=default; t=1646666564; bh=VkW6gdayf0hoOimgcI7YgK1AA7o2hhEc8xZZqpAjMeo=; h=Date:To:References:From:Subject:In-Reply-To; b=rYl+ZbDt/cT4dDWOGxfGYJVmyRZ1bSsmJTT7fqYy67PzolVDeXrS70JXjRQ3uOtpc oDTaRuCx1sWgqm1955eqMIzULRMtJQSxZQHoPXuzRXv/lICqTwaRRlZCRpbvA/q8JC KaSI9GEhh2GeJI4XMZF7o7VaNqcl4HYnd7FDjQ5Q=
X-Authentication-Warning: raven.nostrum.com: Host [47.186.48.51] claimed to be [192.168.1.114]
Message-ID: <c6410619-6852-d440-8e06-aca716442d55@nostrum.com>
Date: Mon, 07 Mar 2022 09:22:38 -0600
MIME-Version: 1.0
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:91.0) Gecko/20100101 Thunderbird/91.6.2
Content-Language: en-US
To: John R Levine <johnl@taugh.com>, XML weed whackers <xml-sg-cmt@ietf.org>
References: <7362f22b-b636-2331-7ebe-08e8a92fd041@taugh.com> <97D1D158-8DA9-40B5-BD67-F3839CFFDCD7@nostrum.com> <bd7a39d4-5475-ffac-eb7b-e62d5a36d064@taugh.com> <133f5370-5b48-1ef5-6005-f17d9d41690b@gmx.de> <ba9e6eb1-47f6-41bd-80f2-756eda755d83@nostrum.com> <6d98c810-f241-a3ab-f226-e95cf17e0c19@taugh.com> <10bc9689-eea2-98a4-1f79-9f893ae44046@nostrum.com> <dae56272-be6e-5f6d-d898-dd76639d7a22@taugh.com>
From: Robert Sparks <rjsparks@nostrum.com>
In-Reply-To: <dae56272-be6e-5f6d-d898-dd76639d7a22@taugh.com>
Content-Type: text/plain; charset="UTF-8"; format="flowed"
Content-Transfer-Encoding: 8bit
Archived-At: <https://mailarchive.ietf.org/arch/msg/xml-sg-cmt/U3Ens4hGrIGlaAyP3Oi7oettYvU>
Subject: Re: [Xml-sg-cmt] Please consider adding a way to speak about references as a token to the grammar
X-BeenThere: xml-sg-cmt@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Working list for the xml and style guide change management team <xml-sg-cmt.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/xml-sg-cmt>, <mailto:xml-sg-cmt-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/xml-sg-cmt/>
List-Post: <mailto:xml-sg-cmt@ietf.org>
List-Help: <mailto:xml-sg-cmt-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/xml-sg-cmt>, <mailto:xml-sg-cmt-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 07 Mar 2022 15:22:54 -0000

On 3/5/22 7:10 PM, John R Levine wrote:
> On Sat, 5 Mar 2022, Robert Sparks wrote:
>> I think that's an orthogonal discussion. _Just_ having that anchor 
>> doesn't address the full benefit I think I see from not having such 
>> an explicit path - particularly when we get to trying to make it easy 
>> for working copies to continue to have pointers while the repository 
>> has resolved content.
>
> Something is going to have to know where the bibxml files are, and 
> either we figure out how to maintain a list of stable URLs or we need 
> to manage the list of tools that know where they are, and how we are 
> going to update them all and push out the updates to all of the users 
> when the tools change.
A much more tractable problem (tens of these things) than the current 
state of managing this information in the thousands of author's heads.
>
> Doing it this way at least makes it obvious what is broken and what 
> you need to fix if they move:
>
> <references xml:base="https://bibxml.ietf.org/">
>   <xi:include href="I-D.sparks-sipcore-multiple-reasons"/>
>   <xi:include href="RFC2549"/>
> </references>
>
> In your proposal with the link in the reference, would it 
> automagically replace the link with the contents when an I-D is 
> submitted, or when it's rendered, or when it's prepped,, or add the 
> contents and keep the link so it could update later if the library 
> changes, or something else?

It would expand the reference pointer , but leave it in the source as a 
comment, structured such that it makes it easy for another program to 
remove what resulted from the expansion, and restore the pointer.

The biggest goal here is to make it easier for authors to continue to 
maintain their source in a way that has indirection to references while 
allowing us to make draft-submission more accurate than it currently is.

There are secondary, but large goals.

1) Right now the URL structure the older bibxml service uses suffers 
from having to build a really arcane bit of structure into the tag 
that's used to distinguish between "give me a particular version of a 
draft" and "give me the latest version of the draft". At 
xml2rfc.tools.ietf.org, references that start 'I-D.draft-' are always 
references to a specific version, and references that start 'I-D.ietf-' 
or 'I-D.name-' are to the latest version. The datatracker's bibxml3 
service and the new bibxml service are currently different - they will 
take the token and look for a version match and return that if there is 
one, and then look to see if it matches a draft name without a version, 
returning the latest if there is a match. In both ways of doing things, 
there is a level of complexity that is unkind to the authors (and the 
coders - the guessing heuristics for finding 'latest' are ugly - people 
have submitted drafts that look like draft-foo-bar-21-22-23). I would 
_much_ rather that we provided the authors a way to explicitly say what 
they want rather than relying on side effect. Mocking up an alternate 
future again, I would like to see something like:

<reference type='I-D' name='draft-foo-bar'> for 'current' and <reference 
type='I-D' name='draft-foo-bar' rev='08'> for a specific version.

We could encode this things back into URL paths and do something like 
what you have above, but then people have to learn the conventions of 
what bit of info lives in what place in the url path, where something 
like the above makes the statement very explicit.

2) For xml2rfc, internally, we plan to use the API that's been developed 
(https://dev.bibxml.org/api/v1/) rather than the current url based form 
for retrieval. We'll be converting URL when given. Yes, I realize that 
we'll always need to be able to recognize the older hosts and paths, so 
that code will be with us anyhow, but I still think we'd be much better 
off going forward with something that allowed the kind of explicit and 
self-documenting statement in 1).

RjS

>
> R's,
> John
>