Re: [rfc-i] archiving outlinks in RFCs

Jay Daley <exec-director@ietf.org> Fri, 28 April 2023 10:54 UTC

Return-Path: <jay@staff.ietf.org>
X-Original-To: rfc-interest@ietfa.amsl.com
Delivered-To: rfc-interest@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id D6793C1516F2 for <rfc-interest@ietfa.amsl.com>; Fri, 28 Apr 2023 03:54:52 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.896
X-Spam-Level:
X-Spam-Status: No, score=-1.896 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=ietf-org.20221208.gappssmtp.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id rEIcc9t5NvIe for <rfc-interest@ietfa.amsl.com>; Fri, 28 Apr 2023 03:54:49 -0700 (PDT)
Received: from mail-wr1-x42c.google.com (mail-wr1-x42c.google.com [IPv6:2a00:1450:4864:20::42c]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id F2CDCC151520 for <rfc-interest@rfc-editor.org>; Fri, 28 Apr 2023 03:54:48 -0700 (PDT)
Received: by mail-wr1-x42c.google.com with SMTP id ffacd0b85a97d-3003b97fa12so5980016f8f.2 for <rfc-interest@rfc-editor.org>; Fri, 28 Apr 2023 03:54:48 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ietf-org.20221208.gappssmtp.com; s=20221208; t=1682679287; x=1685271287; h=references:to:cc:in-reply-to:date:subject:mime-version:message-id :from:from:to:cc:subject:date:message-id:reply-to; bh=k82EOJ/hBpKmdfd+f4g57HieD2wjgOPDLfYbxrPr+zM=; b=tCu+XBfxeDdIONhF7po7A1myPQdjEKAtZYdpGB3k4JMhcNOXswpwVU85XY79sSeZiZ t4PrPynjK5846enas9COfq4fIoCuKlOqjUJjEXchRO6Twp1FWw6q0P7SWI+qCIVuO/f8 Eea4iCqeK31U7CdbtiptBARVrA7sbrbWal+vsGVe7+SPMegtxazMs0Iu7cWcqICZIJCy D/TAO89PPWpSkLA5ACI7SolNPg4M1Nhad15JrzFl2qCL7gFDY0vt7lzgfPO5NFKQI9FO SK6oQwrfVtkUey5Uy97tNrhfB/7AL7iDPoWFF3X1qMjKZENGIQSYBwcFN5SDvqucDpp3 wuuQ==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1682679287; x=1685271287; h=references:to:cc:in-reply-to:date:subject:mime-version:message-id :from:x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=k82EOJ/hBpKmdfd+f4g57HieD2wjgOPDLfYbxrPr+zM=; b=iLQy4gxww9K5UPimX1VMTDT/7gfu7LNspc2q8qoNku2eYIedgnsefeeAQrw/ObAfnL R6zqWPgQHUvu48IRjag4wIsRaH6B5H+wG2DCrAKq+qICqQLqDDm1OZnZek3wWCTqWU2P 0/gkSW/JxqQbNV5K62Alqgu4ttY2O44XGKz+7NSFNfY+qrfN0QrGCg+bb5FVSxy3o/L3 J1k+tHXg+D0ZraTRBSueLRjWwELk2VlRSVESaGxtkVJ7PIMzc0CUVqB72Xs9kfDEjW2E llFZq/oJJFWyW+hbmU9HiwfAdymYUsl9jGCRq2O5Txrde3nWepBhuvW2AbFtJT3YgsLO z5dQ==
X-Gm-Message-State: AC+VfDydt9Lfi6GT9oDhGtYSYQN45E2fEYvUzJ5GmvwLS+tesxW1+j6x FJwERmoLNvJbK0/dfxLOL9xtxg==
X-Google-Smtp-Source: ACHHUZ7AwXWFjKd7wPaoL7RRsfGUOXlK1ATMnBw9QFJs/0vL/6cnTb5EpBgiVKnzoibIRUOIHnwssQ==
X-Received: by 2002:adf:e707:0:b0:2fe:ca6e:edb with SMTP id c7-20020adfe707000000b002feca6e0edbmr3733862wrm.39.1682679286382; Fri, 28 Apr 2023 03:54:46 -0700 (PDT)
Received: from smtpclient.apple (host-92-27-125-209.static.as13285.net. [92.27.125.209]) by smtp.gmail.com with ESMTPSA id j22-20020a5d6e56000000b002fbb285b01fsm20778043wrz.25.2023.04.28.03.54.45 (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Fri, 28 Apr 2023 03:54:45 -0700 (PDT)
From: Jay Daley <exec-director@ietf.org>
Message-Id: <BBAA1DE3-20EF-4DB7-9C0C-E24221B983A1@ietf.org>
Content-Type: multipart/alternative; boundary="Apple-Mail=_83D42115-8966-4830-A523-F715C6EED7C7"
Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3731.500.231\))
Date: Fri, 28 Apr 2023 11:54:35 +0100
In-Reply-To: <CA+9kkMCBw7U65EuabLirAODsdR46xBLOqmQCgmsmRa+2BicsDQ@mail.gmail.com>
Cc: RFC Interest <rfc-interest@rfc-editor.org>
To: Ted Hardie <ted.ietf@gmail.com>
References: <E024D9AC-2B92-4720-9713-519592D2362B@rfc-editor.org> <30c30c2f-4e96-560a-73dd-a51ba8d04714@comcast.net> <771B7586-FFBB-49E4-9B99-5578863FBD8B@rfc-editor.org> <CABcZeBOevOj8cWY7dacWxzwZS82+iAjf1p+DZWF=7WZ9JydnrQ@mail.gmail.com> <48de4d92-e279-4c26-ab3c-15dd854b56f8@betaapp.fastmail.com> <CABcZeBPqePQwPAq5pWda1pGaY_=kLkcOxCjZWmOv9yRZ_MNb7g@mail.gmail.com> <CA+9kkMBVMTG7Zku4gt_DwCNWArYTauR_O0u70zceCMtN2GNN_Q@mail.gmail.com> <796.1682529129@localhost> <CA+9kkMBiqZCqbDviOVQFmjROYJtViz=S7ZsW6T41mv4XGbZ3=g@mail.gmail.com> <04BE48FA-322D-457A-9D7B-A9DA8FCE8E50@rfc-editor.org> <CA+9kkMCKM7A81+EU0OegtE5UbjLoVwsK7FVig8toddj-1APwxw@mail.gmail.com> <CANMZLAakmafNpe91TGG0eioR_yHt=n=ncV7nKLMCvCaQevoH8A@mail.gmail.com> <1718A586-7CFE-42CB-8206-DD7B18383BC9@ietf.org> <CA+9kkMCm1C762sTXiiP=MLLP9huuzdTbjJ-zROEXXJKGuwoGdg@mail.gmail.com> <0BC1C666-732D-497D-AC69-5D0CBD029F0D@ietf.org> <CA+9kkMCBw7U65EuabLirAODsdR46xBLOqmQCgmsmRa+2BicsDQ@mail.gmail.com>
X-Mailer: Apple Mail (2.3731.500.231)
Archived-At: <https://mailarchive.ietf.org/arch/msg/rfc-interest/HfitapQv7uMGWiYjwDvTlcSxCls>
Subject: Re: [rfc-i] archiving outlinks in RFCs
X-BeenThere: rfc-interest@rfc-editor.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: "A list for discussion of the RFC series and RFC Editor functions." <rfc-interest.rfc-editor.org>
List-Unsubscribe: <https://mailman.rfc-editor.org/mailman/options/rfc-interest>, <mailto:rfc-interest-request@rfc-editor.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/rfc-interest/>
List-Post: <mailto:rfc-interest@rfc-editor.org>
List-Help: <mailto:rfc-interest-request@rfc-editor.org?subject=help>
List-Subscribe: <https://mailman.rfc-editor.org/mailman/listinfo/rfc-interest>, <mailto:rfc-interest-request@rfc-editor.org?subject=subscribe>
X-List-Received-Date: Fri, 28 Apr 2023 10:54:52 -0000


> On 28 Apr 2023, at 11:03, Ted Hardie <ted.ietf@gmail.com> wrote:
> 
> Hi Jay,
> 
> Snipped, then in-line.
> 
>> 
>> Not quite.  My point is that the current errata process does not see any changes applied to the HTML and so if we were to use that process we would either have to accept that broken links remain broken unless someone goes looking for errata,
> 
> The presence of errata is pretty prominent: https://www.rfc-editor.org/rfc/rfc9110 for example, has a link at the very top of the page and https://datatracker.ietf.org/doc/rfc9110/ shows you both an html version and one with errata inline. So I believe that anyone who clicks an outlink and finds it broken will work out what to do pretty quickly if they start at the RFC editor site and/or the datatracker.  For those  documents which have been reproduced elsewhere we don't have any control over what errata get shown, but we wouldn't have any control no matter which option we picked.

We clearly have a very, very different idea of "pretty prominent".  Even if someone spots that text, they still need to clear some other hurdles:

- they need to understand what 'errata' are (this is not a common word)

- they need to be aware that there is the possibility that there are things wrong with the document that we know are wrong but have chosen not to fix in the current page (my totally unevidenced view is that 99.99% of people will fall at that hurdle)

- they need to read through our remarkably unfriendly errata page to find details of the broken link.

>  
>> or that a fundamental change is made to that process to apply some errata and not others.  As the former is unsatisfactory, and the latter is in the "really hard as demonstrated by long-standing debate" category, I am suggesting a new process that avoids either trap.
>> 
> 
> I personally use the errata inline version as often as I remember it exists, and I would be happy if it were the datatracker default view.  But that's a different issue and a slightly different set of decision makers. For this issue, it seems we simply disagree about the extent which it is unsatisfactory to have to look at errata to find the community decision on an updated link after one is broken; we may also disagree about the extent to which it is satisfactory to have a process which updates an RFC when some data in it is wrong but not use that same process for other data (even for trivial fixes like typos).  It's easy enough to disagree about that sort of thing, and I suppose the final word goes to the new WG.

Indeed.

Jay

> 
> regards,
> 
> Ted
> 
> 
>  
>> Jay
>> 
>>> 
>>> regards,
>>> 
>>> Ted
>>> 
>>> 
>>>  
>>>> 
>>>> Jay 
>>>> 
>>>> 
>>>>> 
>>>>> (via tiny screen & keyboard)
>>>>> Regards,
>>>>>         Brian Carpenter
>>>>> 
>>>>> On Fri, 28 Apr 2023, 20:03 Ted Hardie, <ted.ietf@gmail.com <mailto:ted.ietf@gmail.com>> wrote:
>>>>>> On Thu, Apr 27, 2023 at 9:10 PM Alexis Rossi <rsce@rfc-editor.org <mailto:rsce@rfc-editor.org>> wrote:
>>>>>>> It seems like using the errata system would maybe be a more haphazard method of fixing broken links over time, since it relies on humans to notice the original link is dead, report that as an errata, and then another human to check and approve it. Unless the proposal is to have a bot that files errata when links die (and then there’s just one human step to approve)?
>>>>>>> 
>>>>>>> 
>>>>>> Yes, sorry that was clear.  Essentially, the steps I propose:
>>>>>> 
>>>>>> On RFC publication, archive the outlink.
>>>>>> 
>>>>>> Run a periodic check on the status of the links in each RFC.
>>>>>> 
>>>>>> When one is determined to be unavailable, file an erratum mechanically
>>>>>> 
>>>>>> The community that would normally determine erratum validity examines the issue and determines the next step, which might be one of:
>>>>>> 
>>>>>> 1) The erratum is approved and lists the archived information as the target for approval on document update 
>>>>>> 2) The erratum is approved a different URL is listed as the target for approval on document update (e.g. the ieee.example/standards/08675 replacing ieee.example/standards/8675
>>>>>> 3) The erratum is rejected as the error was transient or will be corrected by the origin (where these are sibling SDOs, we generally have a way to reach out to them for this information).
>>>>>> 
>>>>>> The normal erratum process is then used to provide this information to the community (either separately or in-line, depending on the method they choose).
>>>>>> 
>>>>>> The advantage of this approach is that we are using community approved processes in a pretty easily understood way.  We can also use the same process when the link is live but something like a paywall has changed the state of availability.  That's not something we can likely identify mechanically, but we can re-use this set of steps.
>>>>>> 
>>>>>> Sorry that this wasn't clearer before.
>>>>>> 
>>>>>> regards,
>>>>>> 
>>>>>> Ted
>>>>>> 
>>>>>>  
>>>>>>>> On Apr 27, 2023, at 1:40 AM, Ted Hardie <ted.ietf@gmail.com <mailto:ted.ietf@gmail.com>> wrote:
>>>>>>>> 
>>>>>>>> On Wed, Apr 26, 2023 at 6:12 PM Michael Richardson <mcr+ietf@sandelman.ca <mailto:mcr%2Bietf@sandelman.ca>> wrote:
>>>>>>>>> 
>>>>>>>>> Ted Hardie <ted.ietf@gmail.com <mailto:ted.ietf@gmail.com>> wrote:
>>>>>>>>>     > I agree with Ekr that this is problematic, but my concern is with external
>>>>>>>>>     > links to other standards.  If I replace a link to https://ieee.example/876.1
>>>>>>>>>     > to an archive link like
>>>>>>>>>     > https://archivesite.example/see?https_ieee_example/876.1_retrieved_the_day_of_publication
>>>>>>>>>     > then ieee.example has no chance to use its own redirects or tombstones to
>>>>>>>>> 
>>>>>>>>> It also keeps ieee.example from replacing the link that we were using with a
>>>>>>>>> link that goes through a paywall.
>>>>>>>>> 
>>>>>>>>> (Which, btw, DOES HAPPEN regularly)
>>>>>>>>> 
>>>>>>>> 
>>>>>>>> I think this would also be grounds for filing an erratum.  But my basic point remains that the erratum process triggers the right thing: discussion among the folks within the IETF who are responsible for the relevant RFC.  There are several different "correct" responses depending on the circumstances, and they are the right folks to make the decision.  We already have a way to indicate that there are errata and/or display them in line.  We can use that here, rather than trying to decide in advance.
>>>>>>>> 
>>>>>>>> regards,
>>>>>>>> 
>>>>>>>> Ted
>>>>>>>> 
>>>>>>>> 
>>>>>>>>  
>>>>>>>>> --
>>>>>>>>> Michael Richardson <mcr+IETF@sandelman.ca <mailto:mcr%2BIETF@sandelman.ca>>   . o O ( IPv6 IøT consulting )
>>>>>>>>>            Sandelman Software Works Inc, Ottawa and Worldwide
>>>>>>>>> 
>>>>>>>>> 
>>>>>>>>> 
>>>>>>>>> 
>>>>>>>> _______________________________________________
>>>>>>>> rfc-interest mailing list
>>>>>>>> rfc-interest@rfc-editor.org <mailto:rfc-interest@rfc-editor.org>
>>>>>>>> https://mailman.rfc-editor.org/mailman/listinfo/rfc-interest
>>>>>>> 
>>>>>> _______________________________________________
>>>>>> rfc-interest mailing list
>>>>>> rfc-interest@rfc-editor.org <mailto:rfc-interest@rfc-editor.org>
>>>>>> https://mailman.rfc-editor.org/mailman/listinfo/rfc-interest
>>>>> _______________________________________________
>>>>> rfc-interest mailing list
>>>>> rfc-interest@rfc-editor.org <mailto:rfc-interest@rfc-editor.org>
>>>>> https://mailman.rfc-editor.org/mailman/listinfo/rfc-interest
>>>> 
>>>> -- 
>>>> Jay Daley
>>>> IETF Executive Director
>>>> exec-director@ietf.org <mailto:exec-director@ietf.org>
>>> _______________________________________________
>>> rfc-interest mailing list
>>> rfc-interest@rfc-editor.org <mailto:rfc-interest@rfc-editor.org>
>>> https://mailman.rfc-editor.org/mailman/listinfo/rfc-interest
>> 
>> -- 
>> Jay Daley
>> IETF Executive Director
>> exec-director@ietf.org <mailto:exec-director@ietf.org>

-- 
Jay Daley
IETF Executive Director
exec-director@ietf.org