Re: [rfc-i] archiving outlinks in RFCs

Ted Hardie <ted.ietf@gmail.com> Fri, 28 April 2023 10:04 UTC

Return-Path: <ted.ietf@gmail.com>
X-Original-To: rfc-interest@ietfa.amsl.com
Delivered-To: rfc-interest@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 1970BC14CE45 for <rfc-interest@ietfa.amsl.com>; Fri, 28 Apr 2023 03:04:09 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.095
X-Spam-Level:
X-Spam-Status: No, score=-2.095 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id yngiP5SaUBhL for <rfc-interest@ietfa.amsl.com>; Fri, 28 Apr 2023 03:04:05 -0700 (PDT)
Received: from mail-ed1-x52f.google.com (mail-ed1-x52f.google.com [IPv6:2a00:1450:4864:20::52f]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 48AB0C151545 for <rfc-interest@rfc-editor.org>; Fri, 28 Apr 2023 03:04:05 -0700 (PDT)
Received: by mail-ed1-x52f.google.com with SMTP id 4fb4d7f45d1cf-504eac2f0b2so16711201a12.3 for <rfc-interest@rfc-editor.org>; Fri, 28 Apr 2023 03:04:05 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20221208; t=1682676243; x=1685268243; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=eN3ZO4PH/rRy0Fjr7BfwFsNqAapHWckXGCNDaf0LO6Y=; b=g4QHcdpVD+ihtZ4GO+aeqx4e5EHlIJjr+LfN7W5Ne8+6sA1EWHAyySgoUV67AJ1wym KyPTfbwHsLDTxcZXMfYTYsXcA0ZEiKORnyM/QeSeHiRIN6eON4oSw3diPEY5tf9BLKJ9 8lnpxdX5smbwto072NpWtdepRBiMmhjnTIhZff6bkvV3yygUtBp95xqH6+E8VTh5NTT4 w79U4HvLISYd9t1hB9halUIX5j13bICP8Xw/XNC969YfZYzwJjRfp32JqipEBsV4aMld TkoiPMtiOVZ6uxcj7YNXtMws6Jz8q9Y+7YMM4uxlXduZs7zUF7DfpQfSEGwI/3zu9dTd FKYA==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1682676243; x=1685268243; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=eN3ZO4PH/rRy0Fjr7BfwFsNqAapHWckXGCNDaf0LO6Y=; b=aJulnK2m3aH1JQ87aLeC6shxmplCFaOp4wRqg/tgeaZdJkZIHFwfRg6ptOsRxqpMU/ 8krK7QcsfMLDv9SyTWDiIuZ2pCjPJ3u5h99OQ9cI6W377Txig5tfb6XbBYVUn5gK93K9 08qTTmnoHOy9ljnCkBt95tmMhyIs1tiCaPVpw66NaK0iW/4AtT73AzpuZNPUHKWqNxjc gAzh92YConAbsLaxd2msLYQX2nc1hhlW2cokLWz2cFSTMtD6291DxMx2E5qJpPr1x5H7 yRGZW+LLYd0FRz5up3XOVlHzvUhNVctYUjkPp49TH/WXevfIbQ+/JPoqoI4cB23Dc44U 0cUg==
X-Gm-Message-State: AC+VfDwVQ+JVZzLMIwqgrPmKD/bT5pH5Rit+4z8xq5rQ5yREY8Qm5Ydy MkmEgBtUtPyeChJpYI6KZPibezfzdBxP1BGdm+k=
X-Google-Smtp-Source: ACHHUZ5FUp302uzCC78gbeh52YDCNY5QRQIwgqBggbuv+bvKzVRpBfPL39W3V1v5eN/WlyescWkdzvIZyV7fG9tnSaE=
X-Received: by 2002:a05:6402:4d9:b0:506:73a7:ce12 with SMTP id n25-20020a05640204d900b0050673a7ce12mr3530713edw.36.1682676242875; Fri, 28 Apr 2023 03:04:02 -0700 (PDT)
MIME-Version: 1.0
References: <E024D9AC-2B92-4720-9713-519592D2362B@rfc-editor.org> <30c30c2f-4e96-560a-73dd-a51ba8d04714@comcast.net> <771B7586-FFBB-49E4-9B99-5578863FBD8B@rfc-editor.org> <CABcZeBOevOj8cWY7dacWxzwZS82+iAjf1p+DZWF=7WZ9JydnrQ@mail.gmail.com> <48de4d92-e279-4c26-ab3c-15dd854b56f8@betaapp.fastmail.com> <CABcZeBPqePQwPAq5pWda1pGaY_=kLkcOxCjZWmOv9yRZ_MNb7g@mail.gmail.com> <CA+9kkMBVMTG7Zku4gt_DwCNWArYTauR_O0u70zceCMtN2GNN_Q@mail.gmail.com> <796.1682529129@localhost> <CA+9kkMBiqZCqbDviOVQFmjROYJtViz=S7ZsW6T41mv4XGbZ3=g@mail.gmail.com> <04BE48FA-322D-457A-9D7B-A9DA8FCE8E50@rfc-editor.org> <CA+9kkMCKM7A81+EU0OegtE5UbjLoVwsK7FVig8toddj-1APwxw@mail.gmail.com> <CANMZLAakmafNpe91TGG0eioR_yHt=n=ncV7nKLMCvCaQevoH8A@mail.gmail.com> <1718A586-7CFE-42CB-8206-DD7B18383BC9@ietf.org> <CA+9kkMCm1C762sTXiiP=MLLP9huuzdTbjJ-zROEXXJKGuwoGdg@mail.gmail.com> <0BC1C666-732D-497D-AC69-5D0CBD029F0D@ietf.org>
In-Reply-To: <0BC1C666-732D-497D-AC69-5D0CBD029F0D@ietf.org>
From: Ted Hardie <ted.ietf@gmail.com>
Date: Fri, 28 Apr 2023 11:03:36 +0100
Message-ID: <CA+9kkMCBw7U65EuabLirAODsdR46xBLOqmQCgmsmRa+2BicsDQ@mail.gmail.com>
To: Jay Daley <exec-director@ietf.org>
Cc: RFC Interest <rfc-interest@rfc-editor.org>
Content-Type: multipart/alternative; boundary="000000000000ba88b005fa629791"
Archived-At: <https://mailarchive.ietf.org/arch/msg/rfc-interest/uX9ER9RmHed1miI7O0fRTHrhKNs>
Subject: Re: [rfc-i] archiving outlinks in RFCs
X-BeenThere: rfc-interest@rfc-editor.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: "A list for discussion of the RFC series and RFC Editor functions." <rfc-interest.rfc-editor.org>
List-Unsubscribe: <https://mailman.rfc-editor.org/mailman/options/rfc-interest>, <mailto:rfc-interest-request@rfc-editor.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/rfc-interest/>
List-Post: <mailto:rfc-interest@rfc-editor.org>
List-Help: <mailto:rfc-interest-request@rfc-editor.org?subject=help>
List-Subscribe: <https://mailman.rfc-editor.org/mailman/listinfo/rfc-interest>, <mailto:rfc-interest-request@rfc-editor.org?subject=subscribe>
X-List-Received-Date: Fri, 28 Apr 2023 10:04:09 -0000

Hi Jay,

Snipped, then in-line.


> Not quite.  My point is that the current errata process does not see any
> changes applied to the HTML and so if we were to use that process we would
> either have to accept that broken links remain broken unless someone goes
> looking for errata,
>

The presence of errata is pretty prominent:
https://www.rfc-editor.org/rfc/rfc9110 for example, has a link at the very
top of the page and https://datatracker.ietf.org/doc/rfc9110/ shows you
both an html version and one with errata inline. So I believe that anyone
who clicks an outlink and finds it broken will work out what to do pretty
quickly if they start at the RFC editor site and/or the datatracker.  For
those  documents which have been reproduced elsewhere we don't have any
control over what errata get shown, but we wouldn't have any control no
matter which option we picked.


> or that a fundamental change is made to that process to apply some errata
> and not others.  As the former is unsatisfactory, and the latter is in the
> "really hard as demonstrated by long-standing debate" category, I am
> suggesting a new process that avoids either trap.
>
>
I personally use the errata inline version as often as I remember it
exists, and I would be happy if it were the datatracker default view.  But
that's a different issue and a slightly different set of decision makers.
For this issue, it seems we simply disagree about the extent which it is
unsatisfactory to have to look at errata to find the community decision on
an updated link after one is broken; we may also disagree about the extent
to which it is satisfactory to have a process which updates an RFC when
some data in it is wrong but not use that same process for other data (even
for trivial fixes like typos).  It's easy enough to disagree about that
sort of thing, and I suppose the final word goes to the new WG.

regards,

Ted




> Jay
>
>
> regards,
>
> Ted
>
>
>
>
>>
>> Jay
>>
>
>>
>>
>> (via tiny screen & keyboard)
>> Regards,
>>         Brian Carpenter
>>
>> On Fri, 28 Apr 2023, 20:03 Ted Hardie, <ted.ietf@gmail.com> wrote:
>>
>>> On Thu, Apr 27, 2023 at 9:10 PM Alexis Rossi <rsce@rfc-editor.org>
>>> wrote:
>>>
>>>> It seems like using the errata system would maybe be a more haphazard
>>>> method of fixing broken links over time, since it relies on humans to
>>>> notice the original link is dead, report that as an errata, and then
>>>> another human to check and approve it. Unless the proposal is to have a bot
>>>> that files errata when links die (and then there’s just one human step to
>>>> approve)?
>>>>
>>>>
>>>> Yes, sorry that was clear.  Essentially, the steps I propose:
>>>
>>> On RFC publication, archive the outlink.
>>>
>>> Run a periodic check on the status of the links in each RFC.
>>>
>>> When one is determined to be unavailable, file an erratum mechanically
>>>
>>> The community that would normally determine erratum validity examines
>>> the issue and determines the next step, which might be one of:
>>>
>>> 1) The erratum is approved and lists the archived information as the
>>> target for approval on document update
>>> 2) The erratum is approved a different URL is listed as the target for
>>> approval on document update (e.g. the ieee.example/standards/08675
>>> replacing ieee.example/standards/8675
>>> 3) The erratum is rejected as the error was transient or will be
>>> corrected by the origin (where these are sibling SDOs, we generally have a
>>> way to reach out to them for this information).
>>>
>>> The normal erratum process is then used to provide this information to
>>> the community (either separately or in-line, depending on the method they
>>> choose).
>>>
>>> The advantage of this approach is that we are using community approved
>>> processes in a pretty easily understood way.  We can also use the same
>>> process when the link is live but something like a paywall has changed the
>>> state of availability.  That's not something we can likely identify
>>> mechanically, but we can re-use this set of steps.
>>>
>>> Sorry that this wasn't clearer before.
>>>
>>> regards,
>>>
>>> Ted
>>>
>>>
>>>
>>>> On Apr 27, 2023, at 1:40 AM, Ted Hardie <ted.ietf@gmail.com> wrote:
>>>>
>>>> On Wed, Apr 26, 2023 at 6:12 PM Michael Richardson <
>>>> mcr+ietf@sandelman.ca> wrote:
>>>>
>>>>>
>>>>> Ted Hardie <ted.ietf@gmail.com> wrote:
>>>>>     > I agree with Ekr that this is problematic, but my concern is
>>>>> with external
>>>>>     > links to other standards.  If I replace a link to
>>>>> https://ieee.example/876.1
>>>>>     > to an archive link like
>>>>>     >
>>>>> https://archivesite.example/see?https_ieee_example/876.1_retrieved_the_day_of_publication
>>>>>     > then ieee.example has no chance to use its own redirects or
>>>>> tombstones to
>>>>>
>>>>> It also keeps ieee.example from replacing the link that we were using
>>>>> with a
>>>>> link that goes through a paywall.
>>>>>
>>>>> (Which, btw, DOES HAPPEN regularly)
>>>>>
>>>>>
>>>> I think this would also be grounds for filing an erratum.  But my basic
>>>> point remains that the erratum process triggers the right thing: discussion
>>>> among the folks within the IETF who are responsible for the relevant RFC.
>>>> There are several different "correct" responses depending on the
>>>> circumstances, and they are the right folks to make the decision.  We
>>>> already have a way to indicate that there are errata and/or display them in
>>>> line.  We can use that here, rather than trying to decide in advance.
>>>>
>>>> regards,
>>>>
>>>> Ted
>>>>
>>>>
>>>>
>>>>
>>>>> --
>>>>> Michael Richardson <mcr+IETF@sandelman.ca>   . o O ( IPv6 IøT
>>>>> consulting )
>>>>>            Sandelman Software Works Inc, Ottawa and Worldwide
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> _______________________________________________
>>>> rfc-interest mailing list
>>>> rfc-interest@rfc-editor.org
>>>> https://mailman.rfc-editor.org/mailman/listinfo/rfc-interest
>>>>
>>>>
>>>> _______________________________________________
>>> rfc-interest mailing list
>>> rfc-interest@rfc-editor.org
>>> https://mailman.rfc-editor.org/mailman/listinfo/rfc-interest
>>>
>> _______________________________________________
>> rfc-interest mailing list
>> rfc-interest@rfc-editor.org
>> https://mailman.rfc-editor.org/mailman/listinfo/rfc-interest
>>
>>
>> --
>> Jay Daley
>> IETF Executive Director
>> exec-director@ietf.org
>>
>> _______________________________________________
> rfc-interest mailing list
> rfc-interest@rfc-editor.org
> https://mailman.rfc-editor.org/mailman/listinfo/rfc-interest
>
>
> --
> Jay Daley
> IETF Executive Director
> exec-director@ietf.org
>
>