Re: [rfc-i] standards for references/URLs in RFCs ? (Was: Re: archiving outlinks in RFCs)

Brian Carpenter <brian.e.carpenter@gmail.com> Wed, 26 April 2023 11:05 UTC

Return-Path: <brian.e.carpenter@gmail.com>
X-Original-To: rfc-interest@ietfa.amsl.com
Delivered-To: rfc-interest@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id D3B47C16952B for <rfc-interest@ietfa.amsl.com>; Wed, 26 Apr 2023 04:05:24 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.093
X-Spam-Level:
X-Spam-Status: No, score=-2.093 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_BLOCKED=0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id sSayPyT1dRfr for <rfc-interest@ietfa.amsl.com>; Wed, 26 Apr 2023 04:05:19 -0700 (PDT)
Received: from mail-vs1-xe2c.google.com (mail-vs1-xe2c.google.com [IPv6:2607:f8b0:4864:20::e2c]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id D281DC169533 for <rfc-interest@rfc-editor.org>; Wed, 26 Apr 2023 04:05:19 -0700 (PDT)
Received: by mail-vs1-xe2c.google.com with SMTP id ada2fe7eead31-42e3909fd58so5343453137.3 for <rfc-interest@rfc-editor.org>; Wed, 26 Apr 2023 04:05:19 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20221208; t=1682507119; x=1685099119; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=UiVXt25y8STZQLT+hdJ69Yr/BHOBH70MKP1LCvJ9+rM=; b=VtyiH4awYdLhl0vlz73hIF2IKvpjD5rNmiLXXF1sPZipRFNkPiUkdFUJHFPNvkNckI 3eth6lm6uQTAZibX7mjwIH5Vmip4xxljTQKoXOMOMk2R6WGxWb77AVH1+0xqxxmhtJ4D /4nglfuQOoml8KeGLW0RIOrZAiug48t5DtmrjnpJr3nSJwxdD4SvJrOnf3hYUXzoGhZp D++pWYdGKqFVlrP3ym9bsOMXBoCt4fza8h9do0N3IijHyaKgygUtlgdyfLu9HHZDmCbY Sdd/bcu1T7FwwP4bw0mVDly8k79L8x+YC/okOEiLWTXP1eaPzAi1c1UrkF+6JCp3eSXZ xLew==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1682507119; x=1685099119; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=UiVXt25y8STZQLT+hdJ69Yr/BHOBH70MKP1LCvJ9+rM=; b=kIef/HP4i5a4AdxEKFN+MHpoIdKFzl6YRF15oa66KOXjr1TviIFUc5N6RLiUOJrN1L eCA+pxVc/sJPDNcAHy0KrjQf2EB3g5tIPIheWHNpwbj4/q+gQSK6YtjKk7Wy4OL1X18W ZeSq4B+mx2KmSc7pQ4rnaYE6EkQT6bl5VZH8gSItVJ5PzHtKIVByeYAAR16dmKVmNVjF HsIVJw95jD8FRajCqlXLkDZQWiSfQAmPlYtcR1o/uhlwYGGHpIYUGvkx1tH9r/RPZQl3 /jhEEzVlAh4K3iTyg53JIh44MMTJ6UBh8UCbroXYDAd3qHh4G+PD8iF/xZu/JNuHhpu2 Rl5w==
X-Gm-Message-State: AAQBX9dyV29eb0BOJwDTJTSClPy+cMCGC6mHOKQD98k9xp6VK3VOUyIX dZBuIDtyiu39LZ+yaPgS63p+LUDUBTRa6fyFVJ0=
X-Google-Smtp-Source: AKy350ZAK9D9Hp+NsT9Wz1wc8AlNXarn9AO3s3YtiAqb3A6WmgRMUPzhBqKUUyVg93WgpcVcDh8aET9XGOHV3LhUzNU=
X-Received: by 2002:a05:6102:3663:b0:42e:4f6a:d077 with SMTP id bg3-20020a056102366300b0042e4f6ad077mr8728377vsb.4.1682507118690; Wed, 26 Apr 2023 04:05:18 -0700 (PDT)
MIME-Version: 1.0
References: <E024D9AC-2B92-4720-9713-519592D2362B@rfc-editor.org> <ZEjKrJK/LGvyHzog@faui48e.informatik.uni-erlangen.de> <A8D2E753-A8D7-4F17-822A-95998AD36F6E@ietf.org>
In-Reply-To: <A8D2E753-A8D7-4F17-822A-95998AD36F6E@ietf.org>
From: Brian Carpenter <brian.e.carpenter@gmail.com>
Date: Wed, 26 Apr 2023 23:05:06 +1200
Message-ID: <CANMZLAYdN5NYjAhMGv84GTm8DZAJdyCKzU=PKF18285zavX5zg@mail.gmail.com>
To: Jay Daley <exec-director@ietf.org>
Cc: Toerless Eckert <tte@cs.fau.de>, RFC Interest <rfc-interest@rfc-editor.org>
Content-Type: multipart/alternative; boundary="00000000000024484705fa3b3731"
Archived-At: <https://mailarchive.ietf.org/arch/msg/rfc-interest/Y0MkA7FEd8ugn88RgkiVb1QfK58>
Subject: Re: [rfc-i] standards for references/URLs in RFCs ? (Was: Re: archiving outlinks in RFCs)
X-BeenThere: rfc-interest@rfc-editor.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: "A list for discussion of the RFC series and RFC Editor functions." <rfc-interest.rfc-editor.org>
List-Unsubscribe: <https://mailman.rfc-editor.org/mailman/options/rfc-interest>, <mailto:rfc-interest-request@rfc-editor.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/rfc-interest/>
List-Post: <mailto:rfc-interest@rfc-editor.org>
List-Help: <mailto:rfc-interest-request@rfc-editor.org?subject=help>
List-Subscribe: <https://mailman.rfc-editor.org/mailman/listinfo/rfc-interest>, <mailto:rfc-interest-request@rfc-editor.org?subject=subscribe>
X-List-Received-Date: Wed, 26 Apr 2023 11:05:25 -0000

The original comment Toerless referred to was about the suitability of a
github citation, not about the mechanics of it.

(via tiny screen & keyboard)
Regards,
        Brian Carpenter

On Wed, 26 Apr 2023, 20:32 Jay Daley, <exec-director@ietf.org> wrote:

> Hi Toerless
>
> See https://www.rfc-editor.org/styleguide/part2/#ref_repo for the
> specifics of referencing GitHub.
>
> Jay
>
> On 26 Apr 2023, at 07:54, Toerless Eckert <tte@cs.fau.de> wrote:
>
> Alexis,
>
> Thanks a lot for the initiative, but before having more opinion about it,
> i would
> like to reconfirm our current policies for new draft->RFC, and what type of
> references/URLs are permitted/required by our process (IETF and
> RFC-editor).
>
> For example, in one of my ongoing WG drafts, WG members made the comment
> that
> a reference to a github location would not be looked upon favorably by RFC
> editor.
> Aka: inappropriate reference because it is not stable.
>
> In that particular case, the github file was part of a presentation given
> at a
> WG meeting in the past, so our plan for this reference is to use the IETF
> proceeding URL
> for the presentation instead.
>
> [ Which should be considered to be an eternally stable URL, one would
> hope, unless we
>  do get IETF LLC? web admins that like in almost all commercial web pages
> of our industry
>  seem to be run by admins that like to drive users mad by randomnly
> deleting important
>  reference/history information or just changing URLs for spite or some new
> wb page tooling
>  that "did not allow us to keep existing URLs" (typical excuse i hear).
> /rant ]
>
> Aka: I am totally unclear what type of URLs are or are not seen as
> appropriate today by IETF process/RFC-Editor, and if someone could point
> me to any
> reference we have (RFC ? www.ietf.org/.... ?), that would be lovely.
>
> As another example, when defining terms, i sometimes thought it would be
> appropriate
> to point to wikipedia. But given how the understanding of terms is
> changing over time,
> wikipedia definitions might be the worst references to use. Just think of
> all the
> technical terms we where fond of using (e.g.: blacklist) and that are now
> shunned/deprecated
> by us (not even sure what the right word for the process is ;-) - as one
> example category
> for this problem. If at all, it seems to me that references to wikipedia
> could
> really only go to an archived version of a wikipedia page that was used by
> the authors
> when writing the draft/RFC.
>
> Cheers
>    Toerless
>
> On Tue, Apr 25, 2023 at 11:50:24AM -0700, Alexis Rossi wrote:
>
> Hi all,
>
> I wanted to let the community know about something I’ve been working on.
> As you might know, one of my previous jobs was running the Wayback Machine,
> so when I started working with with this collection of RFCs one of my first
> thoughts was, “I wonder how many broken links are in these RFCs from the
> past few decades?”
>
> In general, the average lifespan of a URL before the content changes or
> disappears is on the order of 100 days. Fortunately for us, the links used
> in RFC references seem to be much more stable than that. For instance, so
> far I’ve only found one broken link in an RFC from the past 6 months [1].
>
> Even though we favor these more stable URLs, some of them will eventually
> change or go 404 and having archival documents with link rot is something
> we can take steps to avoid in the future.
>
> The first thing I wanted to do was just make sure we were archiving these
> outlinks somewhere. This won’t fix a broken link in the RFC, but at least
> the resource can be saved elsewhere for someone curious enough to go look
> (and potentially we could fix links in some version of the RFC in future).
>
> The main services that are well qualified for this purpose are
> Archive-It.org <http://archive-it.org/> (run by the Internet Archive) and
> Perma.cc <http://perma.cc/> (run by Harvard Law School Library). I chose
> Archive-It, and when I approached them they offered us an account [2] with
> enough free data storage for our needs. Yay for non-profits supporting each
> other!
>
> So far I have used Archive-It to:
> Archive rfc-editor.org <http://rfc-editor.org/>, iab.org <http://iab.org/>,
> irtf.org <http://irtf.org/>, and ietf.org <http://ietf.org/> (minus
> datatracker and the mail archive)
> There are lots of references to these sites in RFCs, but I also wanted to
> preserve the contents for their own sake. I plan to revisit these sites
> once per year.
> I am avoiding datatracker (except for outlinks from RFCs) because of
> concern about the extra traffic causing problems for the team that
> maintains the site.
> I have not concentrated on archiving the mail archive yet, though I know
> some of it has been saved incidentally.
> Archive outlinks from RFCs
> About once per quarter I’ll grab the outlinks from newly published RFCs
> and get them crawled.
> I am also going backwards through the entire series - I’ve started with
> the most recent RFCs (links are more likely to still be live) and am
> working my way back in time.
>
> There may be more room for improvements here, for example including
> archived links in RFCs from the start w here appropriate, or potentially
> defining a way for links to be self-healing in published RFCs.
>
> Please let me know if you have ideas or feedback on this.
>
> Thanks,
> Alexis
>
>
> [1] RFC9311 published in September 2022, in Section 11 (Informative
> References) this link is 404:
> https://www.ietf.org/how/meetings/98/bits-n-bites/ <
> https://www.ietf.org/how/meetings/98/bits-n-bites/>[2]
> https://archive-it.org/organizations/2540 <
> https://archive-it.org/organizations/2540>
>
>
> _______________________________________________
> rfc-interest mailing list
> rfc-interest@rfc-editor.org
> https://mailman.rfc-editor.org/mailman/listinfo/rfc-interest
>
>
>
> --
> ---
> tte@cs.fau.de
>
> _______________________________________________
> rfc-interest mailing list
> rfc-interest@rfc-editor.org
> https://mailman.rfc-editor.org/mailman/listinfo/rfc-interest
>
>
> --
> Jay Daley
> IETF Executive Director
> exec-director@ietf.org
>
> _______________________________________________
> rfc-interest mailing list
> rfc-interest@rfc-editor.org
> https://mailman.rfc-editor.org/mailman/listinfo/rfc-interest
>