Re: [rfc-i] Meta decorations in generated HTML

John R Levine <johnl@taugh.com> Fri, 27 May 2022 14:56 UTC

Return-Path: <rfc-interest-bounces@rfc-editor.org>
X-Original-To: ietfarch-rfc-interest-archive@ietfa.amsl.com
Delivered-To: ietfarch-rfc-interest-archive@ietfa.amsl.com
Received: from ietfa.amsl.com (localhost [IPv6:::1]) by ietfa.amsl.com (Postfix) with ESMTP id B2932C2740CA for <ietfarch-rfc-interest-archive@ietfa.amsl.com>; Fri, 27 May 2022 07:56:05 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=ietf.org; s=ietf1; t=1653663365; bh=Gu2elF/Wf6idTrqoJqvyHbTJL5DKn9AfYRA/EGMdla4=; h=Date:From:To:In-Reply-To:References:Subject:List-Id: List-Unsubscribe:List-Archive:List-Post:List-Help:List-Subscribe; b=e5FWd9Ea8ViCTozSduRZmWv+AIapDVxfnb/BaztwV+7sg0KgyvAmm7O2BjWghiQE0 sIAwUZex5L5qsX2fBjZOw6PRkSuDgc6MdnMyWCbQI6QNNO1TvsGeG3av00B0VfcH6V bePV16CCWfaVFcfyrWjgm2vlUQGG5RkkePYAJT14=
X-Mailbox-Line: From rfc-interest-bounces@rfc-editor.org Fri May 27 07:56:05 2022
Received: from ietfa.amsl.com (localhost [IPv6:::1]) by ietfa.amsl.com (Postfix) with ESMTP id 7BFC8C2740B6; Fri, 27 May 2022 07:56:05 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=ietf.org; s=ietf1; t=1653663365; bh=Gu2elF/Wf6idTrqoJqvyHbTJL5DKn9AfYRA/EGMdla4=; h=Date:From:To:In-Reply-To:References:Subject:List-Id: List-Unsubscribe:List-Archive:List-Post:List-Help:List-Subscribe; b=e5FWd9Ea8ViCTozSduRZmWv+AIapDVxfnb/BaztwV+7sg0KgyvAmm7O2BjWghiQE0 sIAwUZex5L5qsX2fBjZOw6PRkSuDgc6MdnMyWCbQI6QNNO1TvsGeG3av00B0VfcH6V bePV16CCWfaVFcfyrWjgm2vlUQGG5RkkePYAJT14=
X-Original-To: rfc-interest@ietfa.amsl.com
Delivered-To: rfc-interest@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 3EB90C2740B4 for <rfc-interest@ietfa.amsl.com>; Fri, 27 May 2022 07:56:04 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.096
X-Spam-Level:
X-Spam-Status: No, score=-2.096 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_BLOCKED=0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=iecc.com header.b=p+WkSKO0; dkim=pass (2048-bit key) header.d=taugh.com header.b=J/scjTT1
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id oN-4QQWcfbvd for <rfc-interest@ietfa.amsl.com>; Fri, 27 May 2022 07:55:59 -0700 (PDT)
Received: from gal.iecc.com (gal.iecc.com [IPv6:2001:470:1f07:1126:0:43:6f73:7461]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 31864C2740BA for <rfc-interest@rfc-editor.org>; Fri, 27 May 2022 07:55:58 -0700 (PDT)
Received: (qmail 72848 invoked from network); 27 May 2022 14:55:56 -0000
DKIM-Signature: v=1; a=rsa-sha256; c=simple; d=iecc.com; h=date:message-id:from:to:subject:in-reply-to:references:mime-version:content-type; s=11c8a.6290e67c.k2205; bh=WJBzMIkCYFkt5B+KCjBnQucVnOX0t+z4hjQwAkRiBOA=; b=p+WkSKO0A1GygxmnFhNehqBLxWykWiHieLgaXohzT2mQQuuhuwKFwSWAKxkAkAWMQk7iljmAqQdmY86cyVmvu9UUzLiPEbX5z0lph1dsQ5A84aAC88+fP0GpFLfTD0oVEFFppX5kr26JScII3SqmHc1i3ERQmW6DTvp1MW9wR5femaS5iehVlnjgM43Y9/adZJ5GqrpMvX9U/QbkM28W+8OjQGZPqlpOqLtfjA+X5G8ZrrmsClsmGzZztz6aGwN5We5O+XC7OWSUK5auIJYxMVUN6XMpJF//8N/fvVFBIfkXKBBRuxpod6hrnKBfabCUa/G//bbFOC2whw+jD6FYEA==
DKIM-Signature: v=1; a=rsa-sha256; c=simple; d=taugh.com; h=date:message-id:from:to:subject:in-reply-to:references:mime-version:content-type; s=11c8a.6290e67c.k2205; bh=WJBzMIkCYFkt5B+KCjBnQucVnOX0t+z4hjQwAkRiBOA=; b=J/scjTT1LugUTF2+gWyGr34t5JvhHy+3SaSGIB5MeuY9NGp0RN+CIclef6/XzbA0ehu6gEYnv22hpitFTrGNKRSbp9wMnYJjDasBmr6yePUSSoHkf4t5gKO1bsRsZ+le4fzC+VUeCUNh20f/U4XQkfp2jdK0UsKWwb8p0+fQP6yXbViHsbLFVvUO9v+eZN4g39+WzOMhzioWxD6CSrMVlFyPVrFVV2IGhGHLFMqSA101v7qRjAY5bf5hEBUi9hfiPK75hbXcoyu7UgzRzEfxkJOc+VPwiU9y0JVxV4m/jQC7eUmiacCrEBvq2Xc9xcR8Po+AL//63SPvxQXPEgaxuQ==
Received: from ary.qy ([IPv6:2001:470:1f07:1126::78:696d:6170]) by imap.iecc.com ([IPv6:2001:470:1f07:1126::78:696d:6170]) with ESMTPS (TLS1.3 ECDHE-RSA AES-256-GCM AEAD) via TCP6; 27 May 2022 14:55:56 -0000
Received: by ary.qy (Postfix, from userid 501) id DEC5D41C2762; Fri, 27 May 2022 10:55:55 -0400 (EDT)
Received: from localhost (localhost [127.0.0.1]) by ary.qy (Postfix) with ESMTP id 6C8BC41C2742; Fri, 27 May 2022 10:55:55 -0400 (EDT)
Date: Fri, 27 May 2022 10:55:55 -0400
Message-ID: <d603174c-db26-4fcb-0a60-a1f8b714c951@taugh.com>
From: John R Levine <johnl@taugh.com>
To: Michael Richardson <mcr+ietf@sandelman.ca>, rfc-interest@rfc-editor.org
X-X-Sender: johnl@ary.qy
In-Reply-To: <27659.1653660447@localhost>
References: <20220525203826.8606A41A4E93@ary.qy> <f0f92d4c-8cc4-c3bb-0f0d-96c3ad422303@gmx.de> <C826D239-7CCB-404E-9591-B33C34ED82C9@tzi.org> <5afe0f29-ab5a-b79e-cad4-7c18cf8fc5d3@gmx.de> <0ab66d2e-aa7d-eb17-83dc-2774e9d021a7@taugh.com> <27659.1653660447@localhost>
MIME-Version: 1.0
Archived-At: <https://mailarchive.ietf.org/arch/msg/rfc-interest/aabboUV-CBX3Hx13veSBz25VXKs>
Subject: Re: [rfc-i] Meta decorations in generated HTML
X-BeenThere: rfc-interest@rfc-editor.org
X-Mailman-Version: 2.1.34
Precedence: list
List-Id: "A list for discussion of the RFC series and RFC Editor functions." <rfc-interest.rfc-editor.org>
List-Unsubscribe: <https://mailman.rfc-editor.org/mailman/options/rfc-interest>, <mailto:rfc-interest-request@rfc-editor.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/rfc-interest/>
List-Post: <mailto:rfc-interest@rfc-editor.org>
List-Help: <mailto:rfc-interest-request@rfc-editor.org?subject=help>
List-Subscribe: <https://mailman.rfc-editor.org/mailman/listinfo/rfc-interest>, <mailto:rfc-interest-request@rfc-editor.org?subject=subscribe>
Content-Transfer-Encoding: 7bit
Content-Type: text/plain; charset="us-ascii"; Format="flowed"
Errors-To: rfc-interest-bounces@rfc-editor.org
Sender: rfc-interest <rfc-interest-bounces@rfc-editor.org>

On Fri, 27 May 2022, Michael Richardson wrote:
> view-source:https://www.rfc-editor.org/rfc/rfc6698.html (which was an open tab)
> view-source:https://www.rfc-editor.org/rfc/rfc8995.html
>
> and in the former, I see lots of interesting <meta name="citation_...
> stuff, but in the later, I do not.

Right.  For the pre-XML RFCs, the HTML version is generated on the fly by 
the RFC Editor's web server so it was easy to adjust the script to add the 
tags.  But now the HTML is a static version created by xml2rfc, so it will 
have to add the tags.  This also means we need to re-render the existing 
RFCs to get the tags but I expect to do that for other reasons.

> In the former, I do see an almost good self-reference:
>   <meta name="citation_pdf_url" content="https://www.rfc-editor.org/rfc/pdfrfc/rfc6698.txt.pdf"/>

That's a link to the PDF version of an HTML document.  Scholar says they 
use it, so I add it.

> I looked this up because I was thinking that some of our indexing of AUTH48
> drafts might be dealt with if the indexers could be told where the canonical
> URL of the document is.

The way to keep AUTH48 pages out of search engines is to tell them not to 
index them.  Earlier this month I helped the RPC update the web site so 
spiders will skip the authors' directory.  The AUTH48 pages that were 
already indexed will eventually age out of the search engines.

Regards,
John Levine, johnl@taugh.com, Taughannock Networks, Trumansburg NY
Please consider the environment before reading this e-mail. https://jl.ly

_______________________________________________
rfc-interest mailing list
rfc-interest@rfc-editor.org
https://mailman.rfc-editor.org/mailman/listinfo/rfc-interest