Re: [Tools-discuss] missing DC.Creator in some RFCs for HTML versions

Henrik Levkowetz <henrik@levkowetz.com> Tue, 11 December 2018 23:31 UTC

Return-Path: <henrik@levkowetz.com>
X-Original-To: tools-discuss@ietfa.amsl.com
Delivered-To: tools-discuss@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id D02A2130F61 for <tools-discuss@ietfa.amsl.com>; Tue, 11 Dec 2018 15:31:11 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.9
X-Spam-Level:
X-Spam-Status: No, score=-1.9 tagged_above=-999 required=5 tests=[BAYES_00=-1.9] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 9QWXNAfh0cYJ for <tools-discuss@ietfa.amsl.com>; Tue, 11 Dec 2018 15:31:10 -0800 (PST)
Received: from zinfandel.tools.ietf.org (zinfandel.tools.ietf.org [IPv6:2001:1890:126c::1:2a]) (using TLSv1.2 with cipher DHE-RSA-AES128-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 3AE9C128BCC for <tools-discuss@ietf.org>; Tue, 11 Dec 2018 15:31:10 -0800 (PST)
Received: from h-37-140.a357.priv.bahnhof.se ([94.254.37.140]:62611 helo=tannat.localdomain) by zinfandel.tools.ietf.org with esmtpsa (TLS1.2:DHE_RSA_AES_128_CBC_SHA1:128) (Exim 4.80) (envelope-from <henrik@levkowetz.com>) id 1gWrUb-0006yE-0A; Tue, 11 Dec 2018 15:31:09 -0800
To: Daniel Kahn Gillmor <dkg@fifthhorseman.net>, tools-discuss@ietf.org
References: <87k1kfreuc.fsf@fifthhorseman.net>
From: Henrik Levkowetz <henrik@levkowetz.com>
Message-ID: <6bb0b2f1-fcfa-6b30-83b1-444e6c0599cd@levkowetz.com>
Date: Wed, 12 Dec 2018 00:31:01 +0100
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.11; rv:45.0) Gecko/20100101 Thunderbird/45.8.0
MIME-Version: 1.0
In-Reply-To: <87k1kfreuc.fsf@fifthhorseman.net>
Content-Type: multipart/signed; micalg="pgp-sha256"; protocol="application/pgp-signature"; boundary="owPWWssqRFdRBNrESc2RsAeFRJXxqbTPf"
X-SA-Exim-Connect-IP: 94.254.37.140
X-SA-Exim-Rcpt-To: tools-discuss@ietf.org, dkg@fifthhorseman.net
X-SA-Exim-Mail-From: henrik@levkowetz.com
X-SA-Exim-Version: 4.2.1 (built Mon, 26 Dec 2011 16:24:06 +0000)
X-SA-Exim-Scanned: Yes (on zinfandel.tools.ietf.org)
X-Clacks-Overhead: GNU Terry Pratchett
Archived-At: <https://mailarchive.ietf.org/arch/msg/tools-discuss/KzjnX_SGAhGxWJ2U-6-4abUj2nM>
Subject: Re: [Tools-discuss] missing DC.Creator in some RFCs for HTML versions
X-BeenThere: tools-discuss@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: IETF Tools Discussion <tools-discuss.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tools-discuss>, <mailto:tools-discuss-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tools-discuss/>
List-Post: <mailto:tools-discuss@ietf.org>
List-Help: <mailto:tools-discuss-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tools-discuss>, <mailto:tools-discuss-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 11 Dec 2018 23:31:12 -0000

Hi Daniel,

I'll respond to all 3 DC\..* subject emails in one, as the response is
pretty much the same: The underlying code is quite old, varying from
10 to 15 years old by this time, and did not (and does not) have any of
the sophisticated data available which we have today through the
datatracker database.  It does not have the ability, for instance, to
differentiate between author sets for different versions of the document.

Rewriting the code today to fetch information from the database instead
would give much better results.  That's not on the table at present, but
is something I might spend some time on when all the xml2rfc and idnits
rewrite deliverables are out, and I've recovered.  Maybe in the spring.

However, with respect to author information for old RFCs and drafts, the
datatracker also has missing information.  In a different context it has
recently been discussed whether to try to manually fill in the gaps in
that information.  The format of early drafts and RFCs are sufficiently
irregular that I don't see anything but manual data entry as workable.


Best regards,

	Henrik

On 2018-12-11 21:24, Daniel Kahn Gillmor wrote:
> A handful of published RFCs have an author/editor listed, but do not
> include any <meta> tags that identify DC.Creator.  They're listed at the
> end of this e-mail.
> 
> I don't know why they're missing, but it would be great if we could
> refresh these RFCs HTML versions so that they have the correct metadata
> elements.
> 
> Regards,
> 
>         --dkg
> 
> published RFCs that are missing DC.Creator tags:
> 
>   rfc109
>   rfc206
>   rfc347
>   rfc571
>   rfc588
>   rfc616
>   rfc1465
>   rfc1591
>   rfc2070
>   rfc2201
>   rfc2279
>   rfc2634
>   rfc2886
>   rfc2901
>   rfc2941
>   rfc2946
>   rfc2953
>   rfc3117
>   rfc3296
>   rfc3305
>   rfc3564
>   rfc3619
>   rfc5540
>   rfc5941
>   rfc6207
>   rfc6262
>   rfc6414
>   rfc6604
> 
> 
> 
> ___________________________________________________________
> Tools-discuss mailing list
> Tools-discuss@ietf.org
> https://www.ietf.org/mailman/listinfo/tools-discuss
> 
> Please report datatracker.ietf.org and mailarchive.ietf.org
> bugs at http://tools.ietf.org/tools/ietfdb
> or send email to datatracker-project@ietf.org
> 
> Please report tools.ietf.org bugs at
> http://tools.ietf.org/tools/issues
> or send email to webmaster@tools.ietf.org
>