Re: [rfc-i] rfcNNNNN.json

Alice Russo <arusso@amsl.com> Tue, 11 May 2021 04:22 UTC

Return-Path: <rfc-interest-bounces@rfc-editor.org>
X-Original-To: ietfarch-rfc-interest-archive@ietfa.amsl.com
Delivered-To: ietfarch-rfc-interest-archive@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 25BEE3A064A; Mon, 10 May 2021 21:22:47 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.651
X-Spam-Level:
X-Spam-Status: No, score=-2.651 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HEADER_FROM_DIFFERENT_DOMAINS=0.249, MAILING_LIST_MULTI=-1, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id bRbOMBNHYUof; Mon, 10 May 2021 21:22:41 -0700 (PDT)
Received: from rfc-editor.org (rfc-editor.org [4.31.198.49]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id B87A73A0650; Mon, 10 May 2021 21:22:41 -0700 (PDT)
Received: from rfcpa.amsl.com (localhost [IPv6:::1]) by rfc-editor.org (Postfix) with ESMTP id 091E0F407D5; Mon, 10 May 2021 21:22:40 -0700 (PDT)
X-Original-To: rfc-interest@rfc-editor.org
Delivered-To: rfc-interest@rfc-editor.org
Received: from localhost (localhost [127.0.0.1]) by rfc-editor.org (Postfix) with ESMTP id E3969F407D5 for <rfc-interest@rfc-editor.org>; Mon, 10 May 2021 21:22:38 -0700 (PDT)
X-Virus-Scanned: amavisd-new at rfc-editor.org
Received: from rfc-editor.org ([127.0.0.1]) by localhost (rfcpa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id TU4VUJGzUVGm for <rfc-interest@rfc-editor.org>; Mon, 10 May 2021 21:22:33 -0700 (PDT)
Received: from mail.amsl.com (c8a.amsl.com [4.31.198.40]) by rfc-editor.org (Postfix) with ESMTPS id 536ECF407AA for <rfc-interest@rfc-editor.org>; Mon, 10 May 2021 21:22:33 -0700 (PDT)
Received: from localhost (localhost [127.0.0.1]) by c8a.amsl.com (Postfix) with ESMTP id 84BFF389FB5; Mon, 10 May 2021 21:22:34 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
Received: from c8a.amsl.com ([127.0.0.1]) by localhost (c8a.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id rOAG6YXzsZRI; Mon, 10 May 2021 21:22:34 -0700 (PDT)
Received: from [10.0.0.219] (c-71-227-153-114.hsd1.wa.comcast.net [71.227.153.114]) by c8a.amsl.com (Postfix) with ESMTPSA id 4C1EA389FB4; Mon, 10 May 2021 21:22:34 -0700 (PDT)
Mime-Version: 1.0 (Mac OS X Mail 10.3 \(3273\))
From: Alice Russo <arusso@amsl.com>
In-Reply-To: <0DF87392-01E3-4E24-BD99-B48DF5DFB685@tzi.org>
Date: Mon, 10 May 2021 21:22:31 -0700
Message-Id: <07D75228-A388-4B7D-AB64-E96556AC2517@amsl.com>
References: <20210423211312.3FBBA73F0D94@ary.qy> <0DF87392-01E3-4E24-BD99-B48DF5DFB685@tzi.org>
To: Carsten Bormann <cabo@tzi.org>
X-Mailer: Apple Mail (2.3273)
Subject: Re: [rfc-i] rfcNNNNN.json
X-BeenThere: rfc-interest@rfc-editor.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "A list for discussion of the RFC series and RFC Editor functions." <rfc-interest.rfc-editor.org>
List-Unsubscribe: <https://www.rfc-editor.org/mailman/options/rfc-interest>, <mailto:rfc-interest-request@rfc-editor.org?subject=unsubscribe>
List-Archive: <http://www.rfc-editor.org/pipermail/rfc-interest/>
List-Post: <mailto:rfc-interest@rfc-editor.org>
List-Help: <mailto:rfc-interest-request@rfc-editor.org?subject=help>
List-Subscribe: <https://www.rfc-editor.org/mailman/listinfo/rfc-interest>, <mailto:rfc-interest-request@rfc-editor.org?subject=subscribe>
Cc: rfc-interest@rfc-editor.org, John Levine <johnl@taugh.com>
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
Errors-To: rfc-interest-bounces@rfc-editor.org
Sender: rfc-interest <rfc-interest-bounces@rfc-editor.org>

Hi Carsten,

Following up re: JSON files of RFC metadata.  To summarize, items #1 and #3 have been addressed; #2 requires ongoing data maintenance; #4 is queued to be addressed. Details below.

1) missing DOIs for 252 recent RFCs. 
-> Corrected files were posted 28 April. (The underlying process issue has been corrected.)

2) mismatch of JSON files for "Not Issued" and the list you provided of files that don't have .txt files.
-> It's intentional that there are records for "not issued" RFC numbers. The list on https://www.rfc-editor.org/never-issued/ as well as corresponding records in rfc-index.xml (and JSON files) are periodically updated. 

3) nonsensical pub-dates for "Not Issued" RFC numbers
-> Corrected files were posted 28 April. (This was an issue in index_json_creator.php; it has been updated.)

4) "broken RFC name" - extraneous whitespace in, e.g., obsoleted_by field
-> Queued to be addressed.

Regarding whether the JSON files are intended to be a complete resource:
Essentially, yes. At this point, rfc-index.xml is a more complete resource (and has rfc-index.xsd). Currently there is not a schema file for the JSON files; their content is intended to match rfc-index.xml. More info is here:  https://www.rfc-editor.org/rse/wiki/doku.php?id=rfc_metadata_in_the_v3_era

Alice Russo
RFC Production Center

On Apr 23, 2021, at 9:35 PM, Sandy Ginoza <sginoza@amsl.com> wrote:
> 
> Carsten,
> 
> Thank you for bringing this to our attention. We will examine these issues and get back to you.  
> 
> When you notice bugs or inconsistencies, please send mail to <rfc-editor@rfc-editor.org>.  The right people will get the message and look into the issues. 
> 
> Thanks,
> Sandy 
> 
>> On Apr 23, 2021, at 4:40 PM, Carsten Bormann <cabo@tzi.org> wrote:
>> 
>> On 23. Apr 2021, at 23:13, John Levine <johnl@taugh.com> wrote:
>>> 
>>> That's a bug.
>> 
>> Thanks for the confirmation.
>> 
>> So my real question is maybe:
>> 
>> Are these files on a trajectory to become a reliable data source?
>> And, if yes, who should I be talking to when I find bugs or inconsistencies?
>> Is there an issue tracker somewhere?
>> Where is documentation collected (so I could, e.g., submit my CDDL file)?
>> 
>> More weirdness below ("pub_date"=>"November -0001”, spurious spaces in RFC names).
>> There also is the interesting case of RFC 1384 that was obsoleted by “RTR0011”, but that is also in rfc-index.txt, so I should probably believe that.
>> 
>> Grüße, Carsten
>> 
>> 
>> 
>> ** Features potentially used (rfc10.json): broken RFC name: [" RFC0016"]
>> ** Features potentially used (rfc1009.json): broken RFC name: [" RFC1812"]
[...]
_______________________________________________
rfc-interest mailing list
rfc-interest@rfc-editor.org
https://www.rfc-editor.org/mailman/listinfo/rfc-interest