Re: [rfc-i] looking for a volunteer to write a simple script

Joe Touch <touch@strayalpha.com> Fri, 12 July 2019 17:31 UTC

Return-Path: <rfc-interest-bounces@rfc-editor.org>
X-Original-To: ietfarch-rfc-interest-archive@ietfa.amsl.com
Delivered-To: ietfarch-rfc-interest-archive@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 8399E1203A3 for <ietfarch-rfc-interest-archive@ietfa.amsl.com>; Fri, 12 Jul 2019 10:31:02 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.75
X-Spam-Level:
X-Spam-Status: No, score=-4.75 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_INVALID=0.1, DKIM_SIGNED=0.1, HEADER_FROM_DIFFERENT_DOMAINS=0.249, HTML_MESSAGE=0.001, MAILING_LIST_MULTI=-1, RCVD_IN_DNSWL_MED=-2.3, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=fail (2048-bit key) reason="fail (message has been altered)" header.d=strayalpha.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id uD3c78C3RHWk for <ietfarch-rfc-interest-archive@ietfa.amsl.com>; Fri, 12 Jul 2019 10:31:00 -0700 (PDT)
Received: from rfc-editor.org (rfc-editor.org [4.31.198.49]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id A8C5B120310 for <rfc-interest-archive-eekabaiReiB1@ietf.org>; Fri, 12 Jul 2019 10:31:00 -0700 (PDT)
Received: from rfcpa.amsl.com (localhost [IPv6:::1]) by rfc-editor.org (Postfix) with ESMTP id 21060B812D2; Fri, 12 Jul 2019 10:30:55 -0700 (PDT)
X-Original-To: rfc-interest@rfc-editor.org
Delivered-To: rfc-interest@rfc-editor.org
Received: from localhost (localhost [127.0.0.1]) by rfc-editor.org (Postfix) with ESMTP id 15DE7B812D2; Fri, 12 Jul 2019 10:30:54 -0700 (PDT)
X-Virus-Scanned: amavisd-new at rfc-editor.org
Authentication-Results: rfcpa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=strayalpha.com
Received: from rfc-editor.org ([127.0.0.1]) by localhost (rfcpa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id rCb3kAJ-CwOk; Fri, 12 Jul 2019 10:30:53 -0700 (PDT)
Received: from server217-3.web-hosting.com (server217-3.web-hosting.com [198.54.115.226]) by rfc-editor.org (Postfix) with ESMTPS id 0722CB812D1; Fri, 12 Jul 2019 10:30:53 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=strayalpha.com; s=default; h=Message-ID:References:In-Reply-To:Subject:Cc: To:From:Date:Content-Type:MIME-Version:Sender:Reply-To: Content-Transfer-Encoding:Content-ID:Content-Description:Resent-Date: Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:List-Id: List-Help:List-Unsubscribe:List-Subscribe:List-Post:List-Owner:List-Archive; bh=C3oBgIMjco/pxRt8Etf0/mRc+C+cnvb98zWAmLFet9U=; b=f04dczlI9iaNxoc37C8HD2Fb+ GsXo0nmryfQYzgNVtWmAO3Yf2g+qAPb7hMmS2J3tHof+wqJUbS7lHD1U99UD6QDYIyfnQhd1pqGiR i4oTItGUeh24fE0Hg9vzQWwXpSKbwCTRLdDku1xcmL9LBUx1kgiS4+/OzDol/0ng+V5/tOfmpAsnv sPV8Wpi5XQHSEMEKU4pm+ih+/4F3JHIwOHqmJykJEgmErc6YpM+ipdwAJaD0HEZ+ZxPZ80sbP8PV3 Q8LHZZVQRVeH8suyzGIU1rIWIcthXbg5uvKPjITXz23gy95LmTNwz+3mjH8LENZKxk3lICoHWgSX/ HIfgkW2gQ==;
Received: from [::1] (port=33072 helo=server217.web-hosting.com) by server217.web-hosting.com with esmtpa (Exim 4.92) (envelope-from <touch@strayalpha.com>) id 1hlzNk-003d7z-4e; Fri, 12 Jul 2019 13:30:57 -0400
MIME-Version: 1.0
Date: Fri, 12 Jul 2019 10:30:52 -0700
From: Joe Touch <touch@strayalpha.com>
To: Julian Reschke <julian.reschke@gmx.de>
In-Reply-To: <e86b8894-4d7a-4c9d-3476-0221a94c9eb0@gmx.de>
References: <62c8413d-c735-4ec3-8b22-eb0fa5356636@Spark> <38d0704f-348c-4ec0-9d94-340747960201@Spark> <e86b8894-4d7a-4c9d-3476-0221a94c9eb0@gmx.de>
Message-ID: <6a0ca8d52b7ac7ec004b6479381d61e7@strayalpha.com>
X-Sender: touch@strayalpha.com
User-Agent: Roundcube Webmail/1.3.7
X-OutGoing-Spam-Status: No, score=-1.0
X-AntiAbuse: This header was added to track abuse, please include it with any abuse report
X-AntiAbuse: Primary Hostname - server217.web-hosting.com
X-AntiAbuse: Original Domain - rfc-editor.org
X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12]
X-AntiAbuse: Sender Address Domain - strayalpha.com
X-Get-Message-Sender-Via: server217.web-hosting.com: authenticated_id: touch@strayalpha.com
X-Authenticated-Sender: server217.web-hosting.com: touch@strayalpha.com
X-Source:
X-Source-Args:
X-Source-Dir:
X-From-Rewrite: unmodified, already matched
Subject: Re: [rfc-i] looking for a volunteer to write a simple script
X-BeenThere: rfc-interest@rfc-editor.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "A list for discussion of the RFC series and RFC Editor functions." <rfc-interest.rfc-editor.org>
List-Unsubscribe: <https://www.rfc-editor.org/mailman/options/rfc-interest>, <mailto:rfc-interest-request@rfc-editor.org?subject=unsubscribe>
List-Archive: <http://www.rfc-editor.org/pipermail/rfc-interest/>
List-Post: <mailto:rfc-interest@rfc-editor.org>
List-Help: <mailto:rfc-interest-request@rfc-editor.org?subject=help>
List-Subscribe: <https://www.rfc-editor.org/mailman/listinfo/rfc-interest>, <mailto:rfc-interest-request@rfc-editor.org?subject=subscribe>
Cc: RFC Interest <rfc-interest@rfc-editor.org>, Heather Flanagan <rse@rfc-editor.org>
Content-Type: multipart/mixed; boundary="===============8357258054811344342=="
Errors-To: rfc-interest-bounces@rfc-editor.org
Sender: rfc-interest <rfc-interest-bounces@rfc-editor.org>

On 2019-07-12 10:23, Julian Reschke wrote:

> On 12.07.2019 18:55, Heather Flanagan wrote: 
> 
>> Hola a todos!
>> 
>> The RFC Editor has the need for a comparatively simple script that would
>> automatically add <bcp14></bcp14> tags to requirement language in v3 RFCs.
>> 
>> Specifically, this would take a v3 XML input file, and create a v3 XML
>> output file with <bcp14></bcp14> added around each instance of a 2119
>> keyword in the file. (MUST, MUST NOT, REQUIRED, SHALL, SHALL NOT,
>> SHOULD, SHOULD NOT, RECOMMENDED, NOT RECOMMENDED, MAY, and OPTIONAL)
>> 
>> Anyone up for helping us out with that?
>> 
>> Thanks! Heather
>> ...
> 
> The tricky part is to find the right instances. For instance, what if it
> appears in a quote, or in artwork? Or if "SHALL NOT" is across a line
> break...

That's actually quite easy in Perl - including retaining the original
line breaks/whitespace. 

> So the output will require sanity checking.
> 
> I assume that the tool is supposed to preserve whitespace, line breaks
> etc? This essentially rules out running the input through an XML parser...

That shouldn't be necessary. 

Joe
_______________________________________________
rfc-interest mailing list
rfc-interest@rfc-editor.org
https://www.rfc-editor.org/mailman/listinfo/rfc-interest