Re: [Tools-discuss] [rfc-i] missing line ends in generated TXT (from datatracker)

Julian Reschke <julian.reschke@gmx.de> Tue, 12 November 2019 16:15 UTC

Return-Path: <julian.reschke@gmx.de>
X-Original-To: tools-discuss@ietfa.amsl.com
Delivered-To: tools-discuss@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 9C14E12004A for <tools-discuss@ietfa.amsl.com>; Tue, 12 Nov 2019 08:15:57 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.599
X-Spam-Level:
X-Spam-Status: No, score=-2.599 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_HELO_NONE=0.001, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=gmx.net
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id A4A9a-RWG9HR for <tools-discuss@ietfa.amsl.com>; Tue, 12 Nov 2019 08:15:54 -0800 (PST)
Received: from mout.gmx.net (mout.gmx.net [212.227.17.20]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 492B912008B for <tools-discuss@ietf.org>; Tue, 12 Nov 2019 08:15:54 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.net; s=badeba3b8450; t=1573575280; bh=6Y4TBXbPqFNbcszUIpP/vN8C5/rOgah4mPN/QlqhStY=; h=X-UI-Sender-Class:Subject:To:Cc:References:From:Date:In-Reply-To; b=b88WbS3oqyfRmLk169Gj8TiLTogPWJlI8dbuA7+uNuoh08lhdL+8F/JYgzk5TOOTR q7mfewya/LWXvhk3oAyTkAMr+HktExXgSfoAooYmPRz8s/Y9m7pACF9g/anie8HSqk pLk5iMN3VtmkrVy23Y5XuiXE/qQxh0eiMnot018g=
X-UI-Sender-Class: 01bb95c1-4bf8-414a-932a-4f6e2808ef9c
Received: from [192.168.1.34] ([217.91.35.233]) by mail.gmx.com (mrgmx105 [212.227.17.168]) with ESMTPSA (Nemesis) id 1MIx3C-1iA8Fe0zUI-00KPZ5; Tue, 12 Nov 2019 17:14:40 +0100
To: Henrik Levkowetz <henrik@levkowetz.com>, Dave Rice <dave@dericed.com>
Cc: rfc-interest <rfc-interest@rfc-editor.org>, tools-discuss@ietf.org, Robert Sparks <rjsparks@nostrum.com>
References: <20285.1568652805@localhost> <705AA16B-68B1-4B5E-AE47-0714F6781C1A@dericed.com> <26105.1568818137@localhost> <E70CD0FF-0294-47E5-873F-5E184A248577@dericed.com> <1268.1568828850@localhost> <B4047303-E2B9-4B9D-B368-2ACC85A06D15@dericed.com> <15814.1569013442@localhost> <3a8f5865-9f5c-8c65-63be-bca31c5f1c86@levkowetz.com> <3650B5EF-A442-4291-928A-2FCB2144134C@dericed.com> <06122b6d-d5f9-30b9-2db4-a2427d299a1f@levkowetz.com> <CB32A50B-D6F2-41C4-8AC7-750E35F4941D@dericed.com> <97e68838-ac0c-25e4-67a9-a2a998b6e8af@gmx.de> <3ef6a53c-e18a-9fc7-8324-67f388acd32f@gmx.de> <60a9d29f-03ef-5842-0c21-6930293759da@gmx.de> <cf651c95-9a22-775e-16d9-ebfdc9ee6ed0@gmx.de> <730f143a-f751-e367-22ad-f1fbf53b1c5a@nostrum.com> <6b05a391-ad8c-285a-b8bd-c5f13fbfa260@gmx.de> <83EE89DB-B6EB-4495-A5B9-2793BF22F220@dericed.com> <2d06c521-0218-e3d4-61e4-cc329369f4aa@levkowetz.com>
From: Julian Reschke <julian.reschke@gmx.de>
Message-ID: <0558fd8e-4ab6-3c6d-0e50-321ba02f61da@gmx.de>
Date: Tue, 12 Nov 2019 17:14:32 +0100
User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.9.1
MIME-Version: 1.0
In-Reply-To: <2d06c521-0218-e3d4-61e4-cc329369f4aa@levkowetz.com>
Content-Type: text/plain; charset="utf-8"; format="flowed"
Content-Language: en-US
Content-Transfer-Encoding: quoted-printable
X-Provags-ID: V03:K1:V/v3LSlnqeqKCNr6C/r+xqxt34X7PoD5m9XBhuZ8ZjJmyBa+sg+ eTFbWNncOjaDzTrqtVh1d8MgPEZnfNy7NPvCnKqXgERfdJiddOVe7iURzSOVI+ZxEdCg6cx GLyORh8/d0v0dsc1iUSVlLyMTBdzIcuy/o0Up8cBHT703PG8w9HMiblg9L1r3X6mvLMr0qO VEc1v4UsU8AyhV+TJ0iLA==
X-UI-Out-Filterresults: notjunk:1;V03:K0:+WwqXV29G/c=:CpJBB5I+L2x+wwj1ZuYg+q vGcEzlPIQSt/C6wH4B7/KJzxwxzaVJ+i1dnbRuKum1LI38mqqE2KMETMfkDoeP0+KXrXAiMNj RF0qzQSAbnPsmdNwqsBASKWxR3Ze4AI4tYxp0eJIXouSNBE4n+3eW4llclcf3P3GqXVw4P95w mHyxLfuMIHek3+l65kztuZNvbx9tIj8Zu0vKuZgxwyoFS8KEDSWJCa2hr8DqFC00y/Fx8dI5l jR8bE+oHVvzILgEfZG7kqe59PYVBkjD5WVZtWlZGD/wTDpBo5FJU/aXdl5JNtMEb58ozEaXhg nBV8XtT+oAOT9p85Xe5UqpEelQAxHCyZozNGm08/WMtdsd3XY8Wv5qkaU/e+vgMRRSxd4R31M VzVUOXc8R/jE3vDVmpWMwfHks4N01qDfjr+F89hH8HDEtOOYkuNE9785N9Zp2/kdSeSFMR0Py 0P8pOZI8QFEAxixElxWueEgQOx+Dvnx2pf9WQgGefPLxODaAJPb5rWXoAasXrpdfm/RtIy8lB yRUlhp92T67XaOrjgU0mTAnxoQeNtrEiP5cwVAXMklMESwWCpLeUBmXXIHvu52Lfh27RIt6Dg vsHSKPyM6TA9hoiWaFfWz+FFFmVD0B0R4yXurJAUMkSJvFEg2wRl+e60CKWDJQejG+j/BJW5/ /m/XYjMO5zAR7MGB1Rrm79Pag3SKeAsqbHkIJWU4nnaMvHQmcl/StL4HVHYv6XU/eiPxcjEQ/ Az4zI8ibQAWPbTeCX4DyD0drHBc7ei5c9JGZDSDtBwxJN2AMGyEQgaMH7XFx+7zx9Q8thn4Fp 10IIETFGlj0CzGovumNXB15M+pw8qQbNDsoDskfzDq9RHRSaUEcgqf5i+PVF048UnePTQ7Ob1 XI22oFiaiqF2eqVAgZmmHYUHGCHOWibXt2vcVGlFjkApEkMDaZnVa0tkYtE57S31GeC5fuMZj 3IiwOMpppboJdlU4UPW01+sjjJs5SgwrTNOP37/reVYl5wlU7FaAcwUL0dTBYiCJwlFO0OMP8 kIqISKYDT5ARbboXcGhrPd45Mo1kvvOiQxUA7CNjlAJ76uvNgz+5G9T/BzFSMc/m8lUNulkF6 OVPQJkZjPMjI0lXaHB29t4ddgnxEE16LIBNtczDnq6DVlwOt7vhqF2zBncU/tAPCuNHzYgti4 QwlpTVbocft7Rb8hRtJ1nxLe7eYFbhjEJRvI4OSVEk25uT+LcZCDQhCkX5OY3gcT4WEVzrVXZ CbxKxAKCnhS5lLNOUH5DlzvQT9bBetA/lcBUuSKfU8eQp9F1J8SGhx9sHagc=
Archived-At: <https://mailarchive.ietf.org/arch/msg/tools-discuss/KwCW_4Jr5VejD3huzezjcojLINQ>
Subject: Re: [Tools-discuss] [rfc-i] missing line ends in generated TXT (from datatracker)
X-BeenThere: tools-discuss@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: IETF Tools Discussion <tools-discuss.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tools-discuss>, <mailto:tools-discuss-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tools-discuss/>
List-Post: <mailto:tools-discuss@ietf.org>
List-Help: <mailto:tools-discuss-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tools-discuss>, <mailto:tools-discuss-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 12 Nov 2019 16:15:57 -0000

On 12.11.2019 16:28, Henrik Levkowetz wrote:
> Hi David,
>
> On 2019-11-11 22:23, Dave Rice wrote:
>> FWIW, for the cellar working group, we pipe our rfc xml through sed
>> to find all occurrence of <sourcecode> and replace them with
>> <artwork>. It’s a loss in semantics but seems to workaround this
>> issue for now. We’ll watch the progress of the ticket and remove our
>> workaround when it’s resolved.
>
> Another workaround is to generate the txt file yourself, and upload with
> the XML file.  I've started looking into why standalone invocation of
> xml2rfc handles <sourecode> correctly, while invocation as a library
> function looses newlines, but I don't have an answer yet.
> ...

A wild guess would be that the XML whitespace treatment might be
affected by the grammar (when the parser is in validating mode).

See:

>  artwork =
>      element artwork {
>        attribute xml:base { text }?,
>        attribute xml:lang { text }?,
>        attribute anchor { xsd:ID }?,
>        attribute pn { xsd:ID }?,
>        attribute xml:space { text }?,
>        [ a:defaultValue = "" ] attribute name { text }?,
>        [ a:defaultValue = "" ] attribute type { text }?,
>        attribute src { text }?,
>        [ a:defaultValue = "left" ]
>        attribute align { "left" | "center" | "right" }?,
>        [ a:defaultValue = "" ] attribute alt { text }?,
>        [ a:defaultValue = "" ] attribute width { text }?,
>        [ a:defaultValue = "" ] attribute height { text }?,
>        attribute originalSrc { text }?,
>        (text* | svg)
>      }

>    sourcecode =
>      element sourcecode {
>        attribute xml:base { text }?,
>        attribute xml:lang { text }?,
>        attribute anchor { xsd:ID }?,
>        attribute pn { xsd:ID }?,
>        [ a:defaultValue = "" ] attribute name { text }?,
>        [ a:defaultValue = "" ] attribute type { text }?,
>        [ a:defaultValue = "false" ]
>        attribute markers { "true" | "false" }?,
>        attribute src { text }?,
>        attribute originalSrc { text }?,
>        text
>      }

So those differ in the presence of xml:space, and the actual content
model (just text vs text or <svg>).

Best regards, Julian