Re: [xml2rfc] #332 (Version 2 cli): Getting error at column 33851 ...

worley@ariadne.com (Dale R. Worley) Mon, 12 June 2017 23:49 UTC

Return-Path: <worley@alum.mit.edu>
X-Original-To: xml2rfc@ietfa.amsl.com
Delivered-To: xml2rfc@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 3D36A129407 for <xml2rfc@ietfa.amsl.com>; Mon, 12 Jun 2017 16:49:55 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.934
X-Spam-Level:
X-Spam-Status: No, score=-1.934 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HEADER_FROM_DIFFERENT_DOMAINS=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_SOFTFAIL=0.665] autolearn=no autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id TVDOjBLKzYt2 for <xml2rfc@ietfa.amsl.com>; Mon, 12 Jun 2017 16:49:54 -0700 (PDT)
Received: from resqmta-ch2-08v.sys.comcast.net (resqmta-ch2-08v.sys.comcast.net [IPv6:2001:558:fe21:29:69:252:207:40]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id DA26F126C3D for <xml2rfc@ietf.org>; Mon, 12 Jun 2017 16:49:53 -0700 (PDT)
Received: from resomta-ch2-08v.sys.comcast.net ([69.252.207.104]) by resqmta-ch2-08v.sys.comcast.net with SMTP id KZ5ed3TUQAfZsKZ5ldp5dm; Mon, 12 Jun 2017 23:49:53 +0000
Received: from hobgoblin.ariadne.com ([IPv6:2601:192:4603:9471:222:fbff:fe91:d396]) by resomta-ch2-08v.sys.comcast.net with SMTP id KZ5idDjUWtZO6KZ5jdan4I; Mon, 12 Jun 2017 23:49:52 +0000
Received: from hobgoblin.ariadne.com (hobgoblin.ariadne.com [127.0.0.1]) by hobgoblin.ariadne.com (8.14.7/8.14.7) with ESMTP id v5CNnnNG003769; Mon, 12 Jun 2017 19:49:50 -0400
Received: (from worley@localhost) by hobgoblin.ariadne.com (8.14.7/8.14.7/Submit) id v5CNnnDg003766; Mon, 12 Jun 2017 19:49:49 -0400
X-Authentication-Warning: hobgoblin.ariadne.com: worley set sender to worley@alum.mit.edu using -f
From: worley@ariadne.com
To: "HANSEN, TONY L" <tony@att.com>
Cc: henrik@levkowetz.com, jordi.palet@consulintel.es, xml2rfc@ietf.org
In-Reply-To: <0324DF3A-4072-401B-B875-C9A7D31804AE@att.com> (tony@att.com)
Sender: worley@ariadne.com
Date: Mon, 12 Jun 2017 19:49:49 -0400
Message-ID: <8737b4kbqa.fsf@hobgoblin.ariadne.com>
X-CMAE-Envelope: MS4wfPVcPlMm07k+1/CMusum8TBrsDFguErmWfOb8YYqZuJzruA1eZw2Oaz6NtQo3PCD3feKrfBUSOtnW15sTCB0mGiHH7YreEzYqVCUAb//28U9cfA7PH8H Jzmbr5YQPXJ8/aAyZnri9t3a2DkXxO6u/Z+fbcagF2qYnJ9d+fMW5z/jvfJEk11BgV/8SwubEtTkHqzuFMA328a/TWyp/qWt1NWI+O8OS3HiecCbHxhtCamc gnTEXBsmp1o87VeKAR5gwQ==
Archived-At: <https://mailarchive.ietf.org/arch/msg/xml2rfc/SOlidp9TE7wHzNze7t7odRVf4_I>
Subject: Re: [xml2rfc] #332 (Version 2 cli): Getting error at column 33851 ...
X-BeenThere: xml2rfc@ietf.org
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: <xml2rfc.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/xml2rfc>, <mailto:xml2rfc-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/xml2rfc/>
List-Post: <mailto:xml2rfc@ietf.org>
List-Help: <mailto:xml2rfc-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/xml2rfc>, <mailto:xml2rfc-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 12 Jun 2017 23:49:55 -0000

"HANSEN, TONY L" <tony@att.com> writes:
> Using CR as the line separator is the root of the issue. The XML
> library we use would treat that as whitespace, but not a newline. That
> would definitely explain things.

That's strange behavior, because the XML specification ("Extensible
Markup Language (XML) 1.0 (Fifth Edition)",
http://www.w3.org/TR/xml/#sec-white-space), section 2.11 says that a
naked CR must be translated into an NL:

    To simplify the tasks of applications, the XML processor MUST behave
    as if it normalized all line breaks in external parsed entities
    (including the document entity) on input, before parsing, by
    translating both the two-character sequence #xD #xA and any #xD that
    is not followed by #xA to a single #xA character.

Dale