Re: [xml2rfc-dev] <contact> element in <t>

"Martin Thomson" <mt@lowentropy.net> Wed, 11 December 2019 02:16 UTC

Return-Path: <mt@lowentropy.net>
X-Original-To: xml2rfc-dev@ietfa.amsl.com
Delivered-To: xml2rfc-dev@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 58C80120086 for <xml2rfc-dev@ietfa.amsl.com>; Tue, 10 Dec 2019 18:16:55 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.7
X-Spam-Level:
X-Spam-Status: No, score=-2.7 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=lowentropy.net header.b=RMoRdykV; dkim=pass (2048-bit key) header.d=messagingengine.com header.b=SQ8MJdnF
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id TLo_EPqavIsw for <xml2rfc-dev@ietfa.amsl.com>; Tue, 10 Dec 2019 18:16:53 -0800 (PST)
Received: from wout2-smtp.messagingengine.com (wout2-smtp.messagingengine.com [64.147.123.25]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 38FF0120059 for <xml2rfc-dev@ietf.org>; Tue, 10 Dec 2019 18:16:53 -0800 (PST)
Received: from compute1.internal (compute1.nyi.internal [10.202.2.41]) by mailout.west.internal (Postfix) with ESMTP id 99471D1C; Tue, 10 Dec 2019 21:16:52 -0500 (EST)
Received: from imap2 ([10.202.2.52]) by compute1.internal (MEProxy); Tue, 10 Dec 2019 21:16:52 -0500
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=lowentropy.net; h=mime-version:message-id:in-reply-to:references:date:from:to :subject:content-type:content-transfer-encoding; s=fm1; bh=leyf5 CPE3pWj2PggzLuXb483PoEeKkGU5pAblr6Tkcs=; b=RMoRdykV4McNgfCdLmhuZ gngonKc7U0l2p1ItiyyljINe0GeDzUkFJfM3Rcwipi/EuKpuOLmd0hsXIosupdbr WNZ8mrfxT3IWI18wX4tXfX70rbQ/Mjji1VyuWc8gRRA5agP3u+iG2+nXfm8ZEN1S nXAuZmtCCVFrDZIHK2oGX4+gWqibZEXVTH81f+T6rnWcW1EpiOoPAiAKy1Al6AyU V9mKALFlioCM19zGQ25etcFDwkJLsXlEll2J17qstfZlRW4FjLiociVUWHpKwkXM n8tZBNrEfh0Zf8/YjVfeMgNJ6/LpA/SyweCKU4d+b/PY6ysnbucPT1/7FxxS5Nra Q==
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=content-transfer-encoding:content-type :date:from:in-reply-to:message-id:mime-version:references :subject:to:x-me-proxy:x-me-proxy:x-me-sender:x-me-sender :x-sasl-enc; s=fm1; bh=leyf5CPE3pWj2PggzLuXb483PoEeKkGU5pAblr6Tk cs=; b=SQ8MJdnFvRyCSvJ+qK6g6YfC7wOl2MtlG24inXL6l1r8j6ikTTlnhpcpJ FxkkXKchYqanDcuY8joghwBxZkRnxYKiXS8VmfBYx1Lcx50aAos3g2Xmzr0k7WBA pZZJBynqw9w6eW6bLBXZ14xuHVRdy0kmVSXin782lWyeAQlewG06uczqfZS8HhH1 SDXEb4vibumOtlAehxQwcyzmMSmH/vlInM8S5GqJJxOZShvE9CPbMc7VtOt/iVbN I7lQ/Lmcjr++t3+NUtUfwnW+wph6uJWLDPu78FP1I4NrTauUjQqfPNqP/yidebsN oaLcSqFbc1rPxgSgzqZphvNa2Zg4w==
X-ME-Sender: <xms:k1HwXVs9Dq8jSyLCJYViWmLWu_Lv9A4pHgVMMMwGSWYbqIdv0Tgucw>
X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgedufedrudelgedgfeekucetufdoteggodetrfdotf fvucfrrhhofhhilhgvmecuhfgrshhtofgrihhlpdfqfgfvpdfurfetoffkrfgpnffqhgen uceurghilhhouhhtmecufedttdenucesvcftvggtihhpihgvnhhtshculddquddttddmne cujfgurhepofgfggfkjghffffhvffutgfgsehtqhertderreejnecuhfhrohhmpedfofgr rhhtihhnucfvhhhomhhsohhnfdcuoehmtheslhhofigvnhhtrhhophihrdhnvghtqeenuc ffohhmrghinhepghhithhhuhgsrdgtohhmpdhmrghrthhinhdrlhhotggrlhenucfrrghr rghmpehmrghilhhfrhhomhepmhhtsehlohifvghnthhrohhphidrnhgvthenucevlhhush htvghrufhiiigvpedt
X-ME-Proxy: <xmx:k1HwXf8LtVOkOAxwYOmJlY1wxbbxAFyhxsdgE-06xZcvQkxFNXTsoQ> <xmx:k1HwXY3yBrMk9I4twZEj4psyUWRuHPEJD_OxOJMZqTDl5XQDT7yasQ> <xmx:k1HwXcBN2Ft1ofmG_fMEcPy9w_X3nYI8o6fS7whW4Ek8Xrjsuqv9OQ> <xmx:lFHwXaon0dys-Jje-t5hzuwpTbnNS6XZbjji2Bee-YDQSZXCSrwfeQ>
Received: by mailuser.nyi.internal (Postfix, from userid 501) id ACEEBE00A2; Tue, 10 Dec 2019 21:16:51 -0500 (EST)
X-Mailer: MessagingEngine.com Webmail Interface
User-Agent: Cyrus-JMAP/3.1.7-679-g1f7ccac-fmstable-20191210v1
Mime-Version: 1.0
Message-Id: <af8ad8d5-3097-498b-8768-0d569e67bb26@www.fastmail.com>
In-Reply-To: <d9a9ad58-d80b-eee5-97ff-c2734f3be96a@levkowetz.com>
References: <041d01b9-8364-4d84-a7af-28ac58dd14d7@www.fastmail.com> <d9a9ad58-d80b-eee5-97ff-c2734f3be96a@levkowetz.com>
Date: Wed, 11 Dec 2019 13:16:32 +1100
From: Martin Thomson <mt@lowentropy.net>
To: Henrik Levkowetz <henrik@levkowetz.com>, xml2rfc-dev@ietf.org
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: quoted-printable
Archived-At: <https://mailarchive.ietf.org/arch/msg/xml2rfc-dev/ORQV0FD_z59vezE_fqWfPPcRLR8>
Subject: Re: [xml2rfc-dev] <contact> element in <t>
X-BeenThere: xml2rfc-dev@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "Discussion about particulars of xml2rfc V3 design, development and code." <xml2rfc-dev.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/xml2rfc-dev>, <mailto:xml2rfc-dev-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/xml2rfc-dev/>
List-Post: <mailto:xml2rfc-dev@ietf.org>
List-Help: <mailto:xml2rfc-dev-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/xml2rfc-dev>, <mailto:xml2rfc-dev-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 11 Dec 2019 02:16:55 -0000

The XML in question is produced from kramdown, but you can see the input here:

https://github.com/quicwg/base-drafts/compare/contributors?expand=1

This expands into pretty much what you see below.

On Wed, Dec 11, 2019, at 12:39, Henrik Levkowetz wrote:
> Hi Martin,
> 
> On 2019-12-11 02:13, Martin Thomson wrote:
> > Hi Henrik,
> > 
> > I just tested this new element out in xml2rfc 2.37.0.  I think that there are some bugs to work out.
> > 
> > I have the following input (in reduced form):
> > 
> > <t>The following people helped:
> > ... (ascii names) ...,
> > <contact fullname="Mirja Kühlewind"/>,
> > ...
> > <contact asciiFullname="Kazuho Oku" fullname="?"/>,
> > ...
> > </t>
> > 
> > I don't have Kazuho's real name just yet, so I just added "?" for the moment.
> > 
> > What I have observed is that the output includes many additional line
> > breaks. If I get this right, this is producing a new paragraph break
> > for the name included in <contact>. That's not expected.
> 
> No, and I haven't seen this here in my tests.  Could you send your XML?
> 
> > It also appears to remove the "?" I'm using as a placeholder in this
> > example. I can put some Unicode in fullname (for instance, "øøø"
> > works fine), but the Japanese text I copied from somewhere failed.
> > 
> > fullname="投稿者" causes xml2rfc to choke:
> > 
> >   File "/home/martin/bin/xml2rfc", line 11, in <module>
> >     load_entry_point('xml2rfc==2.37.0', 'console_scripts', 'xml2rfc')()
> >   File "/home/martin/.local/lib/python3.8/site-packages/xml2rfc/run.py", line 590, in main
> >     writer.write(filename)
> >   File "/home/martin/.local/lib/python3.8/site-packages/xml2rfc/writers/text.py", line 274, in write
> >     lines = self.paginate(lines)
> >   File "/home/martin/.local/lib/python3.8/site-packages/xml2rfc/writers/text.py", line 412, in paginate
> >     paginated[i].elem.page = page
> > AttributeError: 'lxml.etree._Element' object has no attribute 'page'
> > 
> > I thought that that specific error might be due to placement, but
> > moving the text around didn't help.
> 
> Interesting.  I haven't seen this in my tests either.  I used Georgian
> Mkhedruli though, not Japanese Kanji.  I'll try that tomorrow, fix, and do a
> bugfix release.  I see this is related to pagination, so again your XML file
> might help debug this.
> 
> 
> 	Henrik
> 
> 
> Attachments:
> * signature.asc