Re: [Tools-discuss] idnits 2.16.02 counts line length using bytes and not characters (but xml2rfc 2.34.0 line-wraps counting chars, not bytes)

Daniel Kahn Gillmor <dkg@fifthhorseman.net> Tue, 05 November 2019 15:39 UTC

Return-Path: <dkg@fifthhorseman.net>
X-Original-To: tools-discuss@ietfa.amsl.com
Delivered-To: tools-discuss@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id EB4E8120071 for <tools-discuss@ietfa.amsl.com>; Tue, 5 Nov 2019 07:39:33 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.999
X-Spam-Level:
X-Spam-Status: No, score=-1.999 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=neutral reason="invalid (unsupported algorithm ed25519-sha256)" header.d=fifthhorseman.net header.b=yvjb5l/f; dkim=pass (2048-bit key) header.d=fifthhorseman.net header.b=krEYKWBb
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id x6wzE5ywDob2 for <tools-discuss@ietfa.amsl.com>; Tue, 5 Nov 2019 07:39:31 -0800 (PST)
Received: from che.mayfirst.org (che.mayfirst.org [IPv6:2001:470:1:116::7]) (using TLSv1.2 with cipher ADH-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 2FD6E120090 for <tools-discuss@ietf.org>; Tue, 5 Nov 2019 07:39:31 -0800 (PST)
DKIM-Signature: v=1; a=ed25519-sha256; c=relaxed/simple; d=fifthhorseman.net; i=@fifthhorseman.net; q=dns/txt; s=2019; t=1572957299; h=from : to : subject : in-reply-to : references : date : message-id : mime-version : content-type : from; bh=JztHFvNE+Pd2/f1HMfCI50i2s/PI4Q/sNeyGeoLeLd4=; b=yvjb5l/fq/va30cWFvH2iSfeaAZp6uaqhFxiAcYrNC5BHn53WDw5D6pB NkwioCkKZGFozsL9CcHg8dhMZ8rZDg==
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=fifthhorseman.net; i=@fifthhorseman.net; q=dns/txt; s=2019rsa; t=1572957299; h=from : to : subject : in-reply-to : references : date : message-id : mime-version : content-type : from; bh=JztHFvNE+Pd2/f1HMfCI50i2s/PI4Q/sNeyGeoLeLd4=; b=krEYKWBbsGYWJf+uUyPr32vBnQXl/Irv0NgQAW1s7Wl161gFZYVXabxI P8FtMl2U0j7+bibTDQrOmt24MECejqeiFPfvtVc0MPX2oYWj7c5fL/RnK1 BSmeAsXfpL+TamxFaQJzuKArpywkFhZc3MwXaXw3ShWVC/B2iauIv1HEL8 8KgoUUtA75ihAMFSgxkM7BvYaq6alXGIWmKxzgVY3gscLqWsR3aFYz+Vp7 /bnCpjiH/evBwy63Woce+MqFWIA4Lsg7waoJLA6QlUHnLH9mtw1vWhQcXA 2eyvmFZEZ0SUpFGpBpaNuNBqQyIt4tlPAR0lPm+JlCDsoHmmsyucEw==
Received: from fifthhorseman.net (unknown [IPv6:2001:470:1f07:60d:54c4:58ff:fe31:1452]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by che.mayfirst.org (Postfix) with ESMTPSA id B64CBF9A5; Tue, 5 Nov 2019 07:34:59 -0500 (EST)
Received: by fifthhorseman.net (Postfix, from userid 1000) id 6FC3D203ED; Tue, 5 Nov 2019 07:34:53 -0500 (EST)
From: Daniel Kahn Gillmor <dkg@fifthhorseman.net>
To: Henrik Levkowetz <henrik@levkowetz.com>, tools-discuss@ietf.org
In-Reply-To: <b045766b-ef0b-7dbd-cf34-b1f2f332c151@levkowetz.com>
References: <87h83jpdp1.fsf@fifthhorseman.net> <b045766b-ef0b-7dbd-cf34-b1f2f332c151@levkowetz.com>
Autocrypt: addr=dkg@fifthhorseman.net; prefer-encrypt=mutual; keydata= mDMEXEK/AhYJKwYBBAHaRw8BAQdAr/gSROcn+6m8ijTN0DV9AahoHGafy52RRkhCZVwxhEe0K0Rh bmllbCBLYWhuIEdpbGxtb3IgPGRrZ0BmaWZ0aGhvcnNlbWFuLm5ldD6ImQQTFggAQQIbAQUJA8Jn AAULCQgHAgYVCgkICwIEFgIDAQIeAQIXgBYhBMS8Lds4zOlkhevpwvIGkReQOOXGBQJcQsbzAhkB AAoJEPIGkReQOOXG4fkBAO1joRxqAZY57PjdzGieXLpluk9RkWa3ufkt3YUVEpH/AP9c+pgIxtyW +FwMQRjlqljuj8amdN4zuEqaCy4hhz/1DbgzBFxCv4sWCSsGAQQB2kcPAQEHQERSZxSPmgtdw6nN u7uxY7bzb9TnPrGAOp9kClBLRwGfiPUEGBYIACYWIQTEvC3bOMzpZIXr6cLyBpEXkDjlxgUCXEK/ iwIbAgUJAeEzgACBCRDyBpEXkDjlxnYgBBkWCAAdFiEEyQ5tNiAKG5IqFQnndhgZZSmuX/gFAlxC v4sACgkQdhgZZSmuX/iVWgD/fCU4ONzgy8w8UCHGmrmIZfDvdhg512NIBfx+Mz9ls5kA/Rq97vz4 z48MFuBdCuu0W/fVqVjnY7LN5n+CQJwGC0MIA7QA/RyY7Sz2gFIOcrns0RpoHr+3WI+won3xCD8+ sVXSHZvCAP98HCjDnw/b0lGuCR7coTXKLIM44/LFWgXAdZjm1wjODbg4BFxCv50SCisGAQQBl1UB BQEBB0BG4iXnHX/fs35NWKMWQTQoRI7oiAUt0wJHFFJbomxXbAMBCAeIfgQYFggAJhYhBMS8Lds4 zOlkhevpwvIGkReQOOXGBQJcQr+dAhsMBQkB4TOAAAoJEPIGkReQOOXGe/cBAPlek5d9xzcXUn/D kY6jKmxe26CTws3ZkbK6Aa5Ey/qKAP0VuPQSCRxA7RKfcB/XrEphfUFkraL06Xn/xGwJ+D0hCw==
Date: Tue, 05 Nov 2019 07:34:53 -0500
Message-ID: <87woceo61e.fsf@fifthhorseman.net>
MIME-Version: 1.0
Content-Type: multipart/signed; boundary="=-=-="; micalg="pgp-sha256"; protocol="application/pgp-signature"
Archived-At: <https://mailarchive.ietf.org/arch/msg/tools-discuss/n5tRaWEDdKT1n_PaBm93payYv2g>
Subject: Re: [Tools-discuss] idnits 2.16.02 counts line length using bytes and not characters (but xml2rfc 2.34.0 line-wraps counting chars, not bytes)
X-BeenThere: tools-discuss@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: IETF Tools Discussion <tools-discuss.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tools-discuss>, <mailto:tools-discuss-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tools-discuss/>
List-Post: <mailto:tools-discuss@ietf.org>
List-Help: <mailto:tools-discuss-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tools-discuss>, <mailto:tools-discuss-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 05 Nov 2019 15:39:34 -0000

On Mon 2019-11-04 22:07:00 +0100, Henrik Levkowetz wrote:

> Agreed.  A rewrite of idnits which will do better in this respect is
> in progress.

great, thanks!

By the way, the incluson of UTF-8 characters has been really helpful for
this specific draft, as it permits pretty diagrams of MIME structure
that would be tedious to explain in text, or harder to follow visually
if we left out edges between parent and child nodes.

I really appreciate the work that goes into modernizing the IETF toolchain
to permit these improvements.

My only regret is that the htmlizer on tools.ietf.org seems to work from
the uploaded .txt variant, rather than building the html directly from
the .xml source.  "xml2rfc --html --v3" produces a document that is
*much* nicer to read (and better structured from an HTML point of view)
than the htmlized txt source.

Is there any plan to generate the HTML from the uploaded xml in this way
for future documents?

   --dkg