Re: [Tools-discuss] Why do we even have text formats any more?

Toerless Eckert <tte@cs.fau.de> Wed, 28 July 2021 14:32 UTC

Return-Path: <eckert@i4.informatik.uni-erlangen.de>
X-Original-To: tools-discuss@ietfa.amsl.com
Delivered-To: tools-discuss@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 7F7763A12C5 for <tools-discuss@ietfa.amsl.com>; Wed, 28 Jul 2021 07:32:02 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.198
X-Spam-Level:
X-Spam-Status: No, score=-4.198 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HEADER_FROM_DIFFERENT_DOMAINS=0.001, RCVD_IN_DNSWL_MED=-2.3, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id WTcc0DrnTDbx for <tools-discuss@ietfa.amsl.com>; Wed, 28 Jul 2021 07:31:58 -0700 (PDT)
Received: from faui40.informatik.uni-erlangen.de (faui40.informatik.uni-erlangen.de [131.188.34.40]) (using TLSv1.2 with cipher ADH-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id D41D73A12C3 for <tools-discuss@ietf.org>; Wed, 28 Jul 2021 07:31:57 -0700 (PDT)
Received: from faui48e.informatik.uni-erlangen.de (faui48e.informatik.uni-erlangen.de [IPv6:2001:638:a000:4134::ffff:51]) by faui40.informatik.uni-erlangen.de (Postfix) with ESMTP id 4D222548053; Wed, 28 Jul 2021 16:31:52 +0200 (CEST)
Received: by faui48e.informatik.uni-erlangen.de (Postfix, from userid 10463) id 40A4E4E7B8B; Wed, 28 Jul 2021 16:31:52 +0200 (CEST)
Date: Wed, 28 Jul 2021 16:31:52 +0200
From: Toerless Eckert <tte@cs.fau.de>
To: Martin Thomson <mt@lowentropy.net>
Cc: tools-discuss@ietf.org
Message-ID: <20210728143152.GA57091@faui48e.informatik.uni-erlangen.de>
References: <4d70a1ac-a275-420a-83f6-99dfd5b5385c@www.fastmail.com>
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Disposition: inline
In-Reply-To: <4d70a1ac-a275-420a-83f6-99dfd5b5385c@www.fastmail.com>
User-Agent: Mutt/1.10.1 (2018-07-13)
Archived-At: <https://mailarchive.ietf.org/arch/msg/tools-discuss/7BlkXvYwuR7r-JHpq2Q3NAvG86I>
Subject: Re: [Tools-discuss] Why do we even have text formats any more?
X-BeenThere: tools-discuss@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: IETF Tools Discussion <tools-discuss.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tools-discuss>, <mailto:tools-discuss-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tools-discuss/>
List-Post: <mailto:tools-discuss@ietf.org>
List-Help: <mailto:tools-discuss-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tools-discuss>, <mailto:tools-discuss-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 28 Jul 2021 14:32:03 -0000

a) As mentioned in before, i would love for datatracker to offer
   the option for users to choose or even upload an html style sheet
   which would then be used for rendering of datatracker drafta/RFC
   for them. This IMHO would be the most easy and useful starting point 
   to leverage the flexibility of HTML.

b) I can not see any way to move away from a pure text editor
   for all core work - writing drafts and commenting on them.
   I can see that this can be done with either a txt or
   a markdown version of drafts/RFCs, not with HTML/XML.

c) RFCdiff is a big thing for me, i guess that can evolve to use other
   imputs than just txt over time.

Cheers
    Toerless

On Wed, Jul 28, 2021 at 11:53:49AM +1000, Martin Thomson wrote:
> I realize that this might be a little inflammatory as far as subjects go, but bear with me.
> 
> There are probably a few narrow cases where rendering plain text is better than HTML.  But what we've been doing for years (thanks to Henrik's great tool) is take text and turn it into HTML using the power of regular expressions.  That's been good, but it's not always reliable (how many errata mention that "Section X of [FOO]" links to Section X of this document?).  It's also been lagging as the text format changed (case in point: lack of a table of contents).
> 
> Here's an alternative: style the HTML so that it looks like the text.  I tried this and it worked shockingly well.
> 
> Repo: https://github.com/martinthomson/rfc-txt-html
> Demo: https://martinthomson.github.io/rfc-txt-html/diff.html
> 
> This isn't perfect, but it seems pretty good to me.  Keep in mind that this took only a little bit of time to sketch out. No doubt it can be improved.  The readme has a bunch of things I found, all minor.
> 
> I don't think that this is the end of text, but a possible way to limit our use of the htmlizer[1].  People who need to automate access to content might still use text, though I will argue that XML is superior in that regard.   The other thing that comes to mind is diffs: HTML-native diff tools are somewhat less than ideal.  Either way, serving HTML is just better.
> 
> Enjoy,
> Martin
> 
> 
> [1] Though I still see a shocking number of people authoring in XML (or XML-capable input formats) and submitting in text.  But I think we have plans to limit that.
> 
> ___________________________________________________________
> Tools-discuss mailing list - Tools-discuss@ietf.org
> This list is for discussion, not for action requests or bug reports.
> * Report datatracker and mailarchive bugs to: datatracker-project@ietf.org
> * Report tools.ietf.org bugs to: webmaster@tools.ietf.org
> * Report all other bugs or issues to: ietf-action@ietf.org
> List info (including how to Unsubscribe): https://www.ietf.org/mailman/listinfo/tools-discuss

-- 
---
tte@cs.fau.de