Re: [xml2rfc] assuming that period (.) ends a sentence is sometimes wrong

Carsten Bormann <cabo@tzi.org> Sat, 27 February 2021 02:54 UTC

Return-Path: <cabo@tzi.org>
X-Original-To: xml2rfc@ietfa.amsl.com
Delivered-To: xml2rfc@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 9924F3A07F7 for <xml2rfc@ietfa.amsl.com>; Fri, 26 Feb 2021 18:54:21 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.197
X-Spam-Level:
X-Spam-Status: No, score=-4.197 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_MED=-2.3, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id g5hbHBn0amUK for <xml2rfc@ietfa.amsl.com>; Fri, 26 Feb 2021 18:54:18 -0800 (PST)
Received: from gabriel-vm-2.zfn.uni-bremen.de (gabriel-vm-2.zfn.uni-bremen.de [134.102.50.17]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 34AA63A046B for <xml2rfc@ietf.org>; Fri, 26 Feb 2021 18:54:17 -0800 (PST)
Received: from [192.168.217.152] (p5089a828.dip0.t-ipconnect.de [80.137.168.40]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by gabriel-vm-2.zfn.uni-bremen.de (Postfix) with ESMTPSA id 4DnWM96SwBzyYs; Sat, 27 Feb 2021 03:54:13 +0100 (CET)
Content-Type: text/plain; charset=utf-8
Mime-Version: 1.0 (Mac OS X Mail 14.0 \(3654.60.0.2.21\))
From: Carsten Bormann <cabo@tzi.org>
In-Reply-To: <87wnuucjra.fsf@fifthhorseman.net>
Date: Sat, 27 Feb 2021 03:54:13 +0100
Cc: xml2rfc@ietf.org
Content-Transfer-Encoding: quoted-printable
Message-Id: <1B52AA0D-1FA1-41F2-8826-9C47C80D378A@tzi.org>
References: <87wnuucjra.fsf@fifthhorseman.net>
To: Daniel Kahn Gillmor <dkg@fifthhorseman.net>
X-Mailer: Apple Mail (2.3654.60.0.2.21)
Archived-At: <https://mailarchive.ietf.org/arch/msg/xml2rfc/TJCVFLnm7yJ6SFUcQQ7izLUJKfs>
Subject: Re: [xml2rfc] assuming that period (.) ends a sentence is sometimes wrong
X-BeenThere: xml2rfc@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <xml2rfc.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/xml2rfc>, <mailto:xml2rfc-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/xml2rfc/>
List-Post: <mailto:xml2rfc@ietf.org>
List-Help: <mailto:xml2rfc-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/xml2rfc>, <mailto:xml2rfc-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sat, 27 Feb 2021 02:54:22 -0000

I’d replace the R. Z with R. Z.  
OK, that may be hard to see (an nbsp, U+00A0).
(Actually, it’s not there, as my mail reader swallows it.  Wow.)

Any software that consumes plaintext text should follow the age old convention that a sentence is ended by a period followed by a newline, or a period followed by two spaces.  
Unfortunately, this simple rule has fallen out of fashion…

Grüße, Carsten


> On 27. Feb 2021, at 03:44, Daniel Kahn Gillmor <dkg@fifthhorseman.net> wrote:
> 
> The toolchain to build draft-ietf-openpgp-crypto-refresh produces XML
> that contains:
> …
>          <li>PGP - Pretty Good Privacy.
> PGP is a family of software systems developed by Philip R. Zimmermann from which OpenPGP is based.</li>
> …
> 
> xml2rfc renders this to text as:
> 
> …
>   *  PGP - Pretty Good Privacy.  PGP is a family of software systems
>      developed by Philip R.  Zimmermann from which OpenPGP is based.
> …
> 
> It looks like it is assuming that a period (.) ends a sentence, and that
> two spaces should follow each sentence.  It is correct about the . after
> "Privacy", but it is wrong about the . after "R". :(
> 
> This is kind of a dumb nit-pick, but it was noticed during a review on
> the mailing list.
> 
> Is there a recommended way to fix this?  What XML input would produce a
> .txt with only one space after the "R."?
> 
>   --dkg
> _______________________________________________
> xml2rfc mailing list
> xml2rfc@ietf.org
> https://www.ietf.org/mailman/listinfo/xml2rfc