Re: [xml2rfc] assuming that period (.) ends a sentence is sometimes wrong

Carsten Bormann <cabo@tzi.org> Sun, 28 February 2021 06:32 UTC

Return-Path: <cabo@tzi.org>
X-Original-To: xml2rfc@ietfa.amsl.com
Delivered-To: xml2rfc@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id BA21A3A07EB for <xml2rfc@ietfa.amsl.com>; Sat, 27 Feb 2021 22:32:08 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -0.019
X-Spam-Level:
X-Spam-Status: No, score=-0.019 tagged_above=-999 required=5 tests=[RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id qTJM6dkNvoIm for <xml2rfc@ietfa.amsl.com>; Sat, 27 Feb 2021 22:32:04 -0800 (PST)
Received: from gabriel-vm-2.zfn.uni-bremen.de (gabriel-vm-2.zfn.uni-bremen.de [134.102.50.17]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id B96C13A07E2 for <xml2rfc@ietf.org>; Sat, 27 Feb 2021 22:32:03 -0800 (PST)
Received: from [192.168.217.152] (p5089a828.dip0.t-ipconnect.de [80.137.168.40]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by gabriel-vm-2.zfn.uni-bremen.de (Postfix) with ESMTPSA id 4DpD810MJrzyS1; Sun, 28 Feb 2021 07:32:01 +0100 (CET)
Content-Type: text/plain; charset=utf-8
Mime-Version: 1.0 (Mac OS X Mail 14.0 \(3654.60.0.2.21\))
From: Carsten Bormann <cabo@tzi.org>
In-Reply-To: <3dc1abe5-24bf-3b12-7b58-d06c7cde428e@taugh.com>
Date: Sun, 28 Feb 2021 07:32:00 +0100
Cc: xml2rfc@ietf.org
Content-Transfer-Encoding: quoted-printable
Message-Id: <BBA9B16E-5B06-419D-9ABE-BFB7E69B54C9@tzi.org>
References: <20210227191644.165F76F105E2@ary.qy> <28B528D6-7CBA-4735-A5EE-C7061D1C1D0C@tzi.org> <3dc1abe5-24bf-3b12-7b58-d06c7cde428e@taugh.com>
To: John R Levine <johnl@taugh.com>
X-Mailer: Apple Mail (2.3654.60.0.2.21)
Archived-At: <https://mailarchive.ietf.org/arch/msg/xml2rfc/txw9CJliGtcliOPsL4nO9GSEX1I>
Subject: Re: [xml2rfc] assuming that period (.) ends a sentence is sometimes wrong
X-BeenThere: xml2rfc@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <xml2rfc.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/xml2rfc>, <mailto:xml2rfc-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/xml2rfc/>
List-Post: <mailto:xml2rfc@ietf.org>
List-Help: <mailto:xml2rfc-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/xml2rfc>, <mailto:xml2rfc-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 28 Feb 2021 06:32:09 -0000

On 28. Feb 2021, at 00:51, John R Levine <johnl@taugh.com> wrote:
> 
> Having been through the publishing process in a lot of books, I can report that no matter how good your tools are, the only way to typeset stuff of professional quality is to do hand tweaks where the tools don't get it quite right.  For a bunch of reasons we have decided we're not doing that and I would prefer not to say oh, but THIS tweak is worth it.

For properly doing sentence spacing, what is needed is a way to signal sentence ends.
For 50 years, the convention in keyboarding manuscripts has been that dots at the end of the input line and dots followed by two spaces (here we are actually using two spaces — in the manuscript!) are periods (i.e., sentence ends).
That works exceedingly well.
Authors that keyboard carelessly don’t get proper sentence spacing, but no major disaster happens.

No tweaks needed.

The discussion came up because xml2rfc treated the dot in “Philip R. Zimmermann” as a sentence end.
This is a mere bug, and bugs can be fixed.
I’d send a pull request, but...

Grüße, Carsten