Re: [xml2rfc] assuming that period (.) ends a sentence is sometimes wrong

Julian Reschke <julian.reschke@gmx.de> Sun, 28 February 2021 18:56 UTC

Return-Path: <julian.reschke@gmx.de>
X-Original-To: xml2rfc@ietfa.amsl.com
Delivered-To: xml2rfc@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 2A5F13A1AAC for <xml2rfc@ietfa.amsl.com>; Sun, 28 Feb 2021 10:56:42 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -0.001
X-Spam-Level:
X-Spam-Status: No, score=-0.001 tagged_above=-999 required=5 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FREEMAIL_FROM=0.001, NICE_REPLY_A=-0.001, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=gmx.net
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id tUyel4z6uMnx for <xml2rfc@ietfa.amsl.com>; Sun, 28 Feb 2021 10:56:41 -0800 (PST)
Received: from mout.gmx.net (mout.gmx.net [212.227.15.18]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 7998A3A1AAB for <xml2rfc@ietf.org>; Sun, 28 Feb 2021 10:56:40 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.net; s=badeba3b8450; t=1614538598; bh=RZu1wxBk6TTv4mW2qEu8mZKZ2pq6ejfwzmU0ZLszkvg=; h=X-UI-Sender-Class:Subject:To:References:From:Date:In-Reply-To; b=ji5z8NTM8fbK/zt7iE2T7/u9kunX7Nnv5XkJVxZSqzPrE80PASHTILAWs0I1G4x8b yXH8tTYPnEFAzr1rjmVPTngm6ZcPwT5O/dGjqFdzc8C3xsDKxMwSeZy1jNRkLfTTPm 0lPtVjy/UG5hf1bvMtiipq2qqiBMjyFX+3AkjVoc=
X-UI-Sender-Class: 01bb95c1-4bf8-414a-932a-4f6e2808ef9c
Received: from [192.168.178.20] ([217.251.129.137]) by mail.gmx.net (mrgmx005 [212.227.17.190]) with ESMTPSA (Nemesis) id 1N2V0H-1lwGsr0Q2B-013urz for <xml2rfc@ietf.org>; Sun, 28 Feb 2021 19:56:38 +0100
To: xml2rfc@ietf.org
References: <20210227191644.165F76F105E2@ary.qy> <28B528D6-7CBA-4735-A5EE-C7061D1C1D0C@tzi.org> <3dc1abe5-24bf-3b12-7b58-d06c7cde428e@taugh.com> <BBA9B16E-5B06-419D-9ABE-BFB7E69B54C9@tzi.org> <6603926-561f-c9b8-2612-2afb9847b71@taugh.com> <20210228173825.GE30153@localhost> <14ad2b3e-852a-28b1-27ae-5e25ec7823bc@taugh.com> <20210228175959.GF30153@localhost> <631d06e1-1f33-58d6-6661-4da0fc18a2ef@gmx.de>
From: Julian Reschke <julian.reschke@gmx.de>
Message-ID: <1bba4da7-114d-ed7c-e1ce-39518704e777@gmx.de>
Date: Sun, 28 Feb 2021 19:56:36 +0100
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.8.0
MIME-Version: 1.0
In-Reply-To: <631d06e1-1f33-58d6-6661-4da0fc18a2ef@gmx.de>
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: quoted-printable
X-Provags-ID: V03:K1:pF/KrBlHNXU89d2aYlea9CgGslEAPF2XIq/nyeLGC5ytpnsYvqe bR4BX9bbE4BKMarWMZqw0uY/q8x2PsFfaFULh+8VRhuVueaYQzThW010hf19TlFISJjq6t7 RcJF0nXMvgeEJbkRlLNKCkbfyLiaBrFO/F3tv9/rnHT0oOfJftoPJ7QkZZbn6vyrLrqZWnl VxkSED/PnDGu9yD0MO5ew==
X-UI-Out-Filterresults: notjunk:1;V03:K0:hE+UdHMSTog=:pKjGhB+fFRZxrQxbB2+CVS U/e5F5kk30X4zLtQSOsjgdlibwWPeWQdR3fGbFQC4eFbfXzdf8J+r0p93JVMqFM4c+zIyM33j jwDbYy/DVVAIgRP23KG4tG/iAqwcEBZ5gHrwos+gqMxdsDPFkZKb61y6FtCq66qo8frFuL6XT ND8bB8PgrO89VCv92nIJfwAhufGHAdkfm5gNDYsETT8WrEzO6JuEYoNYqK+lDPOp/Zyx4ag6q siyKGiyvIBzEGwVIYTjKt3k3o/rad8s+XtUWwmKfRdduzw5Tf0tJpDgqJVuXNuXltQi7HIn+V +g7e373iHJ79DS2jiGuUSR1L7r5D1h9GDxGfMlktMgVGL7lJnBTdZk4onw15Zy6nncUAtQ5Tr XFo707pRAf8sxNeBPLHDLSDSIrtF7gx4ZVoyelxoW4C8sd47aczmOeEkxDK7sBAqZgS3JNRmY V/LkdGmY3d2X8K7EMuaiOiEEIb9Es8wLa30ZiK1xHqnoW3gg1IGsba08pIeCHwoysCjQomBD9 haieohmZ9H/BIIgxe80XzVYXbCLj3oPohlHJ/w2x/MdQVz5//sBqcBRedz/blJ4o4ihQ4HhAb BJiJQ/ZosOjZnebep9BCO6yDdMGbhn8Ph95Rfa4tNU6AiTZc9gTeaWqnP8iULSt+mcKcL/3D4 Jh1MclTbAnJZsfOujDoU/4mzziLvDwWduCMZTM+e67lFLEqmLJrImrtAk+0YB+LAZStJb6w1d KczAfjLcV/ggpwQAThF4OZ0+f2l/xXpeB1H1X/Oa4di/CSneJ35GkTRkfM9ck9rkUSIfNmN3o gK1nVlueYpXg6pc0rZOr9qAXLkyyhPIysKQYNCbSov4hbQVRfHa3v3urzd/4Uwq5hpIHx058Q flPRtqfTohsMxY4+G39Q==
Archived-At: <https://mailarchive.ietf.org/arch/msg/xml2rfc/viP8-resSiBMNjndlLJRh14kebM>
Subject: Re: [xml2rfc] assuming that period (.) ends a sentence is sometimes wrong
X-BeenThere: xml2rfc@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <xml2rfc.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/xml2rfc>, <mailto:xml2rfc-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/xml2rfc/>
List-Post: <mailto:xml2rfc@ietf.org>
List-Help: <mailto:xml2rfc-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/xml2rfc>, <mailto:xml2rfc-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 28 Feb 2021 18:56:42 -0000

Am 28.02.2021 um 19:49 schrieb Julian Reschke:
> Am 28.02.2021 um 18:59 schrieb Nico Williams:
>> On Sun, Feb 28, 2021 at 12:54:45PM -0500, John R Levine wrote:
>>>> Provided it doesn't also lose alternative Unicode whitespace
>>>> characters,
>>>> using &emsp; is an option.  In a pinch we could have an element to mark
>>>> the end of a sentence (<s/>).
>>>
>>> At the end of every sentence?  That's, uh, quite a stretch.  Are we sure
>>> this problem is worth that much effort by every author?
>>
>> If some such markup were only needed when the sentence-ending-period
>> heuristic would fail, then its usage would be very rare.  Maybe
>> non-breaking spacing could be used for that?
>
> Nope.
>
> Please let's not leak "optimizations" for text output into the canonical
> format.
>
> Assuming we continue with that 2SP practice (in contrast to what 7322bis
> says): there are other alternatives, such as a config file to be
> supplied to xml2rfc which continues known exception strings.


s/continues/contains/

BR, Julian