Re: [xml2rfc] assuming that period (.) ends a sentence is sometimes wrong

Julian Reschke <julian.reschke@gmx.de> Mon, 01 March 2021 13:36 UTC

Return-Path: <julian.reschke@gmx.de>
X-Original-To: xml2rfc@ietfa.amsl.com
Delivered-To: xml2rfc@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id DC1B33A1C3A for <xml2rfc@ietfa.amsl.com>; Mon, 1 Mar 2021 05:36:05 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.9
X-Spam-Level:
X-Spam-Status: No, score=-1.9 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FREEMAIL_FROM=0.001, NICE_REPLY_A=-0.001, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=gmx.net
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id jY4gcw1yJakV for <xml2rfc@ietfa.amsl.com>; Mon, 1 Mar 2021 05:36:04 -0800 (PST)
Received: from mout.gmx.net (mout.gmx.net [212.227.17.22]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 7843A3A1C36 for <xml2rfc@ietf.org>; Mon, 1 Mar 2021 05:36:03 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.net; s=badeba3b8450; t=1614605759; bh=EP1Dum1JxQS/YMhrtUR4YwmkkdE4QlOKyiNN4u/XsnM=; h=X-UI-Sender-Class:Subject:To:References:From:Date:In-Reply-To; b=jcutafqDkRQvxH5elHbuQVrTgwp7xRfZZ8w6FNDKjVr0KURzoryCU4W/ODTcb3q8q 3FJ3MWqVtyqeO45s5y2HgoAA0DcQhD7VDei8ceQQsREcIo++sxGuV1V8gtkuW+kFiW 3lkQsbwG35ZK9L5bAeBuuzSrkZN0632TPI2ZaD70=
X-UI-Sender-Class: 01bb95c1-4bf8-414a-932a-4f6e2808ef9c
Received: from [192.168.1.236] ([217.91.35.233]) by mail.gmx.net (mrgmx105 [212.227.17.168]) with ESMTPSA (Nemesis) id 1MuUnK-1m7rX32wrK-00rcNb for <xml2rfc@ietf.org>; Mon, 01 Mar 2021 14:35:59 +0100
To: xml2rfc@ietf.org
References: <20210227191644.165F76F105E2@ary.qy> <28B528D6-7CBA-4735-A5EE-C7061D1C1D0C@tzi.org> <3dc1abe5-24bf-3b12-7b58-d06c7cde428e@taugh.com> <BBA9B16E-5B06-419D-9ABE-BFB7E69B54C9@tzi.org> <6603926-561f-c9b8-2612-2afb9847b71@taugh.com> <20210228173825.GE30153@localhost> <14ad2b3e-852a-28b1-27ae-5e25ec7823bc@taugh.com> <a7734631-a4f3-cee1-1ee7-e9e0bd3d534a@gmail.com> <d96fc964-f367-dc8f-bdf3-a76b90abd042@alum.mit.edu> <26DCBA0D-AA14-461F-9992-CC631774877E@tzi.org> <45ca32a4-65df-7eea-84f0-b5451698a27b@gmx.de> <D3D8A513-87A6-4A74-97CE-C3FA8DC36318@tzi.org> <ec03aa52-6aa1-0bd0-3638-c11bfc9d64dd@gmx.de> <9C9F3CE7-E269-4BE4-A6FB-D13101D1927D@tzi.org> <47edd9eb-6c96-aa9c-9709-73e054373d4a@gmx.de> <40CE7C2C-65E7-4A4C-B16E-BA4ED62C6FF4@tzi.org> <39aac336-0032-5b11-7d64-e73ed314b79c@gmx.de> <FBC30EFB-10ED-4682-86BD-6CE89E7CDA80@tzi.org> <bf0e22cf-2f90-18e7-cb09-c791aa872f49@gmx.de> <3F93A1FB-0475-4DC5-96B1-A832880779CD@tzi.org>
From: Julian Reschke <julian.reschke@gmx.de>
Message-ID: <50c61df3-64e7-d81a-a955-ef2854f05919@gmx.de>
Date: Mon, 1 Mar 2021 14:35:59 +0100
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.8.0
MIME-Version: 1.0
In-Reply-To: <3F93A1FB-0475-4DC5-96B1-A832880779CD@tzi.org>
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: quoted-printable
X-Provags-ID: V03:K1:q9ErkiJzoRxyELiyQTs+rXihgBzCDOmr7p9j4OKqzuflqez4X5k 7PmqKLFp2IAsgW+KpIRbHxhnnaI5b4a6L1Wa/TwSYUC9U0Mvr7hkLyCpx0PmNUFW+T5ua35 eBD1TWm7bURWny+GmeiB6U1Sd3U9wQA0pxvpbAvD2I8lGbY6eaj1ifBa3MFJzpdQly5m85o lOfxZXlzL6LBKjHFGfzgQ==
X-UI-Out-Filterresults: notjunk:1;V03:K0:sNuBXInKRqE=:YrDaW9Hqo/2F/2ueUAcujd ObHy+lSGUb6XiUByQReesR80xFJt3mBfkjf4VJe6F4pTZ2WQHZj1SaV9Rr8hH2SFOdwJzAXzL ynqvqe9+UX5SYm1spv+soJxawz+VjJ89GQc07U7b9iihyXExxVAOwZ5ral6UbqaloQyChL4A4 ASik1RbTbW+FQBO8/mK3BIX/Om4k4rbB+GnZgwF1l8wW6I5JR2OhFyLf+jZhp0m4gSEREV1+h jKWwPk+YeFdxwdBgxE+RC7l18rjHTTlEP0mMa92EMhHG2of6krgz/p+rn8vJXPSpcqt4zLHCk MBtFdIHzBJ1gwZqviH1pR9KiyUA2L8EhQP+sI8XSFXTRERp+sCplnGUjAmcnij9oJ2biWakZS igEYQODKEr3aiG/QuE1YDFAhQ8beA8NZ8+4/WvQxxnBgnREqZcR0RoL0hzc8eURU3Od1Tk3G+ ZBazwnV9eJRKa5MpIw+SU7OgGsxDCPzC97R8Nwwhir7m3PIGaB4q8OqmYU0S2g0ZAZD4rQ9dZ 0cTISDpGaj6YzZa4K7eE1g7Kq0Kut2Bh98E7NIFO7TuKQADpJZR7jirMkr9pFdXeGa4XVRQ4n zo80hssJqyD/HuzdDhfGnOBr06jwSrmHliapy0YTLAUQAKCojaKEFK7NfwjbNL9Alon/UvcrK Z2QU96IHQiMR93UEXmGpYDvqd/gO3I0hlxnxWKzXrEbaxnNlNhwGRlcDctcamVli/pFkexo/C ermdRSL23tjo9J9e0o8h1o3sItNzWC4VYVc5iyVY7NNXb/CAxCwRu7j49Xxjen/3HlbXk3O0Q HqU/k5Eyta0TZWDeHfDoHSJ5QEMvAJmg1DU5o1mD46Q0vHvQjmDHEqCYWiB3VYifAsO83mbzj nls3Jie5ZOCGEXeXQH5Q==
Archived-At: <https://mailarchive.ietf.org/arch/msg/xml2rfc/-npoxcqoanuzELGDYyEndfGgie4>
Subject: Re: [xml2rfc] assuming that period (.) ends a sentence is sometimes wrong
X-BeenThere: xml2rfc@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <xml2rfc.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/xml2rfc>, <mailto:xml2rfc-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/xml2rfc/>
List-Post: <mailto:xml2rfc@ietf.org>
List-Help: <mailto:xml2rfc-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/xml2rfc>, <mailto:xml2rfc-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 01 Mar 2021 13:36:06 -0000

Am 01.03.2021 um 14:19 schrieb Carsten Bormann:
> On 2021-03-01, at 13:57, Julian Reschke <julian.reschke@gmx.de> wrote:
>>
>> Am 01.03.2021 um 13:46 schrieb Carsten Bormann:
>>> On 2021-03-01, at 13:16, Julian Reschke <julian.reschke@gmx.de> wrote:
>>>>
>>>> It means that a sequence of whitespace characters can be collapsed to a
>>>> single space character.
>>>
>>> That is XSLT normalize-space.
>>
>> XPATH, to be pedantic.
>>
>> It's not exactly the same thing in that leading / trailing space in
>> content can be significant (at least if you consider it to apply to text
>> child nodes of <t>). For instance,
>>
>>   <t>see <xref target="foo"/></t>
>>
>> is not equivalent to
>>
>>   <t>see<xref target="foo"/></t>
>>
>> At this point I'm not sure anymore what we're discussing here.
>
> I’m sorry, I’m trying to elicit what you think xml:space=preserve does (or doesn’t that =default does, actually).  It does not mesh with what I learned about XML.

xml:space=reserve: all whitespace is significant; as used in HTML's
<pre> element.

xml:space=default: whatever the "application" deems significant (AFAICT
from the spec). In our case, the "application" is the xml2rfc processor.
Also, AFAICT, when M. T. Rose developed this, he used the same
processing model as HTML.

>> If your point is that RFC 7991bis should be clearer about what XML calls
>> the "default whitespace handling mode", I agree.
>
> Well, what *DOES* XML call the default whitespace handling mode?
>
> https://www.w3.org/TR/2008/REC-xml-20081126/#sec-white-space
>
>> The value "default" signals that applications' default white-space processing modes are acceptable for this element; the value "preserve" indicates the intent that applications preserve all the white space.
>
>
> (This is in a context of:
>
>> In editing XML documents, it is often convenient to use "white space" (spaces, tabs, and blank lines) to set apart the markup for greater readability. Such white space is typically not intended for inclusion in the delivered version of the document. On the other hand, "significant" white space that should be preserved in the delivered version is common, for example in poetry and source code.
>
> The intent certainly is not to get rid of all whitespace.  There is no talk about the whitespace normalization you have in mind, either.)
> ...

Yes. Again, if your point is that the default handling should be
described in RFC 7991bis, I'm totally with you.

Best regards, Julian