Re: [xml2rfc] rfc2629-xhtml.ent warning

Julian Reschke <julian.reschke@gmx.de> Tue, 16 May 2023 14:31 UTC

Return-Path: <julian.reschke@gmx.de>
X-Original-To: xml2rfc@ietfa.amsl.com
Delivered-To: xml2rfc@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 018E1C151554; Tue, 16 May 2023 07:31:21 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.797
X-Spam-Level:
X-Spam-Status: No, score=-2.797 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, NICE_REPLY_A=-0.001, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmx.de
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 5TfmD17vq6vt; Tue, 16 May 2023 07:31:16 -0700 (PDT)
Received: from mout.gmx.net (mout.gmx.net [212.227.15.19]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 8ED80C151542; Tue, 16 May 2023 07:31:15 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.de; s=s31663417; t=1684247473; i=julian.reschke@gmx.de; bh=s6coyr8JQvaZGmjR92XO3WEw6j7Evntu62MMWtDssAs=; h=X-UI-Sender-Class:Date:Subject:To:Cc:References:From:In-Reply-To; b=V0bGah6WO1FYROq3ZaPlGCBhwg2KoL++pfztRLweAQlPYFhatjPWgYFqLAGrIi7t7 0FusyM174j/FfuFGjBW1TL3LNeU9jVzKP0bXZ64J5PXzKjU1zAk4AUXBFYfuWVvPHm WghXKRv3QiKIr9nRa34EXMzqYq49kO/rtxIop8+JsuROkkmz2vUdBxze1XS0FdJTDw rMPudfExHPLWpJcCL5cvoyq14RZEZ3ANHvlye5ZwWAE0nlMlIEi7M/YLo56vYbVA4O DtbBCr8Zc85sEIYglgdQmFeHKkI+lm9ah2EF2cLOq02wxdDxVk2WqrCwqHBgiMuI/h E6sHDLJ4gDh3w==
X-UI-Sender-Class: 724b4f7f-cbec-4199-ad4e-598c01a50d3a
Received: from [10.173.37.15] ([193.105.139.131]) by mail.gmx.net (mrgmx004 [212.227.17.190]) with ESMTPSA (Nemesis) id 1MFsZ3-1px21G1eqF-00HRxg; Tue, 16 May 2023 16:31:13 +0200
Message-ID: <49c86fec-70ef-ab8d-f522-2669520b2dfb@gmx.de>
Date: Tue, 16 May 2023 16:31:13 +0200
MIME-Version: 1.0
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.11.0
Content-Language: en-US
To: Jay Daley <exec-director@ietf.org>
Cc: xml2rfc@ietf.org
References: <b09e9607-966a-6250-db4b-5d541b1e690d@htt-consult.com> <EA51BFDA-0A1C-49DA-8301-FDF9BD805D2E@ietf.org> <9c8e932b-f6eb-5cf9-2dae-3018f0a217c0@gmx.de> <EA50DA32-4D0A-4FB9-B749-CAB07EF18B33@ietf.org>
From: Julian Reschke <julian.reschke@gmx.de>
In-Reply-To: <EA50DA32-4D0A-4FB9-B749-CAB07EF18B33@ietf.org>
Content-Type: text/plain; charset="UTF-8"; format="flowed"
Content-Transfer-Encoding: quoted-printable
X-Provags-ID: V03:K1:+C7euNK4h5551zDYyvaaDfkdf1PmubPa08VfeZIJNcUbDxZ/VZY 3QwUfH5wdL4WyOaUG0d0Cso7+9ic3LUNf/j3DLhdzJRU3nC8HgdwQndlqd4/fxn7zqCNkhT 9sbuXNVZ9JN6i6lTGedd0UjkOZC3J2tEhsEtqN6Op+AcZZsPdmhS6SRleYAsslADlkH/CIj nE7uqxGcHb6Lc0yQ1yq3w==
UI-OutboundReport: notjunk:1;M01:P0:8Pa3SWKymU4=;QVPG1tHdA2bL8E3ELS/QlEoKw3s yU1CaJRisfHOjFCUtudUaaw9w1kzOZ3lWl1G0+3MRm0kvHuHkLMY51hOYLOcUlKI/0apke6Un U2g/ubGx09MQrQwPfy8TMGAffM6GO64RTqmvLWPciFVdVL7IBCwXOxThGqrx3cleijdk0gP7Z fb7boIkFYxuqeWQwfvPXidyslABlFxXzLKDiUfCN/TJOs/tBYgjft8eV9qTEHjZpMMxq/QzPX NyhKta++vfau/oepso55RUCWjDxMNPoCBlGqoT8QAtSr46joDZ91auQ+dhMPe4m1HWn3hR7WM vHG4Z0RYn3OT81lbTAhl4rdwx10IO3+bkhJJahGOAtlLrzTjy1fEi7BQCZHb1Sx22k3TtiOih ZO4glCiyDkMBHXHsv0spXV4HeBpQUz/8XSR5UP5TFc1x1ksE4nvn0RV3SOEafhBE+OqkqZeM5 tQ5uhAYZMimjJoQ3ToRgoAy0wrahTa/2d4TwHTUNLXyiegkF0uJSVhbPjJZqG4xxqLw1XFroC 3hyfggU75Bxj8CWN6wCm5utT+30iW9YxTc1ED62da25evE57EOvTDsWgf3LOm3AYjZj0Y14vM HLAhx3OMECz9Rg5YUYsE7UfRVfiZyJdyj8CMoOk7vx76WSKGS9BlVLnqwnkxp2YlFSNYH8BYf 8dcotizl1jrmGeReU81EnfalJQ9wNnGfOst3gpAFh9mwFIkJTG9zDfLSINPdRqnEeNaX19JQU FS41QqBYiDUea1hollkhVV09zvzAB060SzqWKtHSxqkY5bY3xli/cQVkmzgv9vBfXLyRlO0HP stllDUwxJPFH4R7umRTlvbRLuHyRyeQnno+/hoAgWqyTjIeZlatQhCFyCm7xjpqklBS549Oq2 K67W0ImyYRjbWCdlH4mZuGHlS91Bzn7C6KILkabKOG+mGVkLPDNuZaUs4IrCowXaZqhPHx0FY z0eBWg==
Archived-At: <https://mailarchive.ietf.org/arch/msg/xml2rfc/x9EWzHwvyF4JuXwzAdsh_N15WSE>
Subject: Re: [xml2rfc] rfc2629-xhtml.ent warning
X-BeenThere: xml2rfc@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: XML2RFC discussion list <xml2rfc.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/xml2rfc>, <mailto:xml2rfc-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/xml2rfc/>
List-Post: <mailto:xml2rfc@ietf.org>
List-Help: <mailto:xml2rfc-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/xml2rfc>, <mailto:xml2rfc-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 16 May 2023 14:31:21 -0000

On 16.05.2023 16:23, Jay Daley wrote:
>
>
>> On 16 May 2023, at 14:37, Julian Reschke <julian.reschke@gmx.de> wrote:
>>
>> On 16.05.2023 15:19, Jay Daley wrote:
>>>
>>>> On 16 May 2023, at 14:12, Robert Moskowitz <rgm@htt-consult.com> wrote:
>>>>
>>>> It has been a few weeks since my last run of xml2rfc, and that was
>>>> on what I had installed on my old Fedora 35 notebook, now on my
>>>> updated Fedora 38 system...
>>>>
>>>> So this morning I got (4 times):
>>>>
>>>> Warning:
>>>> /home/rgm/data/htt/Projects/Critical/drafts/rfc2629-xhtml.ent is no
>>>> longer needed as the special processing of non-ASCII characters has
>>>> been superseded by direct support for non-ASCII characters in RFCXML.
>>>>
>>>>
>>>> The only use of rfc2629 in my xml is:
>>>>
>>>> <!DOCTYPE rfc SYSTEM "rfc2629-xhtml.ent">
>>>> <?xml-stylesheet type="text/xsl" href="rfc2629.xslt" ?>
>>>>
>>>> So obviously there has been a change that I did not catch.  No
>>>> comments from the rfc editor on working with my drafts a couple
>>>> months back for rfcs 9373 & 9374 that had these lines.
>>>>
>>>> So what is the current for !DOCTYPE and xml-stylesheet?
>>>
>>> A DOCTYPE is not recommended because the grammar is specified in
>>> Compact RelaxNG not any other schema format.   The recommended method
>>> to associate the schema with the XML is
>>>
>>> <?xml-model href="rfc7991bis.rnc"?>
>>>
>>> Note, this will require you to have rfc7991bis.rnc in your build path.
>>>
>>> This is fully explained in the authors documentation here:
>>>
>>> https://authors.ietf.org/en/templates-and-schemas
>>>
>>> That same page also links to the "official" templates that can be
>>> used as starting points.
>>>
>>> Finally, the authors documentation also lists various XML editors and
>>> how the interact with the schema
>>>
>>> https://authors.ietf.org/en/drafting-in-xml
>>>
>>> Jay
>>
>> Jay,
>>
>> I don't think this is helpful.
>>
>> Robert's DOCTYPE declaration is used to define named character entities.
>> RelaxNG doesn't help with these at all.
>
> I was trying to address two things at once, but yes I can see that I
> only confused things.
>
> The page I pointed to explains this correctly
> https://authors.ietf.org/en/templates-and-schemas#character-entities
> <https://authors.ietf.org/en/templates-and-schemas#character-entities>
>
>>
>> The warning message from xml2rfc is completely misleading, as it
>> conflates entity declarations with validation.
>
> No it’s correct, the full rfc2629-xhtml.ent file is no longer required
> for exactly the reason explained in the warning.

The warning is misleading. It was never required.

This is an XML question, not an RFCXML question.

If you want named entities, you need to declare them. Either in an
external resource, or in the internal subset. Or, instead, given an
editor with proper Unicode support, you can type them directly (which,
depending on the code point, may or may not be a good idea).

This is true for all versions of x2mlrfc (the tool) and RFCXML (the
language). It has nothing to do with whether you use DTDs or RelaxNG for
validation, or whether you validate at all.

And yes, RFCXML since version 3 supports *outputting* non-ASCII
characters in HTML and TXT. But this has nothing to do with the way they
are *inserted* into the XML file.

Best regards, Julian