Re: [Gendispatch] Making all RFCs UTF-8, with questions about RFC 20

John Levine <johnl@taugh.com> Mon, 13 March 2023 23:09 UTC

Return-Path: <johnl@iecc.com>
X-Original-To: gendispatch@ietfa.amsl.com
Delivered-To: gendispatch@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 74F73C1516FF for <gendispatch@ietfa.amsl.com>; Mon, 13 Mar 2023 16:09:22 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -6.847
X-Spam-Level:
X-Spam-Status: No, score=-6.847 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HEADER_FROM_DIFFERENT_DOMAINS=0.25, RCVD_IN_DNSWL_HI=-5, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=iecc.com header.b="u2eTjp25"; dkim=pass (2048-bit key) header.d=taugh.com header.b="bJWpelIb"
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id dYiBSJ_sqdeS for <gendispatch@ietfa.amsl.com>; Mon, 13 Mar 2023 16:09:18 -0700 (PDT)
Received: from gal.iecc.com (gal.iecc.com [IPv6:2001:470:1f07:1126:0:43:6f73:7461]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id B8806C151535 for <gendispatch@ietf.org>; Mon, 13 Mar 2023 16:09:17 -0700 (PDT)
Received: (qmail 42053 invoked from network); 13 Mar 2023 23:09:16 -0000
DKIM-Signature: v=1; a=rsa-sha256; c=simple; d=iecc.com; h=date:message-id:from:to:cc:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:cleverness; s=a442.640fad1c.k2303; bh=iHZxe5zCoCtvh3XV0RxUjPK9tesvNGmpxawq/V1Doy8=; b=u2eTjp25yVKVmql6BYBaBhq2R+DwiMOx3uSO5XXMk7MyJw8vK+9Hi1GXKy6v0NcjVavVRoIGuHawpdl6KFxRnTcTE8rHyyxap1BOvjsu6oYj9l1Zo3Lbl5Vvh0Iq650yAYalc8Lfay2KQkXHdt6hb2I8lAGDSpXOBp475JRJjvm9dIUy2zO120KBL84acGiDh+3bRIXS3d211VwRVz0gla1RBxx6sjb7xUOiKGJJlg0W7nuLY0yyaPYwJ3Q6iZfOkDuC+8TykfF/3JwglbxefJT/YzKE/b0i5rwA13UzbUFGUTRilCtuxz7Gx+m2dBDOQbJ5uefd3cRgJ/vJOMcUrA==
DKIM-Signature: v=1; a=rsa-sha256; c=simple; d=taugh.com; h=date:message-id:from:to:cc:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:cleverness; s=a442.640fad1c.k2303; bh=iHZxe5zCoCtvh3XV0RxUjPK9tesvNGmpxawq/V1Doy8=; b=bJWpelIbx0QLR5N1Lgoy2yndKsCfwvoD4jMqlG7UHJbHOoyLdWXfV9Bi7A02vbIWGoiJJiKuQewMWJQO0xeK5iobHfckqG6tArwYIrRa8E5kFsbPyNpHALyk95RDnHUP7DTSoKTIHSCq4M9iBxQ3K3BYZMeSlKYzGOicqnZrn6Waid6bFW6RNpGbhiI/w3z5Y0VPb4Pnm3H7GxBqtjkZ15wexe1hex9vzjNyvLYsuX6K9b7A693HZU7wpVO/m2gz4AmrAKiUwGk8WnREXbV7cgWfHA393VNbcsZtELbVYpj9yG+Zq0CRvEJG33PfJSiwRIcChI/fd4I4m5iz/Xc9hw==
Received: from ary.local ([IPv6:2001:470:1f07:1126::78:696d:6170]) by imap.iecc.com ([IPv6:2001:470:1f07:1126::78:696d:6170]) with ESMTPS (TLS1.3 ECDHE-RSA AES-256-GCM AEAD) via TCP6; 13 Mar 2023 23:09:15 -0000
Received: by ary.local (Postfix, from userid 501) id 62378B0D49F0; Mon, 13 Mar 2023 18:09:14 -0500 (EST)
Date: Mon, 13 Mar 2023 18:09:14 -0500
Message-Id: <20230313230914.62378B0D49F0@ary.local>
From: John Levine <johnl@taugh.com>
To: gendispatch@ietf.org
Cc: sayrer@gmail.com
In-Reply-To: <CAChr6SxsD7k83ad2HLNcbOEDF7j_Rfit-zM+zsH_gHbUqeOyUg@mail.gmail.com>
Organization: Taughannock Networks
X-Headerized: yes
Cleverness: minimal
Mime-Version: 1.0
Content-type: text/plain; charset="utf-8"
Content-transfer-encoding: 8bit
Archived-At: <https://mailarchive.ietf.org/arch/msg/gendispatch/dG6xgQHk4cfqQP8OKS1kO6Slayo>
Subject: Re: [Gendispatch] Making all RFCs UTF-8, with questions about RFC 20
X-BeenThere: gendispatch@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: General Area Dispatch <gendispatch.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/gendispatch>, <mailto:gendispatch-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/gendispatch/>
List-Post: <mailto:gendispatch@ietf.org>
List-Help: <mailto:gendispatch-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/gendispatch>, <mailto:gendispatch-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 13 Mar 2023 23:09:22 -0000

It appears that Rob Sayre  <sayrer@gmail.com> said:
>-=-=-=-=-=-
>
>Hi,
>
>I was looking at making all RFCs UTF-8, which would be a brief document.
>The old ones are mostly ASCII, but that's kind of the point of UTF-8.

There's a lot of random non-ASCII junk in old RFCs, I think mostly
8859-1. I do not think that is a problem, nor should we try to change them,

>I think RFC 10000 should say "RFCs are now encoded in UTF-8"

Since RFC 7990 said that seven years ago, I don't see the point.

R's,
John

PS: here's some of the old random junk
rfc101.txt 753 b'      [into the online RFC archives by Kelly Tardif, Viag\xe9nie 10/99]\n'
rfc1305.txt 2551 b'.01% or <F128M>\xe6<F255D>100 ppm. Its contents are designated y in the\n'
rfc1305.txt 3547 b'3, reachability okay (peer.reach <F128M>\xd6<F255D> 0)\n'
rfc1305.txt 4634 b'seconds, but varies about <F128M>\xe6<F255D>30 ms throughout the year due\n'
rfc1305.txt 5236 b'<F128M>\xe6<F255D>128 ms, called the aperture, which guarantees the seconds\n'
rfc1305.txt 5282 b'<F128M>\xe6<F255D>500 ppm. Even larger ranges may be required in the case\n'
rfc1305.txt 5299 b'the range <F128M>\xe6<F255D>100 ppm with an intrinsic oscillator frequency\n'
rfc1305.txt 5300 b'error as great as <F128M>\xe6<F255D>100 ppm. Figure 11<$&fig11> shows the\n'
rfc1305.txt 6018 b'(shown by the <F128>\xec<F255> symbol) and confidence interval shown for\n'
rfc177.txt 485 b'    [into the online RFC archives by Kelly Tardif, Viag\xe9nie 12/1999]\n'
rfc178.txt 607 b'        [into the online RFC archives by Kelly Tardif,Viag\xe9nie 11/99]\n'
rfc182.txt 52 b'     [into the online RFC archives by Kelly Tardif, Viag\xe9nie 10/1999]\n'
rfc2166.txt 641 b'   formats see the \x93Frame Formats\x94 section.\n'
rfc2166.txt 660 b'   the \x93Frame Formats\x94 section.\n'
rfc2166.txt 759 b'   which implement these multicast enhancements will carry a \x93Multicast\n'
rfc2166.txt 760 b'   Capabilities\x94 Control Vector in their capabilities exchange (see RFC\n'
rfc2166.txt 876 b'   generically refers to this process as \x93address resolution\x94.\n'
rfc2166.txt 1036 b'   and explorer retries.  Accordingly, the \x93Retry\x94 section in the table\n'
rfc2166.txt 1312 b'   indicated in the table above with an \x93Yes\x94 in the \x93Retry\x94 column.\n'
rfc2166.txt 1489 b'   implementations in the future can expect that the DLSw \x93Version\n'
rfc2166.txt 1490 b'   Number\x94 is found in byte one and that the following bytes describe\n'
rfc2166.txt 1501 b"   1) The \x93Version Number\x94 field is set to x'32' (ASCII '2') and now\n"
rfc227.txt 70 b'     [into the online RFC archives by Kelly Tardif, Viag\xe9nie 10/99]\n'
rfc234.txt 53 b'      [into the online RFC archives by Kelly Tardif, Viag\xe9nie 10/99]\n'
rfc235.txt 243 b'     [into the online RFC archives by Kelly Tardif, Viag\xe9nie 10/99]\n'
rfc237.txt 50 b'     [into the online RFC archives by H\xe9l\xe8ne Morin, Viag\xe9nie 10/99]\n'
rfc243.txt 353 b'     [into the online RFC archives by Kelly Tardif, Viag\xe9nie 10/1999]\n'
rfc2497.txt 210 b'   [EUI64]   "64-Bit Global Identifier Format Tutorial", http://stan\xad\n'
rfc2557.txt 813 b'      E with acute accent becomes \xc9.<br>\n'
rfc270.txt 46 b'      [into the online RFC archives by Kelly Tardif, Viag\xe9nie 10/99]\n'
rfc2708.txt 521 b'   assigned ID\xc6s, there is a limited amount of clear text information\n'
rfc282.txt 446 b'     [into the online RFC archives by Kelly Tardif, Viag\xe9nie 10/99]\n'
rfc2875.txt 754 b'   TBS: the \xf4text\xf6 for computing the SHA-1 HMAC.\n'
rfc2875.txt 926 b'   Signature verification requires CA\xc6s private key, the CA certificate\n'
rfc288.txt 211 b'     [into the online RFC archives by Kelly Tardif, Viag\xe9nie 10/99]\n'
rfc290.txt 828 b'     [into the online RFC archives by Kelly Tardif, Viag\xe9nie 10/99]\n'
rfc292.txt 529 b'      [into the online RFC archives by Kelly Tardif, Viag\xe9nie 10/99]\n'
rfc303.txt 607 b'     [into the online RFC archives by Kelly Tardif, Viag\xe9nie 10/99]\n'
rfc306.txt 189 b'     [into the online RFC archives by H\xe9l\xe8ne Morin, Viag\xe9nie 10/99]\n'
rfc307.txt 331 b'    [into the online RFC archives by H\xe9l\xe8ne Morin, Viag\xe9nie, 12/99]\n'
rfc310.txt 389 b'     [into the online RFC archives by H\xe9l\xe8ne Morin, Viag\xe9nie 10/99]\n'
rfc313.txt 440 b'     [into the online RFC archives by H\xe9l\xe8ne Morin, Viag\xe9nie 10/99]\n'
rfc315.txt 181 b'      [into the online RFC archives by H\xe9l\xe8ne Morin, Viag\xe9nie, 12/99]\n'
rfc316.txt 369 b'     [into the online RFC archives by H\xe9l\xe8ne Morin, Viag\xe9nie 10/99]\n'
rfc317.txt 51 b'     [into the online RFC archives by H\xe9l\xe8ne Morin, Viag\xe9nie 10/99]\n'
rfc323.txt 499 b'    [into the online RFC archives by H\xe9l\xe8ne Morin, Viag\xe9nie, 12/99]\n'
rfc327.txt 277 b'     [into the online RFC archives by H\xe9l\xe8ne Morin, Viag\xe9nie 10/99]\n'
rfc367.txt 184 b'    [into the online RFC archives by H\xe9l\xe8ne Morin, Viag\xe9nie, 12/99]\n'
rfc369.txt 602 b'     [into the online RFC archives by H\xe9l\xe8ne Morin, Viag\xe9nie 12/99]\n'
rfc441.txt 234 b'      U + 3 --> X\xa0\n'
rfc441.txt 389 b'      [into the online RFC archives by H\xe9l\xe8ne Morin, Viag\xe9nie, 12/99]\n'
rfc64.txt 65 b'message 4 bits to the left. This takes approximately 12 \xb5sec per double\n'