Re: support for non-ASCII in strings, was: signatures vs sf-date

Poul-Henning Kamp <phk@phk.freebsd.dk> Fri, 02 December 2022 19:18 UTC

Return-Path: <ietf-http-wg-request+bounce-httpbisa-archive-bis2juki=lists.ie@listhub.w3.org>
X-Original-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Delivered-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 325C5C14CF05 for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Fri, 2 Dec 2022 11:18:41 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.951
X-Spam-Level:
X-Spam-Status: No, score=-4.951 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HEADER_FROM_DIFFERENT_DOMAINS=0.25, MAILING_LIST_MULTI=-1, RCVD_IN_DNSWL_MED=-2.3, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id A1HvBfPvNtnV for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Fri, 2 Dec 2022 11:18:39 -0800 (PST)
Received: from lyra.w3.org (lyra.w3.org [128.30.52.18]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id EAAF3C14F719 for <httpbisa-archive-bis2Juki@lists.ietf.org>; Fri, 2 Dec 2022 11:18:39 -0800 (PST)
Received: from lists by lyra.w3.org with local (Exim 4.94.2) (envelope-from <ietf-http-wg-request@listhub.w3.org>) id 1p1BYL-00A3Tu-00 for ietf-http-wg-dist@listhub.w3.org; Fri, 02 Dec 2022 19:18:29 +0000
Resent-Date: Fri, 02 Dec 2022 19:18:29 +0000
Resent-Message-Id: <E1p1BYL-00A3Tu-00@lyra.w3.org>
Received: from mimas.w3.org ([128.30.52.79]) by lyra.w3.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from <phk@critter.freebsd.dk>) id 1p1BYI-00A3Sw-KL for ietf-http-wg@listhub.w3.org; Fri, 02 Dec 2022 19:18:26 +0000
Received: from phk.freebsd.dk ([130.225.244.222]) by mimas.w3.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from <phk@critter.freebsd.dk>) id 1p1BYH-005xQA-6p for ietf-http-wg@w3.org; Fri, 02 Dec 2022 19:18:26 +0000
Received: from critter.freebsd.dk (unknown [192.168.55.3]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by phk.freebsd.dk (Postfix) with ESMTPS id D51B189284; Fri, 2 Dec 2022 19:18:12 +0000 (UTC)
Received: from critter.freebsd.dk (localhost [127.0.0.1]) by critter.freebsd.dk (8.16.1/8.16.1) with ESMTPS id 2B2JICpq007229 (version=TLSv1.3 cipher=TLS_AES_256_GCM_SHA384 bits=256 verify=NO); Fri, 2 Dec 2022 19:18:12 GMT (envelope-from phk@critter.freebsd.dk)
Received: (from phk@localhost) by critter.freebsd.dk (8.16.1/8.16.1/Submit) id 2B2JIBHC007228; Fri, 2 Dec 2022 19:18:11 GMT (envelope-from phk)
Message-Id: <202212021918.2B2JIBHC007228@critter.freebsd.dk>
To: "Roy T. Fielding" <fielding@gbiv.com>
cc: Julian Reschke <julian.reschke@gmx.de>, ietf-http-wg@w3.org
In-reply-to: <841DC85E-F936-4350-A74F-170D22E6ADCE@gbiv.com>
From: Poul-Henning Kamp <phk@phk.freebsd.dk>
References: <2070c8e0-98d6-7b63-77c3-550bcd661397@gmx.de> <202212011735.2B1HZYgm004808@critter.freebsd.dk> <e580db7e-c0ec-0f1a-17af-5719ab09468c@gmx.de> <202212020810.2B28ALnL004331@critter.freebsd.dk> <eee5a787-da37-feb1-098a-7d2d9c0f1d37@it.aoyama.ac.jp> <202212020848.2B28mGbc004600@critter.freebsd.dk> <4e251954-afb6-fa08-616c-db95e23ad1fd@gmx.de> <202212020946.2B29kSe6004829@critter.freebsd.dk> <75dad0c0-e3bb-1189-0c16-e8275d3879ff@gmx.de> <202212021016.2B2AGvEP004972@critter.freebsd.dk> <9990b393-93ff-75af-4e14-de4f6ba3366c@gmx.de> <7b7f714d-890e-db90-4922-cfbc46b3e999@gmx.de> <202212021129.2B2BTY9f005362@critter.freebsd.dk> <b1d3af79-373f-a9af-7ff9-39f5f44915f0@gmx.de> <202212021214.2B2CEUQx005654@critter.freebsd.dk> <7a93fa17-38fe-5fa8-54ed-a726ab9d5a39@gmx.de> <841DC85E-F936-4350-A74F-170D22E6ADCE@gbiv.com>
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-ID: <7226.1670008691.1@critter.freebsd.dk>
Date: Fri, 02 Dec 2022 19:18:11 +0000
Received-SPF: pass client-ip=130.225.244.222; envelope-from=phk@critter.freebsd.dk; helo=phk.freebsd.dk
X-W3C-Hub-Spam-Status: No, score=-4.9
X-W3C-Hub-Spam-Report: BAYES_00=-1.9, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001, W3C_AA=-1, W3C_IRA=-1, W3C_WL=-1
X-W3C-Scan-Sig: mimas.w3.org 1p1BYH-005xQA-6p d4c63dc0c7c3a76938c0140a64077795
X-Original-To: ietf-http-wg@w3.org
Subject: Re: support for non-ASCII in strings, was: signatures vs sf-date
Archived-At: <https://www.w3.org/mid/202212021918.2B2JIBHC007228@critter.freebsd.dk>
Resent-From: ietf-http-wg@w3.org
X-Mailing-List: <ietf-http-wg@w3.org> archive/latest/40626
X-Loop: ietf-http-wg@w3.org
Resent-Sender: ietf-http-wg-request@w3.org
Precedence: list
List-Id: <ietf-http-wg.w3.org>
List-Help: <https://www.w3.org/Mail/>
List-Post: <mailto:ietf-http-wg@w3.org>
List-Unsubscribe: <mailto:ietf-http-wg-request@w3.org?subject=unsubscribe>

--------
Roy T. Fielding writes:

> FWIW, there is nothing technical preventing a requirement that sf fields
> support UTF-8 by default.

(... provided they are somehow escaped into USASCII visibility.)

Yes, and I have absolutely no problem with SF being used to transport
UTF-8, X.509, BJSON or LU6.2 that way, that is literally why I put
sf-binary in:  To make it easy for people to move /any/ binary data,
but at the same time trying to stop them from inventing their own
obscure escaping mechanisms.

But none of the data SF serializes or parses have /semantic/ meaning
at the layer in the cake where SF lives, and that is not by accident:

For me it was an explicit design goal, that SF would be totally
unaware of Code-Tables, UniCode, Mime, Country-codes, TLDs, Timezones
and all other time-variant registries of human policy or fashion.

When somebody wants their headers to have encoded strings, they can
decide how to escape them and safely move them about as sf-token,
sf-string or sf-binary.

But just like all other sf-items in their header, they will have
to explain how to interpret, in their header defining document,
what those sf-items mean.

Poul-Henning

-- 
Poul-Henning Kamp       | UNIX since Zilog Zeus 3.20
phk@FreeBSD.ORG         | TCP/IP since RFC 956
FreeBSD committer       | BSD since 4.3-tahoe    
Never attribute to malice what can adequately be explained by incompetence.