Re: support for non-ASCII in strings, was: signatures vs sf-date

Poul-Henning Kamp <phk@phk.freebsd.dk> Fri, 02 December 2022 11:03 UTC

Return-Path: <ietf-http-wg-request+bounce-httpbisa-archive-bis2juki=lists.ie@listhub.w3.org>
X-Original-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Delivered-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id C78E6C14CEE4 for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Fri, 2 Dec 2022 03:03:10 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.948
X-Spam-Level:
X-Spam-Status: No, score=-4.948 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HEADER_FROM_DIFFERENT_DOMAINS=0.25, MAILING_LIST_MULTI=-1, RCVD_IN_DNSWL_MED=-2.3, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id hQ3qkv6og3Ay for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Fri, 2 Dec 2022 03:03:07 -0800 (PST)
Received: from lyra.w3.org (lyra.w3.org [128.30.52.18]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 2533DC14CEE2 for <httpbisa-archive-bis2Juki@lists.ietf.org>; Fri, 2 Dec 2022 03:03:06 -0800 (PST)
Received: from lists by lyra.w3.org with local (Exim 4.94.2) (envelope-from <ietf-http-wg-request@listhub.w3.org>) id 1p13of-008lwA-Nv for ietf-http-wg-dist@listhub.w3.org; Fri, 02 Dec 2022 11:02:49 +0000
Resent-Date: Fri, 02 Dec 2022 11:02:49 +0000
Resent-Message-Id: <E1p13of-008lwA-Nv@lyra.w3.org>
Received: from titan.w3.org ([128.30.52.76]) by lyra.w3.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from <phk@critter.freebsd.dk>) id 1p13od-008lus-No for ietf-http-wg@listhub.w3.org; Fri, 02 Dec 2022 11:02:47 +0000
Received: from phk.freebsd.dk ([130.225.244.222]) by titan.w3.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from <phk@critter.freebsd.dk>) id 1p13oc-0046io-6n for ietf-http-wg@w3.org; Fri, 02 Dec 2022 11:02:47 +0000
Received: from critter.freebsd.dk (unknown [192.168.55.3]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by phk.freebsd.dk (Postfix) with ESMTPS id 8CFEC89284; Fri, 2 Dec 2022 11:02:34 +0000 (UTC)
Received: from critter.freebsd.dk (localhost [127.0.0.1]) by critter.freebsd.dk (8.16.1/8.16.1) with ESMTPS id 2B2B2YXa005237 (version=TLSv1.3 cipher=TLS_AES_256_GCM_SHA384 bits=256 verify=NO); Fri, 2 Dec 2022 11:02:34 GMT (envelope-from phk@critter.freebsd.dk)
Received: (from phk@localhost) by critter.freebsd.dk (8.16.1/8.16.1/Submit) id 2B2B2Yu7005236; Fri, 2 Dec 2022 11:02:34 GMT (envelope-from phk)
Message-Id: <202212021102.2B2B2Yu7005236@critter.freebsd.dk>
To: Julian Reschke <julian.reschke@gmx.de>
cc: ietf-http-wg@w3.org
In-reply-to: <9990b393-93ff-75af-4e14-de4f6ba3366c@gmx.de>
From: Poul-Henning Kamp <phk@phk.freebsd.dk>
References: <2070c8e0-98d6-7b63-77c3-550bcd661397@gmx.de> <202212011735.2B1HZYgm004808@critter.freebsd.dk> <e580db7e-c0ec-0f1a-17af-5719ab09468c@gmx.de> <202212020810.2B28ALnL004331@critter.freebsd.dk> <eee5a787-da37-feb1-098a-7d2d9c0f1d37@it.aoyama.ac.jp> <202212020848.2B28mGbc004600@critter.freebsd.dk> <4e251954-afb6-fa08-616c-db95e23ad1fd@gmx.de> <202212020946.2B29kSe6004829@critter.freebsd.dk> <75dad0c0-e3bb-1189-0c16-e8275d3879ff@gmx.de> <202212021016.2B2AGvEP004972@critter.freebsd.dk> <9990b393-93ff-75af-4e14-de4f6ba3366c@gmx.de>
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-ID: <5234.1669978954.1@critter.freebsd.dk>
Content-Transfer-Encoding: quoted-printable
Date: Fri, 02 Dec 2022 11:02:34 +0000
Received-SPF: pass client-ip=130.225.244.222; envelope-from=phk@critter.freebsd.dk; helo=phk.freebsd.dk
X-W3C-Hub-Spam-Status: No, score=-4.9
X-W3C-Hub-Spam-Report: BAYES_00=-1.9, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001, W3C_AA=-1, W3C_IRA=-1, W3C_WL=-1
X-W3C-Scan-Sig: titan.w3.org 1p13oc-0046io-6n 07ef783e032002844ec4e511a8ddf127
X-Original-To: ietf-http-wg@w3.org
Subject: Re: support for non-ASCII in strings, was: signatures vs sf-date
Archived-At: <https://www.w3.org/mid/202212021102.2B2B2Yu7005236@critter.freebsd.dk>
Resent-From: ietf-http-wg@w3.org
X-Mailing-List: <ietf-http-wg@w3.org> archive/latest/40607
X-Loop: ietf-http-wg@w3.org
Resent-Sender: ietf-http-wg-request@w3.org
Precedence: list
List-Id: <ietf-http-wg.w3.org>
List-Help: <https://www.w3.org/Mail/>
List-Post: <mailto:ietf-http-wg@w3.org>
List-Unsubscribe: <mailto:ietf-http-wg-request@w3.org?subject=unsubscribe>

--------
Julian Reschke writes:

> The key question here is: do we need to support strings containing human
> language in HTTP fields?
> <https://www.ietf.org/archive/id/draft-ietf-httpapi-rfc7807bis-04.html#section-4>
> does that, and says:

Didn't we just agree that sf(bis) can already transport I18N strings /at least/ two different ways ?

RFC5987 or sf-binary + implicit or explicit encoding information ?

I cannot see any way we "need to support strings [...]" on top of that,
and I am a big beliver in Gettys rules of design:

	1. Do not add new functionality unless an implementor cannot 
	   complete a real application without it.

	2. It is as important to decide what a system is not as to
	   decide what it is. Do not serve all the world's needs;
	   rather, make the system extensible so that additional
	   needs can be met in an upwardly compatible fashion.

	3. The only thing worse than generalizing from one example is
           generalizing from no examples at all.  (Phil Karlton)

So for all I care:  We're done with that subject in context of sfbis.

> JSON, transferred over 7 bit transports, being broken because people got
> escapes wrong?

This is way of topic by now, but the most recent one had danish
national characters encoding as ISO8859-1 characters, the one
before that had the escapes as '%5c%78[hexchar][hexchar]' and
the hex value was in ISO8859-15.

-- 
Poul-Henning Kamp       | UNIX since Zilog Zeus 3.20
phk@FreeBSD.ORG         | TCP/IP since RFC 956
FreeBSD committer       | BSD since 4.3-tahoe    
Never attribute to malice what can adequately be explained by incompetence.