Re: support for non-ASCII in strings, was: signatures vs sf-date

Julian Reschke <julian.reschke@gmx.de> Sat, 03 December 2022 15:09 UTC

Return-Path: <ietf-http-wg-request+bounce-httpbisa-archive-bis2juki=lists.ie@listhub.w3.org>
X-Original-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Delivered-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 69408C1522C2 for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Sat, 3 Dec 2022 07:09:13 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -5.049
X-Spam-Level:
X-Spam-Status: No, score=-5.049 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HEADER_FROM_DIFFERENT_DOMAINS=0.25, MAILING_LIST_MULTI=-1, NICE_REPLY_A=-0.001, RCVD_IN_DNSWL_MED=-2.3, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmx.de
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id NgR6ggSZrLfR for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Sat, 3 Dec 2022 07:09:07 -0800 (PST)
Received: from lyra.w3.org (lyra.w3.org [128.30.52.18]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 89F74C1522BD for <httpbisa-archive-bis2Juki@lists.ietf.org>; Sat, 3 Dec 2022 07:09:07 -0800 (PST)
Received: from lists by lyra.w3.org with local (Exim 4.94.2) (envelope-from <ietf-http-wg-request@listhub.w3.org>) id 1p1U8N-00C9D4-U7 for ietf-http-wg-dist@listhub.w3.org; Sat, 03 Dec 2022 15:08:55 +0000
Resent-Date: Sat, 03 Dec 2022 15:08:55 +0000
Resent-Message-Id: <E1p1U8N-00C9D4-U7@lyra.w3.org>
Received: from titan.w3.org ([128.30.52.76]) by lyra.w3.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from <julian.reschke@gmx.de>) id 1p1U8L-00C9Bb-PV for ietf-http-wg@listhub.w3.org; Sat, 03 Dec 2022 15:08:53 +0000
Received: from mout.gmx.net ([212.227.15.15]) by titan.w3.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from <julian.reschke@gmx.de>) id 1p1U8K-004jwx-8f for ietf-http-wg@w3.org; Sat, 03 Dec 2022 15:08:53 +0000
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.de; s=s31663417; t=1670080120; bh=Et7lvmQTpqXqpyp/FbYgLT2ct5NsCPSWhC5tlDzzZv8=; h=X-UI-Sender-Class:Date:Subject:To:References:From:In-Reply-To; b=MTHvploln3bRVIgiKmKPx5zVGYhHQZHbjTukaqG7h1yuVmvt3cymWEml1cBfiUNg3 lURMlU6kmzZy9eFxnR/9MDu4xii6vG70M4Brpu5yBKxxRMAAqUtR83cDxVBd5BFOQk NPZG7g1FfWe6/8Cr2bYvzWqIRgCKqmTAQyLsUbssSYxFgGDOWPROiEcilpCYUkIIk4 PyuG2Za7WTEObra1vKhYYcZIINaTCS0YGF82n4wgXAYcHOlRkGPO/t3dyU1BvRwaCs MJLpyVsZNtRSRDv/0wyjV+cs8xEAJXdarykwjTJDbrZ+XpmzI4y/ZOuoRJwi62HgWy u8Ssd5v/Q0v2w==
X-UI-Sender-Class: 724b4f7f-cbec-4199-ad4e-598c01a50d3a
Received: from [192.168.178.20] ([84.171.152.225]) by mail.gmx.net (mrgmx004 [212.227.17.190]) with ESMTPSA (Nemesis) id 1MtwUw-1oirLg2bU4-00uIVm for <ietf-http-wg@w3.org>; Sat, 03 Dec 2022 16:08:40 +0100
Message-ID: <39485ee5-2091-c873-3d06-82fd89e255a2@gmx.de>
Date: Sat, 03 Dec 2022 16:08:35 +0100
MIME-Version: 1.0
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.5.0
Content-Language: en-US
To: ietf-http-wg@w3.org
References: <2070c8e0-98d6-7b63-77c3-550bcd661397@gmx.de> <202212011735.2B1HZYgm004808@critter.freebsd.dk> <e580db7e-c0ec-0f1a-17af-5719ab09468c@gmx.de> <202212020810.2B28ALnL004331@critter.freebsd.dk> <eee5a787-da37-feb1-098a-7d2d9c0f1d37@it.aoyama.ac.jp> <202212020848.2B28mGbc004600@critter.freebsd.dk> <4e251954-afb6-fa08-616c-db95e23ad1fd@gmx.de> <202212020946.2B29kSe6004829@critter.freebsd.dk> <75dad0c0-e3bb-1189-0c16-e8275d3879ff@gmx.de> <202212021016.2B2AGvEP004972@critter.freebsd.dk> <9990b393-93ff-75af-4e14-de4f6ba3366c@gmx.de>
From: Julian Reschke <julian.reschke@gmx.de>
In-Reply-To: <9990b393-93ff-75af-4e14-de4f6ba3366c@gmx.de>
Content-Type: text/plain; charset="UTF-8"; format="flowed"
Content-Transfer-Encoding: quoted-printable
X-Provags-ID: V03:K1:9zvW98w0PAmauZ1/93b6y19sWCvCzhpQuSs+I9Gv6MClXQhu40Z wYPUq2PC0trCYolSLb+TqAHlAYAdbly5o5YRVhP2KNTGFEizHugZNzrTUjOX6SoF8PIaDMU QGOEce5CSmQPkGcGPwgq8exvuWlyJnMUh4SylI8OauqAvVkOVuGtGYnc6S1NDqhLGRRPGeN I+nkETovB4PGGQdZ+eApQ==
UI-OutboundReport: notjunk:1;M01:P0:t5/TSySLhD8=;PkQyqeEjwkoH6JN84omZ6KcgKkK 137mguEhyd81ygNCsja0TL4L/N0K124KwmfkYGtwveiJ+PvDyEDA36Wsf9sE5eMhCCGGjlmhP UzUKiPljT65DoASBI9c2fkaTTrLfsb/aGeXRlroDODZphoeVulwaX8Bh/YDpQTapjmfQPyu0V rTclUgAolKO5IcQ5EpSmEtMVQ3MYILJvTkKYTse6ju+RNx+74Ok7y2620meZZxOslCrhxnf4Y kmZLDrC/vrq3tYv21kZbDLUOF7O9dkIplChcSvKcZZle4FbvIPPKrImtS34L9a4jRX0Ne/x+v YSSPbcrYC2rJKzuwmQWrlnSlESHswzRjK0Mt9Y5w0ABseNWWvDM+hKEJodKf3nWpymRaZ/oUH gGB/5j2stDY182h3lz6NKQA7Ex1zeRrwSU5YcONG2YVh0LaMT3e5pIK+gzxC97p+nR70InJIb Y2sX/HB8LuDFmQPWcnyvjZCjYcNWA0V6mWkrraM6p0wQORoyxbjFWQxXbImi9r7nxl1chXsnU wIu2tGdrpb/nc8VTd39YJsa33NnE+iBOFhTvnCOOboAjJhf3szrDjw9gY1u4zSeNDR4C7W8Iy Iz8YrAi6KAEHwBgDLxpLTdyObomBh4hNw9GQHSZPtkqs1uu0/XIxJRizjA+T/u1I2BHptgqoe uOL7beKMKYm7QJf6wx3FCv1bpB2zVtqTYgzAVF83JsbFCnI7RXGPLmSKsAWTUDwjdutj20+PP f7PXPkLCp4OGvjIkZFDwanv6cgAKfiENzputdE3zOEB+8JI1kyOVwxm8NQBC/+stV2b1bklYj AyfPOoOlPrOE1nww/KznCXzoDKwnyW15au31Go58XbDLZKRHEhqUiV2kpygXt325j3uCrwfV/ H/LWWV0GHKtMf8WedD1gdHt0wNG/f/nNAeo31YEg8d0idV91SlJtxqZtmL7GlAq/ysm5o/lKv jA1qUwZ2RkGY/Pdro0eXUNyxmm8=
Received-SPF: pass client-ip=212.227.15.15; envelope-from=julian.reschke@gmx.de; helo=mout.gmx.net
X-W3C-Hub-DKIM-Status: validation passed: (address=julian.reschke@gmx.de domain=gmx.de), signature is good
X-W3C-Hub-Spam-Status: No, score=-6.1
X-W3C-Hub-Spam-Report: BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, NICE_REPLY_A=-0.265, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, W3C_AA=-1, W3C_IRA=-1, W3C_WL=-1
X-W3C-Scan-Sig: titan.w3.org 1p1U8K-004jwx-8f 49c30889fe1a8d5e3744f2d1cfeeaee2
X-Original-To: ietf-http-wg@w3.org
Subject: Re: support for non-ASCII in strings, was: signatures vs sf-date
Archived-At: <https://www.w3.org/mid/39485ee5-2091-c873-3d06-82fd89e255a2@gmx.de>
Resent-From: ietf-http-wg@w3.org
X-Mailing-List: <ietf-http-wg@w3.org> archive/latest/40640
X-Loop: ietf-http-wg@w3.org
Resent-Sender: ietf-http-wg-request@w3.org
Precedence: list
List-Id: <ietf-http-wg.w3.org>
List-Help: <https://www.w3.org/Mail/>
List-Post: <mailto:ietf-http-wg@w3.org>
List-Unsubscribe: <mailto:ietf-http-wg-request@w3.org?subject=unsubscribe>

On 02.12.2022 11:32, Julian Reschke wrote:
>  ...

Trying to structure the discussion somehow, I have opened
<https://github.com/httpwg/http-extensions/issues/2343>, currently saying:

> RFC 8941 does not support non-ASCII characters in sf-string (but recommends to use sf-binary when non-ASCII characters are needed).
>
> The solution space is roughly:
>
>     1. No support at all
>     2. No support, but addings hints about workarounds, such as
>     2.1 Using sf-binary (approach in RFC 8941)
>     2.2 Using RFC8187 parameter encoding (works only for parameters at it uses the "*" marker)
>     2.3 Using an escaping mechanism inside sf-string (with out-of-band info to indicate that it is in use) - currently proposed for the "Problem" header field (see ietf-wg-httpapi/rfc7807bis@ebf9465)
>     3. Direct support
>     3.1 By extending sf-string to allow native UTF-8
>     3.2 By extending sf-string to support a new escaping mechanism (for instance, by introducing new backslash escapes)
>     3.3 By adding a new type ("sf-ustring") that somehow supports non-ASCII characters
>
> Note that the question whether non-ASCII octets appear in the field value (obs-text) is somehat orthogonal, expect if 3.1 would be chosen.

The purpose is to get to a common understanding of where we are with RFC
8941, and potential approaches if we wanted to do more.

Feel free to edit the description to add more details, options, but
please put *discussions* into comments.

Best regards, Julian