Re: support for non-ASCII in strings, was: signatures vs sf-date
Julian Reschke <julian.reschke@gmx.de> Sat, 03 December 2022 07:47 UTC
Return-Path: <ietf-http-wg-request+bounce-httpbisa-archive-bis2juki=lists.ie@listhub.w3.org>
X-Original-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Delivered-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 1D522C14CE2B for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Fri, 2 Dec 2022 23:47:43 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -7.749
X-Spam-Level:
X-Spam-Status: No, score=-7.749 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HEADER_FROM_DIFFERENT_DOMAINS=0.25, MAILING_LIST_MULTI=-1, NICE_REPLY_A=-0.001, RCVD_IN_DNSWL_HI=-5, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmx.de
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id ccn-iQR4rqoA for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Fri, 2 Dec 2022 23:47:39 -0800 (PST)
Received: from lyra.w3.org (lyra.w3.org [128.30.52.18]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 394F1C14F737 for <httpbisa-archive-bis2Juki@lists.ietf.org>; Fri, 2 Dec 2022 23:47:38 -0800 (PST)
Received: from lists by lyra.w3.org with local (Exim 4.94.2) (envelope-from <ietf-http-wg-request@listhub.w3.org>) id 1p1NF8-00BTc8-IS for ietf-http-wg-dist@listhub.w3.org; Sat, 03 Dec 2022 07:47:26 +0000
Resent-Date: Sat, 03 Dec 2022 07:47:26 +0000
Resent-Message-Id: <E1p1NF8-00BTc8-IS@lyra.w3.org>
Received: from mimas.w3.org ([128.30.52.79]) by lyra.w3.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from <julian.reschke@gmx.de>) id 1p1NF6-00BTaf-FZ for ietf-http-wg@listhub.w3.org; Sat, 03 Dec 2022 07:47:24 +0000
Received: from mout.gmx.net ([212.227.15.15]) by mimas.w3.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from <julian.reschke@gmx.de>) id 1p1NF4-006HDU-W6 for ietf-http-wg@w3.org; Sat, 03 Dec 2022 07:47:24 +0000
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.de; s=s31663417; t=1670053631; bh=HW7PtkXJr2JnpDjvj2F6VYymjCOkz7hC8TheBFXBsds=; h=X-UI-Sender-Class:Date:Subject:To:References:From:In-Reply-To; b=qpEzyvgwDVELCUoATrutovPOxoyO85G1Zd3f+xLzwduVPhanXnTDed2HcHvXwe27Y NiJz7rU7D/Y7bpHEBkDtHvS1jXI7VME22n3O5E6sucSlbJVh8XjhPnniKKJhbMQC+t /mWPJfWlu+K8srg2auPlodEZZID4UcV+Xapiy3eZG8VaIi8T0Fr1eywOouWCvRrcHM G8vehzh5xkY2BPiDv+42WaWLXaTBDHCo+HB0DE2/f3XVNImaQw6IdjO/0sTBWNDGeP F8oObKtJOZepVkOie7KZFAdH5butwFpK2fBqTAJpkkWpvm4o0l072tRfjwZc8NVfBP WJioTTvO9itVA==
X-UI-Sender-Class: 724b4f7f-cbec-4199-ad4e-598c01a50d3a
Received: from [192.168.178.20] ([84.171.152.225]) by mail.gmx.net (mrgmx005 [212.227.17.190]) with ESMTPSA (Nemesis) id 1M4s0j-1p2C3E0pXz-001zD4 for <ietf-http-wg@w3.org>; Sat, 03 Dec 2022 08:47:11 +0100
Message-ID: <c6b41b93-23b0-f3b8-5d7f-05e52614070a@gmx.de>
Date: Sat, 03 Dec 2022 08:47:10 +0100
MIME-Version: 1.0
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.5.0
Content-Language: en-US
To: ietf-http-wg@w3.org
References: <2070c8e0-98d6-7b63-77c3-550bcd661397@gmx.de> <e580db7e-c0ec-0f1a-17af-5719ab09468c@gmx.de> <202212020810.2B28ALnL004331@critter.freebsd.dk> <eee5a787-da37-feb1-098a-7d2d9c0f1d37@it.aoyama.ac.jp> <202212020848.2B28mGbc004600@critter.freebsd.dk> <4e251954-afb6-fa08-616c-db95e23ad1fd@gmx.de> <202212020946.2B29kSe6004829@critter.freebsd.dk> <75dad0c0-e3bb-1189-0c16-e8275d3879ff@gmx.de> <202212021016.2B2AGvEP004972@critter.freebsd.dk> <9990b393-93ff-75af-4e14-de4f6ba3366c@gmx.de> <7b7f714d-890e-db90-4922-cfbc46b3e999@gmx.de> <202212021129.2B2BTY9f005362@critter.freebsd.dk> <b1d3af79-373f-a9af-7ff9-39f5f44915f0@gmx.de> <202212021214.2B2CEUQx005654@critter.freebsd.dk> <7a93fa17-38fe-5fa8-54ed-a726ab9d5a39@gmx.de> <841DC85E-F936-4350-A74F-170D22E6ADCE@gbiv.com> <202212021918.2B2JIBHC007228@critter.freebsd.dk> <65070e79-5429-a4cd-abe2-667b526badf1@gmx.de> <202212022147.2B2LlcqP008154@critter.freebsd.dk> <53D8E497-284A-4B2C-91D8-367542AA0A7C@mnot.net>
From: Julian Reschke <julian.reschke@gmx.de>
In-Reply-To: <53D8E497-284A-4B2C-91D8-367542AA0A7C@mnot.net>
Content-Type: text/plain; charset="UTF-8"; format="flowed"
Content-Transfer-Encoding: quoted-printable
X-Provags-ID: V03:K1:Z2DA6kO+h2okLFY5hrYHoqJxjbLxyfL0AAkJvpWzTSB4CZIX8Pi C9bOVZlvukSCENYW/sHaMQuWh/geHBPg6CccCkcAJEzZzMLhjuih73BfVxhX+wdfes5/WrA NjNCi27ogiqX6bxAHfwpti+iSJdRPz6MpBJLZchAdmps7xsxhqc2HMtAmPwt25nQdDbazeI tixw/320TvapeFw7qJJdg==
UI-OutboundReport: notjunk:1;M01:P0:leLKOZ/rxbg=;TXe6aiYg2Nk7xkv4YIgXcqxuss5 SBnf5keYxWoy8zPxdbzMBLKBSy3Unk/YCeo8M73P2gblhNwRMukYOedL9niePamNTx4IzQ6Kq DcOeJGzH2JiAmVn7Rnav7w6r6m9XzS5KqIZPANom+n/fqQ6WCEEz5IRRCGhJW9JNc6e8WFd4p fpkg7As6JJfhYrr3REelJ592f0f7xmjuOJNn4DYqeaMnvWd3mL8S3fFufgTaCv6fzLi9s8JRN DPpAWpX7kLBemzd3j4Nf+dLxev17L7VAUBRXSYFJKrg4fTxVuKn201ZucKNsFd+CDYbkOcBmp jzYy+7mHn4+DFqfVkW/+rBQTkIXLA6nEgwq88HQ8TK8PDXeyrYPtC5vl2XxtmXJBDIlcOdiGt zYEzzTGVaKU1IOqC6GRDk7kneafizAzuO2EskqdIp3cMGotoubhHPFg5KbnWXitQaPhIWiu7B W4ldDhZXRFxDhBDQxJ5Vk4wtwwtRDj8O4CnH7nfu/vUGN2NbbB9vYwGsxW6TCY/IG4G54ehwk RXuOxmfWfYpv7ozaR5Q0a5dKa/gKBczosZZG7BOlcTbr1uN1viMDvngPbAyrUVO3kxWaS3Iv2 RGUx0tZXonuJpneqqA8FFRB774CzLNWRPkIQdNVWOmOBJb9ox5qOpMpGuhWe9phOulk1oThnZ qUmlkH0DzPxM9kVcJ9RvXVVQB+MiiOsM37tkiyLKACP8X+toM886HONgsAc/jJdxwtmj0WFWq LvPlHPZZRXBI3Vs8GkerJL7BnoOcgDUJBlUBwPBIt6ytoWpM+EafyM4czDL+wp4gSkDipsJA6 l/4qEVUkXZB9d41n1eDY1/6Eh/RoE/LbTdZiSJ6+40En6Ti4XgFradh5v8uWkhFaPkoHvcgWl MDaIy7tTcsNMrMILIfgY8aQakFgRJKTRdzZqWywjIUVnJc1eZW7KMHudlPWzQwJcW4fh+abn3 TQaGJ10hl1VJPZ24Bx9VTfoakN4=
Received-SPF: pass client-ip=212.227.15.15; envelope-from=julian.reschke@gmx.de; helo=mout.gmx.net
X-W3C-Hub-DKIM-Status: validation passed: (address=julian.reschke@gmx.de domain=gmx.de), signature is good
X-W3C-Hub-Spam-Status: No, score=-6.1
X-W3C-Hub-Spam-Report: BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, NICE_REPLY_A=-0.265, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, W3C_AA=-1, W3C_IRA=-1, W3C_WL=-1
X-W3C-Scan-Sig: mimas.w3.org 1p1NF4-006HDU-W6 cb8a08f28cb8b56f2d3a030a5839cf74
X-Original-To: ietf-http-wg@w3.org
Subject: Re: support for non-ASCII in strings, was: signatures vs sf-date
Archived-At: <https://www.w3.org/mid/c6b41b93-23b0-f3b8-5d7f-05e52614070a@gmx.de>
Resent-From: ietf-http-wg@w3.org
X-Mailing-List: <ietf-http-wg@w3.org> archive/latest/40635
X-Loop: ietf-http-wg@w3.org
Resent-Sender: ietf-http-wg-request@w3.org
Precedence: list
List-Id: <ietf-http-wg.w3.org>
List-Help: <https://www.w3.org/Mail/>
List-Post: <mailto:ietf-http-wg@w3.org>
List-Unsubscribe: <mailto:ietf-http-wg-request@w3.org?subject=unsubscribe>
On 03.12.2022 05:48, Mark Nottingham wrote: > My, you've all been busy while I slept. Catching up on this thread with my (for now, personal) thoughts. > > 1) We're adding Dates because it's a fairly common data type and generic software can do potentially interesting things when presenting / manipulating them. It's not a _strong_ motivation, but it seems to have got us over the wire. That's my understanding as well. And I argue that for the same reason, a common and on-the-wire-detectable encoding of non-ASCII characters would be good. > 2) I added %-encoded strings to Problem because the other encoding didn't fit cleanly into SF-land. However, we should _not_ add non-ASCII strings to SF because they're a footgun that for _most_ cases, will cause more trouble than they're worth. > > In the protocol (not content), most strings are intended for machines, not people, and ASCII strings can be processed fairly unambiguously; that's not true when you open things up to full Unicode. We're discussing this for header fields that *do* carry human-presentable content. Content-Disposition, Link, and now Problem. If you're serious about human presentable text not belonging here, why do we add that to "Problem" right now? > There are some cases where non-ASCII strings are needed in header fields; mostly, when you're presenting something to a human from the fields. Those cases are not as common. However, there's a catch to adding them: if full unicode strings were available in the protocol, many designers will understandably use them because it's been drilled into all our heads that unicode is what you use for strings. > > Hence, footgun. I would appreciate if you would explain why there is a problem we need to prevent, and what exactly that problem is. Do you have an example? > By leaving full unicode support out of the spec and forcing designers to take positive steps to support it, the (relatively small) barrier to adoption makes them stop and think whether they need it. I think that's a good thing. I also know that will make some i18n folks unhappy, and I'm sorry for that; unfortunately we're working in an area where protocol artefacts intended for humans and machines are mixed, and so it gets difficult. I continue to disagree. By not supporting non-ASCII in the base definition, we force people to come up with ad hoc definitions which in general will be worse than a common extension we can define here. > All of that said, once the algorithms are stable (as Julian has pointed out, they contain some errors), I wouldn't object to including the %-encoding text as an appendix in sf-bis with appropriate warnings, if other folks are amenable. That would be a good step into the right direction. I still think we need an on-the-wire signal that the encoding is in place, for the same reasons why we're doing this revision in the first place (tooling support for special-casing integers that happen to represent dates). > ... Best regards, Julian
- signatures vs sf-date Julian Reschke
- Re: signatures vs sf-date Poul-Henning Kamp
- Re: signatures vs sf-date Julian Reschke
- Re: signatures vs sf-date Poul-Henning Kamp
- Re: signatures vs sf-date Martin J. Dürst
- Re: signatures vs sf-date Poul-Henning Kamp
- Re: signatures vs sf-date Julian Reschke
- Re: signatures vs sf-date Poul-Henning Kamp
- Re: signatures vs sf-date Julian Reschke
- Re: signatures vs sf-date Poul-Henning Kamp
- support for non-ASCII in strings, was: signatures… Julian Reschke
- Re: support for non-ASCII in strings, was: signat… Julian Reschke
- Re: support for non-ASCII in strings, was: signat… Poul-Henning Kamp
- Re: support for non-ASCII in strings, was: signat… Julian Reschke
- Re: support for non-ASCII in strings, was: signat… Poul-Henning Kamp
- Re: support for non-ASCII in strings, was: signat… Poul-Henning Kamp
- Re: support for non-ASCII in strings, was: signat… Julian Reschke
- Re: support for non-ASCII in strings, was: signat… Julian Reschke
- Re: support for non-ASCII in strings, was: signat… Poul-Henning Kamp
- Re: support for non-ASCII in strings, was: signat… Poul-Henning Kamp
- Re: support for non-ASCII in strings, was: signat… Julian Reschke
- Re: support for non-ASCII in strings, was: signat… Carsten Bormann
- Re: support for non-ASCII in strings, was: signat… Poul-Henning Kamp
- Re: signatures vs sf-date Justin Richer
- Re: signatures vs sf-date Julian Reschke
- Re: signatures vs sf-date Ilari Liusvaara
- Re: signatures vs sf-date Poul-Henning Kamp
- Re: support for non-ASCII in strings, was: signat… Roy T. Fielding
- Re: support for non-ASCII in strings, was: signat… Poul-Henning Kamp
- Re: support for non-ASCII in strings, was: signat… Roy T. Fielding
- Re: support for non-ASCII in strings, was: signat… Poul-Henning Kamp
- Re: signatures vs sf-date Justin Richer
- Re: support for non-ASCII in strings, was: signat… Julian Reschke
- Re: signatures vs sf-date Julian Reschke
- Re: support for non-ASCII in strings, was: signat… Poul-Henning Kamp
- Re: support for non-ASCII in strings, was: signat… Mark Nottingham
- Re: support for non-ASCII in strings, was: signat… Julian Reschke
- Re: support for non-ASCII in strings, was: signat… Julian Reschke
- Re: support for non-ASCII in strings, was: signat… Mark Nottingham
- Re: support for non-ASCII in strings, was: signat… Julian Reschke
- Re: support for non-ASCII in strings, was: signat… Willy Tarreau
- Re: support for non-ASCII in strings, was: signat… Julian Reschke
- Re: support for non-ASCII in strings, was: signat… Julian Reschke
- Re: signatures vs sf-date Julian Reschke
- Re: signatures vs sf-date Mark Nottingham
- Re: signatures vs sf-date Julian Reschke
- Re: signatures vs sf-date Lucas Pardue
- Re: signatures vs sf-date Julian Reschke
- Re: signatures vs sf-date Ilari Liusvaara
- Re: signatures vs sf-date Lucas Pardue
- Re: signatures vs sf-date Mark Nottingham
- Re: signatures vs sf-date Lucas Pardue
- Re: signatures vs sf-date Mark Nottingham
- Re: signatures vs sf-date Julian Reschke
- Re: signatures vs sf-date Mark Nottingham
- Re: signatures vs sf-date Julian Reschke
- Re: signatures vs sf-date Watson Ladd
- Re: signatures vs sf-date Julian Reschke
- Re: signatures vs sf-date Watson Ladd
- Re: signatures vs sf-date Julian Reschke