Re: Unicode escape sequence | Re: draft-ietf-httpbis-header-structure-00, unicode range

"Poul-Henning Kamp" <phk@phk.freebsd.dk> Fri, 23 December 2016 16:17 UTC

Return-Path: <ietf-http-wg-request+bounce-httpbisa-archive-bis2juki=lists.ie@listhub.w3.org>
X-Original-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Delivered-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id B18CC1296CC for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Fri, 23 Dec 2016 08:17:07 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -10.001
X-Spam-Level:
X-Spam-Status: No, score=-10.001 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HEADER_FROM_DIFFERENT_DOMAINS=0.001, RCVD_IN_DNSWL_HI=-5, RP_MATCHES_RCVD=-3.1, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Nvllj_I_El_F for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Fri, 23 Dec 2016 08:17:05 -0800 (PST)
Received: from frink.w3.org (frink.w3.org [128.30.52.56]) (using TLSv1.2 with cipher DHE-RSA-AES128-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 87BA212947A for <httpbisa-archive-bis2Juki@lists.ietf.org>; Fri, 23 Dec 2016 08:17:01 -0800 (PST)
Received: from lists by frink.w3.org with local (Exim 4.80) (envelope-from <ietf-http-wg-request@listhub.w3.org>) id 1cKSTw-0000pj-Ey for ietf-http-wg-dist@listhub.w3.org; Fri, 23 Dec 2016 16:14:08 +0000
Resent-Date: Fri, 23 Dec 2016 16:14:08 +0000
Resent-Message-Id: <E1cKSTw-0000pj-Ey@frink.w3.org>
Received: from titan.w3.org ([128.30.52.76]) by frink.w3.org with esmtps (TLS1.2:RSA_AES_128_CBC_SHA1:128) (Exim 4.80) (envelope-from <phk@phk.freebsd.dk>) id 1cKSTo-0000p2-Px for ietf-http-wg@listhub.w3.org; Fri, 23 Dec 2016 16:14:00 +0000
Received: from phk.freebsd.dk ([130.225.244.222]) by titan.w3.org with esmtp (Exim 4.84_2) (envelope-from <phk@phk.freebsd.dk>) id 1cKSTn-0002pV-Ox for ietf-http-wg@w3.org; Fri, 23 Dec 2016 16:14:00 +0000
Received: from critter.freebsd.dk (unknown [192.168.55.3]) by phk.freebsd.dk (Postfix) with ESMTP id B3B5F273C0; Fri, 23 Dec 2016 16:13:34 +0000 (UTC)
Received: from critter.freebsd.dk (localhost [127.0.0.1]) by critter.freebsd.dk (8.15.2/8.15.2) with ESMTP id uBNGDTub056064; Fri, 23 Dec 2016 16:13:30 GMT (envelope-from phk@phk.freebsd.dk)
To: Mark Nottingham <mnot@mnot.net>
cc: Martin Thomson <martin.thomson@gmail.com>, "Julian F. Reschke" <julian.reschke@gmx.de>, Alexey Melnikov <alexey.melnikov@isode.com>, Matthew Kerwin <matthew@kerwin.net.au>, Kari Hurtta <hurtta-ietf@elmme-mailer.org>, Ilari Liusvaara <ilariliusvaara@welho.com>, HTTP working group mailing list <ietf-http-wg@w3.org>, Poul-Henning Kamp <phk@varnish-cache.org>
In-reply-to: <C8F21FA8-8B03-4E9C-B0E8-CD3C9CF028CE@mnot.net>
From: Poul-Henning Kamp <phk@phk.freebsd.dk>
References: <20161213173327.C1F7D1714B@welho-filter2.welho.com> <20161213175419.GA7943@LK-Perkele-V2.elisa-laajakaista.fi> <25434.1481665395@critter.freebsd.dk> <201612140628.uBE6SO3L025885@shell.siilo.fmi.fi> <36792.1481701328@critter.freebsd.dk> <CACweHNDKgWQewZHb=Kz3_2=41M58sY5472Q5OwpqPLxorvkzHQ@mail.gmail.com> <37223.1481707288@critter.freebsd.dk> <3a65ca44-f652-3b14-6d64-46f35b32df57@isode.com> <55880.1481711031@critter.freebsd.dk> <95057a05-6714-9154-8cf8-7cd302c86715@gmx.de> <60914.1481712680@critter.freebsd.dk> <CABkgnnWzOhkznH2HzweNegYo4dDHE+DT0PM=eCSvVr+-Wkup1A@mail.gmail.com> <63340.1481720099@critter.freebsd.dk> <C8F21FA8-8B03-4E9C-B0E8-CD3C9CF028CE@mnot.net>
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-ID: <56062.1482509609.1@critter.freebsd.dk>
Content-Transfer-Encoding: quoted-printable
Date: Fri, 23 Dec 2016 16:13:29 +0000
Message-ID: <56063.1482509609@critter.freebsd.dk>
Received-SPF: none client-ip=130.225.244.222; envelope-from=phk@phk.freebsd.dk; helo=phk.freebsd.dk
X-W3C-Hub-Spam-Status: No, score=-6.9
X-W3C-Hub-Spam-Report: AWL=0.126, BAYES_00=-1.9, RP_MATCHES_RCVD=-3.1, W3C_AA=-1, W3C_WL=-1
X-W3C-Scan-Sig: titan.w3.org 1cKSTn-0002pV-Ox a0bf83e151cfa3096482a605d03788c5
X-Original-To: ietf-http-wg@w3.org
Subject: Re: Unicode escape sequence | Re: draft-ietf-httpbis-header-structure-00, unicode range
Archived-At: <http://www.w3.org/mid/56063.1482509609@critter.freebsd.dk>
Resent-From: ietf-http-wg@w3.org
X-Mailing-List: <ietf-http-wg@w3.org> archive/latest/33236
X-Loop: ietf-http-wg@w3.org
Resent-Sender: ietf-http-wg-request@w3.org
Precedence: list
List-Id: <ietf-http-wg.w3.org>
List-Help: <http://www.w3.org/Mail/>
List-Post: <mailto:ietf-http-wg@w3.org>
List-Unsubscribe: <mailto:ietf-http-wg-request@w3.org?subject=unsubscribe>

--------
In message <C8F21FA8-8B03-4E9C-B0E8-CD3C9CF028CE@mnot.net>, Mark Nottingham wri
tes:

>And, as discussed previously, there aren't a lot of use cases for 
>non-ASCII header values in standards (because few have a payload that's 
>exposed to end users), so the reward for taking that risk is 
>questionable.

But isn't that an almost circular[1] argument, when there is no
safe standardized way to put non-ASCII into header values in the
first place ?

Anyway, here is my current thinking:

a) Remove the UTF8 option
-------------------------

Having UTF8 as an option for the H1 serialization is unlikely to
significantly increase the number of paths on which you can use it.

The subset where it will work, will be the same subset with or without
our blessing:  End-to-End fully confined and controlled paths.


b) Add BCP137's recommendation unchanged (%x5C.75.27 4*6HEXDIG %x27)
--------------------------------------------------------------------

A number of alternatives have been proposed, but I do not think the
relatively minor increase in HPACK and H1 afficiency they offer 
warrants "needlessly multiplying entities"[2].


Absent really convincing arguments, that will be my next edit.

Poul-Henning


[1] "Almost circular" is a lot more complex than most people imagine.
Highly recommended reading: http://press.princeton.edu/titles/8624.html

[2] Occams Razor.

-- 
Poul-Henning Kamp       | UNIX since Zilog Zeus 3.20
phk@FreeBSD.ORG         | TCP/IP since RFC 956
FreeBSD committer       | BSD since 4.3-tahoe    
Never attribute to malice what can adequately be explained by incompetence.