Re: draft-ietf-httpbis-header-structure-00, unicode range

Julian Reschke <julian.reschke@gmx.de> Wed, 14 December 2016 10:18 UTC

Return-Path: <ietf-http-wg-request+bounce-httpbisa-archive-bis2juki=lists.ie@listhub.w3.org>
X-Original-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Delivered-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 1E074129A47 for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Wed, 14 Dec 2016 02:18:52 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -9.797
X-Spam-Level:
X-Spam-Status: No, score=-9.797 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HEADER_FROM_DIFFERENT_DOMAINS=0.001, RCVD_IN_DNSWL_HI=-5, RP_MATCHES_RCVD=-2.896, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id zmGlLIGUi2M4 for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Wed, 14 Dec 2016 02:18:50 -0800 (PST)
Received: from frink.w3.org (frink.w3.org [128.30.52.56]) (using TLSv1.2 with cipher DHE-RSA-AES128-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id CD693129CC4 for <httpbisa-archive-bis2Juki@lists.ietf.org>; Wed, 14 Dec 2016 02:18:46 -0800 (PST)
Received: from lists by frink.w3.org with local (Exim 4.80) (envelope-from <ietf-http-wg-request@listhub.w3.org>) id 1cH6c2-0004XR-4c for ietf-http-wg-dist@listhub.w3.org; Wed, 14 Dec 2016 10:16:38 +0000
Resent-Date: Wed, 14 Dec 2016 10:16:38 +0000
Resent-Message-Id: <E1cH6c2-0004XR-4c@frink.w3.org>
Received: from titan.w3.org ([128.30.52.76]) by frink.w3.org with esmtps (TLS1.2:RSA_AES_128_CBC_SHA1:128) (Exim 4.80) (envelope-from <julian.reschke@gmx.de>) id 1cH6bu-0004WK-Dz for ietf-http-wg@listhub.w3.org; Wed, 14 Dec 2016 10:16:30 +0000
Received: from mout.gmx.net ([212.227.15.18]) by titan.w3.org with esmtps (TLS1.2:DHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.84_2) (envelope-from <julian.reschke@gmx.de>) id 1cH6bn-00006J-CD for ietf-http-wg@w3.org; Wed, 14 Dec 2016 10:16:25 +0000
Received: from [192.168.178.20] ([93.217.110.65]) by mail.gmx.com (mrgmx003 [212.227.17.190]) with ESMTPSA (Nemesis) id 0MWT8s-1c6jOc0bes-00XeV7; Wed, 14 Dec 2016 11:15:28 +0100
To: Poul-Henning Kamp <phk@phk.freebsd.dk>, Ilari Liusvaara <ilariliusvaara@welho.com>
References: <20161213173327.C1F7D1714B@welho-filter2.welho.com> <20161213175419.GA7943@LK-Perkele-V2.elisa-laajakaista.fi> <25434.1481665395@critter.freebsd.dk>
Cc: Kari Hurtta <hurtta-ietf@elmme-mailer.org>, HTTP working group mailing list <ietf-http-wg@w3.org>, Poul-Henning Kamp <phk@varnish-cache.org>
From: Julian Reschke <julian.reschke@gmx.de>
Message-ID: <c360d5bc-d22d-278e-952b-1fad78f73c44@gmx.de>
Date: Wed, 14 Dec 2016 11:15:28 +0100
User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:45.0) Gecko/20100101 Thunderbird/45.5.1
MIME-Version: 1.0
In-Reply-To: <25434.1481665395@critter.freebsd.dk>
Content-Type: text/plain; charset="windows-1252"; format="flowed"
Content-Transfer-Encoding: 7bit
X-Provags-ID: V03:K0:Jzuc9zQbVw5XU6l+AS8Dh5yN/pMB7S7qVRBeKmzIP0izjGLflN1 Qa8cWyn+BAREGUx2h7XHfj+K6qDy8q4suLymVi7ZwdFgR/15hNb8I7sFAmk41K/fP1+qBFw w1wILP3w1wfpr2LGhBOKPEq48CORzuWJpH7EjCG0I3kKSj8oiBDkD2o1Huu+KpbE9EX4XXC QO4bG8f7PXnObevEsZHvQ==
X-UI-Out-Filterresults: notjunk:1;V01:K0:5ATUKNQ5C8A=:z2cTsIys18y7YeX4OJMkAo rImHqAt1RfO+HRSFAWs/wE2tPObQvaizOhBhU+mzeLuchTLJSBegyxJeVl4xNQfL3pFuhbyBg CKN6zJgivCntCmBjvYYMPas77VMjBLUPv3F8oar3XY6d6g0/+7lw5X1NjDdStbr27Tinroe/0 JW36RNSN5NxMa5gI5YvxfyAf+iqRwVQ/tmMr9bnDDZjJOieL0r2CyQHUCQee2L2PJgaqifgBp 108bexnLU0zvuCwZG8ScAB74My6acLwo5SexOihwafTfKzdcDpuUih5WEx0GIumz7KTuyKA25 9Rwnj4dcSWH86uVQZZDUcF9ktWhXGY6n4i5WdqMNBBd0QaixltJRm/He8TK9JkvIRYpr9f2t1 LfVZ3Ib+LwDAYPOiKB1QnNsVd2Kv5K4BTv1IDVol0w8hPqAZ8U1GwF2E7jtbh3mHJieC+lxLv sPglX6xSnFjaM+zNG4rnIvX2Bz3t799OFgBG44rmtzB/6lsGM3B3peaqAXqe/0/Fj5TeCFpP+ h/181jNIGPMuEigmVpkzvQJtc/Lc1kuPyJ4G/156fcOdb9OUquJYyLYAzJygJ8to2X5vD3KST X0Tg03JQ+VEtQHcVhZGVZt5YcZ1KiQ4zpS9ArjM7R+CuSIJXJymdj/mu7hlvXFeOhXk6LClHG HqvxeiQ1BYCReup1kcXak2ixPsLZejAiqUfdN209AUCxoXzIR2gx+UEHcgWi4dDv87lFEn8CM adqV+4cW6cKQtmY1dbh+iQTIWYM8/TNB5Svs+q0BQW/w1hfTM+VCJc4RTkQpduDonRQ5asGpI fvP00HG
Received-SPF: pass client-ip=212.227.15.18; envelope-from=julian.reschke@gmx.de; helo=mout.gmx.net
X-W3C-Hub-Spam-Status: No, score=-6.6
X-W3C-Hub-Spam-Report: AWL=-0.018, BAYES_00=-1.9, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001, W3C_AA=-1, W3C_DB=-1, W3C_IRA=-1, W3C_WL=-1
X-W3C-Scan-Sig: titan.w3.org 1cH6bn-00006J-CD 649dcd3e6911d464b9759328cb006584
X-Original-To: ietf-http-wg@w3.org
Subject: Re: draft-ietf-httpbis-header-structure-00, unicode range
Archived-At: <http://www.w3.org/mid/c360d5bc-d22d-278e-952b-1fad78f73c44@gmx.de>
Resent-From: ietf-http-wg@w3.org
X-Mailing-List: <ietf-http-wg@w3.org> archive/latest/33183
X-Loop: ietf-http-wg@w3.org
Resent-Sender: ietf-http-wg-request@w3.org
Precedence: list
List-Id: <ietf-http-wg.w3.org>
List-Help: <http://www.w3.org/Mail/>
List-Post: <mailto:ietf-http-wg@w3.org>
List-Unsubscribe: <mailto:ietf-http-wg-request@w3.org?subject=unsubscribe>

On 2016-12-13 22:43, Poul-Henning Kamp wrote:
> --------
> In message <20161213175419.GA7943@LK-Perkele-V2.elisa-laajakaista.fi>, Ilari Li
> usvaara writes:
>
>>> 3.  HTTP/1 Serialization of HTTP Header Common Structure
>>> https://tools.ietf.org/html/draft-ietf-httpbis-header-structure-00#section-3
>
>
>> Well, that production lists UTF8-4, which is presumably 4-byte UTF-8
>> sequences, and all valid ones are astral plane codepoints.
>
> My impression was that UTF8 and 8-bit clean HTTP/1 got shot down
> in previous discussions, but I left UTF8 here for now, pending a
> more structured decision making on this.
>
> I see us having four options, in my order of preference:
>
> 1) Forbid Unicode in headers.
>
> 2) Take UTF8 out and leave all (non-ASCII) unicode to the \uxxxx
>    escape mechanism.
>
> 3) Leave UTF8 in, and make it clear that it may or may not work, so
>    that people can use it in controlled environments.
>
> 4) Leave UTF8 in, and specify how to indicate/negotiate if it can be used.
>
>> astral planes (and I hope the escape system there would be more sane
>> than the one JSON has...)
>
> Any suggestions ?

3) seems like the right choice to me.

Best regards, Julian