Re: Unicode sucks, get over it (Re: Delta Compression and UTF-8 Header Values)

Phillip Hallam-Baker <hallam@gmail.com> Mon, 11 February 2013 15:58 UTC

Return-Path: <ietf-http-wg-request@listhub.w3.org>
X-Original-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Delivered-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id E217521F887F for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Mon, 11 Feb 2013 07:58:45 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -9.116
X-Spam-Level:
X-Spam-Status: No, score=-9.116 tagged_above=-999 required=5 tests=[AWL=1.330, BAYES_00=-2.599, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_HI=-8, SARE_SUB_ENC_UTF8=0.152]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id On2sxu+70Dwp for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Mon, 11 Feb 2013 07:58:45 -0800 (PST)
Received: from frink.w3.org (frink.w3.org [128.30.52.56]) by ietfa.amsl.com (Postfix) with ESMTP id 1EBB421F8873 for <httpbisa-archive-bis2Juki@lists.ietf.org>; Mon, 11 Feb 2013 07:58:45 -0800 (PST)
Received: from lists by frink.w3.org with local (Exim 4.72) (envelope-from <ietf-http-wg-request@listhub.w3.org>) id 1U4vls-0003tm-29 for ietf-http-wg-dist@listhub.w3.org; Mon, 11 Feb 2013 15:58:20 +0000
Resent-Date: Mon, 11 Feb 2013 15:58:20 +0000
Resent-Message-Id: <E1U4vls-0003tm-29@frink.w3.org>
Received: from maggie.w3.org ([128.30.52.39]) by frink.w3.org with esmtp (Exim 4.72) (envelope-from <hallam@gmail.com>) id 1U4vlg-0003lK-SD for ietf-http-wg@listhub.w3.org; Mon, 11 Feb 2013 15:58:08 +0000
Received: from mail-we0-f173.google.com ([74.125.82.173]) by maggie.w3.org with esmtps (TLS1.0:RSA_ARCFOUR_SHA1:16) (Exim 4.72) (envelope-from <hallam@gmail.com>) id 1U4vkz-0005hL-1F for ietf-http-wg@w3.org; Mon, 11 Feb 2013 15:58:08 +0000
Received: by mail-we0-f173.google.com with SMTP id r5so4763887wey.4 for <ietf-http-wg@w3.org>; Mon, 11 Feb 2013 07:56:58 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:x-received:in-reply-to:references:date:message-id :subject:from:to:cc:content-type; bh=+hteZgD9+ur2WsvRxbuyJ+cJ/5E/L4DgKshBbNNeXUY=; b=0l9n45eKCwcCo1i3RXHESWisCXO4xts1B1LFowhO1QEQuMIDaZJ610mctWBvU8e41W RtTAcIPhGCVIiKBGlCUQoTKovw9F/YrEB1/Uvd64bRLsY+oZDi9WzRm4IndihGhneu47 57RY/h81v3SARqf14IH9Mwr6UdrHJqo7pM5+rXhpAy4pdAC3gPnYZl1LaSXqwjow9Pqs e//oQHv7zmi93Hxs/U45Q2TicOIPJmqg+eP+We0ONZ1C9XsAfieaNlaXucDSpEc8MLL9 eyFHd8dpYZyQCxb1uEcd4x7d6B+jytyp4RLre6TAhfobSH3CIYou//Hg2+WwjXHOo6ZN HXvA==
MIME-Version: 1.0
X-Received: by 10.180.108.3 with SMTP id hg3mr16875341wib.33.1360598218660; Mon, 11 Feb 2013 07:56:58 -0800 (PST)
Received: by 10.194.153.104 with HTTP; Mon, 11 Feb 2013 07:56:57 -0800 (PST)
In-Reply-To: <CAK3OfOiZBTSUMTNK+qfYKN2+D_qxko1eT778vcpBokq_kd0_HA@mail.gmail.com>
References: <CAK3OfOgYi-=W_QGJywf3hQbFMkfWv-ceXiJbYEdWM3-iaefP4Q@mail.gmail.com> <5118AD61.6030003@gmx.de> <CAK3OfOiZBTSUMTNK+qfYKN2+D_qxko1eT778vcpBokq_kd0_HA@mail.gmail.com>
Date: Mon, 11 Feb 2013 10:56:57 -0500
Message-ID: <CAMm+LwhVgCvBTYsQZALAveh4gD873Sb+Lz1YV_gksWzoRLikPQ@mail.gmail.com>
From: Phillip Hallam-Baker <hallam@gmail.com>
To: Nico Williams <nico@cryptonector.com>
Cc: Julian Reschke <julian.reschke@gmx.de>, Roberto Peon <grmocg@gmail.com>, Poul-Henning Kamp <phk@phk.freebsd.dk>, "\"Martin J. Dürst\"" <duerst@it.aoyama.ac.jp>, James M Snell <jasnell@gmail.com>, "ietf-http-wg@w3.org" <ietf-http-wg@w3.org>
Content-Type: multipart/alternative; boundary="e89a8f3ba6e3815b7704d574f594"
Received-SPF: pass client-ip=74.125.82.173; envelope-from=hallam@gmail.com; helo=mail-we0-f173.google.com
X-W3C-Hub-Spam-Status: No, score=-3.2
X-W3C-Hub-Spam-Report: AWL=-2.354, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001
X-W3C-Scan-Sig: maggie.w3.org 1U4vkz-0005hL-1F ef0e2576a590d38f2894ce65f0d059db
X-Original-To: ietf-http-wg@w3.org
Subject: Re: Unicode sucks, get over it (Re: Delta Compression and UTF-8 Header Values)
Archived-At: <http://www.w3.org/mid/CAMm+LwhVgCvBTYsQZALAveh4gD873Sb+Lz1YV_gksWzoRLikPQ@mail.gmail.com>
Resent-From: ietf-http-wg@w3.org
X-Mailing-List: <ietf-http-wg@w3.org> archive/latest/16558
X-Loop: ietf-http-wg@w3.org
Resent-Sender: ietf-http-wg-request@w3.org
Precedence: list
List-Id: <ietf-http-wg.w3.org>
List-Help: <http://www.w3.org/Mail/>
List-Post: <mailto:ietf-http-wg@w3.org>
List-Unsubscribe: <mailto:ietf-http-wg-request@w3.org?subject=unsubscribe>

On Mon, Feb 11, 2013 at 10:17 AM, Nico Williams <nico@cryptonector.com>wrote:

> On Mon, Feb 11, 2013 at 2:35 AM, Julian Reschke <julian.reschke@gmx.de>
> wrote:
> > On 2013-02-10 23:45, Nico Williams wrote:
> >> My proposal:
> >>
> >>   - All text values in HTTP/2.0 that are also present in HTTP/1.1
> >> should be sent as either UTF-8 or ISO8859-1, with a one-bit tag to
> >> indicate which it is.
> >> ...
> >
> > Why do we need two options?
>
> We probably don't.  The idea was that if you have a client and server
> speaking HTTP/1.1 and using ISO8559-1 (including non-ASCII
> codepoints), *and* HTTP/2.0 proxies were involved that wanted to
> rewrite the HTTP/1.1 as 2.0, well, they could do it and avoid
> re-encoding those ISO8859-1 strings.  Probably not worth it; better go
> with UTF-8 alone, period.
>

+1

I can't see a good reason why a HTTP2 proxy would not speak HTTP/1.1 for a
long time to come. If it is rewriting requests as HTTP2 then it probably
has a reason for doing so that would make UTF8 desirable as well.


-- 
Website: http://hallambaker.com/