Re: Unicode sucks, get over it (Re: Delta Compression and UTF-8 Header Values)

Nico Williams <nico@cryptonector.com> Mon, 11 February 2013 15:18 UTC

Return-Path: <ietf-http-wg-request@listhub.w3.org>
X-Original-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Delivered-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id C82F421F8A96 for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Mon, 11 Feb 2013 07:18:57 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -8.158
X-Spam-Level:
X-Spam-Status: No, score=-8.158 tagged_above=-999 required=5 tests=[AWL=1.667, BAYES_00=-2.599, FM_FORGED_GMAIL=0.622, RCVD_IN_DNSWL_HI=-8, SARE_SUB_ENC_UTF8=0.152]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id I9SszW023fbZ for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Mon, 11 Feb 2013 07:18:57 -0800 (PST)
Received: from frink.w3.org (frink.w3.org [128.30.52.56]) by ietfa.amsl.com (Postfix) with ESMTP id 0513321F8A6F for <httpbisa-archive-bis2Juki@lists.ietf.org>; Mon, 11 Feb 2013 07:18:57 -0800 (PST)
Received: from lists by frink.w3.org with local (Exim 4.72) (envelope-from <ietf-http-wg-request@listhub.w3.org>) id 1U4v8e-0000vw-FA for ietf-http-wg-dist@listhub.w3.org; Mon, 11 Feb 2013 15:17:48 +0000
Resent-Date: Mon, 11 Feb 2013 15:17:48 +0000
Resent-Message-Id: <E1U4v8e-0000vw-FA@frink.w3.org>
Received: from maggie.w3.org ([128.30.52.39]) by frink.w3.org with esmtp (Exim 4.72) (envelope-from <nico@cryptonector.com>) id 1U4v8T-0000th-Cp for ietf-http-wg@listhub.w3.org; Mon, 11 Feb 2013 15:17:37 +0000
Received: from caiajhbdcbef.dreamhost.com ([208.97.132.145] helo=homiemail-a31.g.dreamhost.com) by maggie.w3.org with esmtp (Exim 4.72) (envelope-from <nico@cryptonector.com>) id 1U4v8R-0004Nz-Mg for ietf-http-wg@w3.org; Mon, 11 Feb 2013 15:17:37 +0000
Received: from homiemail-a31.g.dreamhost.com (localhost [127.0.0.1]) by homiemail-a31.g.dreamhost.com (Postfix) with ESMTP id EFEC7202044 for <ietf-http-wg@w3.org>; Mon, 11 Feb 2013 07:17:13 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=cryptonector.com; h= mime-version:in-reply-to:references:date:message-id:subject:from :to:cc:content-type; s=cryptonector.com; bh=SzKzR2YfJR9vr94QGLWI ObaVrbI=; b=hEG3vXQyYbuSqMP0j6jrnx9hx5b0Ta+5J4LuS6HD+h+GKZ7IhsP7 G6kj2Xz2/0zPecjc4/q3MyhOJnagfQP0aulQRMHLx+ip3604YGOHPn1CXsNlvnw5 gXyqQGl+psC8zQaHVn9FfQWDNkkUZ8fIeRQTpBfyUd6XFC+6vlugH9A=
Received: from mail-wi0-f173.google.com (mail-wi0-f173.google.com [209.85.212.173]) (using TLSv1 with cipher RC4-SHA (128/128 bits)) (No client certificate requested) (Authenticated sender: nico@cryptonector.com) by homiemail-a31.g.dreamhost.com (Postfix) with ESMTPSA id 898CA202022 for <ietf-http-wg@w3.org>; Mon, 11 Feb 2013 07:17:13 -0800 (PST)
Received: by mail-wi0-f173.google.com with SMTP id hq4so3273368wib.0 for <ietf-http-wg@w3.org>; Mon, 11 Feb 2013 07:17:12 -0800 (PST)
MIME-Version: 1.0
X-Received: by 10.180.99.227 with SMTP id et3mr16859474wib.6.1360595832143; Mon, 11 Feb 2013 07:17:12 -0800 (PST)
Received: by 10.217.39.133 with HTTP; Mon, 11 Feb 2013 07:17:11 -0800 (PST)
In-Reply-To: <5118AD61.6030003@gmx.de>
References: <CAK3OfOgYi-=W_QGJywf3hQbFMkfWv-ceXiJbYEdWM3-iaefP4Q@mail.gmail.com> <5118AD61.6030003@gmx.de>
Date: Mon, 11 Feb 2013 09:17:11 -0600
Message-ID: <CAK3OfOiZBTSUMTNK+qfYKN2+D_qxko1eT778vcpBokq_kd0_HA@mail.gmail.com>
From: Nico Williams <nico@cryptonector.com>
To: Julian Reschke <julian.reschke@gmx.de>
Cc: Roberto Peon <grmocg@gmail.com>, Poul-Henning Kamp <phk@phk.freebsd.dk>, "\"Martin J. Dürst\"" <duerst@it.aoyama.ac.jp>, James M Snell <jasnell@gmail.com>, "ietf-http-wg@w3.org" <ietf-http-wg@w3.org>
Content-Type: text/plain; charset="UTF-8"
Received-SPF: none client-ip=208.97.132.145; envelope-from=nico@cryptonector.com; helo=homiemail-a31.g.dreamhost.com
X-W3C-Hub-Spam-Status: No, score=-3.5
X-W3C-Hub-Spam-Report: AWL=-3.449, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_NONE=-0.0001
X-W3C-Scan-Sig: maggie.w3.org 1U4v8R-0004Nz-Mg 1b9ecc924411df96f39f024bbd6319df
X-Original-To: ietf-http-wg@w3.org
Subject: Re: Unicode sucks, get over it (Re: Delta Compression and UTF-8 Header Values)
Archived-At: <http://www.w3.org/mid/CAK3OfOiZBTSUMTNK+qfYKN2+D_qxko1eT778vcpBokq_kd0_HA@mail.gmail.com>
Resent-From: ietf-http-wg@w3.org
X-Mailing-List: <ietf-http-wg@w3.org> archive/latest/16553
X-Loop: ietf-http-wg@w3.org
Resent-Sender: ietf-http-wg-request@w3.org
Precedence: list
List-Id: <ietf-http-wg.w3.org>
List-Help: <http://www.w3.org/Mail/>
List-Post: <mailto:ietf-http-wg@w3.org>
List-Unsubscribe: <mailto:ietf-http-wg-request@w3.org?subject=unsubscribe>

On Mon, Feb 11, 2013 at 2:35 AM, Julian Reschke <julian.reschke@gmx.de> wrote:
> On 2013-02-10 23:45, Nico Williams wrote:
>> My proposal:
>>
>>   - All text values in HTTP/2.0 that are also present in HTTP/1.1
>> should be sent as either UTF-8 or ISO8859-1, with a one-bit tag to
>> indicate which it is.
>> ...
>
> Why do we need two options?

We probably don't.  The idea was that if you have a client and server
speaking HTTP/1.1 and using ISO8559-1 (including non-ASCII
codepoints), *and* HTTP/2.0 proxies were involved that wanted to
rewrite the HTTP/1.1 as 2.0, well, they could do it and avoid
re-encoding those ISO8859-1 strings.  Probably not worth it; better go
with UTF-8 alone, period.

Nico
--