Re: Delta Compression and UTF-8 Header Values

Nico Williams <nico@cryptonector.com> Sat, 09 February 2013 17:50 UTC

Return-Path: <ietf-http-wg-request@listhub.w3.org>
X-Original-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Delivered-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 06F4021F863C for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Sat, 9 Feb 2013 09:50:32 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -7.508
X-Spam-Level:
X-Spam-Status: No, score=-7.508 tagged_above=-999 required=5 tests=[AWL=2.316, BAYES_00=-2.599, FM_FORGED_GMAIL=0.622, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_HI=-8, SARE_SUB_ENC_UTF8=0.152]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id g3sQmmg8zI-i for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Sat, 9 Feb 2013 09:50:31 -0800 (PST)
Received: from frink.w3.org (frink.w3.org [128.30.52.56]) by ietfa.amsl.com (Postfix) with ESMTP id 44B7F21F863B for <httpbisa-archive-bis2Juki@lists.ietf.org>; Sat, 9 Feb 2013 09:50:30 -0800 (PST)
Received: from lists by frink.w3.org with local (Exim 4.72) (envelope-from <ietf-http-wg-request@listhub.w3.org>) id 1U4EXF-0005zF-8R for ietf-http-wg-dist@listhub.w3.org; Sat, 09 Feb 2013 17:48:21 +0000
Resent-Date: Sat, 09 Feb 2013 17:48:21 +0000
Resent-Message-Id: <E1U4EXF-0005zF-8R@frink.w3.org>
Received: from maggie.w3.org ([128.30.52.39]) by frink.w3.org with esmtp (Exim 4.72) (envelope-from <nico@cryptonector.com>) id 1U4EX8-0005yQ-Ky for ietf-http-wg@listhub.w3.org; Sat, 09 Feb 2013 17:48:14 +0000
Received: from mailbigip.dreamhost.com ([208.97.132.5] helo=homiemail-a72.g.dreamhost.com) by maggie.w3.org with esmtp (Exim 4.72) (envelope-from <nico@cryptonector.com>) id 1U4EX7-0000kR-7E for ietf-http-wg@w3.org; Sat, 09 Feb 2013 17:48:14 +0000
Received: from homiemail-a72.g.dreamhost.com (localhost [127.0.0.1]) by homiemail-a72.g.dreamhost.com (Postfix) with ESMTP id BC3D96B007B for <ietf-http-wg@w3.org>; Sat, 9 Feb 2013 09:47:51 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=cryptonector.com; h= mime-version:in-reply-to:references:date:message-id:subject:from :to:cc:content-type; s=cryptonector.com; bh=IkvCws+6NIDEP4SKALa/ mFpvk6w=; b=lsfF4ZZHy0TWfdGV3ggCXYeIL/op0Rg7tS81yswhieQfMrUVBjRy gl+zxfwYvU4wheHGrQ+fJk6EENNQzn2YZD9HZi1EKeX1ZhwHxTHTcfQVpm/Q7aMk Ji9vP74Oti+j3krAlhenTNO5jm2aDIAsmYLbyW3xxJBTr9s5KZsf8ys=
Received: from mail-we0-f172.google.com (mail-we0-f172.google.com [74.125.82.172]) (using TLSv1 with cipher RC4-SHA (128/128 bits)) (No client certificate requested) (Authenticated sender: nico@cryptonector.com) by homiemail-a72.g.dreamhost.com (Postfix) with ESMTPSA id 7066E6B0070 for <ietf-http-wg@w3.org>; Sat, 9 Feb 2013 09:47:51 -0800 (PST)
Received: by mail-we0-f172.google.com with SMTP id x10so3814640wey.31 for <ietf-http-wg@w3.org>; Sat, 09 Feb 2013 09:47:49 -0800 (PST)
MIME-Version: 1.0
X-Received: by 10.180.97.166 with SMTP id eb6mr8179429wib.20.1360432069805; Sat, 09 Feb 2013 09:47:49 -0800 (PST)
Received: by 10.217.39.133 with HTTP; Sat, 9 Feb 2013 09:47:48 -0800 (PST)
In-Reply-To: <op.wr8se6rpiw9drz@uranium.westinmy-starwoodgp.com>
References: <CABP7RbfRLXPpL4=wip=FvqD3DM7BM8PXi7uRswHAusXUmPO_xw@mail.gmail.com> <CE65E38D-A482-4EA9-BAF4-F6498F643A78@mnot.net> <511642E9.9010607@it.aoyama.ac.jp> <20130209133341.GA8712@1wt.eu> <op.wr8se6rpiw9drz@uranium.westinmy-starwoodgp.com>
Date: Sat, 09 Feb 2013 11:47:48 -0600
Message-ID: <CAK3OfOiU2PLFFzyQemPCUQ_Ss7MJkbmQsD=n+qq9RqmCr__mKA@mail.gmail.com>
From: Nico Williams <nico@cryptonector.com>
To: Martin Nilsson <nilsson@opera.com>
Cc: "ietf-http-wg@w3.org" <ietf-http-wg@w3.org>
Content-Type: multipart/alternative; boundary="f46d0443066443058704d54e4621"
Received-SPF: none client-ip=208.97.132.5; envelope-from=nico@cryptonector.com; helo=homiemail-a72.g.dreamhost.com
X-W3C-Hub-Spam-Status: No, score=-3.5
X-W3C-Hub-Spam-Report: AWL=-3.448, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001
X-W3C-Scan-Sig: maggie.w3.org 1U4EX7-0000kR-7E caf39f917e6b90bda65ee1df49615f97
X-Original-To: ietf-http-wg@w3.org
Subject: Re: Delta Compression and UTF-8 Header Values
Archived-At: <http://www.w3.org/mid/CAK3OfOiU2PLFFzyQemPCUQ_Ss7MJkbmQsD=n+qq9RqmCr__mKA@mail.gmail.com>
Resent-From: ietf-http-wg@w3.org
X-Mailing-List: <ietf-http-wg@w3.org> archive/latest/16488
X-Loop: ietf-http-wg@w3.org
Resent-Sender: ietf-http-wg-request@w3.org
Precedence: list
List-Id: <ietf-http-wg.w3.org>
List-Help: <http://www.w3.org/Mail/>
List-Post: <mailto:ietf-http-wg@w3.org>
List-Unsubscribe: <mailto:ietf-http-wg-request@w3.org?subject=unsubscribe>

On Saturday, February 9, 2013, Martin Nilsson wrote:

> On Sat, 09 Feb 2013 14:33:41 +0100, Willy Tarreau <w@1wt.eu> wrote:
>
>  Also, processing it is
>> particularly inefficient as you have to parse each and every byte to find
>> a length, making string comparisons quite slow.
>>
>
> You don't need to know the length in characters to compare strings. Just
> comparing byte on byte works fine. Null is encoded the same, and byte zero
> only appear as null in UTF-8, so strlen works fine. So far strings are
> hollerith encoded in HTTP/2, so it should be a moot point anyway.
>

<insert pedantic comment about Unicode normalization here>