Re: bohe implementation for compression tests
Roberto Peon <grmocg@gmail.com> Thu, 17 January 2013 01:10 UTC
Return-Path: <ietf-http-wg-request@listhub.w3.org>
X-Original-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Delivered-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 8E4D511E809C for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Wed, 16 Jan 2013 17:10:07 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -7.284
X-Spam-Level:
X-Spam-Status: No, score=-7.284 tagged_above=-999 required=5 tests=[AWL=-3.274, BAYES_00=-2.599, FB_WORD1_END_DOLLAR=3.294, FB_WORD2_END_DOLLAR=3.294, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_HI=-8]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id rMtysuJeFzYG for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Wed, 16 Jan 2013 17:10:06 -0800 (PST)
Received: from frink.w3.org (frink.w3.org [128.30.52.56]) by ietfa.amsl.com (Postfix) with ESMTP id C65AB11E809A for <httpbisa-archive-bis2Juki@lists.ietf.org>; Wed, 16 Jan 2013 17:10:05 -0800 (PST)
Received: from lists by frink.w3.org with local (Exim 4.72) (envelope-from <ietf-http-wg-request@listhub.w3.org>) id 1Tvdz4-0005X8-1g for ietf-http-wg-dist@listhub.w3.org; Thu, 17 Jan 2013 01:09:34 +0000
Resent-Date: Thu, 17 Jan 2013 01:09:34 +0000
Resent-Message-Id: <E1Tvdz4-0005X8-1g@frink.w3.org>
Received: from lisa.w3.org ([128.30.52.41]) by frink.w3.org with esmtp (Exim 4.72) (envelope-from <grmocg@gmail.com>) id 1Tvdyz-0005Vw-6L for ietf-http-wg@listhub.w3.org; Thu, 17 Jan 2013 01:09:29 +0000
Received: from mail-ie0-f181.google.com ([209.85.223.181]) by lisa.w3.org with esmtps (TLS1.0:RSA_ARCFOUR_SHA1:16) (Exim 4.72) (envelope-from <grmocg@gmail.com>) id 1Tvdyx-0006Iw-Rp for ietf-http-wg@w3.org; Thu, 17 Jan 2013 01:09:29 +0000
Received: by mail-ie0-f181.google.com with SMTP id 16so3775291iea.40 for <ietf-http-wg@w3.org>; Wed, 16 Jan 2013 17:09:02 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:x-received:in-reply-to:references:date:message-id :subject:from:to:cc:content-type; bh=KBUZyGwjgDARjj4Lk1hqdiuuRXEFM8KWRIblcscMrgo=; b=PP0VuBxtxtHp+w3DOccpBpXqHDo9wK3dxHFpGNgDgDhhw5WKjjNrLmWXMCnD4U4rmY RLznExwwG7kcMcssEVlpk9PmG/pT8QxuONsQQ/N0PDV71tJgyZbvyFxmeOqn312gppU7 xwXiFkbwZH+Kk1EW07MKX/+lpGiiAE79L517JbgAKvvO9VBEp/DrStdTyoSq+pVeGPZL UWAuTFnou8fWHfw0Vg1AhUMFAMDSvYZKVk0aJejJnRGaGO8m5vg/1b+oJBNRzXR7FLhK znc1J4/jfF2aPHWuS+0JteDjyknyHQEtHYi4Et7BHQfSrqSQjBE15FbqU0RaClcfXjU2 scrA==
MIME-Version: 1.0
X-Received: by 10.50.10.130 with SMTP id i2mr6319323igb.63.1358384942029; Wed, 16 Jan 2013 17:09:02 -0800 (PST)
Received: by 10.42.73.8 with HTTP; Wed, 16 Jan 2013 17:09:01 -0800 (PST)
In-Reply-To: <CABkgnnVdXwo5eiBMNMjONG88r8EvErxQ9_yrgdh5NFH=KQhfZQ@mail.gmail.com>
References: <CABP7Rbe-B89vVm8=OnHtAG0Y3G2UOysX+DKaTQ3+rAKBJBJyKA@mail.gmail.com> <CABP7RbdSurNdLxvVV1F-9Bx4LwdhrX2btEvfA+RV=rVZ0moXNA@mail.gmail.com> <CABkgnnVdXwo5eiBMNMjONG88r8EvErxQ9_yrgdh5NFH=KQhfZQ@mail.gmail.com>
Date: Wed, 16 Jan 2013 17:09:01 -0800
Message-ID: <CAP+FsNeq4dFYErW-uv3LQ4wESYN90T4pD6mVM8G+_H76LcBcJw@mail.gmail.com>
From: Roberto Peon <grmocg@gmail.com>
To: Martin Thomson <martin.thomson@gmail.com>
Cc: James M Snell <jasnell@gmail.com>, "ietf-http-wg@w3.org" <ietf-http-wg@w3.org>
Content-Type: multipart/alternative; boundary="14dae934058bf01ac204d371a39a"
Received-SPF: pass client-ip=209.85.223.181; envelope-from=grmocg@gmail.com; helo=mail-ie0-f181.google.com
X-W3C-Hub-Spam-Status: No, score=-4.4
X-W3C-Hub-Spam-Report: AWL=-1.654, BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001
X-W3C-Scan-Sig: lisa.w3.org 1Tvdyx-0006Iw-Rp d5f33329a5a474bdba074717a7239b7e
X-Original-To: ietf-http-wg@w3.org
Subject: Re: bohe implementation for compression tests
Archived-At: <http://www.w3.org/mid/CAP+FsNeq4dFYErW-uv3LQ4wESYN90T4pD6mVM8G+_H76LcBcJw@mail.gmail.com>
Resent-From: ietf-http-wg@w3.org
X-Mailing-List: <ietf-http-wg@w3.org> archive/latest/15938
X-Loop: ietf-http-wg@w3.org
Resent-Sender: ietf-http-wg-request@w3.org
Precedence: list
List-Id: <ietf-http-wg.w3.org>
List-Help: <http://www.w3.org/Mail/>
List-Post: <mailto:ietf-http-wg@w3.org>
List-Unsubscribe: <mailto:ietf-http-wg-request@w3.org?subject=unsubscribe>
Unfortunately python does not lend itself to CPU consumption comparisons (thus my writing the compressor in C++ the other day) -=R On Wed, Jan 16, 2013 at 5:06 PM, Martin Thomson <martin.thomson@gmail.com>wrote: > James, > > Any feel for how bohe compares on CPU cost? Ideally, the test harness > would provide that information alongside the rest of the data. > > On 11 January 2013 10:43, James M Snell <jasnell@gmail.com> wrote: > > Just continuing my investigation of various header compression strategies > > around the BOHE mechanism. In my personal github fork, I have just > checked > > in two bohe variations, one that implements selective-compression, the > other > > implements isolated-compression.. > > > > https://github.com/jasnell/compression-test/tree/master/compressor/bohe2 > > https://github.com/jasnell/compression-test/tree/master/compressor/bohe3 > > > > With bohe2 (selective-compression), a header block can consist of a > > compressed set of headers and an uncompressed set of headers. Specific > > headers such as Cookie, Set-Cookie, etc can be marked as "Do Not > Compress". > > These are dropped into the frame as-is and thus avoid the CRIME issue > > completely. The rest of the headers are compressed using gzip using the > > existing spdy3 dictionary. Obviously this is not ideal because Cookie > data > > then is passed around without any compression at all, making it far less > > efficient than any of the other options on the table. > > > > In bohe3 (isolated-compression), a header block can consist of two > separate > > compressed blocks generated using two separate stream compressor > instances. > > Selected headers (like Cookie) can be included in the secondary isolated > > block which would never contain general user-provided header data. > > > > **These are only experiments right now and are not intended as serious > > proposals for the spec**. Isolated-compression (bohe3) does show promise > > however. If we can successfully isolate potentially sensitive headers > into > > their own compression context generated independently of any general > > user-supplied data, we can effectively short-circuit the CRIME attack by > > making it impossible for an attacker to compare values based on the > > compression ratio... and since it still uses gzip compression, we > achieve a > > generally better compression ratio overall than we get with the proposed > > delta encoding. For now, tho, just consider this all just to be fodder > for > > discussion. There are still MANY issues with these experimental > approaches > > and I still need to go through delta in more detail to see if there is a > way > > bohe and delta can be used effectively together. > > > > Just for example... > > > > james-snells-macbook-pro:compression-test james$ > ./compare_compressors.py -c > > bohe3 -c bohe2 -c bohe -c delta -t > > /Users/james/git/http_samples/mnot/amazon.com.har > > 732 req messages processed > > compressed | ratio min max std > > req bohe 26,035 | 0.13 0.03 0.68 0.08 > > req bohe2 44,195 | 0.23 0.07 0.71 0.13 > > req bohe3 30,944 | 0.16 0.05 0.74 0.08 > > req delta 33,955 | 0.17 0.02 0.71 0.09 > > req http1 195,386 | 1.00 1.00 1.00 0.00 > > > > 732 res messages processed > > compressed | ratio min max std > > res bohe 39,525 | 0.25 0.04 0.67 0.07 > > res bohe2 47,157 | 0.29 0.12 0.71 0.08 > > res bohe3 44,843 | 0.28 0.06 0.70 0.07 > > res delta 44,499 | 0.28 0.02 0.65 0.09 > > res http1 159,968 | 1.00 1.00 1.00 0.00 > > > > - James > > > > > > > > On Thu, Jan 10, 2013 at 11:08 AM, James M Snell <jasnell@gmail.com> > wrote: > >> > >> I have an initial bohe implementation for the compression tests... it's > >> very preliminary and uses the same gzip compression as the current > spdy3. > >> I'm going to be playing around with the delta compression mechanism as > well > >> and see how much of an impact that has. Initial results are very > promising > >> but I haven't done much debugging yet. Just wanted folks to know that > this > >> work was underway... > >> > >> https://github.com/jasnell/compression-test/tree/master/compressor/bohe > >> > >> Some test runs.... > >> > >> ./compare_compressors.py -c bohe -c spdy3 -c delta > >> ../http_samples/mnot/amazon.com.har > >> 732 req messages processed > >> compressed | ratio min max std > >> req bohe 26,122 | 0.13 0.04 0.70 0.08 > >> req delta 33,955 | 0.17 0.02 0.71 0.09 > >> req http1 195,386 | 1.00 1.00 1.00 0.00 > >> req spdy3 27,238 | 0.14 0.04 0.71 0.08 > >> > >> 732 res messages processed > >> compressed | ratio min max std > >> res bohe 39,628 | 0.25 0.04 0.66 0.07 > >> res delta 44,499 | 0.28 0.02 0.65 0.09 > >> res http1 159,968 | 1.00 1.00 1.00 0.00 > >> res spdy3 41,325 | 0.26 0.04 0.67 0.08 > >> > >> > >> ./compare_compressors.py -c bohe -c spdy3 -c delta > >> ../http_samples/mnot/craigslist.org.har > >> 66 req messages processed > >> compressed | ratio min max std > >> req bohe 1,948 | 0.15 0.06 0.73 0.11 > >> req delta 2,036 | 0.16 0.07 0.71 0.11 > >> req http1 12,894 | 1.00 1.00 1.00 0.00 > >> req spdy3 2,016 | 0.16 0.07 0.75 0.11 > >> > >> 66 res messages processed > >> compressed | ratio min max std > >> res bohe 1,786 | 0.18 0.07 0.77 0.13 > >> res delta 2,858 | 0.28 0.08 0.69 0.12 > >> res http1 10,147 | 1.00 1.00 1.00 0.00 > >> res spdy3 1,869 | 0.18 0.09 0.78 0.13 > >> > >> > >> ./compare_compressors.py -c bohe -c spdy3 -c delta > >> ../http_samples/mnot/flickr.com.har > >> 438 req messages processed > >> compressed | ratio min max std > >> req bohe 11,988 | 0.10 0.02 0.69 0.07 > >> req delta 26,372 | 0.22 0.01 0.71 0.14 > >> req http1 121,854 | 1.00 1.00 1.00 0.00 > >> req spdy3 12,550 | 0.10 0.02 0.71 0.07 > >> > >> 438 res messages processed > >> compressed | ratio min max std > >> res bohe 13,073 | 0.09 0.05 0.66 0.06 > >> res delta 25,236 | 0.18 0.02 0.70 0.11 > >> res http1 140,457 | 1.00 1.00 1.00 0.00 > >> res spdy3 14,142 | 0.10 0.05 0.66 0.06 > >> > >> > >> ./compare_compressors.py -c bohe -c spdy3 -c delta > >> ../http_samples/mnot/facebook.com.har > >> 234 req messages processed > >> compressed | ratio min max std > >> req bohe 6,091 | 0.15 0.06 0.78 0.07 > >> req delta 7,800 | 0.19 0.02 0.70 0.07 > >> req http1 41,980 | 1.00 1.00 1.00 0.00 > >> req spdy3 6,301 | 0.15 0.06 0.77 0.07 > >> > >> 234 res messages processed > >> compressed | ratio min max std > >> res bohe 9,458 | 0.23 0.07 0.68 0.07 > >> res delta 12,045 | 0.30 0.13 0.60 0.08 > >> res http1 40,252 | 1.00 1.00 1.00 0.00 > >> res spdy3 9,788 | 0.24 0.07 0.69 0.07 > >> > >> > >> > >> > > > >
- bohe implementation for compression tests James M Snell
- Re: bohe implementation for compression tests Roberto Peon
- Re: bohe implementation for compression tests James M Snell
- Re: bohe implementation for compression tests James M Snell
- Re: bohe implementation for compression tests James M Snell
- Re: bohe implementation for compression tests Roberto Peon
- Re: bohe implementation for compression tests James M Snell
- Re: bohe implementation for compression tests Roberto Peon
- Re: bohe implementation for compression tests James M Snell
- Re: bohe implementation for compression tests Martin Thomson
- Re: bohe implementation for compression tests Roberto Peon
- Re: bohe implementation for compression tests Mark Nottingham
- Re: bohe implementation for compression tests Martin Thomson
- Re: bohe implementation for compression tests Roberto Peon
- Re: bohe implementation for compression tests Mark Nottingham
- Re: bohe implementation for compression tests Mark Nottingham