Re: bohe and delta experimentation...

Willy Tarreau <w@1wt.eu> Fri, 18 January 2013 14:13 UTC

Return-Path: <ietf-http-wg-request@listhub.w3.org>
X-Original-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Delivered-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id B688521F886C for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Fri, 18 Jan 2013 06:13:32 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -10.599
X-Spam-Level:
X-Spam-Status: No, score=-10.599 tagged_above=-999 required=5 tests=[BAYES_00=-2.599, RCVD_IN_DNSWL_HI=-8]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 9MnuwDxOswRY for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Fri, 18 Jan 2013 06:13:32 -0800 (PST)
Received: from frink.w3.org (frink.w3.org [128.30.52.56]) by ietfa.amsl.com (Postfix) with ESMTP id 1D87521F8922 for <httpbisa-archive-bis2Juki@lists.ietf.org>; Fri, 18 Jan 2013 06:13:32 -0800 (PST)
Received: from lists by frink.w3.org with local (Exim 4.72) (envelope-from <ietf-http-wg-request@listhub.w3.org>) id 1TwCgA-0004FA-Qv for ietf-http-wg-dist@listhub.w3.org; Fri, 18 Jan 2013 14:12:22 +0000
Resent-Date: Fri, 18 Jan 2013 14:12:22 +0000
Resent-Message-Id: <E1TwCgA-0004FA-Qv@frink.w3.org>
Received: from maggie.w3.org ([128.30.52.39]) by frink.w3.org with esmtp (Exim 4.72) (envelope-from <w@1wt.eu>) id 1TwCg3-0004D8-LY for ietf-http-wg@listhub.w3.org; Fri, 18 Jan 2013 14:12:15 +0000
Received: from 1wt.eu ([62.212.114.60]) by maggie.w3.org with esmtp (Exim 4.72) (envelope-from <w@1wt.eu>) id 1TwCfv-0005f0-5L for ietf-http-wg@w3.org; Fri, 18 Jan 2013 14:12:15 +0000
Received: (from willy@localhost) by mail.home.local (8.14.4/8.14.4/Submit) id r0IEBU7K004996; Fri, 18 Jan 2013 15:11:30 +0100
Date: Fri, 18 Jan 2013 15:11:30 +0100
From: Willy Tarreau <w@1wt.eu>
To: RUELLAN Herve <Herve.Ruellan@crf.canon.fr>
Cc: Mark Nottingham <mnot@mnot.net>, "Martin J. Dürst" <duerst@it.aoyama.ac.jp>, Roberto Peon <grmocg@gmail.com>, Nico Williams <nico@cryptonector.com>, James M Snell <jasnell@gmail.com>, "ietf-http-wg@w3.org" <ietf-http-wg@w3.org>
Message-ID: <20130118141130.GF3163@1wt.eu>
References: <DD2EFC9F-5201-4829-9E6F-BD9CF0307BB0@mnot.net> <CAK3OfOj1O82WqO0L0rNpq2qeKJoT9E0ZQrV6Y=ULETtACpYMag@mail.gmail.com> <CAK3OfOgOGFNbve_QrTrCesqrrAQRH5qWgvebBxAhoMD7_MjhjQ@mail.gmail.com> <0A36AEB6-09B9-462F-B2E8-90B67FE69980@mnot.net> <CAK3OfOhewuVdjxu7UUp49g8B33YZNJ_N-PkASkHLP213+8gquA@mail.gmail.com> <CAP+FsNdi4=Am7pZdKySHZESp79BzRzPaR3UGQM2dsOM-yAxBOA@mail.gmail.com> <CAP+FsNf++RVVAyqweCsGG45wWQyjRrT7LEyWbv+QOd7Z2XdXwg@mail.gmail.com> <50F8F44E.9040401@it.aoyama.ac.jp> <0D1ABADB-E17F-46D3-9B6F-5CDC99FC06B9@mnot.net> <6C71876BDCCD01488E70A2399529D5E52E13E4@ADELE.crf.canon.fr>
Mime-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"
Content-Disposition: inline
Content-Transfer-Encoding: 8bit
In-Reply-To: <6C71876BDCCD01488E70A2399529D5E52E13E4@ADELE.crf.canon.fr>
User-Agent: Mutt/1.4.2.3i
Received-SPF: pass client-ip=62.212.114.60; envelope-from=w@1wt.eu; helo=1wt.eu
X-W3C-Hub-Spam-Status: No, score=-4.0
X-W3C-Hub-Spam-Report: AWL=-2.109, BAYES_00=-1.9, RP_MATCHES_RCVD=-0.001, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001
X-W3C-Scan-Sig: maggie.w3.org 1TwCfv-0005f0-5L 07e4aa7a5b29c97e0e324219457e0bf2
X-Original-To: ietf-http-wg@w3.org
Subject: Re: bohe and delta experimentation...
Archived-At: <http://www.w3.org/mid/20130118141130.GF3163@1wt.eu>
Resent-From: ietf-http-wg@w3.org
X-Mailing-List: <ietf-http-wg@w3.org> archive/latest/16005
X-Loop: ietf-http-wg@w3.org
Resent-Sender: ietf-http-wg-request@w3.org
Precedence: list
List-Id: <ietf-http-wg.w3.org>
List-Help: <http://www.w3.org/Mail/>
List-Post: <mailto:ietf-http-wg@w3.org>
List-Unsubscribe: <mailto:ietf-http-wg-request@w3.org?subject=unsubscribe>

Hi Hervé,

On Fri, Jan 18, 2013 at 01:58:12PM +0000, RUELLAN Herve wrote:
> I'll try a shot at the URLs. Experimental data show that URLs often share the same beginning: for requests targeting a web sites, the URLs will usually start with the same scheme and host and possibly port. The beginning of the path is also usually shared by several URLs.
> 
> Therefore an efficient encoding for an URL is as a delta from a previous URL: the number of shared characters at the beginning, and the new characters. To reduce the state that need to be stored, it is possible to use only the previous URL as a reference.

Some testing was already done with this several months ago and showed
promising results. However, I didn't have enough captures to play with
and I recall that Roberto had was more reserved on whether this would
work all the time or not.

I must admit that another issue concerns the amount of state to be kept
between two requests on intermediaries. Some unusually long URIs can be
several kB large, which would cause a real pain if they have to persist
during idle time.

Regards,
Willy