Re: HTTP router point-of-view concerns

Roberto Peon <grmocg@gmail.com> Thu, 11 July 2013 19:58 UTC

Return-Path: <ietf-http-wg-request@listhub.w3.org>
X-Original-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Delivered-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 0B61021F9D3A for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Thu, 11 Jul 2013 12:58:35 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -10.54
X-Spam-Level:
X-Spam-Status: No, score=-10.54 tagged_above=-999 required=5 tests=[AWL=0.058, BAYES_00=-2.599, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_HI=-8]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id q491wNsvL0hc for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Thu, 11 Jul 2013 12:58:28 -0700 (PDT)
Received: from frink.w3.org (frink.w3.org [128.30.52.56]) by ietfa.amsl.com (Postfix) with ESMTP id 8B3A821F9256 for <httpbisa-archive-bis2Juki@lists.ietf.org>; Thu, 11 Jul 2013 12:58:28 -0700 (PDT)
Received: from lists by frink.w3.org with local (Exim 4.72) (envelope-from <ietf-http-wg-request@listhub.w3.org>) id 1UxMzb-0002X0-9U for ietf-http-wg-dist@listhub.w3.org; Thu, 11 Jul 2013 19:57:31 +0000
Resent-Date: Thu, 11 Jul 2013 19:57:31 +0000
Resent-Message-Id: <E1UxMzb-0002X0-9U@frink.w3.org>
Received: from maggie.w3.org ([128.30.52.39]) by frink.w3.org with esmtp (Exim 4.72) (envelope-from <grmocg@gmail.com>) id 1UxMzT-0002Vd-2W for ietf-http-wg@listhub.w3.org; Thu, 11 Jul 2013 19:57:23 +0000
Received: from mail-oa0-f45.google.com ([209.85.219.45]) by maggie.w3.org with esmtps (TLS1.0:RSA_ARCFOUR_SHA1:16) (Exim 4.72) (envelope-from <grmocg@gmail.com>) id 1UxMzR-0000lc-UT for ietf-http-wg@w3.org; Thu, 11 Jul 2013 19:57:23 +0000
Received: by mail-oa0-f45.google.com with SMTP id j1so11806078oag.18 for <ietf-http-wg@w3.org>; Thu, 11 Jul 2013 12:56:55 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=EdMTuqgo9frPGe6neM53GDAhrw3MnA6aGh1ivy4g7/w=; b=kmYk2doVejOkjfJ+yCvsQ1YzqBLwLb0EHSgijJLQXWaj+oKXjUOf3h12OpvtOim0Sk Nx3uPdXVd56z5SurvjERP/5Fkv7+2UuXeP71yXzNTRX0OVUEUN7Xgb8zHcp+JUWdODm1 /H4EOl8lcn862lVtsj+Y+UbOJgeuvZANXo+pzey0yV0R/YWQd8cNYPy0yoOQPRLqTG32 VqlekJr2ENQUK5IRIVHBvb43YAK9YU4joC9NnEjQgmxGyhKpITRfCTyGgn/NfJPAS6FN GvlKXPTDCdYBe6T3eVEuWaAdWIXEsMsEz35KsvWPRFvDkttVSpECMgjVpmJkJCCmbFaI r5wg==
MIME-Version: 1.0
X-Received: by 10.60.140.168 with SMTP id rh8mr33270111oeb.17.1373572615811; Thu, 11 Jul 2013 12:56:55 -0700 (PDT)
Received: by 10.76.91.229 with HTTP; Thu, 11 Jul 2013 12:56:55 -0700 (PDT)
In-Reply-To: <9AF548E8-D4CD-426B-9F6F-F390476821AA@gmail.com>
References: <CA+qvzFPUpcm6kUtJx+rTw8Dpp4Gtx4Bmr3XPDhjNsjchUfN9_w@mail.gmail.com> <51DE1E32.9010801@treenet.co.nz> <CAP+FsNdcYhA=V5Z+zbt70b5e7WmcmXgjG5M9L3vfXeXfTwmRnw@mail.gmail.com> <51DE327C.7010901@treenet.co.nz> <CABkgnnXeqD6wh0dcJ1Dz=4PLAJNkDeGcCuzMr9ATd_7xS7nbGQ@mail.gmail.com> <CABP7RbcUkLf3CTAB4jwicnsiKWLGVY6=hX0k=0256SR_gcVt9A@mail.gmail.com> <092D65A8-8CB7-419D-B6A4-77CAE40A0026@gmail.com> <CAP+FsNfpHY-Eai7T+vW01LRPweKmSfVhWO-Tj0ii4wWzX6fwUg@mail.gmail.com> <9AF548E8-D4CD-426B-9F6F-F390476821AA@gmail.com>
Date: Thu, 11 Jul 2013 12:56:55 -0700
Message-ID: <CAP+FsNev6zz2VHyj7KTBwHLMagP=n6EOiM_5UFvm13y25Bmx_Q@mail.gmail.com>
From: Roberto Peon <grmocg@gmail.com>
To: Sam Pullara <spullara@gmail.com>
Cc: James M Snell <jasnell@gmail.com>, Martin Thomson <martin.thomson@gmail.com>, Amos Jeffries <squid3@treenet.co.nz>, HTTP Working Group <ietf-http-wg@w3.org>
Content-Type: multipart/alternative; boundary="047d7b2e4c46d6b8b504e141cb6d"
Received-SPF: pass client-ip=209.85.219.45; envelope-from=grmocg@gmail.com; helo=mail-oa0-f45.google.com
X-W3C-Hub-Spam-Status: No, score=-3.5
X-W3C-Hub-Spam-Report: AWL=-2.663, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001
X-W3C-Scan-Sig: maggie.w3.org 1UxMzR-0000lc-UT 121c3e411d37949bbffc5aeb9debee2c
X-Original-To: ietf-http-wg@w3.org
Subject: Re: HTTP router point-of-view concerns
Archived-At: <http://www.w3.org/mid/CAP+FsNev6zz2VHyj7KTBwHLMagP=n6EOiM_5UFvm13y25Bmx_Q@mail.gmail.com>
Resent-From: ietf-http-wg@w3.org
X-Mailing-List: <ietf-http-wg@w3.org> archive/latest/18706
X-Loop: ietf-http-wg@w3.org
Resent-Sender: ietf-http-wg-request@w3.org
Precedence: list
List-Id: <ietf-http-wg.w3.org>
List-Help: <http://www.w3.org/Mail/>
List-Post: <mailto:ietf-http-wg@w3.org>
List-Unsubscribe: <mailto:ietf-http-wg-request@w3.org?subject=unsubscribe>

A fair bit of this quantitative analysis was published with the SPDY
whitepapers.

Yes, packets matter.
Yes, RTT matters most.
number of packets is highly correlated with bytes on the wire

The encoders/compressors here were developed because:
1) any stream compressor is subject to the CRIME attack
2) gzip uses more memory/cpu than these more specific schemes
3) the encoders/compressors here were developed with an eye towards
allowing intermediaries to have much more control over the size and cost of
the compression stuff

The upstream path is often very limited.
If we want to have server push or similar be competitive with inlining, we
need the cost of that metadata to be low.
-=R


On Thu, Jul 11, 2013 at 12:51 PM, Sam Pullara <spullara@gmail.com> wrote:

> It would be great to have a quantitative analysis of the benefit we can
> expect to get on various types of links and header sets so we could compare
> various implementations. I'm unconvinced these solutions are much better
> for real requests than gzip with an initial dictionary. Also, isn't bytes
> on the wire the wrong metric? Aren't these slow links much more sensitive
> to the number of packets / round trips?
>
> Sam
>
> On Jul 11, 2013, at 12:37 PM, Roberto Peon <grmocg@gmail.com> wrote:
>
> If one doesn't care about number of bytes on the wire, or if one doesn't
> care about user-perceived latency, then obviously compression is a waste.
> If one does care, then, especially on slower links, header compression
> does a great deal to reduce latency as the HTTP metadata eats up a
> significant fraction of available bandwidth on those links.
>
> -=R
>
>
> On Thu, Jul 11, 2013 at 10:21 AM, Sam Pullara <spullara@gmail.com> wrote:
>
>> How sure are we that the entire idea of header compression isn't a bad
>> idea? I implemented something similar in the WebLogic T3 protocol
>> (BubblingAbbrevTable, probably still in there) and it was mostly just a
>> pain. If I were to go back I would just use gzip with some agreed upon seed
>> dictionary. Thought I would bring this up since it seems like it is a very
>> controversial feature to begin with.
>>
>> Sam
>>
>> On Jul 11, 2013, at 10:14 AM, James M Snell <jasnell@gmail.com> wrote:
>>
>> > Yes, the ability to set compression context size to 0 is very useful.
>> > My fears around this area are:
>> >
>> > 1. In order to achieve maximum throughput, Intermediaries may opt to
>> > *always* set compression context to 0, forcing the headers to always
>> > be passed as Literals, killing the utility of having the header
>> > compression mechanism there in the first place.
>> >
>> > 2. The assumption of a non-zero default compression context size when
>> > the connection is established opens a race condition that a malicious
>> > sender could exploit in a denial of service attack. Yes, the receiver
>> > could opt to terminate the connection once it detects bad behavior,
>> > but there is still a potential window of time there where the receiver
>> > could be forced to do significant additional work.
>> >
>> >  (This is particularly bad given that header continuations are
>> unbounded.)
>> >
>> > 3. Setting the compression context size to 0 does not stop the sender
>> > from sending the Indexed Literal instructions anyway. The receiving
>> > endpoint would still be required to process those instructions even if
>> > the data is not actually being indexed, causing CPU cycles to be
>> > consumed. For any individual block of headers it may not be a
>> > significant load, but it's something that needs to be addressed.
>> >
>> >  (This can be fixed in the spec by stating that any attempt to Index
>> > any individual (name,value) whose size is greater than the available
>> > header table size results in a Compression Error. Making this change
>> > would mean that when Compression Context size is 0, the only operation
>> > that would not result in an error is Literal without Indexing. This
>> > was discussed on the list but as far as I can tell it's not yet
>> > captured in the spec).
>> >
>> > 4. The fact that header continuations can be unbounded is deeply
>> > troubling, especially given that the endpoint is required to buffer
>> > and process the complete header block (well.. that's only half true,
>> > the encoding does allow for incremental processing of the HEADERS
>> > frame payloads but the spec requires that the complete header block is
>> > always processed). Sure, the recipient is free to terminate the
>> > connection as soon as it detects bad behavior, but the sender could
>> > end up forcing the recipient to do a significant amount of extra
>> > processing with a never ending sequence of HEADERS frames. Smart
>> > implementations will know how to deal with this, yes, but overall it
>> > adds to the already growing list of "New Complex Things" that an
>> > HTTP/2 implementer needs to know about.
>> >
>> >  (In the implementation I've done, I provide a configuration
>> > parameter that allows a developer to cap the number of the
>> > continuations and the total size of the header block)
>> >
>> > I know that we're in "implementation" phase right now and that
>> > everyone is busy getting their code ready for testing in August, but
>> > after updating my implementation to the latest version of the draft,
>> > my concerns with regards to stateful header compression definitely
>> > remain.
>> >
>> > On Thu, Jul 11, 2013 at 9:36 AM, Martin Thomson
>> > <martin.thomson@gmail.com> wrote:
>> >> On 10 July 2013 21:20, Amos Jeffries <squid3@treenet.co.nz> wrote:
>> >>> It seems not to be negotiable from the recipients side.
>> >>
>> >> Compression context size = 0 is entirely negotiable from the recipient
>> >> end, with a small wrinkle, that I know some folks are working on.
>> >> Which is, a client can start using a default compression context size
>> >> prior to learning that a server has no space (substitute intermediary
>> >> as appropriate there).
>> >>
>> >
>>
>>
>>
>
>