Re: [tcpm] TCP Tuning for HTTP - update

Matthew Kerwin <matthew@kerwin.net.au> Wed, 17 August 2016 23:39 UTC

Return-Path: <ietf-http-wg-request+bounce-httpbisa-archive-bis2juki=lists.ie@listhub.w3.org>
X-Original-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Delivered-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 951BC12D669 for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Wed, 17 Aug 2016 16:39:53 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -8.167
X-Spam-Level:
X-Spam-Status: No, score=-8.167 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HEADER_FROM_DIFFERENT_DOMAINS=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_HI=-5, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RP_MATCHES_RCVD=-1.247, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id HFhlUsG9XFmL for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Wed, 17 Aug 2016 16:39:51 -0700 (PDT)
Received: from frink.w3.org (frink.w3.org [128.30.52.56]) (using TLSv1.2 with cipher DHE-RSA-AES128-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 1AC1712D159 for <httpbisa-archive-bis2Juki@lists.ietf.org>; Wed, 17 Aug 2016 16:39:50 -0700 (PDT)
Received: from lists by frink.w3.org with local (Exim 4.80) (envelope-from <ietf-http-wg-request@listhub.w3.org>) id 1baANE-0000qU-M7 for ietf-http-wg-dist@listhub.w3.org; Wed, 17 Aug 2016 23:35:52 +0000
Resent-Date: Wed, 17 Aug 2016 23:35:52 +0000
Resent-Message-Id: <E1baANE-0000qU-M7@frink.w3.org>
Received: from maggie.w3.org ([128.30.52.39]) by frink.w3.org with esmtps (TLS1.2:DHE_RSA_AES_128_CBC_SHA1:128) (Exim 4.80) (envelope-from <phluid61@gmail.com>) id 1baAN7-0000p5-0d for ietf-http-wg@listhub.w3.org; Wed, 17 Aug 2016 23:35:45 +0000
Received: from mail-io0-f194.google.com ([209.85.223.194]) by maggie.w3.org with esmtps (TLS1.2:RSA_AES_128_CBC_SHA1:128) (Exim 4.80) (envelope-from <phluid61@gmail.com>) id 1baAN2-0000vx-Ry for ietf-http-wg@w3.org; Wed, 17 Aug 2016 23:35:44 +0000
Received: by mail-io0-f194.google.com with SMTP id y195so989786iod.0 for <ietf-http-wg@w3.org>; Wed, 17 Aug 2016 16:35:20 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:from:date:message-id :subject:to:cc; bh=IquHlWw+16SLhck91t990eDgI3BmqBiOvybP6ROKhfA=; b=ERAiDezbgTFjO9lg8IxxPbPCMuhuPdn2qgPok8W6i57Eh1yt4lfaUdsKr3cP82Yz4e L00FSxAqwyrsDWIpz30Aa8RydGhCXSRrVE0iCnWrHrT/0pZvdIbJIljwym33S5RXmhex MFQ3r5fOkVTYifhJddafnOvNCl/mTp8KiJOQCZ872/DOlShKif8O4/C+7NhG/Pl2yKDq 6S7KuVMOSeFxMDBc4eE329/jC+m5L9Jk+AFI86rX9mNGdAs9NY4+5T3c30aF90jObtfZ LeAVk0fKoq9TFu+FeJzq/6JWusOevo4Bea9vK5dQ2zGmp4a9i6ujhkTVZ19Vm6v+dVZj FbcQ==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:sender:in-reply-to:references:from :date:message-id:subject:to:cc; bh=IquHlWw+16SLhck91t990eDgI3BmqBiOvybP6ROKhfA=; b=eTMoC24U5jiy961YHzBld7VW1scSFLmIn3Z9EtRP/LODcwrAFSBGIQi+YnTvqYVIzM K4xsIoJj7dQ3TORF6969i+4xOte1SO8KVEPfnqMRsgtaJTN82cnyvfTus+E/u1lBkQXa zDo+IoAiIiuRsyAUbe8w+gF67iFU0E5OsQQBzTHC3nm+K0Hehf0E+kxtknX41nhqN+GY R71dovwXhQpG4Esyxnw/lE7bm+Ko6h5er29zuBBc3D9XEc+Xn6pQNkiGDiFyPrd002H8 ISg26nMtcyTjlMAGlcxUWuAihPE9a/4rP6WrsdNsCZfUKFE5IRy8dpkNUCZknTk7/1zd ljxg==
X-Gm-Message-State: AEkooutWjuuN9Qd8pm2So0miNsbyz4okN/Ek58ghlr58WOLASZqsdr6FoOU86GQio/pIiaq1cMZktKXlbkkPoA==
X-Received: by 10.107.128.210 with SMTP id k79mr50243507ioi.3.1471476914297; Wed, 17 Aug 2016 16:35:14 -0700 (PDT)
MIME-Version: 1.0
Sender: phluid61@gmail.com
Received: by 10.107.158.207 with HTTP; Wed, 17 Aug 2016 16:35:13 -0700 (PDT)
In-Reply-To: <c928d1ca-fc89-d0b0-4e1a-8a0bd960d2bb@isi.edu>
References: <0CC24FC1-37E1-4125-9627-05726A9D9406@mnot.net> <7fa95741-ac58-3183-1b92-238bd4b4dae6@isi.edu> <5CD67877-19E3-4E79-BBF2-3E270343A378@mnot.net> <2197232f-10d7-28cb-fcc9-05bd495e3c22@isi.edu> <20160817064545.GD16017@1wt.eu> <7f7b129c-f156-d067-bef8-4a2213f461ac@isi.edu> <20160817180802.GA16773@1wt.eu> <4ab7c5b0-3722-1346-f481-a8d76de70034@isi.edu> <20160817211317.GA16929@1wt.eu> <c928d1ca-fc89-d0b0-4e1a-8a0bd960d2bb@isi.edu>
From: Matthew Kerwin <matthew@kerwin.net.au>
Date: Thu, 18 Aug 2016 09:35:13 +1000
X-Google-Sender-Auth: aCRU2QWp0nVaOguTPQVQW63KmpE
Message-ID: <CACweHNC1qFH5DMnZRE87bAE5sk_P+1z1Fzm-9YEu=E2DULkaYQ@mail.gmail.com>
To: Joe Touch <touch@isi.edu>
Cc: Willy Tarreau <w@1wt.eu>, Mark Nottingham <mnot@mnot.net>, tcpm@ietf.org, HTTP Working Group <ietf-http-wg@w3.org>, Patrick McManus <pmcmanus@mozilla.com>, Daniel Stenberg <daniel@haxx.se>
Content-Type: multipart/alternative; boundary="001a113f8aaec5a037053a4cea10"
Received-SPF: pass client-ip=209.85.223.194; envelope-from=phluid61@gmail.com; helo=mail-io0-f194.google.com
X-W3C-Hub-Spam-Status: No, score=-5.1
X-W3C-Hub-Spam-Report: AWL=-0.786, BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001, W3C_AA=-1, W3C_WL=-1
X-W3C-Scan-Sig: maggie.w3.org 1baAN2-0000vx-Ry 4d6c19f39aa7a77359eb288a14deb4a8
X-Original-To: ietf-http-wg@w3.org
Subject: Re: [tcpm] TCP Tuning for HTTP - update
Archived-At: <http://www.w3.org/mid/CACweHNC1qFH5DMnZRE87bAE5sk_P+1z1Fzm-9YEu=E2DULkaYQ@mail.gmail.com>
Resent-From: ietf-http-wg@w3.org
X-Mailing-List: <ietf-http-wg@w3.org> archive/latest/32296
X-Loop: ietf-http-wg@w3.org
Resent-Sender: ietf-http-wg-request@w3.org
Precedence: list
List-Id: <ietf-http-wg.w3.org>
List-Help: <http://www.w3.org/Mail/>
List-Post: <mailto:ietf-http-wg@w3.org>
List-Unsubscribe: <mailto:ietf-http-wg-request@w3.org?subject=unsubscribe>

Hi folks, I'm stepping in here on just a couple of points. I'll snip the
bits I can't or won't talk to.

On 18 August 2016 at 08:23, Joe Touch <touch@isi.edu> wrote:

>
>
> On 8/17/2016 2:13 PM, Willy Tarreau wrote:
> > On Wed, Aug 17, 2016 at 11:31:33AM -0700, Joe Touch wrote:
> >>> It can be cited in new RFCs
> >>> to justify certain choices.
> >> Hmm. Like the refs I gave could be cited in this doc to justify *its*
> >> choices? :-)
> > I think it would be nice that this is cited, but to be clear on one
> > point, I've never heard about your papers before you advertised them
> > here in this thread,
>
> A search engine on the terms "TCP HTTP interaction" would have popped
> them up rather quickly.
>
> > and yet I've been dealing with timewait issues
> > for 15 years like many people facing moderate to large web sites
> > nowadays.
>
> "
> ​​
> timewait issues" and we're the 5th hit in Google.
>
> ​
Google, unless it's changed again recently, tailors search results for the
user. My first page of hits for that query are all serverfault.com,
superuser.com, serverframework.com, stackoverflow.com, etc.

Guess where I get most of my advice.
​


>
> >>>> Yes, and discussing those issues would be useful - but not in this
> >>>> document either.
> >>> Why ? Lots of admins don't understand why the time_wait timeout remains
> >>> at 240 seconds on Solaris with people saying "if you want to be
> conservative
> >>> don't touch it but if you want to be modern simply shrink it to 30
> seconds
> >>> or so". People need to understand why advices have changed over 3
> decades.
> >> The advice hasn't really changed - the advice was given in the 99 ref,
> >> which includes some cases where it can still be appropriate to decrease
> >> that timer.
> > Most people see it the other way around : they see no valid case to
> *increase*
> > it beyond a few seconds, because for them the default value should be
> extremely
> > low (ie this firewall vendor several years ago trying to insist on one
> second).
> > Yes that's really sad but that's reality. And you can tell them to read
> 6191
> > they won't care.
>
> Most people's servers don't need to run fast enough to care (note that
> nearly everyone runs some sort of web server on nearly every device,
> whether for control or configuration). The only issue are high-volume
> servers (the kind sysadmins deal with), and those people tend to already
> know what the tradeoffs are and accept the risks.
>
>
​Your sysadmins are not like my sysadmins. But these are generalisations
and anecdotes.​



> >
> >>>   - TCP timestamps: what they provide, what are the risks (some people
> in
> >>>     banking environments refuse to enable them so that they cannot be
> used
> >>>     as an oracle to help in timing attacks).
> >> That's already covered in the security considerations of RFC 7323. How
> >> is HTTP different, if at all, from any other app?
> > HTTP is special in that it is fairly common to have to deal with tens of
> > thousands of connections per second between one client and one server
> when
> > you are on the server side, because you place a number of gateways (also
> > called reverse-proxies) which combine all of the possible issues you can
> > think of at a single place.
>
> There are lots of services that have that many transactions - DNS
> servers (even local ones), remote databases, etc.
>
> The point is that HTTP doesn't make the problem different, so this isn't
> an HTTP issue. It's a high rate server issue.
>
>
​What makes HTTP different is that I expect most high-rate applications
would exist in a context where the people running the servers and
applications have some amount of specialist experience and knowledge, or at
least an expectation that they're doing something that requires such
knowledge, in high-rate throughput.

HTTP is ubiquitous, and resides all along the traffic scale, from my
website (~no bits per second) to google.com (all the bits); and the slide
-- or sudden jump -- up that scale doesn't always correspond with acquiring
expertise in TCP stack tuning.​



>
> > So probably you're starting to see the benefit of having a single doc
> > to concentrate all this.
> The same reason it's useful to have this all in one place is the reason
> we already do - there are books and courses on this.
>
>
​My users aren't getting my content right at the time my site is booming.
I've just been slashdotted/hackernewsed/whatever. Do I enroll in a course?
Buy a textbook and swot up (which I haven't done since I finished my IT
degree 15+ years ago)? Or do I hit up serverfault.com and bash the keyboard
until the fires go out?

Summary information is really important. Having it published by the same
people who published the protocol spec adds some serious cred, at least in
my eyes.​



> >  You provided at least 3 different articles
> > to read and 2 or 3 different RFCs in addition to the original ones,
> > of course. A hosting provider whose web sites are down due to a lack
> > of tuning doesn't start to read many very long articles and even less
> > the most scientific ones, they need to find quick responses that they
> > can apply immediately (matter of minutes). So they launch google, they
> > type "web site dead, time-wait overflow" and they get plenty of
> > responses on stackoverflow and serverfault, many from people having
> > done the same in the past and repeating the same mistakes over and over.
>
> These people don't read RFCs to fix problems. They take online courses
> or read "how to" books - which do already exist in this space.
>
>
​Which people? I tend to google the error and see if there's some sort of
consensus on stackoverflow. And then, as often as not, I have to advise my
sysadmins of a course of action because they know as much as me, or they
don't want to deal with my situation, or some other reason.​



> > A document validated by several people and giving links for further
> > reading can help improve this situation.
> Those are the books and courses I'm talking about already.
> ​​
>
> > People rediscover wheels because it's hard to find simple and accurate
> > information on the net.
> Nobody looks to RFCs to solve that problem...
>
> >  Basically you have the choice :
> >   - either uneducated blog posts saying "how I saved my web site using 2
> >     sysctls"
> >   - or academic papers which are only understandable by scientific people
> >     having enough time
>
> ... that's what net FAQs are for, as well as courses and books.
>
> > At least the first ones have the merit of being easy to test, and since
> > they appear to work they are viral.
> >
> >>>  All of them became issues for
> >>> many web server admins who just copy-paste random settings from various
> >>> blogs found on the net who just copy the same stupidities over and over
> >>> resulting in the same trouble being caused to each of their reader.
> >> This doc is all over the place.
> >>
> >> If you want a doc to advise web admins, do so.
> > That's *exactly* what Daniel started to do when you told him he shouldn't
> > do it.
>
> I didn't say a doc to advise web admins wasn't useful. I said it wasn't
> an RFC.
>
> It's a web FAQ, a book, etc.
>
>
​Here's the crux of the issue. What do you think an RFC is, that we
(apparently) don't? Why is an informational RFC not allowed to present the
same sort of information as an FAQ? (Isn't that what a BCP is?)

Personally I'd be happy if it was written up exactly like an FAQ, and
published as an informational RFC; because that tells me that this FAQ was
published by the IETF. It's not some random dude's unreliable blog full of
cargo-cult advice and anecdotes; it met the consensus of the organisation
that published the HTTP protocol spec. It's legitimate and reliable. And
one day, when it's out of date, it'll be updated or obsoleted by the new
consensus wisdom of the IETF.

That, as far as I know, doesn't defy the official definition of what an RFC
is*, nor does it devalue any existing (or future) RFCs.

And Google indexes RFCs. If this ends up being a really useful document
(which I imagine it would), with lots of inbound links and references,
people won't need to "look to RFCs to solve [their] problem", they'll do
what they already do -- look to Google, and Google will point them to this
RFC.


* heh​



>
> > and I find it fantastic to see that this protocol
> > still scales so well.
> > But we need to consider modern usages of this protocol
> > for the web, and not just academic research and e-mail.
> You might consider that TCPM and TSVWG don't exist for just "academic
> research and e-mail". What do you think we've been doing for the past 40
> years?
>
>
​Dunno, I was only born 35-odd years ago. ;) Shall we talk about generation
gaps? Greybeards vs millennials? (Of which I'm neither, BTW. Not yet, at
least.)

Where we come from, how we find information to solve our problems, the way
we view the IETF and its RFCs are all different. If this argument is just
that this stuff doesn't belong in an RFC, that's a cultural issue and not
one I think we can resolve in this one technical working group.

Cheers
-- 
  Matthew Kerwin
  http://matthew.kerwin.net.au/