Re: [tcpm] Tuning TCP parameters for the 21st century

Jerry Chu <hkchu@google.com> Fri, 17 July 2009 02:35 UTC

Return-Path: <hkchu@google.com>
X-Original-To: tcpm@core3.amsl.com
Delivered-To: tcpm@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 522613A6BFB for <tcpm@core3.amsl.com>; Thu, 16 Jul 2009 19:35:58 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -101.977
X-Spam-Level:
X-Spam-Status: No, score=-101.977 tagged_above=-999 required=5 tests=[BAYES_00=-2.599, FM_FORGED_GMAIL=0.622, USER_IN_WHITELIST=-100]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id QYOgQ8E1zTOh for <tcpm@core3.amsl.com>; Thu, 16 Jul 2009 19:35:57 -0700 (PDT)
Received: from smtp-out.google.com (smtp-out.google.com [216.239.45.13]) by core3.amsl.com (Postfix) with ESMTP id 535363A6E12 for <tcpm@ietf.org>; Thu, 16 Jul 2009 19:35:56 -0700 (PDT)
Received: from spaceape24.eur.corp.google.com (spaceape24.eur.corp.google.com [172.28.16.76]) by smtp-out.google.com with ESMTP id n6H2aQcd022090 for <tcpm@ietf.org>; Thu, 16 Jul 2009 19:36:27 -0700
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed/relaxed; d=google.com; s=beta; t=1247798188; bh=1mTquiY9b5vz66Uq5P4SOVULspg=; h=DomainKey-Signature:MIME-Version:In-Reply-To:References:Date: Message-ID:Subject:From:To:Cc:Content-Type: Content-Transfer-Encoding:X-System-Of-Record; b=jrQzg8LDyVgg5HWiGo nAWqyQwXCQ/SG0CG1vv8+Lr1of5iN32GHdrIF6obVK2zHCDbEtGcgUxtyJT+lBozMdO w==
DomainKey-Signature: a=rsa-sha1; s=beta; d=google.com; c=nofws; q=dns; h=mime-version:in-reply-to:references:date:message-id:subject:from:to: cc:content-type:content-transfer-encoding:x-system-of-record; b=LnZP5afD7xD69v6r7Bq5GW6ik24C3o1KZfPbv4yKv9IlUCSGcPSjThYC//OrUfQ91 m41LQDbOpHXy+Qnrd3s6A==
Received: from yxe42 (yxe42.prod.google.com [10.190.2.42]) by spaceape24.eur.corp.google.com with ESMTP id n6H2aN6I015393 for <tcpm@ietf.org>; Thu, 16 Jul 2009 19:36:24 -0700
Received: by yxe42 with SMTP id 42so973007yxe.13 for <tcpm@ietf.org>; Thu, 16 Jul 2009 19:36:23 -0700 (PDT)
MIME-Version: 1.0
Received: by 10.100.96.12 with SMTP id t12mr840204anb.4.1247798183710; Thu, 16 Jul 2009 19:36:23 -0700 (PDT)
In-Reply-To: <1247733589.4184.56.camel@localhost.localdomain>
References: <d1c2719f0907131619t1a80997ep4080a3a721ef3627@mail.gmail.com> <d1c2719f0907141241p73e605adqc1d2e6f0db4eb3aa@mail.gmail.com> <4A5CE3D0.5000904@isi.edu> <d1c2719f0907141532i31d2b740hfa32209a8ccb156@mail.gmail.com> <4A5D0E8F.1040402@isi.edu> <d1c2719f0907141743n4952c9far54e3be36668577ed@mail.gmail.com> <4A5DE6BC.3090904@isi.edu> <7261633966bb7a55cebd4ce313069082.squirrel@webmail.uio.no> <4A5E018C.8030604@isi.edu> <1247733589.4184.56.camel@localhost.localdomain>
Date: Thu, 16 Jul 2009 19:36:23 -0700
Message-ID: <d1c2719f0907161936y5d46d883q25507cfe935b17cc@mail.gmail.com>
From: Jerry Chu <hkchu@google.com>
To: Michael Welzl <michawe@ifi.uio.no>
Content-Type: text/plain; charset="ISO-8859-1"
Content-Transfer-Encoding: quoted-printable
X-System-Of-Record: true
Cc: tcpm@ietf.org
Subject: Re: [tcpm] Tuning TCP parameters for the 21st century
X-BeenThere: tcpm@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: TCP Maintenance and Minor Extensions Working Group <tcpm.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/tcpm>, <mailto:tcpm-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/tcpm>
List-Post: <mailto:tcpm@ietf.org>
List-Help: <mailto:tcpm-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tcpm>, <mailto:tcpm-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 17 Jul 2009 02:35:58 -0000

On Thu, Jul 16, 2009 at 1:39 AM, Michael Welzl<michawe@ifi.uio.no> wrote:
>> > - but what about retransmitting SYN/ACKs faster?
>> > Since the other side already sent a SYN, it probably
>> > isn't overloaded
>>
>> That presumes the other side already sent a SYN and it's been received,
>> and that the SYN/ACK is what's getting lost/delayed/retransmitted.  If
>> not, that won't help.
>
> Right -
>
>
>> First find where things are going wrong empirically ;-)
>
> We've done that. See, for example, fig. 2 (a) of
> http://www.welzl.at/research/publications/networking2009.pdf
> (page 7 of the pdf)
>
> Since our goal in this paper was to document general failures
> of connection setup, we did not strictly split our results
> into the case of lost SYNs or lost SYN/ACKs in it.
> Section 3 (end of page 3 / beginning of page 4) explains
> how lost SYN/ACKs were incorporated in our measurements.
>
>
> This paper is based on data produced by a student. The
> data are more thoroughly documented in his (alas, German)
> bachelor thesis. This document is here:
> http://www.welzl.at/research/tools/syn-retransmit/benjamin_kaser_bak1.pdf
> and the most relevant information is table 2 on page 47.
> (Table 1 is similar, just for SYNs, not SYN/ACKs).
>
> If you look at it, with or without German knowledge
> you will be able to see the connection to table 1 in
> the paper, where we exchanged columns with lines and
> omitted some details - apparently just the crucial
> ones  :-(    Anyway, the listed measurements are just
> the same, and hence you can read the details in the
> paper if you're interested in how we obtained our
> data.
>
> Table 2 on page 47 of the bachelor thesis has several
> details about duplicate SYN/ACKs that we've seen,
> e.g. classified by eventually successful and unsuccessful
> connections. As a quick summary, in 3432744 connection
> attempts, we have seen 56586 duplicate SYN/ACKs in total
> (this is a cumulative number, i.e. 4 SYN/ACKs are counted
> as 3). Counting only the occurrence of duplicate SYN/ACKs,
> we saw them for 0,55% of the 3432744 connection attempts.
> In general, the difference between duplicate SYN/ACK
> occurrences and duplicate SYN occurrences is negligible
> in our measurements.
>
> As I just looked at the data again, I noticed a mistake:
> in the paper, we wrote that the problem only happens for
> around 0.5% of the connections, but this is really only
> from the SYN table, not the SYN/ACK table in the bachelor
> thesis. The SYN/ACK table covers the same measurement sets.
> Hence, in total, it seems that we also saw > 1% lost SYNs
> *or* SYN/ACKs, which exactly matches what Jerry Chu said.
>
> BTW I maintain a small website about the whole thing:
> http://www.welzl.at/research/tools/syn-retransmit/index.html

Our overall pkt retransmission rate often goes over 1%. I was
wondering if SYN/SYN-ACK pkts are less likely to be dropped
by some routers due to their smaller size so we collected traces
and computed SYN-ACK retransmissions rate on some servers.
We confirmed it to be consistent with the overall pkt drop rate,
i.e., > 1% often.

FYI,

Jerry

>
> Cheers,
> Michael
>
>
> _______________________________________________
> tcpm mailing list
> tcpm@ietf.org
> https://www.ietf.org/mailman/listinfo/tcpm
>