[tcpm] Increasing TCP's Initial Congestion
"Scheffenegger, Richard" <rs@netapp.com> Wed, 10 November 2010 19:29 UTC
Return-Path: <rs@netapp.com>
X-Original-To: tcpm@core3.amsl.com
Delivered-To: tcpm@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id E385E3A69D8 for <tcpm@core3.amsl.com>; Wed, 10 Nov 2010 11:29:21 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -10.479
X-Spam-Level:
X-Spam-Status: No, score=-10.479 tagged_above=-999 required=5 tests=[AWL=0.120, BAYES_00=-2.599, RCVD_IN_DNSWL_HI=-8]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id BKxmrlJ9cn0R for <tcpm@core3.amsl.com>; Wed, 10 Nov 2010 11:29:21 -0800 (PST)
Received: from mx4.netapp.com (mx4.netapp.com [217.70.210.8]) by core3.amsl.com (Postfix) with ESMTP id A69F43A6821 for <tcpm@ietf.org>; Wed, 10 Nov 2010 11:29:20 -0800 (PST)
X-IronPort-AV: E=Sophos;i="4.59,179,1288594800"; d="scan'208";a="225635505"
Received: from smtp3.europe.netapp.com ([10.64.2.67]) by mx4-out.netapp.com with ESMTP; 10 Nov 2010 11:29:47 -0800
Received: from ldcrsexc1-prd.hq.netapp.com (emeaexchrs.hq.netapp.com [10.65.251.109]) by smtp3.europe.netapp.com (8.13.1/8.13.1/NTAP-1.6) with ESMTP id oAAJTl0o019918; Wed, 10 Nov 2010 11:29:47 -0800 (PST)
Received: from LDCMVEXC1-PRD.hq.netapp.com ([10.65.251.108]) by ldcrsexc1-prd.hq.netapp.com with Microsoft SMTPSVC(6.0.3790.3959); Wed, 10 Nov 2010 19:29:46 +0000
X-MimeOLE: Produced By Microsoft Exchange V6.5
Content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
Date: Wed, 10 Nov 2010 19:27:29 -0000
Message-ID: <5FDC413D5FA246468C200652D63E627A0B65C814@LDCMVEXC1-PRD.hq.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [tcpm] Increasing TCP's Initial Congestion
Thread-Index: AcuBDVCuBA27FU5TSZaEq60S69mRKg==
From: "Scheffenegger, Richard" <rs@netapp.com>
To: Jerry Chu <hkchu@google.com>, Yaogong Wang <ywang15@ncsu.edu>
X-OriginalArrivalTime: 10 Nov 2010 19:29:46.0905 (UTC) FILETIME=[A2618890:01CB810D]
Cc: Arvind Jain <arvind@google.com>, tcpm <tcpm@ietf.org>, Matt Mathis <mattmathis@google.com>
Subject: [tcpm] Increasing TCP's Initial Congestion
X-BeenThere: tcpm@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: TCP Maintenance and Minor Extensions Working Group <tcpm.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/tcpm>, <mailto:tcpm-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/tcpm>
List-Post: <mailto:tcpm@ietf.org>
List-Help: <mailto:tcpm-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tcpm>, <mailto:tcpm-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 10 Nov 2010 19:29:22 -0000
Hi Jerry, Yaogong, Reading your slides (i won't be online when you give your talk): http://www.ietf.org/proceedings/79/slides/tcpm-0.pdf | Is SACK required for IW10 to Perform? | | * From the testbed at Google | - SACK does help reducing UCT but only by a small | percentage for both IW10 and IW3 | * 24 hours, 3-way parallel experiment at Google's | frontend servers | - IW10+NewReno still beats IW3+SACK | | Photos download Avg response time Retransmission rate | IW10+SACK 2.6secs 4.1% | IW10+NewReno 2.8secs 4.1% | IW3+SACK 3.0secs 3.3% | November 11, 2010 79th IETF, Beijing China 19 First, let me thank you for conducting these experiments with SACK and NewReno. My comment at IETF78 was that Linux SACK (since 2.4.xx) can detect lost retransmissions, and recover from them; On the 2nd slide you paraphrase this: | How does IW10 perform | when SACK is either not available, or not adequately | implemented? Did you disable this aspect of Linux SACK during your tests (to have a RFC3517 compliant SACK sender)? Also, IIRC, Linux 2.6.2x already implements draft-jarvinen-tcpm-sack-recovery-entry-02.txt // RFC3517bis. I assume the fraction of lost retransmissions and early recovery entries was small (afaik, Linux records these events individually). If these counters were zero during your the testing, I'm fully satisfied. I'm just worried that a good fraction of loss recovery was due to these recent features (not widely available on other stacks), causing grief if one simply tries to do IW10 without these other modifications. Also, this comment on slide 13 indicates, that some flows had to do RTO-based loss recovery, correct? | * Under heavy load IW10 lost UCT advantage due to high PLR | * IW10 UCT exhibits long tails Did you get traces off your testbed to investigate these long tail flows further? One of the reasons might be the recent observation, that with short flows, NewReno can sometimes recover more speedy than SACK ( http://www.ietf.org/mail-archive/web/tcpm/current/msg05875.html ). (RecoveryPoint is close to the end-of-stream, and at least the last segment prior to RP was lost). The other reason could be, that SACK retransmissions are lost, but Linux can not recover them any more. (RecoveryPoint is at the end-of-stream). There might be additional cases causing RTO, though... I would be very interested in looking at these long tail (high latency) flow traces myself. Richard Scheffenegger
- [tcpm] Increasing TCP's Initial Congestion Scheffenegger, Richard
- Re: [tcpm] Increasing TCP's Initial Congestion Jerry Chu