RE: [tcpm] revising 2581: setting ssthresh on RTOs

"Murari Sridharan" <muraris@microsoft.com> Mon, 31 July 2006 22:06 UTC

Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1G7ftz-0003n6-QT; Mon, 31 Jul 2006 18:06:19 -0400
Received: from [10.91.34.44] (helo=ietf-mx.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1G7fty-0003ja-Gi for tcpm@ietf.org; Mon, 31 Jul 2006 18:06:18 -0400
Received: from maila.microsoft.com ([131.107.1.7] helo=mail2.microsoft.com) by ietf-mx.ietf.org with esmtp (Exim 4.43) id 1G7ftx-00019E-4j for tcpm@ietf.org; Mon, 31 Jul 2006 18:06:18 -0400
Received: from mailout5.microsoft.com ([157.54.69.148]) by mail2.microsoft.com with Microsoft SMTPSVC(6.0.3790.1830); Mon, 31 Jul 2006 15:06:16 -0700
Received: from RED-MSG-52.redmond.corp.microsoft.com ([157.54.61.154]) by mailout5.microsoft.com with Microsoft SMTPSVC(6.0.3790.2725); Mon, 31 Jul 2006 15:06:15 -0700
x-mimeole: Produced By Microsoft Exchange V6.5
Content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
Subject: RE: [tcpm] revising 2581: setting ssthresh on RTOs
Date: Mon, 31 Jul 2006 15:06:06 -0700
Message-ID: <C1F4EAB8CA894E4AB1419EEC5224326F09288F2E@RED-MSG-52.redmond.corp.microsoft.com>
In-Reply-To: <20060731114729.4E9524453E7@lawyers.icir.org>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [tcpm] revising 2581: setting ssthresh on RTOs
Thread-Index: Aca0l1KPgwHtyyuWR5KCpLY/72JbJQAVcC7g
From: Murari Sridharan <muraris@microsoft.com>
To: mallman@icir.org, tcpm@ietf.org
X-OriginalArrivalTime: 31 Jul 2006 22:06:15.0895 (UTC) FILETIME=[8B007670:01C6B4ED]
X-Spam-Score: 0.0 (/)
X-Scan-Signature: 14582b0692e7f70ce7111d04db3781c8
Cc:
X-BeenThere: tcpm@ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: TCP Maintenance and Minor Extensions Working Group <tcpm.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/tcpm>, <mailto:tcpm-request@ietf.org?subject=unsubscribe>
List-Post: <mailto:tcpm@ietf.org>
List-Help: <mailto:tcpm-request@ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/tcpm>, <mailto:tcpm-request@ietf.org?subject=subscribe>
Errors-To: tcpm-bounces@ietf.org

#2 definitely looks reasonable. 

Although given the scenario you are targeting "hiccup was caused by some
small network issue like a handoff" which seems specific to a wireless
setting, or even if it not, a sender that implements F-RTO can detect
that the RTO is spurious and in which case could potentially revert the
congestion control parameters to pre-loss thresholds. 

Murari 

-----Original Message-----
From: Mark Allman [mailto:mallman@icir.org] 
Sent: Monday, July 31, 2006 4:47 AM
To: tcpm@ietf.org
Subject: [tcpm] revising 2581: setting ssthresh on RTOs

 
Folks-

We have talked about setting ssthresh after RTOs on this list a number
of times and we talked about it in Montreal.  I'd like to verify what
seemed the general mood in Montreal here on the list.  I think this is
the last bit we need to work out on 2581bis.  If you have something else
that you think needs done to the document, please yell.

RFC 2581 says that on each RTO a TCP reduces ssthresh to FlightSize / 2.
Consider a lost retransmit.  Say we RTO on segment X, cut ssthresh to Y
segments (Y > 2 - just for the example), cwnd becomes 1 segment and the
RTO is backed off.  Now, say we RTO on segment X again.  The cwnd will
stay at one, but the FlightSize is now 1 and so ssthresh takes its
minimum value of 2 segments.

The observation is that this forces linear growth for a potentially long
time if this loss hiccup was caused by some small network issue like a
handoff.  In that case, it'd be nice to be able to keep ssthresh higher
and use slow start when packets started flowing again.

Also, as a practical matter I don't think the above scenario is the way
we intended things to work.  Rather, I think we envisioned (but, alas,
did not write) suggestion #1:

Suggestion #1: On the first RTO for some segment, set ssthresh to
FlightSize/2.  On each subsequent RTO for the given segment halve
ssthresh (ssthresh =/ 2).

Basically, this slowly degrades ssthresh as the RTO gets backed off,
such that the longer TCP has been transmitting into a lousy network the
less the TCP gets to use exponential increase when packets start flowing
again.

In addition, another variant has been suggested in the meantime, ...

Suggestion #2: On the first RTO for some segment, set ssthresh to
FlightSize/2.  On each subsequent RTO for the given segment do not
adjust ssthresh at all.

This variant means that a TCP always gets to re-probe with slow start
based on the pre-loss conditions no matter how long it took to fix the
loss. 

Both #1 and #2 are quite safe.  If the network is in a really lousy
state then the TCP is going to continue to get losses even after getting
out of RTO backoff without increasing the congestion window all that
much.  And, if that happens then ssthresh will get further reduced
(probably to its minimum).  Essentially, if the network is heavily
loaded all of the sudden then this additional loss isn't really going to
be exacerbated by the first couple RTTs of slow start.  If the backoff
was caused by something other than a suddenly massively congested
network then this tweak lets TCP get back to a reasonable operating
point more rapidly.

So, a couple of questions ... and, the authors current hits ...

(1) Is this change in-scope for 2581bis?  We said "no algorithmic
    tweaks" and so one view is that this should be cooked elsewhere and
    rolled in later.

    The author's hit on this is that the behavior of slamming ssthresh
    down on the first backed off RTO is not our intent and so tweaking
    this seems in-scope.  

(2) Assuming folks are fine with making a change then which change
    should we make?  Suggestion #1 or #2?

    The author's chatted and we feel like #2 is fine.  As noted above,
    neither case really aggravates the state of the network in suddenly
    heavily loaded situations.  So, #2 seems OK.

What do people think?  Is #2 OK?  Or, something else?

Thanks in advance for the feedback!

allman




_______________________________________________
tcpm mailing list
tcpm@ietf.org
https://www1.ietf.org/mailman/listinfo/tcpm