Re: quick failover in SCTP

Preethi Natarajan <prenatar@cisco.com> Thu, 09 December 2010 21:08 UTC

Return-Path: <prenatar@cisco.com>
X-Original-To: tsvwg@core3.amsl.com
Delivered-To: tsvwg@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 7E65E3A6BD8 for <tsvwg@core3.amsl.com>; Thu, 9 Dec 2010 13:08:32 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -5.636
X-Spam-Level:
X-Spam-Status: No, score=-5.636 tagged_above=-999 required=5 tests=[BAYES_00=-2.599, J_CHICKENPOX_43=0.6, J_CHICKENPOX_48=0.6, MIME_8BIT_HEADER=0.3, MIME_QP_LONG_LINE=1.396, RCVD_IN_DNSWL_HI=-8, RCVD_NUMERIC_HELO=2.067]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id qYvlcw1xGfTb for <tsvwg@core3.amsl.com>; Thu, 9 Dec 2010 13:08:31 -0800 (PST)
Received: from rtp-iport-1.cisco.com (rtp-iport-1.cisco.com [64.102.122.148]) by core3.amsl.com (Postfix) with ESMTP id 524CD3A6BD7 for <tsvwg@ietf.org>; Thu, 9 Dec 2010 13:08:31 -0800 (PST)
Authentication-Results: rtp-iport-1.cisco.com; dkim=neutral (message not signed) header.i=none
X-IronPort-Anti-Spam-Filtered: true
X-IronPort-Anti-Spam-Result: ApQIAP7TAE2tJV2c/2dsb2JhbACjFGkCeKUamxKFSgSEZIYVgxqEcQ
X-IronPort-AV: E=Sophos;i="4.59,322,1288569600"; d="scan'208";a="190978543"
Received: from rcdn-core-5.cisco.com ([173.37.93.156]) by rtp-iport-1.cisco.com with ESMTP; 09 Dec 2010 21:10:01 +0000
Received: from xbh-rcd-301.cisco.com (xbh-rcd-301.cisco.com [72.163.63.8]) by rcdn-core-5.cisco.com (8.14.3/8.14.3) with ESMTP id oB9LA08g022875; Thu, 9 Dec 2010 21:10:00 GMT
Received: from xmb-rcd-208.cisco.com ([72.163.62.215]) by xbh-rcd-301.cisco.com with Microsoft SMTPSVC(6.0.3790.4675); Thu, 9 Dec 2010 15:10:00 -0600
Received: from 171.70.225.252 ([171.70.225.252]) by XMB-RCD-208.cisco.com ([72.163.62.215]) with Microsoft Exchange Server HTTP-DAV ; Thu, 9 Dec 2010 21:10:00 +0000
User-Agent: Microsoft-Entourage/12.26.0.100708
Subject: Re: quick failover in SCTP
From: Preethi Natarajan <prenatar@cisco.com>
To: Michael Tüxen <Michael.Tuexen@lurchi.franken.de>, tsvwg@ietf.org
Message-ID: <C92685B2.B488%prenatar@cisco.com>
Thread-Topic: quick failover in SCTP
Thread-Index: ActpXb1cT6+HGFabQ0Owkx0dQ+FirQuh7kIa
In-Reply-To: <C8D88334.93F9%prenatar@cisco.com>
Mime-version: 1.0
Content-type: text/plain; charset="ISO-8859-1"
Content-transfer-encoding: quoted-printable
X-OriginalArrivalTime: 09 Dec 2010 21:10:00.0900 (UTC) FILETIME=[70FB1840:01CB97E5]
X-Mailman-Approved-At: Mon, 03 Jan 2011 08:09:15 -0800
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/tsvwg>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
Date: Thu, 09 Dec 2010 21:08:32 -0000
X-Original-Date: Thu, 09 Dec 2010 13:10:10 -0800
X-List-Received-Date: Thu, 09 Dec 2010 21:08:32 -0000

Hello all,

A new version of the quick failover draft
(draft-nishida-tsvwg-sctp-failover-01) is now available at
http://www.ietf.org/id/draft-nishida-tsvwg-sctp-failover-01.txt.

The failover algorithm details have been updated in Section 4.3 based on
feedback from TSVWG.

Please let us know if you have questions/comments.

Thanks,
Preethi

On 10/11/10 9:02 AM, "Preethi Natarajan" <prenatar@cisco.com> wrote:

> Michael,
> 
> Thanks for the insightful comments/suggestions. Please see my responses below.
> 
> 
> On 10/9/10 5:34 AM, "Michael Tüxen" <Michael.Tuexen@lurchi.franken.de> wrote:
> 
>> On Oct 9, 2010, at 12:04 PM, Yoshifumi Nishida wrote:
>> 
>>> Hello Michael,
>>> 
>>> Thanks for your response.
>>> 
>>> 2010/10/8 Michael Tüxen <Michael.Tuexen@lurchi.franken.de>:
>>>> Hello Nishida-san,
>>>> 
>>>> OK, I thought about this some time and think it would be good
>>>> to specify a way for a quick failover which can be implemented
>>>> at the sender only.
>>>> I would like to see some extensions to you suggestion:
>>>> * Introduce a threshold (call it PFMR for now).
>>>>  Then use
>>>>  if (error counter > PMR)
>>>>     the path is inactive.
>>>>  if (error counter <= PMR) && (error counter > PFMR)
>>>>     the path is potentially failed.
>>>>  Using PFMR = 0 is what you suggest, PFMR=PMR gives
>>>>  the old behavior.
>>> 
>>> I thought similar thing and I agree with providing a way to disable PF.
>>> I tend to agree with this idea, but one thing I'm not very sure is how
>>> PFMR != 0 && PFMR < PMR can be useful.
>> I could image someone wanting to call a path potentially failed
>> after 2 consecutive timer based retransmission instead of 1.
>> Just being a bit more conservative. This might help deploying
>> such a feature in SIGTRAN networks where CMT is not deployed.
>> For CMT traffic, PFMR==0 is the right choice, I guess.
>> But I think PF is also very helpful in non-CMT scenarios.
> 
> Preethi: Good point. What you are suggesting is a tunable PFMR threshold while
> the current draft always assumes PFMR=0, the most aggressive behavior. I don't
> see how conservative PFMR values can be helpful but it may be a good idea to
> have it just in case.
> 
> 
>>>> * Specify that you start sending HBs when the path is
>>>>  PF. Explicitly allow PFHB.interval=0, which
>>>>  I think is a good choice. Maybe we can just remove PFHB.interval.
>>> 
>>> Hmm. Sorry. I might not quite follow this point.
>>> Does PFHB.interval=0 mean suppress sending HB during PF state?
>> No. I mean just to send them every RTO.
>> Having a specific interval allows this by setting PFHB.interval=0.
>> I was just thinking whether one can remove the parameter and send
>> the HB every RTO (and doubling it)... The same as using PFHB.interval=0.
> 
> Preethi: I agree. I don't think we need a PFHB.interval. In PF state, HBs are
> sent once every RTO.
> 
> 
>>>> * Make sure that the following works: The application disabled HBs.
>>>>  When a path enters PF (or failed) HB are sent to get the path
>>>>  active again. If it is active, no HB should be send (since
>>>>  the application disables them).
> 
> Preethi: Michael can you clarify this a bit? Looks like we want to confirm the
> following -- apps can control only those HBs that get sent (or not sent)
> during path failure. Apps do not have control over PF HBs; i.e., apps cannot
> enable/disable PF HBs. Is that right?
> 
>>>> * Provide a way that applications are not bothered with
>>>>  state change notification related to PF when not explicitly
>>>>  subscribed.
> 
> Preethi: Good point. To clarify: when PF is on, apps will be notified only
> about PF-to-failure and failure-to-active state changes. Apps will not be
> notified about active-to-PF state change.
> 
>>>> 
>>>> * Make clear what to do when all paths are PF.
>>>> 
>>>> * Make clear what to do when all paths are failed.
>>>> 
> 
> Preethi: I agree with both points. We can clarify these in the draft.
> 
> Thanks,
> Preethi