Re: bgp4-17 Cease subcode

andrewl@exodus.net Fri, 18 January 2002 01:49 UTC

Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id UAA29822 for <idr-archive@nic.merit.edu>; Thu, 17 Jan 2002 20:49:39 -0500 (EST)
Received: by trapdoor.merit.edu (Postfix) id ECCC491301; Thu, 17 Jan 2002 20:49:19 -0500 (EST)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id BA27591302; Thu, 17 Jan 2002 20:49:19 -0500 (EST)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id D0ECE91301 for <idr@trapdoor.merit.edu>; Thu, 17 Jan 2002 20:49:18 -0500 (EST)
Received: by segue.merit.edu (Postfix) id A46905DE04; Thu, 17 Jan 2002 20:49:18 -0500 (EST)
Delivered-To: idr@merit.edu
Received: from demiurge.exodus.net (unknown [216.32.171.82]) by segue.merit.edu (Postfix) with ESMTP id 28BD05DDB9 for <idr@merit.edu>; Thu, 17 Jan 2002 20:49:18 -0500 (EST)
Received: (from andrewl@localhost) by demiurge.exodus.net (8.9.3+Sun/8.9.3) id RAA12068; Thu, 17 Jan 2002 17:46:30 -0800 (PST)
Date: Thu, 17 Jan 2002 17:46:30 -0800
From: andrewl@exodus.net
To: Susan Hares <skh@nexthop.com>
Cc: Alex Zinin <azinin@nexsi.com>, randy Bush <randy@psg.com>, fenner@research.att.com, idr@merit.edu
Subject: Re: bgp4-17 Cease subcode
Message-ID: <20020117174630.J19424@demiurge.exodus.net>
References: <5.0.0.25.0.20020117083423.0252ef28@mail.nexthop.com> <5.0.0.25.0.20020116090028.039d2fa8@mail.nexthop.com> <20020115140711.GA23937@opentransit.net> <20020114123700.C7761@nexthop.com> <200201141750.g0EHo3634958@merlot.juniper.net> <20020115140711.GA23937@opentransit.net> <5.0.0.25.0.20020116090028.039d2fa8@mail.nexthop.com> <5.0.0.25.0.20020117083423.0252ef28@mail.nexthop.com> <32264605142.20020117091413@nexsi.com> <5.0.0.25.0.20020117124225.042f96e0@mail.nexthop.com>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <5.0.0.25.0.20020117124225.042f96e0@mail.nexthop.com>; from skh@nexthop.com on Thu, Jan 17, 2002 at 01:28:40PM -0500
Sender: owner-idr@merit.edu
Precedence: bulk

> 3)bgp exponential back-off

One operator's view:

The vendor we use for our peering and backbone boxes doesn't implement this 
feature.  However, we would like it.  What usually happens when we have a 
flapping BGP session, due to say, a bad circuit, is someone is forced to come in
and shut it down manually, then bring it back up at some point when the
circuit is stable again.  With exponential backoff the lengths of the 
retrys would rapidly get to the point where human intervention to shut
the session down manually would not be required.  And the box would still try to
bring up the session occasionally, eliminating the "Oops, I forgot to bring
that back up" problem.

As for the FSM, were we to have this feature we would demand a command from 
the vendor that would indicate what the status of the backoff is.  So when
we did a "sh ip bgp summ" we could see either that it was transitioned to
idle for x reason and it had y seconds/minutes/hours remaining.  The 
proposal to call that state "IdleHold" is a sound one.  Although if
it were called ("Idle - Hold Flag Set" or something that would be fine too.)
And yes, we would want both the state and the time remaining in the MIB.
Preferably, if the state were IdleHold, we would have a "try #, # of seconds
remaining, # of seconds in this retry interval.  Eg: BackoffAttemptNumber: 5
BackoffSecondsRemaining: 38 BackoffSecondsTotalThisAttempt: 960

And being the generally knob-happy types we operators are :), a max-backoff
value or the like would be good as an enhancement.

Andrew