Re: [GROW] Potential route flap with bgp-diverse-path

Jakob Heitz <jakob.heitz@ericsson.com> Sun, 27 March 2011 21:04 UTC

Return-Path: <jakob.heitz@ericsson.com>
X-Original-To: grow@core3.amsl.com
Delivered-To: grow@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id B91F43A6917; Sun, 27 Mar 2011 14:04:21 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -6.33
X-Spam-Level:
X-Spam-Status: No, score=-6.33 tagged_above=-999 required=5 tests=[AWL=0.269, BAYES_00=-2.599, RCVD_IN_DNSWL_MED=-4]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id LvxoL2hOjHwP; Sun, 27 Mar 2011 14:04:21 -0700 (PDT)
Received: from imr4.ericy.com (imr4.ericy.com [198.24.6.8]) by core3.amsl.com (Postfix) with ESMTP id CFA913A6906; Sun, 27 Mar 2011 14:04:20 -0700 (PDT)
Received: from eusaamw0712.eamcs.ericsson.se ([147.117.20.181]) by imr4.ericy.com (8.14.3/8.14.3/Debian-9.1ubuntu1) with ESMTP id p2RL5udS005936; Sun, 27 Mar 2011 16:05:57 -0500
Received: from EUSAACMS0701.eamcs.ericsson.se ([169.254.2.141]) by eusaamw0712.eamcs.ericsson.se ([147.117.20.181]) with mapi; Sun, 27 Mar 2011 17:05:50 -0400
From: Jakob Heitz <jakob.heitz@ericsson.com>
To: "raszuk@cisco.com" <raszuk@cisco.com>
Date: Sun, 27 Mar 2011 17:05:48 -0400
Thread-Topic: [GROW] Potential route flap with bgp-diverse-path
Thread-Index: AcvswAZnGPY/FHqGTUi5P7cOoKt9MQAAUDCA
Message-ID: <7309FCBCAE981B43ABBE69B31C8D21390E3F5BECC3@EUSAACMS0701.eamcs.ericsson.se>
References: <7309FCBCAE981B43ABBE69B31C8D21390E3F5BEB34@EUSAACMS0701.eamcs.ericsson.se> <4D8DA854.6060401@cisco.com> <7309FCBCAE981B43ABBE69B31C8D21390E3F5BECAF@EUSAACMS0701.eamcs.ericsson.se> <4D8FA215.7020809@cisco.com>
In-Reply-To: <4D8FA215.7020809@cisco.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
acceptlanguage: en-US
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
Cc: IETF IDR <idr@ietf.org>, "grow@ietf.org" <grow@ietf.org>
Subject: Re: [GROW] Potential route flap with bgp-diverse-path
X-BeenThere: grow@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: Grow Working Group Mailing List <grow.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/grow>, <mailto:grow-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/grow>
List-Post: <mailto:grow@ietf.org>
List-Help: <mailto:grow-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/grow>, <mailto:grow-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 27 Mar 2011 21:04:21 -0000

The "corner" case happens 1/2 the time.

Initial:
Plane1 advertises P1.
Plane2 advertises P2.

P1 goes down:
Plane1 advertises P2.
Plane2 advertises P3.

The case that the client receives Plane2 message first
is 50% likely. During the flap, it has:

Plane1: P1
Plane2: P3

P1 is unreachable, so it chooses P3.

Originally, I proposed a delay on Plane2 advertisements,
but you didn't like it. How "small" should the delay be?
I think longer than MRAI, since Plane1 may be running
that timer for its P2 advertisement.

What is the "extra session"?
Is it a 3rd RR plane?

--
Jakob Heitz.
 

> -----Original Message-----
> From: Robert Raszuk [mailto:raszuk@cisco.com] 
> Sent: Sunday, March 27, 2011 1:46 PM
> To: Jakob Heitz
> Cc: grow@ietf.org; IETF IDR
> Subject: Re: [GROW] Potential route flap with bgp-diverse-path
> 
> Hello Jakob,
> 
> > If there are 3 paths: P1, P2 and P3
> > and 2 RR planes.
> > Plane1 advertises best, Plan2 advertizes 2nd best.
> 
> Ok.
> 
> > When the bestpath P1 goes away, the client can
> > flap to the 3rd best path, P3, before coming
> > back to the 2nd bset path P2.
> 
> Nope it will not .. see below on comments to your description.
> 
> > Here is the initial condition:
> > P1 is best, P2 2nd best, P3 3rd best.
> > Plane1 advertises P1, Plane2 advertises P2
> 
> Perfect !
> 
> > Because the advertisements from the RR planes are
> > not synchronized, the order of events could be:
> >
> > 1. P1 goes down: P2 becomes best, P3 becomes 2nd best.
> 
> Ok.
> 
> > 2. Client receives IGP message of P1 down: Client chooses P2.
> 
> Ok.
> 
> > 3. Client receives P3 from Plane2:         Client chooses P3.
> 
> This is only when client would not have P2 at all. IMHO 
> corner case (see 
> below why)
> 
> > 4. Client receives P2 from Plane1:         Client chooses P2.
> 
> P2 should not go away from client.
> 
> > Possible fix: Recommend at least as many RR planes
> > as possible paths for any destination.
> 
> Nope.
> 
> There are two simple solution here:
> 
> - Presence of an extra session from client to either RR to make sure 
> that your second best is not removed before overall best is advertised
> 
> - Small delay on the RR when advertising 2nd best to clients to make 
> sure overall best is there first in some implementations just 
> configuring per session mrai may help here.
> 
> Cheers,
> R.
> 
>