Re: [OSPF] Query regarding behavior of OSPF DR-Other's neighbor-State with BDR when DR fails, when DR down detection is delayed at DR-Other.

Bharath R <rbharath@juniper.net> Fri, 05 July 2013 12:06 UTC

Return-Path: <rbharath@juniper.net>
X-Original-To: ospf@ietfa.amsl.com
Delivered-To: ospf@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id DCE9F21F997B for <ospf@ietfa.amsl.com>; Fri, 5 Jul 2013 05:06:18 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: 0.134
X-Spam-Level:
X-Spam-Status: No, score=0.134 tagged_above=-999 required=5 tests=[BAYES_00=-2.599, HTML_MESSAGE=0.001, J_CHICKENPOX_33=0.6, RCVD_IN_DNSWL_LOW=-1, UNRESOLVED_TEMPLATE=3.132]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id hMt7bSz+YQwg for <ospf@ietfa.amsl.com>; Fri, 5 Jul 2013 05:06:13 -0700 (PDT)
Received: from ch1outboundpool.messaging.microsoft.com (ch1ehsobe001.messaging.microsoft.com [216.32.181.181]) by ietfa.amsl.com (Postfix) with ESMTP id 9845321F9EA8 for <OSPF@ietf.org>; Fri, 5 Jul 2013 05:06:12 -0700 (PDT)
Received: from mail19-ch1-R.bigfish.com (10.43.68.252) by CH1EHSOBE005.bigfish.com (10.43.70.55) with Microsoft SMTP Server id 14.1.225.22; Fri, 5 Jul 2013 12:06:09 +0000
Received: from mail19-ch1 (localhost [127.0.0.1]) by mail19-ch1-R.bigfish.com (Postfix) with ESMTP id F276A4801FF for <OSPF@ietf.org>; Fri, 5 Jul 2013 12:06:08 +0000 (UTC)
X-Forefront-Antispam-Report: CIP:66.129.224.50; KIP:(null); UIP:(null); IPV:NLI; H:P-EMHUB03-HQ.jnpr.net; RD:none; EFVD:NLI
X-SpamScore: -23
X-BigFish: VPS-23(zz98dI9371Ic85fh4015Izz1f42h1ee6h1de0h1fdah2073h1202h1e76h1d1ah1d2ah1fc6hzz8275ch1d7338h1033IL17326ah18c673h1c8fb4h8275bh8275dhz2fh2a8h683h839hd25hf0ah1288h12a5h12bdh137ah1441h1504h1537h153bh15d0h162dh1631h1758h18e1h1946h19b5h19ceh1ad9h1b0ah1bceh1d07h1d0ch1d2eh1d3fh1de9h1dfeh1dffh1e1dh1155h)
Received-SPF: pass (mail19-ch1: domain of juniper.net designates 66.129.224.50 as permitted sender) client-ip=66.129.224.50; envelope-from=rbharath@juniper.net; helo=P-EMHUB03-HQ.jnpr.net ; -HQ.jnpr.net ;
X-Forefront-Antispam-Report-Untrusted: CIP:132.245.2.21; KIP:(null); UIP:(null); (null); H:BN1PRD0512HT004.namprd05.prod.outlook.com; R:internal; EFV:INT
Received: from mail19-ch1 (localhost.localdomain [127.0.0.1]) by mail19-ch1 (MessageSwitch) id 1373025966245085_12743; Fri, 5 Jul 2013 12:06:06 +0000 (UTC)
Received: from CH1EHSMHS015.bigfish.com (snatpool1.int.messaging.microsoft.com [10.43.68.250]) by mail19-ch1.bigfish.com (Postfix) with ESMTP id 281BAE0043 for <OSPF@ietf.org>; Fri, 5 Jul 2013 12:06:06 +0000 (UTC)
Received: from P-EMHUB03-HQ.jnpr.net (66.129.224.50) by CH1EHSMHS015.bigfish.com (10.43.70.15) with Microsoft SMTP Server (TLS) id 14.1.225.23; Fri, 5 Jul 2013 12:05:59 +0000
Received: from P-CLDFE01-HQ.jnpr.net (172.24.192.59) by P-EMHUB03-HQ.jnpr.net (172.24.192.37) with Microsoft SMTP Server (TLS) id 8.3.213.0; Fri, 5 Jul 2013 05:05:58 -0700
Received: from o365mail.juniper.net (207.17.137.149) by o365mail.juniper.net (172.24.192.59) with Microsoft SMTP Server id 14.1.355.2; Fri, 5 Jul 2013 05:05:57 -0700
Received: from va3outboundpool.messaging.microsoft.com (216.32.180.16) by o365mail.juniper.net (207.17.137.149) with Microsoft SMTP Server (TLS) id 14.1.355.2; Fri, 5 Jul 2013 05:09:52 -0700
Received: from mail218-va3-R.bigfish.com (10.7.14.251) by VA3EHSOBE013.bigfish.com (10.7.40.63) with Microsoft SMTP Server id 14.1.225.22; Fri, 5 Jul 2013 12:05:56 +0000
Received: from mail218-va3 (localhost [127.0.0.1]) by mail218-va3-R.bigfish.com (Postfix) with ESMTP id 44D9A9A0702 for <OSPF@ietf.org.FOPE.CONNECTOR.OVERRIDE>; Fri, 5 Jul 2013 12:05:56 +0000 (UTC)
Received: from mail218-va3 (localhost.localdomain [127.0.0.1]) by mail218-va3 (MessageSwitch) id 1373025953693221_23757; Fri, 5 Jul 2013 12:05:53 +0000 (UTC)
Received: from VA3EHSMHS005.bigfish.com (unknown [10.7.14.230]) by mail218-va3.bigfish.com (Postfix) with ESMTP id 9D0F1D8006F; Fri, 5 Jul 2013 12:05:53 +0000 (UTC)
Received: from BN1PRD0512HT004.namprd05.prod.outlook.com (132.245.2.21) by VA3EHSMHS005.bigfish.com (10.7.99.15) with Microsoft SMTP Server (TLS) id 14.1.225.23; Fri, 5 Jul 2013 12:05:52 +0000
Received: from BN1PRD0512MB620.namprd05.prod.outlook.com ([169.254.16.70]) by BN1PRD0512HT004.namprd05.prod.outlook.com ([10.255.193.37]) with mapi id 14.16.0324.000; Fri, 5 Jul 2013 12:05:52 +0000
From: Bharath R <rbharath@juniper.net>
To: Acee Lindem <acee.lindem@ericsson.com>, Tanmoy Kundu <tanmoycs@gmail.com>
Thread-Topic: [OSPF] Query regarding behavior of OSPF DR-Other's neighbor-State with BDR when DR fails, when DR down detection is delayed at DR-Other.
Thread-Index: Ac5sGnznNwBh+oLtRnGrauEoRknkhf//8RyAgA6LkAD/9rGzgA==
Date: Fri, 05 Jul 2013 12:05:51 +0000
Message-ID: <B39429F1A3EA1A4A9110CEC3D99263EA14C866CA@BN1PRD0512MB620.namprd05.prod.outlook.com>
References: <94A203EA12AECE4BA92D42DBFFE0AE4718CDEF@eusaamb101.ericsson.se>
In-Reply-To: <94A203EA12AECE4BA92D42DBFFE0AE4718CDEF@eusaamb101.ericsson.se>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-originating-ip: [116.197.178.83]
Content-Type: multipart/alternative; boundary="_000_B39429F1A3EA1A4A9110CEC3D99263EA14C866CABN1PRD0512MB620_"
MIME-Version: 1.0
X-FOPE-CONNECTOR: Id%0$Dn%*$RO%0$TLS%0$FQDN%$TlsDn%
X-FOPE-CONNECTOR: Id%12219$Dn%ERICSSON.COM$RO%2$TLS%5$FQDN%onpremiseedge-1018244.customer.frontbridge.com$TlsDn%o365mail.juniper.net
X-FOPE-CONNECTOR: Id%12219$Dn%GMAIL.COM$RO%2$TLS%5$FQDN%onpremiseedge-1018244.customer.frontbridge.com$TlsDn%o365mail.juniper.net
X-FOPE-CONNECTOR: Id%12219$Dn%IETF.ORG$RO%2$TLS%5$FQDN%onpremiseedge-1018244.customer.frontbridge.com$TlsDn%o365mail.juniper.net
X-OriginatorOrg: juniper.net
Cc: "OSPF@ietf.org" <OSPF@ietf.org>
Subject: Re: [OSPF] Query regarding behavior of OSPF DR-Other's neighbor-State with BDR when DR fails, when DR down detection is delayed at DR-Other.
X-BeenThere: ospf@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: The Official IETF OSPG WG Mailing List <ospf.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ospf>, <mailto:ospf-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ospf>
List-Post: <mailto:ospf@ietf.org>
List-Help: <mailto:ospf-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ospf>, <mailto:ospf-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 05 Jul 2013 12:06:19 -0000

Hi Acee,
Thank you for your response.

Sorry If my mail was unclear and longish.

Can you please clarify on following questions of mine:

If the DR-interface goes down, adjacency of old DR with DR-Others will go down. This is expected.
But, is it expected for adjacency of DR-Others with BDR(now the new DR) also flap as well(transition to 2-WAY and again back to FULL)?
(Like, in cases wherein DR-Others have detect DR down much after BDR has detected DR down and sent it's Hello to DR-Others).
If this adjacency does flap with BDR, do we not loose out on the advantage of having a BDR?


Also I was hoping to have clarity on the section 9.2 of RFC 2328of RFC 2328 about triggers for NeighborChange event:
            o   One of the bidirectional neighbors is newly declaring
                itself as either Designated Router or Backup Designated
                Router.  This is detected through examination of that
                neighbor's Hello Packets.

            o   One of the bidirectional neighbors is no longer
                declaring itself as Designated Router, or is no longer
                declaring itself as Backup Designated Router.  This is
                again detected through examination of that neighbor's
                Hello Packets.

Whether these two bullets are mentioning transition from {DR to (BDR or DR-Other)}, and {BDR to (DR or DR-other)}
Or does it just mean a transition from (Non-DR-Other to DR-Other) and (DR-Other to Non-DR-Other).
Where Non-DR-Other means (DR and BDR).


Thanks and Regards,
Bharath R.


From: Acee Lindem [mailto:acee.lindem@ericsson.com]
Sent: Thursday, June 27, 2013 8:00 PM
To: Tanmoy Kundu
Cc: Bharath R; OSPF@ietf.org; Dileep Singh
Subject: Re: [OSPF] Query regarding behavior of OSPF DR-Other's neighbor-State with BDR when DR fails, when DR down detection is delayed at DR-Other.

I agree with Tanmoy that this is proper behavior.
Thanks,
Acee
On Jun 18, 2013, at 10:00 AM, Tanmoy Kundu wrote:


Hi Bharath,
Few queries. As you mentioned "R2(The current BDR) detects the DR-down.", how did R2 sensed that R1 is down ?
    a. first possible option is dead timer expiry in R2. In that case R3 should also get the expiry soon and till that time the network won't converge. isn't that expected?
    b. Another option is having BFD session between R1 and R2, hence it comes to know. Why don't we run BFD between all the routers in the network ? As we know in OSPF the DR and BDR is not guaranteed.

This typical scenario is due to the DR-other is with priority zero. But when received hello packet from DR, both BDR and DR-Other should reset the dead timer. Even if we consider the link transmission delay and ASIC processing, the dead timer expiry difference between R2 and R3 should not be more than milisecond, isn't it ?


*         Is  it right to trigger the NeighborChange event at R3? - I feel yes, other than few typical scenarios the network will at least not be used, until its converged. If Nbr-chg event is not sent then all DR other feels that DR is still active and the same will be used for forwarding the traffic. If someone in the network sensed that DR/BDR is down, why don't tell others immediately?
*         Is this transition from FULL to 2-WAY  is expected? - As per RFC, DR others should not be FULL with other routers than DR and BDR, hence YES. It is expected.
*         Can DR-Others flap adjacency with BDR if DR down detection  happens later than reception of new Hello from  the new DR? -
*         Intuitively, it may seem desirable to continue to be adjacent to a neighbor as long as it is still DR or BDR. Is this a fair call? - As mentioned above, its not fair to use a disturbed or unsettled network for forwarding. Due to backlink check failure the LAN wont be used for forwarding during SPF. Hence as per me its proper behavior.


Thanks,
Tanmoy


On Tue, Jun 18, 2013 at 5:26 PM, Bharath R <rbharath@juniper.net<mailto:rbharath@juniper.net>> wrote:
Hi,
Please consider the scenario:

R1(DR)                                            R2(BDR)
|  I1                                               I2   |
|                                                            |
|______________________ |
                                |
                                |  I3
                                |
                      R3(DR-Other)

Here:
R1 is the DR.
R2 is the BDR
R3 is DR-Other, with DR-priority 0 and hence ineligible to become DR/BDR.


Following are the is the set of operations and a sequence of events:

1.       Interface I1 is disabled. So, DR connectivity is lost to the rest of the network.
2.       R2(The current BDR) detects the DR-down.
a.       Declares itself the DR.
b.      Declares BDR as NULL.
c.       Sends out a Hello with DR as I2 interface address, and BDR as 0.0.0.0
3.       R3(DR-Other) receives Hello from R2.
As per, section 9.2 of RFC 2328:
            o   One of the bidirectional neighbors is newly declaring
                itself as either Designated Router or Backup Designated
                Router.  This is detected through examination of that
                neighbor's Hello Packets.

            o   One of the bidirectional neighbors is no longer
                declaring itself as Designated Router, or is no longer
                declaring itself as Backup Designated Router.  This is
                again detected through examination of that neighbor's
                Hello Packets.

Triggers a NeighborChange event, which in-turn results in a DR election at R3.
Please note that at this time R3 has not yet detected R1 down.
Now result of DR-election at R3:
DR: R1
BDR: 0.0.0.0
Since R2 is no longer the BDR, R3 transitions from FULL to 2-WAY with R2(the new DR).
Of course, on detection of DR down at R3, R3 will elect R2 as DR and then again transition to ExStart, to Exchange to Full with R2.

Can you please let me know:
*         Is  it right to trigger the NeighborChange event at R3 ?
*         Is this transition from FULL to 2-WAY  is expected?
*         Can DR-Others flap adjacency with BDR if DR down detection  happens later than reception of new Hello from  the new DR?
*         Intuitively, it may seem desirable to continue to be adjacent to a neighbor as long as it is still DR or BDR. Is this a fair call?

Please correct me if I have missed out something.


Thanks and Regards,
Bharath R.






_______________________________________________
OSPF mailing list
OSPF@ietf.org<mailto:OSPF@ietf.org>
https://www.ietf.org/mailman/listinfo/ospf

_______________________________________________
OSPF mailing list
OSPF@ietf.org<mailto:OSPF@ietf.org>
https://www.ietf.org/mailman/listinfo/ospf