Re: Service Redundancy using BFD

Ashesh Mishra <> Tue, 28 November 2017 12:04 UTC

Return-Path: <>
Received: from localhost (localhost []) by (Postfix) with ESMTP id A4C0D126C2F for <>; Tue, 28 Nov 2017 04:04:44 -0800 (PST)
X-Virus-Scanned: amavisd-new at
X-Spam-Flag: NO
X-Spam-Score: -1.998
X-Spam-Status: No, score=-1.998 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: (amavisd-new); dkim=pass (2048-bit key)
Received: from ([]) by localhost ( []) (amavisd-new, port 10024) with ESMTP id b8bkeI097wbV for <>; Tue, 28 Nov 2017 04:04:42 -0800 (PST)
Received: from ( []) (using TLSv1.2 with cipher ECDHE-RSA-AES256-SHA384 (256/256 bits)) (No client certificate requested) by (Postfix) with ESMTPS id 0567C120725 for <>; Tue, 28 Nov 2017 04:04:41 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version; bh=o3LdzcwHJfmz1Z94D3tFBaqAJ7kv9nEUP0Rfv7YWbjI=; b=aXNl3Pxh7JnnhjMSeQQ0JyhTFq4rMQfFXycZR+SWaLpwBFe+Scg4PGDVAWbAtCcJwo8WjICYvnO4sVdaia7rdHdZonauoOaegGKYCLcBcBo6CwpPE0HOsKlITh/8vC0dvB9U9qFKijFpHLgJmOjALp2OlmUbcS6RZJAhZpBTnWQf255GcjkbIyYNNR5WaRDZ1mo/Yl330WEN1xSyV6MOXsk0ZzRlrzIgWKt0UEQL4bWgfI3DtrEN38DlRwGbx8sRdLK+gWseNRuYyR7e2NikBkLDpChTAljzGlDksXVXNspYYAx9Ta6JQ7nYwg52dnLV5kTek+WaB55O4FK7P0vVaA==
Received: from ( by ( with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_CBC_SHA384_P384) id; Tue, 28 Nov 2017 12:04:41 +0000
Received: from ( by ( with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_CBC_SHA256_P256) id via Frontend Transport; Tue, 28 Nov 2017 12:04:41 +0000
Received: from ([]) by ([]) with mapi id 15.20.0260.006; Tue, 28 Nov 2017 12:04:41 +0000
From: Ashesh Mishra <>
To: Ankur Dubey <>, "" <>
CC: Reshad Rahman <>, Sami Boutros <>
Subject: Re: Service Redundancy using BFD
Thread-Topic: Service Redundancy using BFD
Thread-Index: AQHTZ/M82vlye4FAgk+FmQYa86ul9aMpXmmA
Date: Tue, 28 Nov 2017 12:04:40 +0000
Message-ID: <>
References: <>
In-Reply-To: <>
Accept-Language: en-US
Content-Language: en-US
x-incomingtopheadermarker: OriginalChecksum:A658D0C8DD02F9FA2CC810377F0E59442F7CDD8E35F8F9C26CB24174D974ADED; UpperCasedChecksum:D3B3639DE798E7AFFE0E09D2548F6302F83C82D100EE93D71175F40476D031E7; SizeAsReceived:6980; Count:47
x-ms-exchange-messagesentrepresentingtype: 1
x-tmn: [dW1aup5ahUYHe+REtlWsqqaDmB6zbjnz]
x-ms-publictraffictype: Email
x-microsoft-exchange-diagnostics: 1; CO1NAM03HT089; 6:tYJPtVtz8YVoII4jV3anpdRcskvAHDwmoo6jG3+5xgYR/nVaSgxT7v+7/CRpA5tFDfyClJ8tti+vV5XUJ2PRcn1m78VGNAdNebYu6BHLzVu8NHZR5uRM4p4E9l03DpPS8NCodxkYsMTEyjUXCTi7XFgg/sQ4uVT2/NUEyyOy1p9fObLH2EvDct0r+UqlU9ICNkxeWmTiLu15KM6IkJHIaC0DW9CURqmMD57GeBnaJfRqmESCKYpS+n+tRo1vn28eLKQJMtjSUyt/KD0wSq0F2pT+uCbe1ejO9vIAOw9G3xgOpktoV5944feT4Ra/TE/Iu0FzhpabhK6gXfGK/KIMoaeVXcdeE57t+rQ4wGZijC4=; 5:ANf2PXRpHq5/s6RFVPfwa6WMzcAku3IF9arIKXtKBwdTem7Vs4yAmoshqPWtZrlFqLmmf039kCXPpEI0pko8WkB/qGEB+sEi2qzEK3zBEwrBDId8l0bXCDRDebukzvckUIshmqVVcWWL48jkrOOZhR1No1iZScxfPtqSzqy1Y7c=; 24:Tfu0clRfKyHisndELZcDY9Ne3coeTZgJHqlo2MYsuEoWrXSGZdN5X7iAgp1O5fmYcTWkA5PYDLnPuI5o6tOxkNcKxpxPMtcOJ3ZaKsSyQaU=; 7:P3BoovYF6iUJbrDYoAaOPWxIMVZq62bmNBGjtevAOd4XyR5D2t5r80aWOt6DXN+tzVr9hP1f06TpprtNQylYzEd1uZNGCLSJBNsNKqMQZZgB8j/1DyKyUjc1biufqTXHORepAoVIMpHVlV+VUtYldbZRxmWezikJeDVGn+Jb2c3hHJ/lH3jeHJIIwSDSyLBhHLcZRnzqvBqSMJwEeFU57CTHHAPBKRb7AJY5HDRyfk2fU9rMs2uTsM0+V7JC6MH4
x-incomingheadercount: 47
x-eopattributedmessage: 0
x-microsoft-antispam: UriScan:; BCL:0; PCL:0; RULEID:(201702061074)(5061506573)(5061507331)(1603103135)(2017031320274)(2017031324274)(2017031323274)(2017031322404)(1601125374)(1603101448)(1701031045); SRVR:CO1NAM03HT089;
x-ms-traffictypediagnostic: CO1NAM03HT089:
x-ms-office365-filtering-correlation-id: 7f678b1e-e3d5-4d74-4c95-08d53658349a
x-exchange-antispam-report-cfa-test: BCL:0; PCL:0; RULEID:(444000031); SRVR:CO1NAM03HT089; BCL:0; PCL:0; RULEID:(100000803101)(100110400095); SRVR:CO1NAM03HT089;
x-forefront-prvs: 0505147DDB
x-forefront-antispam-report: SFV:NSPM; SFS:(7070007)(98901004); DIR:OUT; SFP:1901; SCL:1; SRVR:CO1NAM03HT089;; FPR:; SPF:None; LANG:;
spamdiagnosticoutput: 1:99
spamdiagnosticmetadata: NSPM
Content-Type: multipart/alternative; boundary="_000_76804F3563BB46A0A74C9E41B2C213B4outlookcom_"
MIME-Version: 1.0
X-MS-Exchange-CrossTenant-Network-Message-Id: 7f678b1e-e3d5-4d74-4c95-08d53658349a
X-MS-Exchange-CrossTenant-originalarrivaltime: 28 Nov 2017 12:04:40.9248 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Internet
X-MS-Exchange-CrossTenant-id: 84df9e7f-e9f6-40af-b435-aaaaaaaaaaaa
X-MS-Exchange-Transport-CrossTenantHeadersStamped: CO1NAM03HT089
Archived-At: <>
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: "RTG Area: Bidirectional Forwarding Detection DT" <>
List-Unsubscribe: <>, <>
List-Archive: <>
List-Post: <>
List-Help: <>
List-Subscribe: <>, <>
X-List-Received-Date: Tue, 28 Nov 2017 12:04:45 -0000

Hi Ankur,

This is a good proposal to pursue within the BFD-wg.

Couple of comments:

-          BFD can only signal this diag code for the interface that it is monitoring (the IP next hop, MPLS LSP, etc.). You mention per-service (which I assume means per-service-per-interface) failover in the draft but it may be worthwhile defining behavior on per-service-type-per-interface as well.

-          There still needs to be a method for the primary and backup pairs (two BFD end-points on primary service and two on backup service) to communicate with each other (primary-to-primary and backup-to-backup) if the service is active or standby. This is useful in the scenario when the primary cannot communicate with backup nodes (it is a failure condition after all).

Again, at 10k ft, I like the idea of signaling active/standby using BFD.


From: Rtg-bfd <>; on behalf of Ankur Dubey <>;
Date: Monday, November 27, 2017 at 9:47 PM
To: ""; <>;
Cc: Reshad Rahman <>;, Sami Boutros <>;
Subject: Service Redundancy using BFD

Hi all,

Please review and provide comments for the following draft:

Summary of draft:

This draft proposes a new BFD diag code via which a node running a BFD session with another node, can inform the other node after a BFD session times out, that it didn’t go down and did live through the failure.

Such notification is useful for a set of nodes providing Active/Standby redundancy. When these nodes are running multiple L2/L3/L4-L7 services  in non-revertive mode of redundancy, the standby node taking over as active for non-revertive services after BFD times out needs to indicate in the BFD packet that it outlived the other failed old active node. The new diag code will be used for this purpose. When this diag code is set in the BFD packets, it will provide an indication to the failed old active node that it MUST NOT activate the non-revertive services when it comes up.

For providing a per service level failover, a node activating certain non-revertive services needs to indicate that it is Active ONLY for those non-revertive services. This can be done by using a unique bitmap where each bit position is uniquely identifying a service. This unique bitmap is configured on all nodes by a network controller. When there is at least one non-revertive service for which a node is not active AND it is active for at least 1 non-revertive service, this node will set bits identifying the active services in the bitmap and send it in the payload of the BFD packet.