RE: BFD stability follow-up from IETF-91

Gregory Mirsky <gregory.mirsky@ericsson.com> Sat, 29 November 2014 03:18 UTC

Return-Path: <gregory.mirsky@ericsson.com>
X-Original-To: rtg-bfd@ietfa.amsl.com
Delivered-To: rtg-bfd@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 013141A0395 for <rtg-bfd@ietfa.amsl.com>; Fri, 28 Nov 2014 19:18:00 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -104.201
X-Spam-Level:
X-Spam-Status: No, score=-104.201 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_MED=-2.3, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=ham
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id ZTxRWu45uuGk for <rtg-bfd@ietfa.amsl.com>; Fri, 28 Nov 2014 19:17:58 -0800 (PST)
Received: from usevmg21.ericsson.net (usevmg21.ericsson.net [198.24.6.65]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id D84051A0404 for <rtg-bfd@ietf.org>; Fri, 28 Nov 2014 19:17:57 -0800 (PST)
X-AuditID: c6180641-f79916d00000623a-e0-5478dee9e300
Received: from EUSAAHC007.ericsson.se (Unknown_Domain [147.117.188.93]) by usevmg21.ericsson.net (Symantec Mail Security) with SMTP id 49.34.25146.9EED8745; Fri, 28 Nov 2014 21:45:30 +0100 (CET)
Received: from EUSAAMB103.ericsson.se ([147.117.188.120]) by EUSAAHC007.ericsson.se ([147.117.188.93]) with mapi id 14.03.0195.001; Fri, 28 Nov 2014 22:17:55 -0500
From: Gregory Mirsky <gregory.mirsky@ericsson.com>
To: Jeffrey Haas <jhaas@pfrc.org>
Subject: RE: BFD stability follow-up from IETF-91
Thread-Topic: BFD stability follow-up from IETF-91
Thread-Index: AQHQCQ6qQ6amXjPuFEmbPIOgLl4p2pxyzfeAgAAfxYCAAAeVAIAAjSUAgAD7mQCAAEe3gIABbj6AgAARfwCAAAtmAIAAF9PggABjhoCAACJJkA==
Date: Sat, 29 Nov 2014 03:17:54 +0000
Message-ID: <7347100B5761DC41A166AC17F22DF1121B89C865@eusaamb103.ericsson.se>
References: <007701d00af9$28719050$7954b0f0$@chinamobile.com> <D09E5FAC.27C51%mmudigon@cisco.com> <007e01d00b07$9c02cc10$d4086430$@chinamobile.com> <7347100B5761DC41A166AC17F22DF1121B8998E7@eusaamb103.ericsson.se> <20141128195536.GG1274@pfrc>
In-Reply-To: <20141128195536.GG1274@pfrc>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-originating-ip: [147.117.188.11]
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
X-Brightmail-Tracker: H4sIAAAAAAAAA+NgFvrGLMWRmVeSWpSXmKPExsUyuXRPrO6rexUhBs9Oc1isb3jBbrH/4FtW iyOvjjFbfP6zjdGBxWPehYVsHlN+b2T1WLLkJ5PH5d6trAEsUVw2Kak5mWWpRfp2CVwZvz7c ZClYJ1VxtK+LrYHxkGgXIyeHhICJxOeG38wQtpjEhXvr2boYuTiEBI4wSnxv3sYK4SxnlPjV t4ARpIpNwEjixcYedhBbREBRYv7/TjYQm1mgg1Gi+2opiC0sYCixqnsxVI2RxLEZc6HsOol3 T3+wgtgsAqoSJ28eApvJK+Ar8evEFEaIZb8YJfrOfgQbyimgKTFhbwvYeYxA530/tYYJYpm4 xK0n85kgzhaQWLLnPNQLohIvH/9jhbCVJD7+ns8OUa8jsWD3J6hDtSWWLXzNDLFYUOLkzCcs ExjFZiEZOwtJyywkLbOQtCxgZFnFyFFanFqWm25kuIkRGFHHJNgcdzAu+GR5iFGAg1GJh/fD mvIQIdbEsuLK3EOM0hwsSuK8mtXzgoUE0hNLUrNTUwtSi+KLSnNSiw8xMnFwSjUwVm3dszvp 7ged0y5z6p7vvxJ5M7k15PpMuaYsJ/5gvySXbQ8mvMlMP6J9WbPwZqbxzliHL9EmV4Nkm5f9 /yhnE3y1pGOxA0fwygSz3Jv5MZfNI75+k9KVbw+fNd2HPz2G4ynPY0GmlNvSAg4nPc4bxbl5 VLB578yt5Hx3/5BS9cV+YaXmh2uUWIozEg21mIuKEwFErn/LiQIAAA==
Archived-At: http://mailarchive.ietf.org/arch/msg/rtg-bfd/GLsseB9xZZTg0aJw-z75RqS34n8
Cc: "rtg-bfd@ietf.org" <rtg-bfd@ietf.org>
X-BeenThere: rtg-bfd@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: "RTG Area: Bidirectional Forwarding Detection DT" <rtg-bfd.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/rtg-bfd>, <mailto:rtg-bfd-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/rtg-bfd/>
List-Post: <mailto:rtg-bfd@ietf.org>
List-Help: <mailto:rtg-bfd-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/rtg-bfd>, <mailto:rtg-bfd-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sat, 29 Nov 2014 03:18:00 -0000

Hi Jeff,
I absolutely agree with continuing discussion and sharing experiences from real-life deployments and interoperability of BFD in IP and MPLS networks, single-, multi-hop and over LAG constituents. From that we're learning and that helps us make our implementations more robust, reliable, and interoperable. And certainly, where it benefits the community informational RFCs, like BFD intervals or RFC 7325 MPLS Forwarding Compliance and Performance Requirements, to document the cases and our recommendations. But I'm somewhat concerned with anything that targets standard status, even as optional functionality, even though it is not proven that the problem is in the standard, not in an implementation.

	Regards,
		Greg

-----Original Message-----
From: Jeffrey Haas [mailto:jhaas@pfrc.org] 
Sent: Friday, November 28, 2014 11:56 AM
To: Gregory Mirsky
Cc: Fan, Peng; 'MALLIK MUDIGONDA (mmudigon)'; rtg-bfd@ietf.org
Subject: Re: BFD stability follow-up from IETF-91

[Speaking as an individual contributor...]

On Fri, Nov 28, 2014 at 07:36:30PM +0000, Gregory Mirsky wrote:
> this is very interesting scenario. I think that if BFD experiences ~30% packet loss, then highly likely so are affected other applications. Then it is not just BFD issue but condition that should be detected  by performance measurement method, whether active or passive packet loss measurement.
> I'm convinced that overloading BFD with performance measurement provisions is counter-productive and is inappropriate.

My opinion is about halfway between your opinion, Greg.

I agree that we wish to be very cautious about overloading BFD with components that are in other OAM mechanisms.  Among my desire for such caution is that IEEE has expressed interest in not having us step on their technologies and this would create paperwork for the chairs. :-)

But where I think we diverge slightly comes from experience in helping the working group and vendors wend their way through implementing BFD for LAG.
During that discussion, it was very clear that depending on the vendor, the architecture and sometimes specific chipsets that "BFD" lived in very different pieces of underlying architecture.

What this means is that trying to do very tight timing things will run into practical issues in having to figure out what the perspective of the timings are.  Is it some underlying L2? L3? Something between?  At what point do you realize you are measuring contradictory things?

But similarly, when trying to measure and account for loss, having some data is useful simply because it helps you determine that the component that is *responsible for BFD* may be experiencing loss.  Depending on your architecture, this may be the underlying layer-1, layer-2 or something else.
In such cases, the lower-layer OAM is better to troubleshoot.  But in cases where your lower-layer OAM doesn't indicate the loss, you still need to understand that there is BFD-level loss.

I encourage participants in this discussion to remember this detail: We are trying to help measure BFD loss.  Trying to read too much detail into what that means outside of BFD may lead you to erroneous conclusions depending on a given implementation.

Thus, consider what is best for BFD.

-- Jeff