Re: TWAMP analysis for assisting BFD debuggin (was Re: BFD stability follow-up from IETF-91)

Jeffrey Haas <jhaas@pfrc.org> Mon, 22 December 2014 15:15 UTC

Return-Path: <jhaas@slice.pfrc.org>
X-Original-To: rtg-bfd@ietfa.amsl.com
Delivered-To: rtg-bfd@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 25FAB1A19EC for <rtg-bfd@ietfa.amsl.com>; Mon, 22 Dec 2014 07:15:27 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.578
X-Spam-Level:
X-Spam-Status: No, score=-1.578 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, IP_NOT_FRIENDLY=0.334, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001, T_RP_MATCHES_RCVD=-0.01] autolearn=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id PZLVIAwGbqng for <rtg-bfd@ietfa.amsl.com>; Mon, 22 Dec 2014 07:15:26 -0800 (PST)
Received: from slice.pfrc.org (slice.pfrc.org [67.207.130.108]) by ietfa.amsl.com (Postfix) with ESMTP id 323231A1A04 for <rtg-bfd@ietf.org>; Mon, 22 Dec 2014 07:15:25 -0800 (PST)
Received: by slice.pfrc.org (Postfix, from userid 1001) id A8F26C265; Mon, 22 Dec 2014 10:15:24 -0500 (EST)
Date: Mon, 22 Dec 2014 10:15:24 -0500
From: Jeffrey Haas <jhaas@pfrc.org>
To: Gregory Mirsky <gregory.mirsky@ericsson.com>
Subject: Re: TWAMP analysis for assisting BFD debuggin (was Re: BFD stability follow-up from IETF-91)
Message-ID: <20141222151524.GP16279@pfrc>
References: <09CD6B2F-4DCC-429F-848B-223C72A0F171@gmail.com> <7347100B5761DC41A166AC17F22DF1121B8AAA24@eusaamb103.ericsson.se> <CO2PR0501MB8231A4913DEB31323847CA8B3780@CO2PR0501MB823.namprd05.prod.outlook.com> <7347100B5761DC41A166AC17F22DF1121B8AAC0D@eusaamb103.ericsson.se> <CAG1kdoiquWYaAz5ti14VrmiqXmph-SpjgYs=m8AuQGdKGo2xXQ@mail.gmail.com> <7347100B5761DC41A166AC17F22DF1121B8AACDB@eusaamb103.ericsson.se> <20141219210222.GJ16279@pfrc> <7347100B5761DC41A166AC17F22DF1121B8C5EA0@eusaamb103.ericsson.se> <F81ED3EF-5818-45D8-9F5F-7CE8910B6B43@gmail.com> <7347100B5761DC41A166AC17F22DF1121B8C6959@eusaamb103.ericsson.se>
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Disposition: inline
In-Reply-To: <7347100B5761DC41A166AC17F22DF1121B8C6959@eusaamb103.ericsson.se>
User-Agent: Mutt/1.5.21 (2010-09-15)
Archived-At: http://mailarchive.ietf.org/arch/msg/rtg-bfd/OB6H5D38QOHB6vJqgDdDd2sb85Y
Cc: "rtg-bfd@ietf.org" <rtg-bfd@ietf.org>
X-BeenThere: rtg-bfd@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: "RTG Area: Bidirectional Forwarding Detection DT" <rtg-bfd.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/rtg-bfd>, <mailto:rtg-bfd-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/rtg-bfd/>
List-Post: <mailto:rtg-bfd@ietf.org>
List-Help: <mailto:rtg-bfd-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/rtg-bfd>, <mailto:rtg-bfd-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 22 Dec 2014 15:15:27 -0000

Greg,

On Mon, Dec 22, 2014 at 06:46:26AM +0000, Gregory Mirsky wrote:
> if you assume that BFD flap is more likely because of implementation or
> overloading system rather than because of network condition, then I???d
> like to understand how intermittent congestion in a source end-node can be
> differentiated from intermittent congestion in the network without PM.

Congestion is one of the scenarios covered by tracking things at the BFD
level.  Another is simply whether packets are being generated or not or even
the simple timings within protocol packet generation.

While very lightweight, BFD still has implementation scaling choices that
can lead to unusual behaviors.  Packets are inherently jittered within the
context of the protocol, but to some extent it's simply a practical
consideration that maintaining tight timings is difficult.  Similarly, if a
packet simply is not sent because an internal timer ran late and it decided
to send the next packet instead, that's a consideration.

Being able to tie BFD issues to either issues within implementation vs.
issues happening on the wire is useful, but the tool sets will overlap in
some cases.

> I???m not questioning value of improved debugability of an implementation,
> BFD in this case, but I???d prefer it not to have adverse impact on the
> protocol, on its ability to perform its main task most efficiently.
> Debugging, I believe, been and still is, proprietary, implementation
> specific. Problems you???ve described, I believe, unlikely to be
> experienced in HW based BFD implementation. 

Unlikely to be experienced in HW?  I suspect you'll get a much different set
of feedback off-list where people think they can speak a bit more bluntly
than on-list.

> Thus, what benefit, changes
> you propose, would bring for such implementations?

Even in a situation where a given implementation was perfect, we care about
any pair of implementations.  When services flap due to BFD, you now have
two devices to troubleshoot and blame.

-- Jeff