TWAMP analysis for assisting BFD debuggin (was Re: BFD stability follow-up from IETF-91)

Jeffrey Haas <jhaas@pfrc.org> Fri, 19 December 2014 21:02 UTC

Return-Path: <jhaas@slice.pfrc.org>
X-Original-To: rtg-bfd@ietfa.amsl.com
Delivered-To: rtg-bfd@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 8F28B1A7023 for <rtg-bfd@ietfa.amsl.com>; Fri, 19 Dec 2014 13:02:23 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.578
X-Spam-Level:
X-Spam-Status: No, score=-1.578 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, IP_NOT_FRIENDLY=0.334, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001, T_RP_MATCHES_RCVD=-0.01] autolearn=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id yrQynUtv2L_y for <rtg-bfd@ietfa.amsl.com>; Fri, 19 Dec 2014 13:02:22 -0800 (PST)
Received: from slice.pfrc.org (slice.pfrc.org [67.207.130.108]) by ietfa.amsl.com (Postfix) with ESMTP id B854A1A7011 for <rtg-bfd@ietf.org>; Fri, 19 Dec 2014 13:02:22 -0800 (PST)
Received: by slice.pfrc.org (Postfix, from userid 1001) id 7043DC19C; Fri, 19 Dec 2014 16:02:22 -0500 (EST)
Date: Fri, 19 Dec 2014 16:02:22 -0500
From: Jeffrey Haas <jhaas@pfrc.org>
To: Gregory Mirsky <gregory.mirsky@ericsson.com>
Subject: TWAMP analysis for assisting BFD debuggin (was Re: BFD stability follow-up from IETF-91)
Message-ID: <20141219210222.GJ16279@pfrc>
References: <00a001d00d64$7735ce50$65a16af0$@chinamobile.com> <7347100B5761DC41A166AC17F22DF1121B8A87E6@eusaamb103.ericsson.se> <730769BB-D021-4E22-878A-2C289822A156@gmail.com> <7347100B5761DC41A166AC17F22DF1121B8AA754@eusaamb103.ericsson.se> <09CD6B2F-4DCC-429F-848B-223C72A0F171@gmail.com> <7347100B5761DC41A166AC17F22DF1121B8AAA24@eusaamb103.ericsson.se> <CO2PR0501MB8231A4913DEB31323847CA8B3780@CO2PR0501MB823.namprd05.prod.outlook.com> <7347100B5761DC41A166AC17F22DF1121B8AAC0D@eusaamb103.ericsson.se> <CAG1kdoiquWYaAz5ti14VrmiqXmph-SpjgYs=m8AuQGdKGo2xXQ@mail.gmail.com> <7347100B5761DC41A166AC17F22DF1121B8AACDB@eusaamb103.ericsson.se>
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Disposition: inline
In-Reply-To: <7347100B5761DC41A166AC17F22DF1121B8AACDB@eusaamb103.ericsson.se>
User-Agent: Mutt/1.5.21 (2010-09-15)
Archived-At: http://mailarchive.ietf.org/arch/msg/rtg-bfd/LwBCEXuJG9RztoG0XFmVOXLpwZM
Cc: "rtg-bfd@ietf.org" <rtg-bfd@ietf.org>
X-BeenThere: rtg-bfd@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: "RTG Area: Bidirectional Forwarding Detection DT" <rtg-bfd.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/rtg-bfd>, <mailto:rtg-bfd-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/rtg-bfd/>
List-Post: <mailto:rtg-bfd@ietf.org>
List-Help: <mailto:rtg-bfd-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/rtg-bfd>, <mailto:rtg-bfd-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 19 Dec 2014 21:02:23 -0000

[I'm running behind in discussions as usual]

On Thu, Dec 04, 2014 at 04:33:17PM +0000, Gregory Mirsky wrote:
> Hi Manav,
> I hope you don???t expect me to give a lecture on how to design and implement debugable implementation using logging and packet tracing.

For what it's worth, I always find such "drive-by" comments about "well,
there already exists something that does <foo>, it's here - go do your
homework!" to be a bit frustrating myself. :-)  

Not being personally familiar with TWAMP, I spend a few minutes digging
through the spec for TWAMP and OWAMP for some details about the control and
test traffic.  It has some properties that are problematic for the pieces of
the ecosystem covered by various portions of BFD deployment:

TCP control channel.  The implication is not only bidirectional
reachability, but a control IP stack that may not involve the nodes being
tested.  (Nodes in this case potentially being anything from host systems
all the way down to line card components.)

The data channel is UDP, which is good.  The problem though is there's no
guarantee it'll cover the layers of the hardware stack covered by the BFD
sessions.  As one example, BFD on LAG, we not only may not have a full IP
stack running at the point it's coming up, there's a chance that the
receiver isn't really much of an IP speaker.  It may be effectively using
the IP+UDP framing as dumb framing rather than protocol in order to
bootstrap the session.  IP may not come up until constituent LAG elements
have been put into service.

There's also the matter that to get TWAMP running at the appropriate layer,
it would be similarly necessary to embed it in the same general paths that
BFD covers.

Thus, at first very coarse analysis, TWAMP neither seems to share enough of
the forwarding fate on a guaranteed basis as we'd want to supplement BFD
debugging.  The control channel also seems a bit heavy-weight, but it's
potentially no worse than architecture of some LACP implementations I'm
aware of.

Is there something missing or incorrect in the above thinking?

-- Jeff