RE: BFD stability follow-up from IETF-91

Gregory Mirsky <gregory.mirsky@ericsson.com> Wed, 26 November 2014 18:20 UTC

Return-Path: <gregory.mirsky@ericsson.com>
X-Original-To: rtg-bfd@ietfa.amsl.com
Delivered-To: rtg-bfd@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 227B71A1A72 for <rtg-bfd@ietfa.amsl.com>; Wed, 26 Nov 2014 10:20:01 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -104.2
X-Spam-Level:
X-Spam-Status: No, score=-104.2 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_MED=-2.3, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=ham
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id y1VAqb-Zo3cd for <rtg-bfd@ietfa.amsl.com>; Wed, 26 Nov 2014 10:19:58 -0800 (PST)
Received: from usevmg20.ericsson.net (usevmg20.ericsson.net [198.24.6.45]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id DEC111A1A6B for <rtg-bfd@ietf.org>; Wed, 26 Nov 2014 10:19:57 -0800 (PST)
X-AuditID: c618062d-f79206d0000014d2-79-5475c050f4a0
Received: from EUSAAHC006.ericsson.se (Unknown_Domain [147.117.188.90]) by usevmg20.ericsson.net (Symantec Mail Security) with SMTP id 76.31.05330.050C5745; Wed, 26 Nov 2014 12:58:09 +0100 (CET)
Received: from EUSAAMB103.ericsson.se ([147.117.188.120]) by EUSAAHC006.ericsson.se ([147.117.188.90]) with mapi id 14.03.0195.001; Wed, 26 Nov 2014 13:19:47 -0500
From: Gregory Mirsky <gregory.mirsky@ericsson.com>
To: Mach Chen <mach.chen@huawei.com>, Marc Binderberger <marc@sniff.de>, Manav Bhatia <manavbhatia@gmail.com>
Subject: RE: BFD stability follow-up from IETF-91
Thread-Topic: BFD stability follow-up from IETF-91
Thread-Index: AQHQCQ6qQ6amXjPuFEmbPIOgLl4p2pxyzfeAgAAfxYCAAAeVAIAAQXYg
Date: Wed, 26 Nov 2014 18:19:46 +0000
Message-ID: <7347100B5761DC41A166AC17F22DF1121B898C13@eusaamb103.ericsson.se>
References: <20141126001931.GJ20330@pfrc> <CAG1kdoghcA=xSaXmkr68qduH2t8oC=-ZazoQztj8JK12SazKsw@mail.gmail.com> <20141126005023981392.0c488535@sniff.de> <F73A3CB31E8BE34FA1BBE3C8F0CB2AE28B2D9A97@SZXEMA510-MBX.china.huawei.com>
In-Reply-To: <F73A3CB31E8BE34FA1BBE3C8F0CB2AE28B2D9A97@SZXEMA510-MBX.china.huawei.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-originating-ip: [147.117.188.11]
Content-Type: multipart/alternative; boundary="_000_7347100B5761DC41A166AC17F22DF1121B898C13eusaamb103erics_"
MIME-Version: 1.0
X-Brightmail-Tracker: H4sIAAAAAAAAA+NgFuphkeLIzCtJLcpLzFFi42KZXLonSjfwQGmIwb6rbBYX1gpbXJ7Uxm4x +8p/ZovPf7YxOrB47Jx1l92j5chbVo8lS34yebSu7mYJYInisklJzcksSy3St0vgylh+7wRz wbSiise3WBoYd8Z2MXJySAiYSLxb38QOYYtJXLi3nq2LkYtDSOAIo8Sfq9tYIJzljBJXVpxj A6liEzCSeLGxB6xDRKBQ4suONjCbWUBTounEZzBbWMBQYlX3YqgaI4ljM+ZC2W4S85a+YAWx WQRUJa7MPgUW5xXwlVg3dzY7xLLnjBI3Tk4FK+IUCJOYe3ItC4jNCHTe91NrmCCWiUvcejKf CeJsAYkle84zQ9iiEi8f/2OFsJUkPv6eD3VcvsSBSTeglglKnJz5hGUCo+gsJKNmISmbhaQM Iq4jsWD3JzYIW1ti2cLXzDD2mQOPmZDFFzCyr2LkKC1OLctNNzLYxAiMvmMSbLo7GPe8tDzE KMDBqMTDu+FiSYgQa2JZcWXuIUZpDhYlcd5ZtfOChQTSE0tSs1NTC1KL4otKc1KLDzEycXBK NTAueHHhkseb/ufayTwvCrjnxdusj00oOzWpaNbVHfkfkq+HHWWs/PRh/qTXvUmHKp+t6Z79 zPUuq1nON8Z+5ZqTW41PLlV4tii+vcFtX2XeCvXq5lO6ZXMdWHb+2vlgH+Pr1bd1CzUMI6zV 9N4z7F7L3OAiOHu5+Y+EMz8mtM1rEIj+KrB8s1u3EktxRqKhFnNRcSIAriHf9Z8CAAA=
Archived-At: http://mailarchive.ietf.org/arch/msg/rtg-bfd/z48idxjW9zQBHJAnX8p1PEafamU
Cc: "rtg-bfd@ietf.org" <rtg-bfd@ietf.org>
X-BeenThere: rtg-bfd@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: "RTG Area: Bidirectional Forwarding Detection DT" <rtg-bfd.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/rtg-bfd>, <mailto:rtg-bfd-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/rtg-bfd/>
List-Post: <mailto:rtg-bfd@ietf.org>
List-Help: <mailto:rtg-bfd-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/rtg-bfd>, <mailto:rtg-bfd-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 26 Nov 2014 18:20:01 -0000

Dear All,
are we turning BFD into 1588? It does seem like that by the extent of our discussion of timestamping issue.
The PTP has two methods of measuring Resident Time, time a packet spent in a node (I feel that is what we're trying to measure by collecting timestamps):
*       one-step - resident time measured on a packet and reported in the same packet;
*       two-step - more elaborate as it requires source port identification, sequence number and proper type of a packet. Residence time measured on one packet but reported in the follow-up of appropriate type.
Does that sounds like what we're discussing?

        Regards,
                Greg

-----Original Message-----
From: Rtg-bfd [mailto:rtg-bfd-bounces@ietf.org] On Behalf Of Mach Chen
Sent: Wednesday, November 26, 2014 1:18 AM
To: Marc Binderberger; Manav Bhatia
Cc: rtg-bfd@ietf.org
Subject: RE: BFD stability follow-up from IETF-91

Hi Marc and Manav,

> -----Original Message-----
> From: Rtg-bfd [mailto:rtg-bfd-bounces@ietf.org] On Behalf Of Marc
> Binderberger
> Sent: Wednesday, November 26, 2014 4:50 PM
> To: Manav Bhatia
> Cc: rtg-bfd@ietf.org<mailto:rtg-bfd@ietf.org>
> Subject: Re: BFD stability follow-up from IETF-91
>
> Hello Manav,
>
> > I believe the work is important and addresses something thats really
> > required (spent too much time debugging why BFD flapped!).
>
> agree :-) we should keep the discussion alive.
>
>
> > side Time stamping would have helped in debugging whether the BFD
> > packet was sent late, or whether the packet was sent on time and
> > also arrived on time but was delayed when passing it up the BFD
> > stack/processor (lay in the RX buffer for tad too long)
>
> well, I can see a point in having the Tx timestamps in the packet
> mainly for the purpose of knowing "this" packet was okay/not okay on
> the Tx side and to correlate it with your local Rx measurement.

Yes, this is one solution if people think BFD delay is needed. If allow to have Tx timestamps to be carried in the packets, seems it should be no problem to leave a seat for the Rx timestamps as well :-). After all, with both Tx and Rx timestamp, it may simplify the implementation.

>
> And even this point is less relevant with sequence numbers as this
> number allows the identification of packets and thus the correlation
> of information from the Tx and Rx system.

Indeed, the sequence number helps a lot for the correlation between the Tx and Rx system.

This triggers me think out there should be another solution for getting the Tx and Rx timestamps without encoding the timestamps in the BFD packets. For example, the Tx and Rx systems could just save timestamps locally or send them to a centralized entity and then use the sequence numbers to correlate them for further analyzing.

Best regards,
Mach

>
>
> Regards, Marc
>
>
>
>
>
>
> On Wed, 26 Nov 2014 12:26:41 +0530, Manav Bhatia wrote:
> > Hi Jeff,
> >
> > I vividly remember the original intent of the stability draft was to
> > help debug BFD failures -- to isolate the issue at the RX or the TX
> > side Time stamping would have helped in debugging whether the BFD
> > packet was sent late, or whether the packet was sent on time and
> > also arrived on time but was delayed when passing it up the BFD
> > stack/processor (lay in the RX buffer for tad too long), etc. But
> > then time stamping came with its own set of issues, and was hence
> > dropped from the original draft.
> >
> > Can the authors send a summary on the list on why time stamping was
> > dropped so that we're all clear on that one.
> >
> > The current proposal does help but is not complete.
> >
> > Assume that the RX end loses a BFD session and learns later that it
> > did eventually receive the missing BFD packets (based on the seq #).
> > How would it know which end was misbehaving? Was it a delay at the
> > TX side, or was it the RX that delayed passing the packets to the
> > BFD process(or). This is usually what we want to debug and i want to
> > understand how this draft with sequence numbers can unequivocally
> > tell me that.
> >
> > I believe the work is important and addresses something thats really
> > required (spent too much time debugging why BFD flapped!). Clearly
> > what would help is putting a small section that describes how we can
> > use the sequence numbers to debug what and where things went wrong.
> >
> > Cheers, Manav
> >
> >
> > On Wed, Nov 26, 2014 at 5:49 AM, Jeffrey Haas <jhaas@pfrc.org<mailto:jhaas@pfrc.org>> wrote:
> >> draft-ashesh-bfd-stability-01 was presented again during IETF-91 in
> >> Honolulu.  The slides can be viewed here:
> >>
> >> http://www.ietf.org/proceedings/91/slides/slides-91-bfd-4.pptx
> >>
> >> To attempt to simplify the presentation, the contentious portion of
> >> the timers were removed from the proposal, leaving only the
> >> sequence numbering for detecting loss of BFD async packets.
> >>
> >> When the room was polled to see whether the draft should be adopted
> >> as a WG item, the sense of the room was very quiet.  As promised,
> >> this is to inquire for support for this draft on the WG mailing
> >> list to make sure the whole group has a voice.
> >>
> >> It should be noted that post-meeting discussion on the fate of this
> >> draft noted that BFD authentication code points are plentiful and
> >> are available with expert review.  Should the draft authors wish to
> >> continue this work as Experimental, that is an option.
> >>
> >> -- Jeff
> >>
> >