RE: BFD stability follow-up from IETF-91

Santosh P K <santoshpk@juniper.net> Thu, 27 November 2014 14:59 UTC

Return-Path: <santoshpk@juniper.net>
X-Original-To: rtg-bfd@ietfa.amsl.com
Delivered-To: rtg-bfd@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 3F2521A0027 for <rtg-bfd@ietfa.amsl.com>; Thu, 27 Nov 2014 06:59:35 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.902
X-Spam-Level:
X-Spam-Status: No, score=-1.902 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001] autolearn=ham
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id J9B9guDw6oy6 for <rtg-bfd@ietfa.amsl.com>; Thu, 27 Nov 2014 06:59:32 -0800 (PST)
Received: from na01-bn1-obe.outbound.protection.outlook.com (mail-bn1bon0798.outbound.protection.outlook.com [IPv6:2a01:111:f400:fc10::1:798]) (using TLSv1 with cipher ECDHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 5C54B1A0025 for <rtg-bfd@ietf.org>; Thu, 27 Nov 2014 06:59:32 -0800 (PST)
Received: from CO2PR0501MB823.namprd05.prod.outlook.com (10.141.244.145) by CO2PR0501MB823.namprd05.prod.outlook.com (10.141.244.145) with Microsoft SMTP Server (TLS) id 15.1.26.15; Thu, 27 Nov 2014 14:59:09 +0000
Received: from CO2PR0501MB823.namprd05.prod.outlook.com ([10.141.244.145]) by CO2PR0501MB823.namprd05.prod.outlook.com ([10.141.244.145]) with mapi id 15.01.0026.003; Thu, 27 Nov 2014 14:59:09 +0000
From: Santosh P K <santoshpk@juniper.net>
To: Manav Bhatia <manavbhatia@gmail.com>, Mach Chen <mach.chen@huawei.com>
Subject: RE: BFD stability follow-up from IETF-91
Thread-Topic: BFD stability follow-up from IETF-91
Thread-Index: AQHQCQ6qzE1oSJ0IQ0C/g/DAGkzw7JxyeiWAgAAfxYCAAAeWAIAAjSQAgAD7mQCAAE4wgIAAGjBQ
Date: Thu, 27 Nov 2014 14:59:08 +0000
Message-ID: <39e5a096e91046d9bfe58a64de42d9c5@CO2PR0501MB823.namprd05.prod.outlook.com>
References: <20141126001931.GJ20330@pfrc> <CAG1kdoghcA=xSaXmkr68qduH2t8oC=-ZazoQztj8JK12SazKsw@mail.gmail.com> <20141126005023981392.0c488535@sniff.de> <F73A3CB31E8BE34FA1BBE3C8F0CB2AE28B2D9A97@SZXEMA510-MBX.china.huawei.com> <20141126094242449051.c8abfe39@sniff.de> <F73A3CB31E8BE34FA1BBE3C8F0CB2AE28B2DB0BD@SZXEMA510-MBX.china.huawei.com> <CAG1kdogS6tYsF5BrOZvvHc-r-druOMroni_jkDXyVPrbbvh5OQ@mail.gmail.com>
In-Reply-To: <CAG1kdogS6tYsF5BrOZvvHc-r-druOMroni_jkDXyVPrbbvh5OQ@mail.gmail.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-originating-ip: [116.197.184.10]
x-microsoft-antispam: BCL:0;PCL:0;RULEID:;SRVR:CO2PR0501MB823;
x-exchange-antispam-report-test: UriScan:;
x-exchange-antispam-report-cfa-test: BCL:0;PCL:0;RULEID:;SRVR:CO2PR0501MB823;
x-forefront-prvs: 040866B734
x-forefront-antispam-report: SFV:NSPM; SFS:(10019020)(6009001)(377454003)(199003)(189002)(24454002)(13464003)(51704005)(86362001)(4396001)(31966008)(33646002)(122556002)(40100003)(20776003)(561944003)(64706001)(76576001)(66066001)(19580405001)(19580395003)(15202345003)(120916001)(107046002)(97736003)(105586002)(99396003)(101416001)(106356001)(106116001)(95666004)(99286002)(2656002)(87936001)(92566001)(15975445006)(77156002)(62966003)(1720100001)(108616004)(50986999)(21056001)(74316001)(54356999)(76176999)(46102003)(24736002); DIR:OUT; SFP:1102; SCL:1; SRVR:CO2PR0501MB823; H:CO2PR0501MB823.namprd05.prod.outlook.com; FPR:; SPF:None; MLV:sfv; PTR:InfoNoRecords; A:1; MX:1; LANG:en;
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
MIME-Version: 1.0
X-OriginatorOrg: juniper.net
Archived-At: http://mailarchive.ietf.org/arch/msg/rtg-bfd/jj3YenAV6gtKWr_E-PChY3Mv1Fg
Cc: "rtg-bfd@ietf.org" <rtg-bfd@ietf.org>
X-BeenThere: rtg-bfd@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: "RTG Area: Bidirectional Forwarding Detection DT" <rtg-bfd.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/rtg-bfd>, <mailto:rtg-bfd-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/rtg-bfd/>
List-Post: <mailto:rtg-bfd@ietf.org>
List-Help: <mailto:rtg-bfd-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/rtg-bfd>, <mailto:rtg-bfd-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 27 Nov 2014 14:59:35 -0000

Manav and Mach,
    Thanks a lot for reference and explanation. So with this we don’t really need time stamping to be carried with in the BFD packet.  Yes, we can add this text in the draft. 

Thanks
Santos h P K 

> -----Original Message-----
> From: Rtg-bfd [mailto:rtg-bfd-bounces@ietf.org] On Behalf Of Manav Bhatia
> Sent: Thursday, November 27, 2014 6:53 PM
> To: Mach Chen
> Cc: rtg-bfd@ietf.org
> Subject: Re: BFD stability follow-up from IETF-91
> 
> Ok so this helps. Can the authors add this to the draft?
> 
> Cheers, Manav
> 
> On Thu, Nov 27, 2014 at 2:13 PM, Mach Chen <mach.chen@huawei.com>
> wrote:
> > Hi Marc,
> >
> >> -----Original Message-----
> >> From: Marc Binderberger [mailto:marc@sniff.de]
> >> Sent: Thursday, November 27, 2014 1:43 AM
> >> To: Mach Chen
> >> Cc: Manav Bhatia; rtg-bfd@ietf.org
> >> Subject: RE: BFD stability follow-up from IETF-91
> >>
> >> Hello Mach,
> >>
> >> > This triggers me think out there should be another solution for
> >> > getting the Tx and Rx timestamps without encoding the timestamps in
> >> > the BFD
> >> packets.
> >> > For example, the Tx and Rx systems could just save timestamps
> >> > locally or send them to a centralized entity and then use the
> >> > sequence numbers to correlate them for further analyzing.
> >>
> >> I remember some discussion on NVO3 about how many bits it takes ;-) -
> >> could you send the links/draft names you are working on to this list?
> >> May be useful for further discussions.
> >
> > Sure, here is the link(http://tools.ietf.org/html/draft-chen-ippm-coloring-
> based-ipfpm-framework-02) for the reference.
> >
> > But here I want to say is that since we have sequence number, we may not
> need the marking based solution. Suppose that someone want to monitor
> the delay of a BFD packet , just record and save the timestamp at the Tx side,
> which indexed by the sequence number. Similarly, do the same at the Rx
> side. Then based on the timestamps from both Tx and Rx, and using the
> sequence number to correlate the timestamps, it can also provide a way to
> monitor the delay of the BFD packet.
> >
> > That means, only if there is sequence number, even if without carrying the
> timestamp in the BFD packet, BFD packet delay can be measured.
> >
> > Best regards,
> > Mach
> >
> >>
> >>
> >> Thanks & Regards,
> >> Marc
> >>
> >>
> >>
> >> On Wed, 26 Nov 2014 09:17:32 +0000, Mach Chen wrote:
> >> > Hi Marc and Manav,
> >> >
> >> >> -----Original Message-----
> >> >> From: Rtg-bfd [mailto:rtg-bfd-bounces@ietf.org] On Behalf Of Marc
> >> >> Binderberger
> >> >> Sent: Wednesday, November 26, 2014 4:50 PM
> >> >> To: Manav Bhatia
> >> >> Cc: rtg-bfd@ietf.org
> >> >> Subject: Re: BFD stability follow-up from IETF-91
> >> >>
> >> >> Hello Manav,
> >> >>
> >> >>> I believe the work is important and addresses something thats
> >> >>> really required (spent too much time debugging why BFD flapped!).
> >> >>
> >> >> agree :-) we should keep the discussion alive.
> >> >>
> >> >>
> >> >>> side Time stamping would have helped in debugging whether the
> BFD
> >> >>> packet was sent late, or whether the packet was sent on time and
> >> >>> also arrived on time but was delayed when passing it up the BFD
> >> >>> stack/processor (lay in the RX buffer for tad too long)
> >> >>
> >> >> well, I can see a point in having the Tx timestamps in the packet
> >> >> mainly for the purpose of knowing "this" packet was okay/not okay
> >> >> on the Tx side and to correlate it with your local Rx measurement.
> >> >
> >> > Yes, this is one solution if people think BFD delay is needed. If
> >> > allow to have Tx timestamps to be carried in the packets, seems it
> >> > should be no problem to leave a seat for the Rx timestamps as well
> >> > :-). After all, with both Tx and Rx timestamp, it may simplify the
> >> implementation.
> >> >
> >> >>
> >> >> And even this point is less relevant with sequence numbers as this
> >> >> number allows the identification of packets and thus the
> >> >> correlation of information from the Tx and Rx system.
> >> >
> >> > Indeed, the sequence number helps a lot for the correlation between
> >> > the Tx and Rx system.
> >> >
> >> > This triggers me think out there should be another solution for
> >> > getting the Tx and Rx timestamps without encoding the timestamps in
> >> > the BFD
> >> packets.
> >> > For example, the Tx and Rx systems could just save timestamps
> >> > locally or send them to a centralized entity and then use the
> >> > sequence numbers to correlate them for further analyzing.
> >> >
> >> > Best regards,
> >> > Mach
> >> >
> >> >>
> >> >>
> >> >> Regards, Marc
> >> >>
> >> >>
> >> >>
> >> >>
> >> >>
> >> >>
> >> >> On Wed, 26 Nov 2014 12:26:41 +0530, Manav Bhatia wrote:
> >> >>> Hi Jeff,
> >> >>>
> >> >>> I vividly remember the original intent of the stability draft was
> >> >>> to help debug BFD failures -- to isolate the issue at the RX or
> >> >>> the TX side Time stamping would have helped in debugging whether
> >> >>> the BFD packet was sent late, or whether the packet was sent on
> >> >>> time and also arrived on time but was delayed when passing it up
> >> >>> the BFD stack/processor (lay in the RX buffer for tad too long),
> >> >>> etc. But then time stamping came with its own set of issues, and
> >> >>> was hence dropped from the original draft.
> >> >>>
> >> >>> Can the authors send a summary on the list on why time stamping
> >> >>> was dropped so that we're all clear on that one.
> >> >>>
> >> >>> The current proposal does help but is not complete.
> >> >>>
> >> >>> Assume that the RX end loses a BFD session and learns later that
> >> >>> it did eventually receive the missing BFD packets (based on the seq
> #).
> >> >>> How would it know which end was misbehaving? Was it a delay at
> >> >>> the TX side, or was it the RX that delayed passing the packets to
> >> >>> the BFD process(or). This is usually what we want to debug and i
> >> >>> want to understand how this draft with sequence numbers can
> >> >>> unequivocally tell me that.
> >> >>>
> >> >>> I believe the work is important and addresses something thats
> >> >>> really required (spent too much time debugging why BFD flapped!).
> >> >>> Clearly what would help is putting a small section that describes
> >> >>> how we can use the sequence numbers to debug what and where
> things went wrong.
> >> >>>
> >> >>> Cheers, Manav
> >> >>>
> >> >>>
> >> >>> On Wed, Nov 26, 2014 at 5:49 AM, Jeffrey Haas <jhaas@pfrc.org>
> wrote:
> >> >>>> draft-ashesh-bfd-stability-01 was presented again during IETF-91
> >> >>>> in Honolulu.  The slides can be viewed here:
> >> >>>>
> >> >>>> http://www.ietf.org/proceedings/91/slides/slides-91-bfd-4.pptx
> >> >>>>
> >> >>>> To attempt to simplify the presentation, the contentious portion
> >> >>>> of the timers were removed from the proposal, leaving only the
> >> >>>> sequence numbering for detecting loss of BFD async packets.
> >> >>>>
> >> >>>> When the room was polled to see whether the draft should be
> >> >>>> adopted as a WG item, the sense of the room was very quiet.  As
> >> >>>> promised, this is to inquire for support for this draft on the
> >> >>>> WG mailing list to make sure the whole group has a voice.
> >> >>>>
> >> >>>> It should be noted that post-meeting discussion on the fate of
> >> >>>> this draft noted that BFD authentication code points are
> >> >>>> plentiful and are available with expert review.  Should the
> >> >>>> draft authors wish to continue this work as Experimental, that is an
> option.
> >> >>>>
> >> >>>> -- Jeff
> >> >>>>
> >> >>>
> >> >