Re: BFD stability follow-up from IETF-91

Marc Binderberger <marc@sniff.de> Mon, 08 December 2014 18:14 UTC

Return-Path: <marc@sniff.de>
X-Original-To: rtg-bfd@ietfa.amsl.com
Delivered-To: rtg-bfd@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 5A7561A87B2 for <rtg-bfd@ietfa.amsl.com>; Mon, 8 Dec 2014 10:14:59 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.56
X-Spam-Level:
X-Spam-Status: No, score=-1.56 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HELO_EQ_DE=0.35, T_RP_MATCHES_RCVD=-0.01] autolearn=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 4U75xPngvx9q for <rtg-bfd@ietfa.amsl.com>; Mon, 8 Dec 2014 10:14:57 -0800 (PST)
Received: from door.sniff.de (door.sniff.de [IPv6:2001:6f8:94f:1::1]) by ietfa.amsl.com (Postfix) with ESMTP id 1872D1ACD74 for <rtg-bfd@ietf.org>; Mon, 8 Dec 2014 10:14:57 -0800 (PST)
Received: from [IPv6:::1] (localhost.sniff.de [127.0.0.1]) by door.sniff.de (Postfix) with ESMTP id D6FEB2AA0F; Mon, 8 Dec 2014 18:14:54 +0000 (GMT)
Date: Mon, 08 Dec 2014 10:18:40 -0800
From: Marc Binderberger <marc@sniff.de>
To: Manav Bhatia <manavbhatia@gmail.com>
Message-ID: <20141208101840770153.4d0c983f@sniff.de>
In-Reply-To: <CAG1kdog4Cu+vhKZq1UdESOHe5kMEWG9=TuVf2KDzEyteQp_AaA@mail.gmail.com>
References: <CO2PR0501MB823C222B7D62779F4DF58CDB3780@CO2PR0501MB823.namprd05.prod.outlook.com> <D0A647C1.28843%mmudigon@cisco.com> <CO2PR0501MB8234A1BDDFD008EE12C847AB3780@CO2PR0501MB823.namprd05.prod.outlook.com> <CECE764681BE964CBE1DFF78F3CDD3943F5AE38D@xmb-aln-x01.cisco.com> <CAG1kdogkUr2YyodeUPWOqea+2jqOkmdYnPywVHCw8j1+=9eM6A@mail.gmail.com> <CECE764681BE964CBE1DFF78F3CDD3943F5AE4AE@xmb-aln-x01.cisco.com> <CAG1kdoh5DwdKrJWK_aSvo4KQ6Xu5ZaTObe_PLhV66YZ4yQozmg@mail.gmail.com> <20141207193610211284.1f098741@sniff.de> <CAG1kdojxdDY0qXPYnZ5K67rizVaD7gHte1MdRA2q==K6SoRVsw@mail.gmail.com> <20141207212102448099.e2e4012a@sniff.de> <CAG1kdogRJ1PU+uaGAZkRP=QhrGv-KcY+8yAz23sobbykL2pLXQ@mail.gmail.com> <20141207234958554921.33ce21f7@sniff.de> <CAG1kdog4Cu+vhKZq1UdESOHe5kMEWG9=TuVf2KDzEyteQp_AaA@mail.gmail.com>
Subject: Re: BFD stability follow-up from IETF-91
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
X-Mailer: GyazMail version 1.5.15
Archived-At: http://mailarchive.ietf.org/arch/msg/rtg-bfd/79RqiBUQSOvoJIzBz60eEOxzFLk
Cc: "rtg-bfd@ietf.org" <rtg-bfd@ietf.org>
X-BeenThere: rtg-bfd@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: "RTG Area: Bidirectional Forwarding Detection DT" <rtg-bfd.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/rtg-bfd>, <mailto:rtg-bfd-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/rtg-bfd/>
List-Post: <mailto:rtg-bfd@ietf.org>
List-Help: <mailto:rtg-bfd-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/rtg-bfd>, <mailto:rtg-bfd-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 08 Dec 2014 18:15:00 -0000

Hello Manav,

> You never send control plane packets at a high rate - but now you send and 
> receive twice as many BFD packets as you were processing earlier. I would 
> wager that this would be a show stopper. 

that depends on implementation details how much impact this has. BFD itself 
is still sending/receiving largely the same number of packets.

The echo U-turn processing would happen in the data/forwarding plane and 
should (well, must) be simple, even with some stamping.


Regards, Marc



On Mon, 8 Dec 2014 18:13:18 +0530, Manav Bhatia wrote:
> Hi Marc,
> 
>>> .. and i dont like this. Are you saying that for each BFD session now you
>>> will run a parallel BFD echo session that would carry the debug data? 
>> Your
>>> scaling just went for a toss !!
>> 
>> Just by a factor of 2 ;-)
> 
> .. and thats quite bad. 
> 
>> 
>> But seriously: if you run echo at hight speed you would usually avoid 
>> running
>> the control packets at high speed too. Just what RFC5880 describes.
> 
> You never send control plane packets at a high rate - but now you send and 
> receive twice as many BFD packets as you were processing earlier. I would 
> wager that this would be a show stopper. 
> 
> Cheers, Manav
>  
>> 
>>> This would help if the flap is deterministic-ally reproducible. If its 
>> not,
>> 
>> Correct. Good for some problems, not good for others. As I said, 
>> interesting
>> option but not what I had in mind in first place.
>> 
>>> then how long do you run the Echo BFD? And what happens to the original 
>> BFD
>>> session? You run the two parallel-ly?
>> 
>> The "original" session?  I'm talking here about an echo session that is
>> controlled by the corresponding control packet exchange. As described in
>> RFC5880. Echo not defined for 5884 or 7130? Ah well, one probably could.
>> 
>> Echo seems at least an option to me, in the context of debugging, that can 
>> be
>> discussed. The freedom of defining the packet is interesting, some 
>> parameters
>> like RTT are measured easily. Other aspects, like Multihop, are a problem, 
>> no
>> doubt. I was simply surprised how quickly this idea was dismissed ... .
>> 
>> 
>> Regards, Marc
>> 
>> 
>> 
>> 
>> 
>>>
>>>>
>>>>
>>>>> Or are you suggesting that once BFD flaps we will start sending Echoes
>>>>> overloaded with debug information to detect the issue?
>>>>
>>>> interesting idea - that would be an alternative use, collecting forensic
>>>> data. Maybe we should support that too!
>>>
>>> This would help if the flap is deterministic-ally reproducible. If its 
>> not,
>>> then how long do you run the Echo BFD? And what happens to the original 
>> BFD
>>> session? You run the two parallel-ly?
>>>
>>> Cheers, Manav
>>>
>>>>
>>>>
>>>> My biggest problem with the echo idea is so far BFD-over-LAG. But maybe 
>> it
>>>> is
>>>> not a real problem, any echo stamping/updating in the forwarding path 
>> would
>>>> require an hardware update (or reprogramming, if your hardware allows) 
>> and
>>>> in
>>>> this case one could boldly state that the echo packet must leave via the
>>>> ingress port :-)
>>>>
>>>>
>>>> Regards, Marc
>>>>
>>>>
>>>>
>>>>
>>>>
>>>> On Mon, 8 Dec 2014 09:33:05 +0530, Manav Bhatia wrote:
>>>>> Hi Marc,
>>>>>
>>>>>>
>>>>>>
>>>>>> * Greg's echo idea is of course do-able - when you think timestamping 
>> in
>>>>>> hardware can be done then it can be done in the forwarding path for
>>>> echos
>>>>>> as
>>>>>> well. Depends on your hardware :-) and on an agreed (minimal) format 
>> for
>>>>>> echo. As mentioned BFD echo is not defined/used for multiple BFD
>>>> features,
>>>>>> which limits it's use though.
>>>>>>
>>>>>
>>>>> For the echo mechanism to work, do you agree that you would have to
>>>>> continuously send Echos so that you can detect the issue?
>>>>>
>>>>> Or are you suggesting that once BFD flaps we will start sending Echoes
>>>>> overloaded with debug information to detect the issue?
>>>>>
>>>>> I'd like to understand this before the mailing list sees a barrage of
>>>>> emails. Alternatively, we can also take it offline and only report the
>>>>> summary of our discussion to the list.
>>>>>
>>>>> Cheers, Manav
>>>
>