Re: draft-palanivelan-bfd-v2-gr-02

David Ward <dward@cisco.com> Tue, 21 July 2009 00:12 UTC

Return-Path: <dward@cisco.com>
X-Original-To: rtg-bfd@core3.amsl.com
Delivered-To: rtg-bfd@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 445CC3A6B60 for <rtg-bfd@core3.amsl.com>; Mon, 20 Jul 2009 17:12:12 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -5.319
X-Spam-Level:
X-Spam-Status: No, score=-5.319 tagged_above=-999 required=5 tests=[AWL=0.150, BAYES_00=-2.599, DNS_FROM_OPENWHOIS=1.13, RCVD_IN_DNSWL_MED=-4]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id mod+CjtCDZnt for <rtg-bfd@core3.amsl.com>; Mon, 20 Jul 2009 17:12:11 -0700 (PDT)
Received: from sj-iport-6.cisco.com (sj-iport-6.cisco.com [171.71.176.117]) by core3.amsl.com (Postfix) with ESMTP id 17C4C3A6B16 for <rtg-bfd@ietf.org>; Mon, 20 Jul 2009 17:12:11 -0700 (PDT)
X-IronPort-AV: E=Sophos;i="4.43,237,1246838400"; d="scan'208";a="350491797"
Received: from sj-dkim-2.cisco.com ([171.71.179.186]) by sj-iport-6.cisco.com with ESMTP; 21 Jul 2009 00:11:41 +0000
Received: from sj-core-2.cisco.com (sj-core-2.cisco.com [171.71.177.254]) by sj-dkim-2.cisco.com (8.12.11/8.12.11) with ESMTP id n6L0Bfku028253; Mon, 20 Jul 2009 17:11:41 -0700
Received: from xbh-rtp-211.amer.cisco.com (xbh-rtp-211.cisco.com [64.102.31.102]) by sj-core-2.cisco.com (8.13.8/8.14.3) with ESMTP id n6L0BfJS029821; Tue, 21 Jul 2009 00:11:41 GMT
Received: from xmb-rtp-202.amer.cisco.com ([64.102.31.52]) by xbh-rtp-211.amer.cisco.com with Microsoft SMTPSVC(6.0.3790.3959); Mon, 20 Jul 2009 20:11:41 -0400
Received: from [127.0.0.1] ([171.68.225.134]) by xmb-rtp-202.amer.cisco.com with Microsoft SMTPSVC(6.0.3790.3959); Mon, 20 Jul 2009 20:11:39 -0400
Message-Id: <E06C7408-7FA5-40F1-902D-140E6B34B0FA@cisco.com>
From: David Ward <dward@cisco.com>
To: Vishwas Manral <vishwas.ietf@gmail.com>
In-Reply-To: <77ead0ec0907201701w18ccd526qbb4612e592b4ee24@mail.gmail.com>
Content-Type: text/plain; charset="US-ASCII"; format="flowed"; delsp="yes"
Content-Transfer-Encoding: 7bit
Mime-Version: 1.0 (Apple Message framework v935.3)
Subject: Re: draft-palanivelan-bfd-v2-gr-02
Date: Mon, 20 Jul 2009 19:11:37 -0500
References: <A96E7D1872FEC94BB90A3A92CEEC7F3A663D6B@XMB-BGL-41E.cisco.com> <C6849A09.57521%nitinb@juniper.net> <77ead0ec0907160940k5300d401i688dead2c22342f4@mail.gmail.com> <00BFE893-FDB3-44E6-B10F-28BC5A795AC4@cisco.com> <77ead0ec0907201421t356d3838ge222266b93ba77b6@mail.gmail.com> <F5410BD3-CBBC-40C2-AE42-FE310D49BB24@cisco.com> <77ead0ec0907201701w18ccd526qbb4612e592b4ee24@mail.gmail.com>
X-Mailer: Apple Mail (2.935.3)
X-OriginalArrivalTime: 21 Jul 2009 00:11:41.0001 (UTC) FILETIME=[D2834390:01CA0997]
DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; l=5336; t=1248135101; x=1248999101; c=relaxed/simple; s=sjdkim2002; h=Content-Type:From:Subject:Content-Transfer-Encoding:MIME-Version; d=cisco.com; i=dward@cisco.com; z=From:=20David=20Ward=20<dward@cisco.com> |Subject:=20Re=3A=20draft-palanivelan-bfd-v2-gr-02 |Sender:=20; bh=5w4dZcUXjAU1983TK0fuJFu8WFzalbVAMq2gCSaZFww=; b=C2K5+Myp9ms6bvW35xDCm/ao9+Ej8ONudnpKwEvEpFR7vhL0A0Jy4geteS 1Y+5aY/T9IZHkywunZdljW/fc4Hq5cYlRi5MY6cOTNi4HIzbUjJGYj9iCaTn 41SgnzfXIO;
Authentication-Results: sj-dkim-2; header.From=dward@cisco.com; dkim=pass ( sig from cisco.com/sjdkim2002 verified; );
Cc: "rtg-bfd@ietf.org" <rtg-bfd@ietf.org>, "Palanivelan A (apvelan)" <apvelan@cisco.com>, David Ward <dward@cisco.com>
X-BeenThere: rtg-bfd@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: "RTG Area: Bidirectional Forwarding Detection DT" <rtg-bfd.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/rtg-bfd>, <mailto:rtg-bfd-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/rtg-bfd>
List-Post: <mailto:rtg-bfd@ietf.org>
List-Help: <mailto:rtg-bfd-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/rtg-bfd>, <mailto:rtg-bfd-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 21 Jul 2009 00:12:12 -0000

Could be both, directions are negotiated separately. What should be  
obvious is that it is impossible to tell what sort of congestion is  
being witnessed. Could be on the wire, could be a scheduler (if SW  
based implementation) ... either way, response is the same.

-DWard

On Jul 20, 2009, at 7:01 PM, Vishwas Manral wrote:

> Hi David,
>
> Thats correct, detection can only be done in the rx direction.
>
> But reaction can be to slow the rate of packet send/ receive or both.
> I was wondering what would be the suggested action?
>
> Thanks,
> Vishwas
>
> On Mon, Jul 20, 2009 at 3:33 PM, David Ward<dward@cisco.com> wrote:
>>
>> V -
>>
>> The method to detect persistent congestion will have enough rules  
>> for the
>> BFD packet loss over a stated interval. A system can only usefully  
>> detect in
>> the rx direction.
>>
>> -DWard
>>
>>
>> On Jul 20, 2009, at 4:21 PM, Vishwas Manral wrote:
>>
>>> Hi David,
>>>
>>> Thanks for sharing the updates with us.
>>>
>>> So is the mechanism to do detect 'persistent congestion' also
>>> mentioned, or is that left out of scope. Also during congestion do  
>>> we
>>> increase the 'Minimum receive interval' or the 'transmit interval'  
>>> or
>>> both. So can the congestion assumed to be in both directions or only
>>> in the direction of the incoming flow?
>>>
>>> Thanks again,
>>> Vishwas
>>>
>>> On Thu, Jul 16, 2009 at 5:42 PM, David Ward<dward@cisco.com> wrote:
>>>>
>>>> One change to the BFD base spec as requested by the IESG is a  
>>>> congestion
>>>> control mechanism. We've come to conclusion with the transport,  
>>>> internet
>>>> and
>>>> routing ADs on the algorithm and you will see it in the last rev  
>>>> of the
>>>> base
>>>> spec. In a very short summary, if one can detect that there is  
>>>> persistent
>>>> congestion then use the adaptive timers to send fewer packets and  
>>>> see if
>>>> the
>>>> congestion clears. Then ramp them back up slowly until self- 
>>>> stabilized
>>>> and
>>>> no observable, persistent congestion is detected.
>>>>
>>>> In the case described below, there doesn't need to be protocol  
>>>> extensions
>>>> to
>>>> cover this issue.  There are many ways system design choices can  
>>>> impact
>>>> your
>>>> implementation.
>>>>
>>>> -DWard
>>>>
>>>> On Jul 16, 2009, at 11:40 AM, Vishwas Manral wrote:
>>>>
>>>>> Hi A. Palanivelan,
>>>>>
>>>>> I agree with Nitin here. If BFD packets are not prioritized over
>>>>> others, we need to fix that instead of any other changes.
>>>>>
>>>>> Thanks,
>>>>> Vishwas
>>>>>
>>>>> On Thu, Jul 16, 2009 at 8:49 AM, Nitin Bahadur<nitinb@juniper.net>
>>>>> wrote:
>>>>>>
>>>>>> Hi,
>>>>>>
>>>>>>    I would think that if there are multiple things happening at  
>>>>>> the
>>>>>> same
>>>>>> time...then the system needs to prioritize
>>>>>> BFD over other things in some way or offload bfd to some place  
>>>>>> that
>>>>>> would
>>>>>> prevent bfd from getting affected.
>>>>>>
>>>>>> Thanks
>>>>>> Nitin
>>>>>>
>>>>>> On 7/16/09 2:11 AM, "Palanivelan A (apvelan)" <apvelan@cisco.com>
>>>>>> wrote:
>>>>>>
>>>>>> Hi Nitin,
>>>>>> It is very true that BFD works well in planned restarts and we  
>>>>>> don't
>>>>>> need
>>>>>> GR extensions there.
>>>>>> But, we have seen issues when bfd is working along with other  
>>>>>> time
>>>>>> intensive, high priority events.
>>>>>>
>>>>>> In such a scenario, for GR, we are likely to hit bfd timer  
>>>>>> expiry which
>>>>>> will be treated as a failure, thus bringing down the adjacencies
>>>>>> (ISIS/OSPF).
>>>>>> One such example is when there are scaled number of PPPoE  
>>>>>> sessions on a
>>>>>> SP router that also has bfd enabled (with strict timers).
>>>>>>
>>>>>> This would mean looking for a workaround at the architecture  
>>>>>> level of
>>>>>> your router to make this work.
>>>>>>
>>>>>> In fact, this sort of experience let me into writing this draft.
>>>>>> Do revert back for more discussion.
>>>>>> PS: sorry for that late reply. I had a recent road accident  
>>>>>> that put me
>>>>>> off for a longer time.
>>>>>> Regards,
>>>>>> A.Palanivelan
>>>>>> To: <apvelan at cisco.com <mailto:apvelan@DOMAIN.HIDDEN> >,  
>>>>>> <rtg-bfd at
>>>>>> ietf.org <mailto:rtg-bfd@DOMAIN.HIDDEN> >
>>>>>>
>>>>>> ________________________________
>>>>>>
>>>>>> Hi,
>>>>>>
>>>>>>  You really do not need GR extensions to BFD to help with
>>>>>> planned restart. There is enough text in draft-ietf-bfd-generic
>>>>>> (Section 4.3.2.2) to help you accomplish what you want.
>>>>>>
>>>>>>  At the top of my head, I can think of 2 ways:
>>>>>>
>>>>>> 1) Restarting router brings down BFD session with diag code of
>>>>>>  ADMIN_DOWN. ADMIN_DOWN should not bring down the adjacency
>>>>>>  on the peer. So the peer BFD session will go down but ISIS/OSPF
>>>>>>  will not treat it as an adjacency down event.
>>>>>>
>>>>>> 2) Restarting router increases the BFD session timer to XXX
>>>>>>  seconds (XXX > restart time). It can save some state locally
>>>>>>  to note this. After restart, the restarting router reads the  
>>>>>> local
>>>>>>  state and continues the BFD session from the UP state.
>>>>>>
>>>>>> Thanks
>>>>>> Nitin
>>>>>>
>>>>
>>>>
>>
>>