Re: [Idr] draft-uttaro-idr-bgp-persistence-00

<bruno.decraene@orange.com> Wed, 23 November 2011 11:18 UTC

Return-Path: <bruno.decraene@orange.com>
X-Original-To: idr@ietfa.amsl.com
Delivered-To: idr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 6434E21F8B37 for <idr@ietfa.amsl.com>; Wed, 23 Nov 2011 03:18:35 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.716
X-Spam-Level:
X-Spam-Status: No, score=-4.716 tagged_above=-999 required=5 tests=[AWL=1.533, BAYES_00=-2.599, HELO_EQ_FR=0.35, RCVD_IN_DNSWL_MED=-4]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id SiO-N4FVuZhj for <idr@ietfa.amsl.com>; Wed, 23 Nov 2011 03:18:34 -0800 (PST)
Received: from p-mail1.rd.francetelecom.com (p-mail1.rd.francetelecom.com [195.101.245.15]) by ietfa.amsl.com (Postfix) with ESMTP id 551AF21F8B2B for <idr@ietf.org>; Wed, 23 Nov 2011 03:18:34 -0800 (PST)
Received: from p-mail1.rd.francetelecom.com (localhost.localdomain [127.0.0.1]) by localhost (Postfix) with SMTP id 3DF2D8F0003; Wed, 23 Nov 2011 12:19:38 +0100 (CET)
Received: from ftrdsmtp2.rd.francetelecom.fr (unknown [10.192.128.47]) by p-mail1.rd.francetelecom.com (Postfix) with ESMTP id 32C178B8001; Wed, 23 Nov 2011 12:19:38 +0100 (CET)
Received: from ftrdmel0.rd.francetelecom.fr ([10.192.128.56]) by ftrdsmtp2.rd.francetelecom.fr with Microsoft SMTPSVC(6.0.3790.4675); Wed, 23 Nov 2011 12:18:33 +0100
X-MimeOLE: Produced By Microsoft Exchange V6.5
Content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain; charset="US-ASCII"
Content-Transfer-Encoding: quoted-printable
Date: Wed, 23 Nov 2011 12:18:31 +0100
Message-ID: <FE8F6A65A433A744964C65B6EDFDC24002A15882@ftrdmel0.rd.francetelecom.fr>
In-Reply-To: <4ECBE495.4070006@raszuk.net>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [Idr] draft-uttaro-idr-bgp-persistence-00
Thread-Index: AcypQW9wfR2AkjRdRK+AhV1B9ZuOtwAiUr2g
References: <4EA1F0FB.3090100@raszuk.net> <4EA487E4.2040201@raszuk.net><B17A6910EEDD1F45980687268941550FA20750@MISOUT7MSGUSR9I.ITServices.sbc.com><4EA84254.9000400@raszuk.net> <4EA8A91C.4090305@cisco.com><B17A6910EEDD1F45980687268941550FA20BB8@MISOUT7MSGUSR9I.ITServices.sbc.com><4EAA496C.9070605@cisco.com><B17A6910EEDD1F45980687268941550FA21F96@MISOUT7MSGUSR9I.ITServices.sbc.com><B17A6910EEDD1F45980687268941550FA324FA@MISOUT7MSGUSR9I.ITServices.sbc.com><4EC21062.5020504@raszuk.net><B17A6910EEDD1F45980687268941550FA32664@MISOUT7MSGUSR9I.ITServices.sbc.com><4EC28B45.1040509@raszuk.net> <4EC32C36.8070902@riw.us> <FE8F6A65A433A744964C65B6EDFDC24002A155DF@ftrdmel0.rd.francetelecom.fr> <4ECBE495.4070006@raszuk.net>
From: bruno.decraene@orange.com
To: robert@raszuk.net
X-OriginalArrivalTime: 23 Nov 2011 11:18:33.0250 (UTC) FILETIME=[A2DC3020:01CCA9D1]
Cc: idr@ietf.org, ju1738@att.com
Subject: Re: [Idr] draft-uttaro-idr-bgp-persistence-00
X-BeenThere: idr@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: Inter-Domain Routing <idr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/idr>, <mailto:idr-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/idr>
List-Post: <mailto:idr@ietf.org>
List-Help: <mailto:idr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/idr>, <mailto:idr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 23 Nov 2011 11:18:35 -0000

Hi Robert,

>Hi Bruno,
>
>Great comparison between GR and persistent !
>
>
>However there is significant two points which need to be highlighted
...
>
>1.
>
>- time t for GR is max expressed as 12 bits in seconds which is 68 min
>... if session does not come up for that long I think it is wise to
>clean up
>
>- time t for persistence is indicated in the draft to be in case of
>L2VPN: "The persist-timer should be set to a large value on the order
of
>days to infinity." In case of L3VPN: "The persist-timer should be set
to
>a large value on the order of hours to a few days."

If you believe 68 minutes is the right max value,
- IMHO Max timer duration should be AS and application specific. There
is no right value. That's also the position taken by GR. Especially as
some routes may be rather static.
- Where does 68 minutes come from? Why would this be the right value?
Why are you more restrictive than the BGP RFC (Hold Time)?
- for consistency, I guess you will also argue against defining a long
timer value in the GR respin draft as has been proposed during the
meeting.

>2.
>
>In both GR and persistance the key to using the path on BGP speakers in
>the data plane is in next hop liveness detection. Be it in scope of the
>draft or out of scope does not really matter. Ingress as you call it
>should not care if RR says this path is suspicious if he can reach just
>fine the next hop of the prefix.

No.
BGP Next-Hop liveliness is important and need to be checked. But this
only one part of the route validity problem as this only checks the BGP
Next-Hop /ASBR. Any downstream BGP routers could withdraw the route.
You/egress/ingress would not know as the BGP session is down. Hence it's
important to mark this path (as "stale" or "not refreshed" if you
prefer) and ingress should care.

>So this is not that we like GR and we do not like persistence. All we
>are saying is that the only real delta between GR and persistance is
>this new signalling informing that path is "perhaps broken". This is
>this "perhaps" which many folks have a problem with.
>
>
>Best regards,
>R.
>
>
>> Russ,
>>
>>> From: Russ White, Sent: Wednesday, November 16, 2011 4:21 AM
>>>
>>>> I am personally completely not convinced that there is any value in
>>>> informing my peers that one of my BGP sessions went down. You
either
>>>> have reachability to next hop and can attract traffic or you do not
>> and
>>>> if so you withdraw.
>>>>
>>>> Telling peers that "I may be perhaps used to reach prefix X as last
>>>> resort" is of highly questionable value.
>>>
>>> I would go farther --this isn't questionable, it's really bad. If
you
>>> have a route you know exists, but you don't want people to use, set
>>> things so it's a "last resort" (wait for BGP, wait for LDP, etc). If
>> you
>>> don't know whether or not you really have reachability, don't
advertise
>> it.
>>
>> As you don't comment on Graceful Restart (GR), I assume that you are
>> fine with GR.
>> Are you?
>>
>> Then I think we need to make a distinction between the information we
>> have on a route, and the routing decision we make based on those
>> information.
>>
>> a) failure assumption:
>> let's assume a PE can lose both its iBGP session toward its redundant
RR
>> Note: the assumption is the same for GR and persistence.
>>
>> b) information available:
>> Once the BGP session(s) are down, we have the following information
>> available on the BGP router: cause of failure, route tags, time (t)
>> elapsed since the session failure, local configuration/preference
>> Note: idem for GR and persistence.
>>
>> At that point, I would say that once t>0, we have no certainty on the
>> validity of the route (from the BGP peer/BGP Next-Hop and from
>> downstream BGP routers). And the longer the duration, the less
>> certainty.
>> Note: for a given "t", this is the same for GR and persistence. Hence
I
>> don't get how you claim that "it's really bad" for persistence while
>> it's ok for GR.
>>
>> c) routing decision:
>> Given the above information and level of uncertainty, at a given "t"
>> time, currently 2 routing decisions are possible:
>> - withdraw the route.
>> 	- This is the regular BGP behavior.
>> 	- The reasoning is that the level of uncertainty is too high to
>> use that route.
>> - keep the route.
>> 	- This is the BGP Graceful Restart behavior.
>> 	- The reasoning is that the level of uncertainty is low enough
>> so that the route can still be considered perfectly valid.
>>
>>
>> But the router making the above decision is the BGP advertising
router
>> hence the egress router. It's lacking some information only available
on
>> the ingress router. Namely the availability of alternate paths/Next
Hop
>> on the ingress. I call that this is an important information to make
the
>> routing decision. Because in the end, each router/AS tries to take
the
>> _best_ decision among _available_ options. This is not a binary
decision
>> between "right/good" and "bad". So compared to GR, BGP persistence
>> proposes to give some additional information and routing decision on
>> upstream ingress BGP routers. (e.g. using "STALE" community, low BGP
>> local pref. But other vehicles could be considered).
>>
>> Could you elaborate on why this would be "really bad"?
>>
>> Thanks,
>> Regards,
>> Bruno
>>
>>> The possible problems involved in saying, "I might have a route to
x,
>>> just in case you don't have any other path there," will end up being
>>> really, really ugly.
>>>
>>> :-)
>>>
>>> Russ
>>> _______________________________________________
>>> Idr mailing list
>>> Idr@ietf.org
>>> https://www.ietf.org/mailman/listinfo/idr
>> _______________________________________________
>> Idr mailing list
>> Idr@ietf.org
>> https://www.ietf.org/mailman/listinfo/idr
>>
>>