RE: TCP state machine doubts

"Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com> Tue, 30 July 2002 21:49 UTC

Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by ietf.org (8.9.1a/8.9.1a) with ESMTP id RAA10316 for <idr-archive@ietf.org>; Tue, 30 Jul 2002 17:49:46 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id D54F991226; Tue, 30 Jul 2002 17:50:34 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id A31D5912C9; Tue, 30 Jul 2002 17:50:34 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 92DC091226 for <idr@trapdoor.merit.edu>; Tue, 30 Jul 2002 17:50:33 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 7636A5DE46; Tue, 30 Jul 2002 17:50:33 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 30D575DDEE for <idr@merit.edu>; Tue, 30 Jul 2002 17:50:33 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id RAA24629; Tue, 30 Jul 2002 17:50:30 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id RAA26619; Tue, 30 Jul 2002 17:50:31 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W99HLJV>; Tue, 30 Jul 2002 17:50:31 -0400
Message-ID: <39469E08BD83D411A3D900204840EC558227E1@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: 'Jeffrey Haas' <jhaas@nexthop.com>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: 'Susan Hares' <skh@nexthop.com>, Kunihiro Ishiguro <kunihiro@zebra.org>, BGP mailing list <idr@merit.edu>
Subject: RE: TCP state machine doubts
Date: Tue, 30 Jul 2002 17:50:29 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Jeff:
  The FSM state machine actions taken are internal sequenced procedures.
These
actions could get messy and possibly execute out of sequence when there is a
multi-threads implemenatation. Although the FSM is preoccupied mostly with
the setting up and tearing down the peering sessions, in the processing of
these states certain multi-threaded sensitive processing rules should be
defined.

For Example:
============
Define locks around the various RIB synchronization points and mentioned
when to take them and when to release them if a state transition has occured
via the various action statements. This will gaurantee that the RIB data is
protected from parallel thread execution. It really does not matter which
thread takes the lock or which thread releases it.  If RIB resource is
locked, what happens to the state machine when state changing events are
seen. (E.G. Session disconnected by TCP or Peer. What happens when an
operator configuration command shuts down a peer session or the whole BGP
engine? 

I know this is getting very detailed, but the FSM is a gray area between the
internal and external world.


Ben

 

> -----Original Message-----
> From: Jeffrey Haas [mailto:jhaas@nexthop.com]
> Sent: Tuesday, July 30, 2002 4:52 PM
> To: Abarbanel, Benjamin
> Cc: 'Susan Hares'; Kunihiro Ishiguro; BGP mailing list
> Subject: Re: TCP state machine doubts
> 
> 
> On Thu, Jul 25, 2002 at 11:19:11AM -0400, Abarbanel, Benjamin wrote:
> > There are various phases that BGP goes through that one
> > might want to use multiple threads. The inter-locking or 
> inter-working
> > between the threads could affect the state transitions. For 
> example, Say you
> > are in Establish state and something happens to the session 
> and it is
> > disconnected. In a single threaded environment it is easy, 
> first come, first
> > served. In a multi-threaded case with thread priorities and 
> interruptions,
> > it might get tricky at times. Some assumptions or 
> expectations by the state
> > machine on actions it should take with regards to semaphore 
> locks (RIB
> > Table, etc.), thread prioritization, processing, 
> pre-emption and so forth
> > might be helpfull. Maybe these actions should only be 
> suggestions not
> > required ways for implementations.
> 
> There are certain situations where the FSM suggests that things be
> done in certain events where an implementation *could* do things
> in a slightly different order.
> 
> For example, after the TCP connection has completed, you are supposed
> to send your messages *before* reading the socket.  In a 
> multi-threaded
> environment, you may obviously not do this.  Indeed, for passive or
> unconfigured peers, you may not even send your message immediately.
> 
> The FSM would suggest where your synchronization points need to go.
> Happily, most of this work seems to need to be done during 
> peering setup.
> 
> For the most part, like the 3-rib abstraction, you should be able
> to do whatever you want inside your black box so long as it looks
> right on the wire.  The FSM gives guidance on what probably is
> happening in the box to get things out as you expect them to.
> 
> More specific examples might be helpful.
> 
> > Ben
> 
> -- 
> Jeff Haas 
> NextHop Technologies
> 



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id RAA23659 for <idr-archive@nic.merit.edu>; Tue, 30 Jul 2002 17:50:54 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id D54F991226; Tue, 30 Jul 2002 17:50:34 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id A31D5912C9; Tue, 30 Jul 2002 17:50:34 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 92DC091226 for <idr@trapdoor.merit.edu>; Tue, 30 Jul 2002 17:50:33 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 7636A5DE46; Tue, 30 Jul 2002 17:50:33 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 30D575DDEE for <idr@merit.edu>; Tue, 30 Jul 2002 17:50:33 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id RAA24629; Tue, 30 Jul 2002 17:50:30 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id RAA26619; Tue, 30 Jul 2002 17:50:31 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W99HLJV>; Tue, 30 Jul 2002 17:50:31 -0400
Message-ID: <39469E08BD83D411A3D900204840EC558227E1@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Jeffrey Haas'" <jhaas@nexthop.com>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: "'Susan Hares'" <skh@nexthop.com>, Kunihiro Ishiguro <kunihiro@zebra.org>, BGP mailing list <idr@merit.edu>
Subject: RE: TCP state machine doubts
Date: Tue, 30 Jul 2002 17:50:29 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Jeff:
  The FSM state machine actions taken are internal sequenced procedures.
These
actions could get messy and possibly execute out of sequence when there is a
multi-threads implemenatation. Although the FSM is preoccupied mostly with
the setting up and tearing down the peering sessions, in the processing of
these states certain multi-threaded sensitive processing rules should be
defined.

For Example:
============
Define locks around the various RIB synchronization points and mentioned
when to take them and when to release them if a state transition has occured
via the various action statements. This will gaurantee that the RIB data is
protected from parallel thread execution. It really does not matter which
thread takes the lock or which thread releases it.  If RIB resource is
locked, what happens to the state machine when state changing events are
seen. (E.G. Session disconnected by TCP or Peer. What happens when an
operator configuration command shuts down a peer session or the whole BGP
engine? 

I know this is getting very detailed, but the FSM is a gray area between the
internal and external world.


Ben

 

> -----Original Message-----
> From: Jeffrey Haas [mailto:jhaas@nexthop.com]
> Sent: Tuesday, July 30, 2002 4:52 PM
> To: Abarbanel, Benjamin
> Cc: 'Susan Hares'; Kunihiro Ishiguro; BGP mailing list
> Subject: Re: TCP state machine doubts
> 
> 
> On Thu, Jul 25, 2002 at 11:19:11AM -0400, Abarbanel, Benjamin wrote:
> > There are various phases that BGP goes through that one
> > might want to use multiple threads. The inter-locking or 
> inter-working
> > between the threads could affect the state transitions. For 
> example, Say you
> > are in Establish state and something happens to the session 
> and it is
> > disconnected. In a single threaded environment it is easy, 
> first come, first
> > served. In a multi-threaded case with thread priorities and 
> interruptions,
> > it might get tricky at times. Some assumptions or 
> expectations by the state
> > machine on actions it should take with regards to semaphore 
> locks (RIB
> > Table, etc.), thread prioritization, processing, 
> pre-emption and so forth
> > might be helpfull. Maybe these actions should only be 
> suggestions not
> > required ways for implementations.
> 
> There are certain situations where the FSM suggests that things be
> done in certain events where an implementation *could* do things
> in a slightly different order.
> 
> For example, after the TCP connection has completed, you are supposed
> to send your messages *before* reading the socket.  In a 
> multi-threaded
> environment, you may obviously not do this.  Indeed, for passive or
> unconfigured peers, you may not even send your message immediately.
> 
> The FSM would suggest where your synchronization points need to go.
> Happily, most of this work seems to need to be done during 
> peering setup.
> 
> For the most part, like the 3-rib abstraction, you should be able
> to do whatever you want inside your black box so long as it looks
> right on the wire.  The FSM gives guidance on what probably is
> happening in the box to get things out as you expect them to.
> 
> More specific examples might be helpful.
> 
> > Ben
> 
> -- 
> Jeff Haas 
> NextHop Technologies
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id RAA23601 for <idr-archive@nic.merit.edu>; Tue, 30 Jul 2002 17:49:04 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 74AB89122D; Tue, 30 Jul 2002 17:48:42 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 2E4BA91226; Tue, 30 Jul 2002 17:48:42 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 77067912C9 for <idr@trapdoor.merit.edu>; Tue, 30 Jul 2002 17:47:58 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 54E645DE42; Tue, 30 Jul 2002 17:47:58 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from dm3cnd.bell.ca (dm3cnd.bell.ca [206.47.0.146]) by segue.merit.edu (Postfix) with SMTP id 0CD705DDEE for <idr@merit.edu>; Tue, 30 Jul 2002 17:47:58 -0400 (EDT)
Received: from 206.47.0.152 by dm3cnd.bell.ca with ESMTP (Tumbleweed MMS SMTP Relay (MMS v4.7)); Tue, 30 Jul 2002 17:47:57 -0400
X-Server-Uuid: b85f21a3-cfd1-11d3-8401-00104bf46ab7
Received: from mta2.on.bell.ca (MTA2 [198.235.102.149]) by mta2.on.bell.ca with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13) id GMA090Q5; Tue, 30 Jul 2002 17:47:57 -0400
Received: FROM toroondc754.on.bell.ca BY mta2.on.bell.ca ; Tue Jul 30 17:47:56 2002 -0400
Received: by toroondc754.on.bell.ca with Internet Mail Service ( 5.5.2653.19) id <3T1HJA9X>; Tue, 30 Jul 2002 17:47:56 -0400
Message-ID: <7A733663D06C92499EF88AC4FD660EE3020849D4@TOROONDC755.on.bell.ca>
From: "Turk, Doughan A." <doughan.turk@bell.ca>
To: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>, "Turk, Doughan A." <doughan.turk@bell.ca>, PamSri <pamsri01@yahoo.com>
Cc: idr@merit.edu
Subject: RE: Need your opinion of this.
Date: Tue, 30 Jul 2002 17:47:46 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
X-WSS-ID: 1159D607176905-01-01
Content-Type: text/plain;  charset=us-ascii
Content-Transfer-Encoding: 7bit
Sender: owner-idr@merit.edu
Precedence: bulk

Comments inline

-----Original Message-----
From: Abarbanel, Benjamin [mailto:Benjamin.Abarbanel@Marconi.com] 
Sent: Monday, July 29, 2002 10:06 AM
To: 'Turk, Doughan A.'; PamSri
Cc: idr@merit.edu
Subject: RE: Need your opinion of this.


Doughan:

Since some of the traffic that is trapped with the DoS traffic is going to
be redirected to the sinkhole router. would that cause those poor cutomers
to loose thier communications? 
*** Not at all, I'm not tampering with routing tables here***

What if the sinkhole router was smart enough to filter through the good
packets from the bad, could it then redirect the good packets back to their
intended destination? 
*** this is exactly what the draft is all about. Since the BGP community
only triggers a local routing change to the router that has the hooks for
this comm. The traffic slides through an MPLS tunnel for example. After your
smart sinkhole device cleans the traffic, you could have a default statement
on the sinkhole pointing to any interface to connected to your network, even
the same physical interface the tunnel was built on. This will guarantee
that traffic goes back to your customer. Since out side of the edge router
with the community hooks, no change to the legitimate next hop is done, it
is not an issue.***

If so, that implies the sinkhole routers have alot more cycles to handle all
the load that will be imposed on them by this service. 

*** choosing a powerful interface on the sinkhole router is completely your
choice, if you know what are the limits of that interface, you could limit
your ACLs or Policing statement not to compromise the sinkhole router.
Investing in a good sinkhole router can save you money of other parts of
your network.***

Is source and destination address the only things needed to filter good
packets from bad ones?

*** This is out of the scope of this draft and totally depends on the
circumstance. But to comment on this, I would say what if the attacker is
doing a great job in randomizing his source/ports how can one filter the
traffic. You need to find a distinguishing signature for the attack traffic.
This might or might not be an easy or possible task.***

My instinct tells me it is better to stop DoS traffic before it enters the
cloud on the border router, then try to manage it at some redirected
sinkhole router. 

*** See there are too parts to this draft, one that does exactly what you
are suggesting and this is at the beginning of the draft, and the other take
traffic through a tunnel for further analysis or treatment.***

Communities attribute although they tend to converge on some Next Hop would
tend to propagate the traffic through the cloud and thereby cause more
loading conditions on the intermediary routers.

*** Communities are used in many places and for different reasons, I can't
see a network outage cause by adding an extra comm. value to a prefix.***

Appreciate you input :-)
DT


Ben


> -----Original Message-----
> From: Turk, Doughan A. [mailto:doughan.turk@bell.ca]
> Sent: Friday, July 26, 2002 9:24 PM
> To: PamSri
> Cc: idr@merit.edu
> Subject: RE: Need your opinion of this.
> 
> 
> I am using BGP to help stopping DoS and DDoS traffic. BGP can
> send updates
> quickly and route-maps can translate communities to stop 
> traffic destined to
> a certain destination. TCP is not the issue here. Attackers 
> usually relay on
> the fact that most routers out there do destination based 
> routing without
> checking the validity of the source address in case it is spoofed.
> 
> DT
> 
> -----Original Message-----
> From: PamSri [mailto:pamsri01@yahoo.com]
> Sent: Friday, July 26, 2002 9:08 PM
> To: Turk, Doughan A.
> Subject: RE: Need your opinion of this.
> 
> 
> Hi Turk ,
>          Just started to read through your doc. I have
> a question :
> BGP is an application from the perspective of TCP. Why
> do you see the need for provding such security at the
> application level than
> TCP ? Do you think the security features in TCP as inadequate 
> ? It would be
> really a good idea to provide security at the underlying 
> layers than at this
> layer. 
> 
> sri
> 
> --- "Turk, Doughan A." <doughan.turk@bell.ca> wrote:
> > Benjamin,
> >  
> > BGP will be used to alter next-hop address for a
> > particular destination
> > address (representing a customer under attack) on
> > selected Routers. Instead
> > of using BGP to announce a different next-hop for
> > the destination address
> > everywhere in your network. we can use communities
> > to make it selective.
> > once that is achieved, your next hop can be a null interface or a 
> > Tunnel interface where the traffic is then pulled to the
> > sinkhole router.
> >  
> > Based on Attack signature you could do various
> > packet lever filtering ( with
> > hardware as a constrain),  simply rate limit the
> > flow or send it out through
> > a shared media (Ethernet) for sniffing after you
> > pull it to a sinkhole
> > router.
> >  
> > As for overheads on routers in the network, there is
> > hardly any since you're
> > only altering next-hop on border routers or edge
> > routers. MPLS tunnel are
> > what I used in my lab. as for the sinkhole router,
> > one can invest in a high
> > performance line card. The sinkhole router should be powerful enough 
> > to handle whatever filtering or policing you deem
> > required to massage the flow.
> > 
> >  
> > Scalability is high in my opinion. the route maps on
> > the borders can be a
> > part of your template. announcing the destination
> > address with the special
> > community would have to be done on only one router (sinkhole is a 
> > good idea).
> >  
> > As for where to place the route maps or policy
> > statements, IMHO, this should
> > be applied anywhere a large attack can enter from.
> > (I.e Transit links or to
> > Peers with large number of subscribers) I wouldn't
> > apply it on a customer
> > interface since a customer is in my reach and can be contacted
> > immediately in case of an attack initiated by him. Another place
> > to apply it on is at
> > peering interfaces where an access-list or policing
> > statement might bring
> > down interface performance. at this point traffic
> > can be pulled in the
> > sinkhole router for the policies to be applied.
> >  
> > This technique can be used in high volume attacks
> > where source based
> > filtering is not possible because of spoofing or
> > high number of source IP
> > addresses. In many cases ISP's are forced to kill
> > traffic destined to a
> > single /32 to relief the rest of the customer
> > network from suffering from
> > the high volume of traffic. I would also use this in
> > cases of attacks
> > targeting infrastructure.
> >  
> > As for the last question, yes I have built this in a
> > large lab and got the
> > result I expected.
> >  
> > Hope this answers your questions.
> >  
> > Regards
> > DT
> >  
> > 
> > -----Original Message-----
> > From: Abarbanel, Benjamin [mailto:Benjamin.Abarbanel@Marconi.com]
> > Sent: Friday, July 26, 2002 4:55 PM
> > To: 'Turk, Doughan A.'; idr@merit.edu
> > Subject: RE: Need your opinion of this.
> > 
> > 
> > Doughan:
> >   Let me see if I understand the fundamental point
> > of your draft. You want
> > BGP via communities to be the big MAGNET that
> > catches all the denial of
> > service attack routes and shoves them through some
> > sink hole router that
> > will keep statistics and knowledge of the source of
> > the attack and be able
> > to report it to the operator?
> >  
> > Did I summarize it properly?
> >  
> > My first reaction, is what kind of overhead will
> > these techniques impose on
> > the routers and the sink hole one?
> >  
> > Will it be scallable?
> >  
> > Can this be done only on border routers, Router
> > Reflectors? To minimize the
> > processing load needed?
> >  
> > What real life cases have you seen where this
> > mechanism was needed?
> >  
> > Have you prototyped this idea and gotten some
> > empirical data?
> >  
> > Ben
> > 
> > -----Original Message-----
> > From: Turk, Doughan A. [mailto:doughan.turk@bell.ca]
> > Sent: Friday, July 26, 2002 4:09 PM
> > To: idr@merit.edu
> > Cc: Turk, Doughan A.
> > Subject: Need your opinion of this.
> > 
> > 
> > Folks,
> >  
> > I would like to get some discussion started on a
> > draft I wrote. It was
> > suggested to me to have the draft worked on under
> > this group. I appreciate
> > any constructive comments and pointers. In summary,
> > the draft could be
> > classified as an operational one. It is an
> > enhancement of a technique that
> > already exists but relying more on BGP as a protocol
> > to help in network
> > security issues.
> >  
> > A copy of the draft could be found at
> >
> http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt
> >
> <http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt>
> > 
> >  
> > Regards
> > Doughan Turk
> >  
> >   _____
> > 
> > 
> >  <http://www.bell.ca/> Bell Canada Logo
> > 
> > Doughan Turk, M.Eng., CCIE#7274
> > Bell Canada, IP Engineering
> > Flr 3, 100 Wynford Dr.
> > Toronto M3C 1K4
> > 
> > Voice: (416) 353-0368
> > Fax: (416) 445-9893
> > Pager: (416) 685-0590
> >  <http://www.bell.ca/> Bell Canada
> > 
> >  Cisco Certified Internetwork Expert 
> > <http://www3.sympatico.ca/doughan/ccie_large.gif>
> > 
> >          "To Engineer and Integrate world class
> >                               IP Services and
> > Technologies for BCE" 	
> >  
> > 
> > 
> 
> 
> __________________________________________________
> Do You Yahoo!?
> Yahoo! Health - Feel better, live better http://health.yahoo.com
> 
> 




Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id RAA22658 for <idr-archive@nic.merit.edu>; Tue, 30 Jul 2002 17:19:02 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id B4A29912C8; Tue, 30 Jul 2002 17:18:40 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 7312B912CA; Tue, 30 Jul 2002 17:18:40 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id AC79D912C8 for <idr@trapdoor.merit.edu>; Tue, 30 Jul 2002 17:17:29 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 91C975DF33; Tue, 30 Jul 2002 17:17:29 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from dm3cn8.bell.ca (dm3cn8.bell.ca [206.47.0.145]) by segue.merit.edu (Postfix) with SMTP id 4C9E35DEBB for <idr@merit.edu>; Tue, 30 Jul 2002 17:17:29 -0400 (EDT)
Received: from 206.47.0.152 by dm3cn8.bell.ca with ESMTP (Tumbleweed MMS SMTP Relay (MMS v4.7)); Tue, 30 Jul 2002 17:17:28 -0400
X-Server-Uuid: b85f21a3-cfd1-11d3-8401-00104bf46ab7
Received: from mta2.on.bell.ca (MTA2 [198.235.102.149]) by mta2.on.bell.ca with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13) id GMA090KY; Tue, 30 Jul 2002 17:17:28 -0400
Received: FROM toroondc754.on.bell.ca BY mta2.on.bell.ca ; Tue Jul 30 17:17:28 2002 -0400
Received: by toroondc754.on.bell.ca with Internet Mail Service ( 5.5.2653.19) id <3T1H29SN>; Tue, 30 Jul 2002 17:17:28 -0400
Message-ID: <7A733663D06C92499EF88AC4FD660EE3020849D0@TOROONDC755.on.bell.ca>
From: "Turk, Doughan A." <doughan.turk@bell.ca>
To: andrewl@xix-w.bengi.exodus.net, "Turk, Doughan A." <doughan.turk@bell.ca>
Cc: idr@merit.edu
Subject: RE: Need your opinion of this.
Date: Tue, 30 Jul 2002 17:17:19 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
X-WSS-ID: 1159DDE225178-01-01
Content-Type: text/plain;  charset=us-ascii
Content-Transfer-Encoding: 7bit
Sender: owner-idr@merit.edu
Precedence: bulk

Andrew,

1) Abuse is not a possibility. While this technique adds no more risk to the
one you get from a fat finger. In the route map match statements you would
add a match for AS-path ^$ this will guarantee that no eBGP peer can
compromise your set up. If you are worried about some one in operations to
miss use it, then the technique adds no more risk.

2) We need a route that is statically sent to a null interface on your edge
routers, You can surely your any of your public IP space is they are of
abandons but logic says RFC 1918 should not be routed out side of your AS
and therefore many ISPs have already in place a set of static routes to drop
those routes. Again, Juniper does that be default in what they call Martian
I believe. ** Juniper folks, correct me if I'm wrong**

3) I'm using routing to effectively acts as an basic ACL to drop traffic. I
can't comment on whether or not this should be on this WG.

Thanks for your comments
DT

-----Original Message-----
From: andrewl@xix-w.bengi.exodus.net [mailto:andrewl@xix-w.bengi.exodus.net]

Sent: Tuesday, July 30, 2002 1:20 PM
To: Turk, Doughan A.
Cc: idr@merit.edu
Subject: Re: Need your opinion of this.


Hi Doughan,

Going through the draft, a couple of thoughts came up:

1) Why use BGP communities?  It seems like that just exposes the network to
the possibility of abuse of those communities.  The NOC or Security group
can just log into the router in question, and apply the NH statement.

2) Why use RFC 1918?  One could also use public space.

3) This probably belongs in ops, not routing, but that is an admin detail.

Andrew

On Fri, Jul 26, 2002 at 04:09:19PM -0400, Turk, Doughan A. wrote:
> Delivered-To: idr-outgoing@trapdoor.merit.edu
> Delivered-To: idr@trapdoor.merit.edu
> Delivered-To: idr@merit.edu
> X-Server-Uuid: b85f21a3-cfd1-11d3-8401-00104bf46ab7
> From: "Turk, Doughan A." <doughan.turk@bell.ca>
> To: idr@merit.edu
> Cc: "Turk, Doughan A." <doughan.turk@bell.ca>
> Subject: Need your opinion of this.
> Date: Fri, 26 Jul 2002 16:09:19 -0400
> X-Mailer: Internet Mail Service (5.5.2653.19)
> X-WSS-ID: 115F73FE134713-01-01
> Precedence: bulk
> X-OriginalArrivalTime: 26 Jul 2002 20:10:58.0281 (UTC) 
> FILETIME=[8E32D590:01C234E0]
> 
> Folks,
>  
> I would like to get some discussion started on a draft I wrote. It was 
> suggested to me to have the draft worked on under this group. I 
> appreciate any constructive comments and pointers. In summary, the 
> draft could be classified as an operational one. It is an enhancement 
> of a technique that already exists but relying more on BGP as a 
> protocol to help in network security issues.
>  
> A copy of the draft could be found at 
> http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt
> <http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt>
>  
> Regards
> Doughan Turk
>  
>   _____
> 
> 
>  <http://www.bell.ca/> Bell Canada Logo
> 
> Doughan Turk, M.Eng., CCIE#7274
> Bell Canada, IP Engineering
> Flr 3, 100 Wynford Dr.
> Toronto M3C 1K4
> 
> Voice: (416) 353-0368
> Fax: (416) 445-9893
> Pager: (416) 685-0590
>  <http://www.bell.ca/> Bell Canada
> 
>  Cisco Certified Internetwork Expert 
> <http://www3.sympatico.ca/doughan/ccie_large.gif>
> 
>          "To Engineer and Integrate world class
>                               IP Services and Technologies for BCE" 	
>  





Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id RAA22216 for <idr-archive@nic.merit.edu>; Tue, 30 Jul 2002 17:05:47 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 5C6F391222; Tue, 30 Jul 2002 17:05:29 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 201E3912BD; Tue, 30 Jul 2002 17:05:29 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id C50F391222 for <idr@trapdoor.merit.edu>; Tue, 30 Jul 2002 17:05:27 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id AF9DB5DEBB; Tue, 30 Jul 2002 17:05:27 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from dm3cn8.bell.ca (dm3cn8.bell.ca [206.47.0.145]) by segue.merit.edu (Postfix) with SMTP id 6F2CA5DE3C for <idr@merit.edu>; Tue, 30 Jul 2002 17:05:27 -0400 (EDT)
Received: from 206.47.0.152 by dm3cn8.bell.ca with ESMTP (Tumbleweed MMS SMTP Relay (MMS v4.7)); Tue, 30 Jul 2002 17:05:22 -0400
X-Server-Uuid: b85f21a3-cfd1-11d3-8401-00104bf46ab7
Received: from mta2.on.bell.ca (MTA2 [198.235.102.149]) by mta2.on.bell.ca with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13) id GMA090HY; Tue, 30 Jul 2002 17:05:22 -0400
Received: FROM toroondc754.on.bell.ca BY mta2.on.bell.ca ; Tue Jul 30 17:05:21 2002 -0400
Received: by toroondc754.on.bell.ca with Internet Mail Service ( 5.5.2653.19) id <3T1H2911>; Tue, 30 Jul 2002 17:05:22 -0400
Message-ID: <7A733663D06C92499EF88AC4FD660EE3020849CD@TOROONDC755.on.bell.ca>
From: "Turk, Doughan A." <doughan.turk@bell.ca>
To: "Jeffrey Haas" <jhaas@nexthop.com>, "Turk, Doughan A." <doughan.turk@bell.ca>
Cc: idr@merit.edu
Subject: RE: Need your opinion of this.
Date: Tue, 30 Jul 2002 17:05:15 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
X-WSS-ID: 1158201817437-01-01
Content-Type: text/plain;  charset=us-ascii
Content-Transfer-Encoding: 7bit
Sender: owner-idr@merit.edu
Precedence: bulk

Jeff, comments inline


> I would like to get some discussion started on a draft I wrote. It was 
> suggested to me to have the draft worked on under this group.

Although this is BGP, this seems more of something along the lines of
operations.  But I'm not going to debate that strongly.

*** As far as I know this technique could not be done using other routing
protocols. It is totally related to BGP ***

> I appreciate
> any constructive comments and pointers. In summary, the draft could be 
> classified as an operational one. It is an enhancement of a technique 
> that already exists but relying more on BGP as a protocol to help in 
> network security issues.

Did I see this as a presentation at a NANOG, or was that a UU-Net/Worldcom
presentation?  This sounds very familiar.

*** As explained in the draft, this is an enhancement to a technique which
in this case is something you saw in Nanog. The new technique builds on the
existing one. The technique adds a way to pull traffic into a tunnel, please
refer to the draft for more information.***

In any case, the technique seems to be pretty solid.  My biggest
observations about the draft:

1. Your sink addresses are always mentioned as RFC 1918.  Sure, you can do
this, but why not some prefix within your own network? 2. A little more talk
about filtering the internal control communities at the edges may be
warranted in the security considerations section.

*** In an ISP environment you would most likely see RFC 1918 routes
statically sent to a null interface on many routers especially edge routers.
On a Juniper box I believe this is the default behavior and you need not to
add any static routes. Therefore riding on the back on an existing common
practice make sense. 

Thanks for the comments
DT



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id QAA21766 for <idr-archive@nic.merit.edu>; Tue, 30 Jul 2002 16:52:42 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 32533912C7; Tue, 30 Jul 2002 16:52:15 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 01AAB912C8; Tue, 30 Jul 2002 16:52:14 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id D5D86912C7 for <idr@trapdoor.merit.edu>; Tue, 30 Jul 2002 16:52:13 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id ADC7E5E088; Tue, 30 Jul 2002 16:52:13 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id ED5015E081 for <idr@merit.edu>; Tue, 30 Jul 2002 16:52:12 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6UKq2l56538; Tue, 30 Jul 2002 16:52:02 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6UKpx156531; Tue, 30 Jul 2002 16:51:59 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g6UKpwC03031; Tue, 30 Jul 2002 16:51:58 -0400 (EDT)
Date: Tue, 30 Jul 2002 16:51:58 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: "Abarbanel, Benjamin" <Benjamin.Abarbanel@marconi.com>
Cc: "'Susan Hares'" <skh@nexthop.com>, Kunihiro Ishiguro <kunihiro@zebra.org>, BGP mailing list <idr@merit.edu>
Subject: Re: TCP state machine doubts
Message-ID: <20020730165158.B1022@nexthop.com>
References: <39469E08BD83D411A3D900204840EC558227D9@vie-msgusr-01.dc.fore.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <39469E08BD83D411A3D900204840EC558227D9@vie-msgusr-01.dc.fore.com>; from Benjamin.Abarbanel@Marconi.com on Thu, Jul 25, 2002 at 11:19:11AM -0400
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

On Thu, Jul 25, 2002 at 11:19:11AM -0400, Abarbanel, Benjamin wrote:
> There are various phases that BGP goes through that one
> might want to use multiple threads. The inter-locking or inter-working
> between the threads could affect the state transitions. For example, Say you
> are in Establish state and something happens to the session and it is
> disconnected. In a single threaded environment it is easy, first come, first
> served. In a multi-threaded case with thread priorities and interruptions,
> it might get tricky at times. Some assumptions or expectations by the state
> machine on actions it should take with regards to semaphore locks (RIB
> Table, etc.), thread prioritization, processing, pre-emption and so forth
> might be helpfull. Maybe these actions should only be suggestions not
> required ways for implementations.

There are certain situations where the FSM suggests that things be
done in certain events where an implementation *could* do things
in a slightly different order.

For example, after the TCP connection has completed, you are supposed
to send your messages *before* reading the socket.  In a multi-threaded
environment, you may obviously not do this.  Indeed, for passive or
unconfigured peers, you may not even send your message immediately.

The FSM would suggest where your synchronization points need to go.
Happily, most of this work seems to need to be done during peering setup.

For the most part, like the 3-rib abstraction, you should be able
to do whatever you want inside your black box so long as it looks
right on the wire.  The FSM gives guidance on what probably is
happening in the box to get things out as you expect them to.

More specific examples might be helpful.

> Ben

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id QAA21547 for <idr-archive@nic.merit.edu>; Tue, 30 Jul 2002 16:45:06 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 07312912C6; Tue, 30 Jul 2002 16:44:44 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id C499D912C7; Tue, 30 Jul 2002 16:44:43 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id ACB83912C6 for <idr@trapdoor.merit.edu>; Tue, 30 Jul 2002 16:44:42 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 8D3785DFA8; Tue, 30 Jul 2002 16:44:42 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 365C85DEC3 for <idr@merit.edu>; Tue, 30 Jul 2002 16:44:42 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6UKiec56277; Tue, 30 Jul 2002 16:44:40 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6UKia156270; Tue, 30 Jul 2002 16:44:36 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g6UKia101983; Tue, 30 Jul 2002 16:44:36 -0400 (EDT)
Date: Tue, 30 Jul 2002 16:44:36 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: "Pusz, MateuszX" <MateuszX.Pusz@intel.com>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: Re: CEASE problems
Message-ID: <20020730164436.A1022@nexthop.com>
References: <413FBB0BA5AED1119122000083234B1A030A99FA@alpha.igk.intel.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <413FBB0BA5AED1119122000083234B1A030A99FA@alpha.igk.intel.com>; from MateuszX.Pusz@intel.com on Fri, Jul 26, 2002 at 09:07:42AM +0200
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

On Fri, Jul 26, 2002 at 09:07:42AM +0200, Pusz, MateuszX wrote:
> I have one problem with CEASE message. 'draft-ietf-idr-bgp4-17' says in 6.7:
>    In absence of any fatal errors (that are indicated in this section),
>    a BGP peer may choose at any given time to close its BGP connection
>    by sending the NOTIFICATION message with Error Code Cease. However,
>    the Cease NOTIFICATION message must not be used when a fatal error
>    indicated by this section does exist.
> 
> but in section 8-Established state:
>    In response to a stop event initiated by an operator:
>       - release all resources (including deleting all routes),
>       - set ConnectRetryCnt to zero (0),
>       - set connect retry timer to zero (0), and
>       - transition to the Idle.
> 
> so how shuold it be? Should BGP router in such a situation send that
> notification message or not?

You are correct, the NOTIFY should be sent.

This is correctly documented in:
http://www.ietf.org/internet-drafts/draft-hares-bgp-statemt-01.txt

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id QAA21288 for <idr-archive@nic.merit.edu>; Tue, 30 Jul 2002 16:37:57 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 0017A91220; Tue, 30 Jul 2002 16:37:28 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id C82F1912C6; Tue, 30 Jul 2002 16:37:27 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id B3D6D91220 for <idr@trapdoor.merit.edu>; Tue, 30 Jul 2002 16:37:26 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 97A045DEC3; Tue, 30 Jul 2002 16:37:26 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 126B95DE72 for <idr@merit.edu>; Tue, 30 Jul 2002 16:37:26 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6UKbHi55849; Tue, 30 Jul 2002 16:37:17 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6UKbC155842; Tue, 30 Jul 2002 16:37:12 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g6UKbCa00908; Tue, 30 Jul 2002 16:37:12 -0400 (EDT)
Date: Tue, 30 Jul 2002 16:37:12 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: andrewl@xix-w.bengi.exodus.net
Cc: "Turk, Doughan A." <doughan.turk@bell.ca>, idr@merit.edu
Subject: Re: Need your opinion of this.
Message-ID: <20020730163712.A29958@nexthop.com>
References: <7A733663D06C92499EF88AC4FD660EE302084641@TOROONDC755.on.bell.ca> <20020730101953.A18198@demiurge.exodus.net>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <20020730101953.A18198@demiurge.exodus.net>; from andrewl@xix-w.bengi.exodus.net on Tue, Jul 30, 2002 at 10:19:53AM -0700
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

On Tue, Jul 30, 2002 at 10:19:53AM -0700, andrewl@xix-w.bengi.exodus.net wrote:
> 1) Why use BGP communities?  It seems like that just exposes the network
> to the possibility of abuse of those communities.  The NOC or Security
> group can just log into the router in question, and apply the NH statement.

The trick is to distribute it across your network.
Dropping something like this as a host route into your IGP could do
the trick - you could even tag the routes to let you know they're
"special".
However, what if you need to cross igp domains?

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id NAA14832 for <idr-archive@nic.merit.edu>; Tue, 30 Jul 2002 13:23:11 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 160B2912BA; Tue, 30 Jul 2002 13:22:52 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id CF3DD912BC; Tue, 30 Jul 2002 13:22:51 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id C782E912BA for <idr@trapdoor.merit.edu>; Tue, 30 Jul 2002 13:22:50 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id A9C445DE74; Tue, 30 Jul 2002 13:22:50 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from demiurge.exodus.net (demiurge.exodus.net [216.32.171.82]) by segue.merit.edu (Postfix) with ESMTP id 4A0F35DE04 for <idr@merit.edu>; Tue, 30 Jul 2002 13:22:50 -0400 (EDT)
Received: (from andrewl@localhost) by demiurge.exodus.net (8.9.3+Sun/8.9.3) id KAA18611; Tue, 30 Jul 2002 10:19:53 -0700 (PDT)
Date: Tue, 30 Jul 2002 10:19:53 -0700
From: andrewl@xix-w.bengi.exodus.net
To: "Turk, Doughan A." <doughan.turk@bell.ca>
Cc: idr@merit.edu
Subject: Re: Need your opinion of this.
Message-ID: <20020730101953.A18198@demiurge.exodus.net>
References: <7A733663D06C92499EF88AC4FD660EE302084641@TOROONDC755.on.bell.ca>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <7A733663D06C92499EF88AC4FD660EE302084641@TOROONDC755.on.bell.ca>; from doughan.turk@bell.ca on Fri, Jul 26, 2002 at 04:09:19PM -0400
Sender: owner-idr@merit.edu
Precedence: bulk

Hi Doughan,

Going through the draft, a couple of thoughts came up:

1) Why use BGP communities?  It seems like that just exposes the network
to the possibility of abuse of those communities.  The NOC or Security
group can just log into the router in question, and apply the NH statement.

2) Why use RFC 1918?  One could also use public space.

3) This probably belongs in ops, not routing, but that is an admin detail.

Andrew

On Fri, Jul 26, 2002 at 04:09:19PM -0400, Turk, Doughan A. wrote:
> Delivered-To: idr-outgoing@trapdoor.merit.edu
> Delivered-To: idr@trapdoor.merit.edu
> Delivered-To: idr@merit.edu
> X-Server-Uuid: b85f21a3-cfd1-11d3-8401-00104bf46ab7
> From: "Turk, Doughan A." <doughan.turk@bell.ca>
> To: idr@merit.edu
> Cc: "Turk, Doughan A." <doughan.turk@bell.ca>
> Subject: Need your opinion of this.
> Date: Fri, 26 Jul 2002 16:09:19 -0400
> X-Mailer: Internet Mail Service (5.5.2653.19)
> X-WSS-ID: 115F73FE134713-01-01
> Precedence: bulk
> X-OriginalArrivalTime: 26 Jul 2002 20:10:58.0281 (UTC) FILETIME=[8E32D590:01C234E0]
> 
> Folks, 
>  
> I would like to get some discussion started on a draft I wrote. It was
> suggested to me to have the draft worked on under this group. I appreciate
> any constructive comments and pointers. In summary, the draft could be
> classified as an operational one. It is an enhancement of a technique that
> already exists but relying more on BGP as a protocol to help in network
> security issues.
>  
> A copy of the draft could be found at
> http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt
> <http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt> 
>  
> Regards
> Doughan Turk
>  
>   _____  
> 
> 
>  <http://www.bell.ca/> Bell Canada Logo
> 
> Doughan Turk, M.Eng., CCIE#7274
> Bell Canada, IP Engineering
> Flr 3, 100 Wynford Dr. 
> Toronto M3C 1K4
> 
> Voice: (416) 353-0368
> Fax: (416) 445-9893
> Pager: (416) 685-0590
>  <http://www.bell.ca/> Bell Canada
> 
>  Cisco Certified Internetwork Expert
> <http://www3.sympatico.ca/doughan/ccie_large.gif> 
> 
>          "To Engineer and Integrate world class
>                               IP Services and Technologies for BCE" 	
>  



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id LAA11006 for <idr-archive@nic.merit.edu>; Tue, 30 Jul 2002 11:29:50 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 7FBC0912B5; Tue, 30 Jul 2002 11:27:09 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 42D7B912B4; Tue, 30 Jul 2002 11:27:09 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 86B8C912B1 for <idr@trapdoor.merit.edu>; Tue, 30 Jul 2002 11:27:06 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 59A505DFE9; Tue, 30 Jul 2002 11:27:06 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id D0F5F5DEAE for <idr@merit.edu>; Tue, 30 Jul 2002 11:27:05 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6UFR4540318; Tue, 30 Jul 2002 11:27:04 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6UFR1140310; Tue, 30 Jul 2002 11:27:01 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g6UFR1P16970; Tue, 30 Jul 2002 11:27:01 -0400 (EDT)
Date: Tue, 30 Jul 2002 11:27:01 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: "Turk, Doughan A." <doughan.turk@bell.ca>
Cc: idr@merit.edu
Subject: Re: Need your opinion of this.
Message-ID: <20020730112701.D16595@nexthop.com>
References: <7A733663D06C92499EF88AC4FD660EE302084641@TOROONDC755.on.bell.ca>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <7A733663D06C92499EF88AC4FD660EE302084641@TOROONDC755.on.bell.ca>; from doughan.turk@bell.ca on Fri, Jul 26, 2002 at 04:09:19PM -0400
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

On Fri, Jul 26, 2002 at 04:09:19PM -0400, Turk, Doughan A. wrote:
> I would like to get some discussion started on a draft I wrote. It was
> suggested to me to have the draft worked on under this group.

Although this is BGP, this seems more of something along the
lines of operations.  But I'm not going to debate that strongly.

> I appreciate
> any constructive comments and pointers. In summary, the draft could be
> classified as an operational one. It is an enhancement of a technique that
> already exists but relying more on BGP as a protocol to help in network
> security issues.

Did I see this as a presentation at a NANOG, or was that a UU-Net/Worldcom
presentation?  This sounds very familiar.

In any case, the technique seems to be pretty solid.  My biggest
observations about the draft:

1. Your sink addresses are always mentioned as RFC 1918.  Sure, you
can do this, but why not some prefix within your own network?
2. A little more talk about filtering the internal control communities
at the edges may be warranted in the security considerations section.

> Doughan Turk

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id VAA13845 for <idr-archive@nic.merit.edu>; Mon, 29 Jul 2002 21:44:34 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 3A92E91297; Mon, 29 Jul 2002 21:43:56 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 060FB91298; Mon, 29 Jul 2002 21:43:55 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 8B18B91297 for <idr@trapdoor.merit.edu>; Mon, 29 Jul 2002 21:43:54 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 6E35E5DEEB; Mon, 29 Jul 2002 21:43:54 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mta1 (unknown [61.144.161.15]) by segue.merit.edu (Postfix) with ESMTP id 1308E5DDC8 for <idr@merit.edu>; Mon, 29 Jul 2002 21:43:54 -0400 (EDT)
Received: from lidefeng (localhost [127.0.0.1]) by mta1.huawei.com (iPlanet Messaging Server 5.2 HotFix 0.7 (built Jun 26 2002)) with ESMTPA id <0H0100EKRGS6CE@mta1.huawei.com> for idr@merit.edu; Tue, 30 Jul 2002 09:43:19 +0800 (CST)
Date: Thu, 11 Jul 2002 09:40:14 +0800
From: lidefeng <lidefeng@huawei.com>
Subject: about:draft-wilfong-ibgp-oscillations-00.txt
To: gtw@lucent.com, anindyabasu@lucent.com, bshep@lucent.com, Luke.Ong@comlab.ox.ac.uk, arasala@theory.lcs.mit.edu
Cc: idr@merit.edu
Message-id: <007801c2287b$e79e60c0$22436e0a@HUAWEI.COM>
MIME-version: 1.0
X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2600.0000
X-Mailer: Microsoft Outlook Express 6.00.2600.0000
Content-type: text/plain; charset=iso-8859-2
Content-transfer-encoding: 7BIT
X-Priority: 3
X-MSMail-priority: Normal
Sender: owner-idr@merit.edu
Precedence: bulk

Hi,

    In Section 3 An Extension to BGP of  draft-wilfong-ibgp-oscillations-00.txt,there is an example as to the relevant figure as follows:

 "  As an example, consider again the configuration shown in Figure 1.
   The modified BGP process will have router RR1 always announcing
   the route through C1 even though it will choose the route
   through C2 as its best route.
   Similarly, RR2 will choose the route through C3 and RR3 will
   choose the route through C1."

IMO, what you said  that RR1 will choose the route through C2,RR2 will choose the route through C3 and RR3 will
   choose the route through C1 are referred to IGP route , as to BGP,  in  the light of RFC 1771,the RR1  will reflect the route learned from C1 to
RR2 and RR3,the RR2  will reflect the route learned from C2 to RR1 and RR3, the RR3  will reflect the route learned from C3 to RR1 and RR2,
at last, every BGP route advertised by C1,C2 or C3 will reside in RR1,RR2,RR3,C1,C2,C3,and every  prefix will have three nexthops.

As to IGP, there are another scheme to compute the IGP route, if there aren't  route redistribution between IGP and BGP, the IGP route 
pattern and BGP route pattern are independent respectively, and there is little chance of route oscillation.

Li Defeng
 



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA20994 for <idr-archive@nic.merit.edu>; Mon, 29 Jul 2002 10:06:48 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id CF9C69123A; Mon, 29 Jul 2002 10:06:18 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 915179123F; Mon, 29 Jul 2002 10:06:18 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 6283F9123A for <idr@trapdoor.merit.edu>; Mon, 29 Jul 2002 10:06:17 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 3F2415DE65; Mon, 29 Jul 2002 10:06:17 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id B75415DDBE for <idr@merit.edu>; Mon, 29 Jul 2002 10:06:16 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA16042; Mon, 29 Jul 2002 10:06:14 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA01641; Mon, 29 Jul 2002 10:06:15 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W991XTB>; Mon, 29 Jul 2002 10:06:14 -0400
Message-ID: <39469E08BD83D411A3D900204840EC558227DE@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Turk, Doughan A.'" <doughan.turk@bell.ca>, PamSri <pamsri01@yahoo.com>
Cc: idr@merit.edu
Subject: RE: Need your opinion of this.
Date: Mon, 29 Jul 2002 10:06:14 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Doughan:

Since some of the traffic that is trapped with the DoS traffic is going to
be
redirected to the sinkhole router. would that cause those poor cutomers to
loose thier communications?

What if the sinkhole router was smart enough to filter through the good
packets from the bad, could it then redirect the good packets back to their
intended destination? 

If so, that implies the sinkhole routers have alot more cycles to handle all
the load that will be imposed on them by this service.

Is source and destination address the only things needed to filter good
packets from bad ones?

My instinct tells me it is better to stop DoS traffic before it enters the
cloud on the border router, then try to manage it at some redirected
sinkhole router. 

Communities attribute although they tend to converge on some Next Hop would
tend to propagate the traffic through the cloud and thereby cause more
loading conditions on the intermediary routers.


Ben


> -----Original Message-----
> From: Turk, Doughan A. [mailto:doughan.turk@bell.ca]
> Sent: Friday, July 26, 2002 9:24 PM
> To: PamSri
> Cc: idr@merit.edu
> Subject: RE: Need your opinion of this.
> 
> 
> I am using BGP to help stopping DoS and DDoS traffic. BGP can 
> send updates
> quickly and route-maps can translate communities to stop 
> traffic destined to
> a certain destination. TCP is not the issue here. Attackers 
> usually relay on
> the fact that most routers out there do destination based 
> routing without
> checking the validity of the source address in case it is spoofed.
> 
> DT
> 
> -----Original Message-----
> From: PamSri [mailto:pamsri01@yahoo.com] 
> Sent: Friday, July 26, 2002 9:08 PM
> To: Turk, Doughan A.
> Subject: RE: Need your opinion of this.
> 
> 
> Hi Turk ,
>          Just started to read through your doc. I have
> a question :
> BGP is an application from the perspective of TCP. Why
> do you see the need for provding such security at the 
> application level than
> TCP ? Do you think the security features in TCP as inadequate 
> ? It would be
> really a good idea to provide security at the underlying 
> layers than at this
> layer. 
> 
> sri
> 
> --- "Turk, Doughan A." <doughan.turk@bell.ca> wrote:
> > Benjamin,
> >  
> > BGP will be used to alter next-hop address for a
> > particular destination
> > address (representing a customer under attack) on
> > selected Routers. Instead
> > of using BGP to announce a different next-hop for
> > the destination address
> > everywhere in your network. we can use communities
> > to make it selective.
> > once that is achieved, your next hop can be a null
> > interface or a Tunnel
> > interface where the traffic is then pulled to the
> > sinkhole router.
> >  
> > Based on Attack signature you could do various
> > packet lever filtering ( with
> > hardware as a constrain),  simply rate limit the
> > flow or send it out through
> > a shared media (Ethernet) for sniffing after you
> > pull it to a sinkhole
> > router.
> >  
> > As for overheads on routers in the network, there is
> > hardly any since you're
> > only altering next-hop on border routers or edge
> > routers. MPLS tunnel are
> > what I used in my lab. as for the sinkhole router,
> > one can invest in a high
> > performance line card. The sinkhole router should be
> > powerful enough to
> > handle whatever filtering or policing you deem
> > required to massage the flow.
> > 
> >  
> > Scalability is high in my opinion. the route maps on
> > the borders can be a
> > part of your template. announcing the destination
> > address with the special
> > community would have to be done on only one router
> > (sinkhole is a good
> > idea).
> >  
> > As for where to place the route maps or policy
> > statements, IMHO, this should
> > be applied anywhere a large attack can enter from.
> > (I.e Transit links or to
> > Peers with large number of subscribers) I wouldn't
> > apply it on a customer
> > interface since a customer is in my reach and can be contacted 
> > immediately in case of an attack initiated by him. Another place
> > to apply it on is at
> > peering interfaces where an access-list or policing
> > statement might bring
> > down interface performance. at this point traffic
> > can be pulled in the
> > sinkhole router for the policies to be applied.
> >  
> > This technique can be used in high volume attacks
> > where source based
> > filtering is not possible because of spoofing or
> > high number of source IP
> > addresses. In many cases ISP's are forced to kill
> > traffic destined to a
> > single /32 to relief the rest of the customer
> > network from suffering from
> > the high volume of traffic. I would also use this in
> > cases of attacks
> > targeting infrastructure.
> >  
> > As for the last question, yes I have built this in a
> > large lab and got the
> > result I expected.
> >  
> > Hope this answers your questions.
> >  
> > Regards
> > DT
> >  
> > 
> > -----Original Message-----
> > From: Abarbanel, Benjamin [mailto:Benjamin.Abarbanel@Marconi.com]
> > Sent: Friday, July 26, 2002 4:55 PM
> > To: 'Turk, Doughan A.'; idr@merit.edu
> > Subject: RE: Need your opinion of this.
> > 
> > 
> > Doughan:
> >   Let me see if I understand the fundamental point
> > of your draft. You want
> > BGP via communities to be the big MAGNET that
> > catches all the denial of
> > service attack routes and shoves them through some
> > sink hole router that
> > will keep statistics and knowledge of the source of
> > the attack and be able
> > to report it to the operator?
> >  
> > Did I summarize it properly?
> >  
> > My first reaction, is what kind of overhead will
> > these techniques impose on
> > the routers and the sink hole one? 
> >  
> > Will it be scallable? 
> >  
> > Can this be done only on border routers, Router
> > Reflectors? To minimize the
> > processing load needed?
> >  
> > What real life cases have you seen where this
> > mechanism was needed?
> >  
> > Have you prototyped this idea and gotten some
> > empirical data?
> >  
> > Ben
> > 
> > -----Original Message-----
> > From: Turk, Doughan A. [mailto:doughan.turk@bell.ca]
> > Sent: Friday, July 26, 2002 4:09 PM
> > To: idr@merit.edu
> > Cc: Turk, Doughan A.
> > Subject: Need your opinion of this.
> > 
> > 
> > Folks, 
> >  
> > I would like to get some discussion started on a
> > draft I wrote. It was
> > suggested to me to have the draft worked on under
> > this group. I appreciate
> > any constructive comments and pointers. In summary,
> > the draft could be
> > classified as an operational one. It is an
> > enhancement of a technique that
> > already exists but relying more on BGP as a protocol
> > to help in network
> > security issues.
> >  
> > A copy of the draft could be found at
> >
> http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt
> >
> <http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt>
> > 
> >  
> > Regards
> > Doughan Turk
> >  
> >   _____  
> > 
> > 
> >  <http://www.bell.ca/> Bell Canada Logo
> > 
> > Doughan Turk, M.Eng., CCIE#7274
> > Bell Canada, IP Engineering
> > Flr 3, 100 Wynford Dr. 
> > Toronto M3C 1K4
> > 
> > Voice: (416) 353-0368
> > Fax: (416) 445-9893
> > Pager: (416) 685-0590
> >  <http://www.bell.ca/> Bell Canada
> > 
> >  Cisco Certified Internetwork Expert
> > <http://www3.sympatico.ca/doughan/ccie_large.gif> 
> > 
> >          "To Engineer and Integrate world class
> >                               IP Services and
> > Technologies for BCE" 	
> >  
> > 
> > 
> 
> 
> __________________________________________________
> Do You Yahoo!?
> Yahoo! Health - Feel better, live better
> http://health.yahoo.com
> 
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id CAA07252 for <idr-archive@nic.merit.edu>; Mon, 29 Jul 2002 02:25:52 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id A384391272; Mon, 29 Jul 2002 02:25:24 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 7750E91275; Mon, 29 Jul 2002 02:25:24 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 15F6291272 for <idr@trapdoor.merit.edu>; Mon, 29 Jul 2002 02:25:23 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id F38BF5E055; Mon, 29 Jul 2002 02:25:22 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mta0 (unknown [61.144.161.2]) by segue.merit.edu (Postfix) with ESMTP id A9B8D5DDDD for <idr@merit.edu>; Mon, 29 Jul 2002 02:25:18 -0400 (EDT)
Received: from lidefeng (localhost [127.0.0.1]) by mta0.huawei.com (iPlanet Messaging Server 5.2 HotFix 0.7 (built Jun 26 2002)) with ESMTPA id <0GZZ008NQZ2OR6@mta0.huawei.com> for idr@merit.edu; Mon, 29 Jul 2002 14:23:13 +0800 (CST)
Date: Wed, 10 Jul 2002 14:21:27 +0800
From: lidefeng <lidefeng@huawei.com>
Subject: about: draft-ietf-idr-route-oscillation-01.txt
To: danny@tcb.net, vijay@umbc.edu, dwalton@cisco.com, aretana@cisco.com
Cc: idr@merit.edu
Message-id: <003001c227da$063b5260$22436e0a@HUAWEI.COM>
MIME-version: 1.0
X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2600.0000
X-Mailer: Microsoft Outlook Express 6.00.2600.0000
Content-type: text/plain; charset=iso-8859-2
Content-transfer-encoding: 7BIT
X-Priority: 3
X-MSMail-priority: Normal
Sender: owner-idr@merit.edu
Precedence: bulk

Hi,

    In "Section 4.1. Route Reflection and Type I Churn" of draft-ietf-idr-route-oscillation-01.txt, you stated as follows: 

    1) Ra has the following installed in its BGP table with
        the path learned via AS2 marked best:

                          NEXT_HOP
           AS_PATH  MED   IGP Cost
           -----------------------
             6 100    1          4
          * 10 100   10       5

I have some questiion:
(1)  In figure 1, I can't find any AS numbered as AS2,
(2) I wonder why Ra learned the wrong route information?

Because I was confused from the beginning, so I can't grope along the draft, I hope you can clarity the above questions
so that I can advance along.

Li Defeng


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id XAA01070 for <idr-archive@nic.merit.edu>; Sun, 28 Jul 2002 23:10:28 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id F226F9126D; Sun, 28 Jul 2002 23:09:59 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id BBC7691288; Sun, 28 Jul 2002 23:09:58 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 525519126D for <idr@trapdoor.merit.edu>; Sun, 28 Jul 2002 23:09:57 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 3888F5E042; Sun, 28 Jul 2002 23:09:57 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from ep_mailout1 (unknown [203.254.224.24]) by segue.merit.edu (Postfix) with ESMTP id CCC7B5DDC4 for <idr@merit.edu>; Sun, 28 Jul 2002 23:09:56 -0400 (EDT)
Received: from custom-daemon.mailout1.samsung.com by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) id <0GZZ00801Q8MY1@mailout1.samsung.com> for idr@merit.edu; Mon, 29 Jul 2002 12:12:22 +0900 (KST)
Received: from ep_mmp2 (localhost [127.0.0.1]) by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTP id <0GZZ0087GQ8LM6@mailout1.samsung.com> for idr@merit.edu; Mon, 29 Jul 2002 12:12:22 +0900 (KST)
Received: from Manav ([107.108.3.180]) by mmp2.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTPA id <0GZZ00F7VQ8UNX@mmp2.samsung.com> for idr@merit.edu; Mon, 29 Jul 2002 12:12:32 +0900 (KST)
Date: Mon, 29 Jul 2002 08:35:19 +0530
From: Manav Bhatia <manav@samsung.com>
Subject: Re: draft-ietf-idr-bgp4-18.txt
To: lidefeng <lidefeng@huawei.com>, idr@merit.edu
Reply-To: Manav Bhatia <manav@samsung.com>
Message-id: <003401c236ac$cab1a5f0$b4036c6b@sisodomain.com>
MIME-version: 1.0
X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2600.0000
X-Mailer: Microsoft Outlook Express 6.00.2600.0000
Content-type: text/plain; charset=iso-8859-1
Content-transfer-encoding: 7BIT
X-Priority: 3
X-MSMail-priority: Normal
References: <006e01c23473$dfc9e870$b4036c6b@sisodomain.com> <002601c23568$1fdc5700$22436e0a@HUAWEI.COM>
Sender: owner-idr@merit.edu
Precedence: bulk

umm .. what i meant was that the suggested changes should go into the draft
mentioned in the subject i.e 18th.
apologies for the confusion .. i was referring to the draft 17 only!

----- Original Message -----
From: "lidefeng" <lidefeng@huawei.com>
To: "Manav Bhatia" <manav@samsung.com>; <idr@merit.edu>
Sent: Saturday, July 27, 2002 5:51 PM
Subject: Re: draft-ietf-idr-bgp4-18.txt


| hi,
|
|     I can only get the draft-ietf-idr-bgp4-17.txt from the
"Internet-Drafts Index ", and where can I get the newest
version:Internet-Drafts Index , which you are discussing.And I wonder why
"Internet-Drafts Index" doesn't list the latest version of draft. I am
always downloading the draft from there, and I am afraid I will drag behind
the discuss.
|
| Regards
|
| Li Defeng
|
|
| ----- Original Message -----
| From: "Manav Bhatia" <manav@samsung.com>
| To: <idr@merit.edu>
| Sent: Friday, July 26, 2002 3:12 PM
| Subject: draft-ietf-idr-bgp4-18.txt
|
|
| > sec 6.5 of the latest draft states - " If  system does not receive
| > successive KEEPALIVE and/or UPDATE and/or NOTIFICATION messages within
the
| > period specified in the Hold Time field of the OPEN message, then the
| > NOTIFICATION message with   Hold Timer Expired Error Code must be sent
and
| > the BGP connection closed"
| >
| > IMHO we must update the draft to also include other BGP messages (Route
| > Refresh, Dynamic Capability, INFORM) which when received by the local
| > system act as surrogate for the KEEPALIVE messages it would expect from
the
| > remote end. The local system upon receiving such messages must restart
its
| > Hold Timer
| >
| > Similarly the draft should mention that each time the local system
sends
| > such BGP messages it should restart its KeepAlive Timer, unless the
| > negotiated HoldTime value is zero.
| >
| > These changes can be reflected in the BGP draft or can be done in the
| > drafts introducing such BGP messages.
| >
| > Regards,
| > Manav
| >
| > ----
| > "When you are courting a nice girl an hour seems like a second. When
you
| > sit on a red-hot cinder a second seems like an hour. That's
relativity."
| >
| > -Albert Einstein, on relativity
| >
| >



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id CAA22161 for <idr-archive@nic.merit.edu>; Sun, 28 Jul 2002 02:35:01 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id B7D2191293; Sun, 28 Jul 2002 02:34:28 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 815F091295; Sun, 28 Jul 2002 02:34:28 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 75A2491293 for <idr@trapdoor.merit.edu>; Sun, 28 Jul 2002 02:34:27 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 6110D5E002; Sun, 28 Jul 2002 02:34:27 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from prattle.redback.com (prattle.redback.com [155.53.12.9]) by segue.merit.edu (Postfix) with ESMTP id 345AB5DDE4 for <idr@merit.edu>; Sun, 28 Jul 2002 02:34:27 -0400 (EDT)
Received: from popserv1.redback.com (popserv1.redback.com [155.53.12.56]) by prattle.redback.com (Postfix) with ESMTP id 65E67473CD4; Sat, 27 Jul 2002 23:34:26 -0700 (PDT)
Received: from redback.com (fall.redback.com [155.53.36.220]) by popserv1.redback.com (Postfix) with ESMTP id 0797615D3C1; Sat, 27 Jul 2002 23:34:24 -0700 (PDT)
To: lidefeng <lidefeng@huawei.com>
Cc: qv@juniper.net, idr@merit.edu, enke@redback.com
Subject: Re: about draft-ietf-idr-as4bytes-05.txt 
In-Reply-To: Message from lidefeng <lidefeng@huawei.com>  of "Sat, 27 Jul 2002 20:58:22 +0800." <003601c2356d$4a48fd40$22436e0a@HUAWEI.COM> 
Date: Sat, 27 Jul 2002 23:34:24 -0700
From: Enke Chen <enke@redback.com>
Message-Id: <20020728063425.0797615D3C1@popserv1.redback.com>
Sender: owner-idr@merit.edu
Precedence: bulk

Hi, 

> Date: Sat, 27 Jul 2002 20:58:22 +0800
> From: lidefeng <lidefeng@huawei.com>
> Subject: about draft-ietf-idr-as4bytes-05.txt
> To: qv@juniper.net, enke@redback.com
> Cc: idr@merit.edu
> Message-id: <003601c2356d$4a48fd40$22436e0a@HUAWEI.COM>
> 
> hi,
> 
>    In "Section7. IANA Consideration" of  "draft-ietf-idr-as4bytes-05.txt" ,it
> states that "The AS number for AS_TRANS has been assigned by the IANA.", I want
> to know what is the AS_TRANS number assigned by the IANA,

The AS_TRAN is listed in http://www.iana.org/assignments/as-numbers:

    Specially designated Autonomous System Numbers:

    23456     AS_TRAN             [draft-ietf-idr-as4bytes-04.txt]

> where I can find the number assignment scheme,

Do not see any reason why the assignment scheme would be different
from the current policy.

> and all the NEW BGP SPEAKER user the only one number ?

No. (Use AS_TRAN only when a 4byte-ASN speaker talks to an OLD peer
that does not understand 4-byte AS.)

-- Enke

> if so, how to handle the situation when they update across several AS? 
> 
> Regards
> 
> Li Defeng 



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id PAA00939 for <idr-archive@nic.merit.edu>; Sat, 27 Jul 2002 15:17:07 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 2B20A91281; Sat, 27 Jul 2002 15:16:42 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id DA8B591283; Sat, 27 Jul 2002 15:16:41 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 46E1991281 for <idr@trapdoor.merit.edu>; Sat, 27 Jul 2002 15:16:38 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 27D375DFA0; Sat, 27 Jul 2002 15:16:38 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from dm3cnd.bell.ca (dm3cnd.bell.ca [206.47.0.146]) by segue.merit.edu (Postfix) with SMTP id DD7635DDB3 for <idr@merit.edu>; Sat, 27 Jul 2002 15:16:37 -0400 (EDT)
Received: from 206.47.0.152 by dm3cnd.bell.ca with ESMTP (Tumbleweed MMS SMTP Relay (MMS v4.7)); Sat, 27 Jul 2002 15:16:37 -0400
X-Server-Uuid: b85f21a3-cfd1-11d3-8401-00104bf46ab7
Received: from mta2.on.bell.ca (MTA2 [198.235.102.149]) by mta2.on.bell.ca with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13) id GMA09DVM; Sat, 27 Jul 2002 15:16:37 -0400
Received: FROM toroondc754.on.bell.ca BY mta2.on.bell.ca ; Sat Jul 27 15:16:36 2002 -0400
Received: by toroondc754.on.bell.ca with Internet Mail Service ( 5.5.2653.19) id <3T1HFL9C>; Sat, 27 Jul 2002 15:16:36 -0400
Message-ID: <7A733663D06C92499EF88AC4FD660EE30208466A@TOROONDC755.on.bell.ca>
From: "Turk, Doughan A." <doughan.turk@bell.ca>
To: "TANDEL Sebastien" <standel@info.fundp.ac.be>, idr@merit.edu
Subject: RE: Need your opinion of this.
Date: Sat, 27 Jul 2002 15:16:36 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
X-WSS-ID: 115C2E1F143917-01-01
Content-Type: text/plain;  charset=iso-8859-1
Sender: owner-idr@merit.edu
Precedence: bulk
Content-Transfer-Encoding: 8bit
X-MIME-Autoconverted: from quoted-printable to 8bit by nic.merit.edu id PAA00939

Tandel,

no-advertise is used to avoid the advertisement of a prefix to all peers
including iBGP. The one that stops a prefix from going out to eBGP neighbors
is the no-export or no-export-subconfed. To keep things in prospective, a
router should not crash because of attack traffic transiting through it.
Routing protocols take precedence over regular traffic in my understanding.
Even if an attack is so big in volume it will never exceed your link
capacity and therefore your upstream provider will be dropping all excess
traffic. 

To clarify one thing, When you identify a destination address as a target
you do not stamp the entire network with the special community. Lets for
example take a case of a customer with a /24 of which one host /32 is under
attack at this point you announce the /32 in addition to the /24 where the
/32 has the special community but the /24 gets advertised normally.

Regards
DT

-----Original Message-----
From: TANDEL Sebastien [mailto:standel@info.fundp.ac.be] 
Sent: Saturday, July 27, 2002 10:08 AM
To: Turk, Doughan A.
Subject: Re: Need your opinion of this.


Hello,


   It's just a question about the use of the community...
   If I have understood your method, when you overwrite the community with
the well-known community no-advertise, it's to avoid the advertisement of
this community in the others eBGP sessions.

   So why don't you use the extended communities ?
   (IMHO) There are two reasons to use the extended communities and not the
community. With the extended community, you can set the non transitive bit.
This has two consequences:

   	1) this community is not advertised to other eBGP sessions
   	2) While there is an attack and if there is a drop session with
           the BGP router (by which the attack traffic goes through) in
           session with the border router (i.e. R1) which recognizes the
           regular community, the prefix under attack will not be
           advertised anymore by R1.
           So there is two cases :
   		a) this prefix is not reachable anymore. In this cases
                   there is a black hole (of course this is the case for
                   an attack routed to a network for an analysis, but
                   there will be no more analysis).
   		b) this prefix can be reach but by another route. And in
                   this case, you have to restart an other operation like
                   described in your draft on another border router.
   	   But if you are using the extended community with the non
           transitive bit set, this prefix will always be advertised in
           any cases and without the community value.

----------------------------
Tandel Sébastien
2ème licence en informatique
F.U.N.D.P.
standel@info.fundp.ac.be
----------------------------

On Fri, 26 Jul 2002, Turk, Doughan A. wrote:

> Folks,
>
> I would like to get some discussion started on a draft I wrote. It was 
> suggested to me to have the draft worked on under this group. I 
> appreciate any constructive comments and pointers. In summary, the 
> draft could be classified as an operational one. It is an enhancement 
> of a technique that already exists but relying more on BGP as a 
> protocol to help in network security issues.
>
> A copy of the draft could be found at 
> http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt
> <http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt>
>
> Regards
> Doughan Turk
>
>   _____
>
>
>  <http://www.bell.ca/> Bell Canada Logo
>
> Doughan Turk, M.Eng., CCIE#7274
> Bell Canada, IP Engineering
> Flr 3, 100 Wynford Dr.
> Toronto M3C 1K4
>
> Voice: (416) 353-0368
> Fax: (416) 445-9893
> Pager: (416) 685-0590
>  <http://www.bell.ca/> Bell Canada
>
>  Cisco Certified Internetwork Expert 
> <http://www3.sympatico.ca/doughan/ccie_large.gif>
>
>          "To Engineer and Integrate world class
>                               IP Services and Technologies for BCE"
>
>





Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id PAA00348 for <idr-archive@nic.merit.edu>; Sat, 27 Jul 2002 15:00:15 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 89D3A91280; Sat, 27 Jul 2002 14:59:50 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 5350F91281; Sat, 27 Jul 2002 14:59:50 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id A54EC91280 for <idr@trapdoor.merit.edu>; Sat, 27 Jul 2002 14:59:48 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 8E62E5DFA1; Sat, 27 Jul 2002 14:59:48 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from dm3cn8.bell.ca (dm3cn8.bell.ca [206.47.0.145]) by segue.merit.edu (Postfix) with SMTP id 501E35DFA0 for <idr@merit.edu>; Sat, 27 Jul 2002 14:59:48 -0400 (EDT)
Received: from 206.47.0.153 by dm3cn8.bell.ca with ESMTP (Tumbleweed MMS SMTP Relay (MMS v4.7)); Sat, 27 Jul 2002 14:59:47 -0400
X-Server-Uuid: b85f21a3-cfd1-11d3-8401-00104bf46ab7
Received: from mta1.on.bell.ca (MTA1 [198.235.102.150]) by mta1.on.bell.ca with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13) id G3BQR6TT; Sat, 27 Jul 2002 14:59:47 -0400
Received: FROM toroondc754.on.bell.ca BY mta1.on.bell.ca ; Sat Jul 27 14:59:47 2002 -0400
Received: by toroondc754.on.bell.ca with Internet Mail Service ( 5.5.2653.19) id <3T1HFLYT>; Sat, 27 Jul 2002 14:59:47 -0400
Message-ID: <7A733663D06C92499EF88AC4FD660EE302084668@TOROONDC755.on.bell.ca>
From: "Turk, Doughan A." <doughan.turk@bell.ca>
To: PamSri <pamsri01@yahoo.com>
Cc: idr@merit.edu
Subject: RE: Need your opinion of this.
Date: Sat, 27 Jul 2002 14:59:46 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
X-WSS-ID: 115C32297923-01-01
Content-Type: multipart/alternative;  boundary="----_=_NextPart_001_01C2359F.C6AF3AC0"
Sender: owner-idr@merit.edu
Precedence: bulk

This message is in MIME format. Since your mail reader does not understand
this format, some or all of this message may not be legible.

------_=_NextPart_001_01C2359F.C6AF3AC0
Content-Type: text/plain; 
 charset=us-ascii
Content-Transfer-Encoding: 7bit

Sri,
 
The draft is not addressing a box or a protocol security. I am using BGP
capabilities to assist in fighting DoS attacks. The reason why BGP is a
protocol suitable for this job is it's many attributes like "communities".
Since routing is the driver of traffic why not develop ways to use routing
protocols to assist in fighting attacks.
 
Regards
DT

-----Original Message-----
From: PamSri [mailto:pamsri01@yahoo.com] 
Sent: Saturday, July 27, 2002 11:19 AM
To: Turk, Doughan A.
Cc: idr@merit.edu
Subject: RE: Need your opinion of this.



Hi Turk , 


            Here are a few more questions : 


1. Is DoS a problem specific to BGP or the box ? 


2. So do we provide this kind of solution to all the routing protocols.



3. Here's what i think : DoS, Anti-Spoofing, Firewalls etc.. can be
catagorized as IP Services. Do we want to bring these concepts into IP
routing ? 


4. There are vendors who provide these IP Services along with routing by
clearly making a distinction between these two concepts. 


Regards, 


sri 


 "Turk, Doughan A." wrote: 


The draft does not suggest a way of verifying reverse path or validating
source address. My comment was just an answer to Sri's questions. BGP is the
only routing protocol I can think of with the capabilities to do what the
draft suggests. More information about the draft can be found at
http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt

DT

-----Original Message-----
From: Justin Fletcher [mailto:jfletcher@proficient.net] 
Sent: Friday, July 26, 2002 9:11 PM
To: Turk, Doughan A.
Cc: idr@merit.edu
Subject: RE: Need your opinion of this.


On Fri, 2002-07-26 at 18:24, Turk, Doughan A. wrote:
> I am using BGP to help stopping DoS and DDoS traffic. BGP can send 
> updates quickly and route-maps can translate communities to stop 
> traffic destined to a certain destination. TCP is not the issue here. 
> Attackers usually relay on the fact that most routers out there do 
> destination based routing without checking the validity of the source 
> address in case it is spoofed.
> 
> DT
> 

Is BGP the place to do this? Note this section from RFC1812:

5.3.8 Source Address Validation

A router SHOULD IMPLEMENT the ability to filter traffic based on a
comparison of the source address of a packet and the forwarding table
for a logical interface on which the packet was received. If this
filtering is enabled, the router MUST silently discard a packet if
the interface on which the packet was received is not the interface
on which a packet would be forwarded to reach the address contained
in the source address. In simpler terms, if a router wouldn't route
a packet containing this address through a particular interface, it
shouldn't believe the address if it appears as a source address in a
packet read from this interface.

--
Justin Fletcher
Proficient Networks, Inc.







  _____  

Do You Yahoo!?
Yahoo! Health <http://health.yahoo.com/>  - Feel better, live better


------_=_NextPart_001_01C2359F.C6AF3AC0
Content-Type: text/html; 
 charset=us-ascii
Content-Transfer-Encoding: 7bit

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=us-ascii">
<TITLE>Message</TITLE>

<META content="MSHTML 6.00.2716.2200" name=GENERATOR></HEAD>
<BODY>
<DIV><SPAN class=993394918-27072002><FONT face=Arial color=#0000ff 
size=2>Sri,</FONT></SPAN></DIV>
<DIV><SPAN class=993394918-27072002><FONT face=Arial color=#0000ff 
size=2></FONT></SPAN>&nbsp;</DIV>
<DIV><SPAN class=993394918-27072002><FONT face=Arial color=#0000ff size=2>The 
draft is not addressing a box or a protocol security. I am using BGP 
capabilities to assist in fighting DoS attacks. The reason why BGP is a protocol 
suitable&nbsp;for this job is it's many attributes like "communities". Since 
routing is the driver of traffic why not develop ways to use routing protocols 
to assist in fighting attacks.</FONT></SPAN></DIV>
<DIV><SPAN class=993394918-27072002><FONT face=Arial color=#0000ff 
size=2></FONT></SPAN>&nbsp;</DIV>
<DIV><SPAN class=993394918-27072002><FONT face=Arial color=#0000ff 
size=2>Regards</FONT></SPAN></DIV>
<DIV><SPAN class=993394918-27072002><FONT face=Arial color=#0000ff 
size=2>DT</FONT></SPAN></DIV>
<BLOCKQUOTE style="MARGIN-RIGHT: 0px">
  <DIV></DIV>
  <DIV class=OutlookMessageHeader lang=en-us dir=ltr align=left><FONT 
  face=Tahoma size=2>-----Original Message-----<BR><B>From:</B> PamSri 
  [mailto:pamsri01@yahoo.com] <BR><B>Sent:</B> Saturday, July 27, 2002 11:19 
  AM<BR><B>To:</B> Turk, Doughan A.<BR><B>Cc:</B> 
  idr@merit.edu<BR><B>Subject:</B> RE: Need your opinion of 
  this.<BR><BR></FONT></DIV>
  <P>Hi Turk , 
  <P>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Here 
  are a few more questions : 
  <P>1. Is DoS a&nbsp;problem specific to BGP or the box ? 
  <P>2. So do we provide this kind of solution to all the routing 
  protocols.&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 
  <P>3. Here's what i think : DoS, Anti-Spoofing, Firewalls etc.. can be 
  catagorized as IP Services. Do we want to bring these concepts into IP routing 
  ? 
  <P>4. There are vendors who provide these IP Services along with routing by 
  clearly making a distinction between these two concepts. 
  <P>Regards, 
  <P>sri 
  <P>&nbsp;<B><I>"Turk, Doughan A." <DOUGHAN.TURK@BELL.CA></I></B>wrote: 
  <BLOCKQUOTE 
  style="PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #1010ff 2px solid">The 
    draft does not suggest a way of verifying reverse path or 
    validating<BR>source address. My comment was just an answer to Sri's 
    questions. BGP is the<BR>only routing protocol I can think of with the 
    capabilities to do what the<BR>draft suggests. More information about the 
    draft can be found 
    at<BR>http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt<BR><BR>DT<BR><BR>-----Original 
    Message-----<BR>From: Justin Fletcher [mailto:jfletcher@proficient.net] 
    <BR>Sent: Friday, July 26, 2002 9:11 PM<BR>To: Turk, Doughan A.<BR>Cc: 
    idr@merit.edu<BR>Subject: RE: Need your opinion of this.<BR><BR><BR>On Fri, 
    2002-07-26 at 18:24, Turk, Doughan A. wrote:<BR>&gt; I am using BGP to help 
    stopping DoS and DDoS traffic. BGP can send <BR>&gt; updates quickly and 
    route-maps can translate communities to stop <BR>&gt; traffic destined to a 
    certain destination. TCP is not the issue here. <BR>&gt; Attackers usually 
    relay on the fact that most routers out there do <BR>&gt; destination based 
    routing without checking the validity of the source <BR>&gt; address in case 
    it is spoofed.<BR>&gt; <BR>&gt; DT<BR>&gt; <BR><BR>Is BGP the place to do 
    this? Note this section from RFC1812:<BR><BR>5.3.8 Source Address 
    Validation<BR><BR>A router SHOULD IMPLEMENT the ability to filter traffic 
    based on a<BR>comparison of the source address of a packet and the 
    forwarding table<BR>for a logical interface on which the packet was 
    received. If this<BR>filtering is enabled, the router MUST silently discard 
    a packet if<BR>the interface on which the packet was received is not the 
    interface<BR>on which a packet would be forwarded to reach the address 
    contained<BR>in the source address. In simpler terms, if a router wouldn't 
    route<BR>a packet containing this address through a particular interface, 
    it<BR>shouldn't believe the address if it appears as a source address in 
    a<BR>packet read from this interface.<BR><BR>--<BR>Justin 
    Fletcher<BR>Proficient Networks, Inc.<BR><BR><BR></BLOCKQUOTE>
  <P><BR>
  <HR SIZE=1>
  <B>Do You Yahoo!?</B><BR><A href="http://health.yahoo.com/">Yahoo! Health</A> 
  - Feel better, live better</BLOCKQUOTE></BODY></HTML>

------_=_NextPart_001_01C2359F.C6AF3AC0--



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id LAA23535 for <idr-archive@nic.merit.edu>; Sat, 27 Jul 2002 11:20:00 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 6E3BC91259; Sat, 27 Jul 2002 11:18:55 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 3DFE79135A; Sat, 27 Jul 2002 11:18:55 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 8A66091259 for <idr@trapdoor.merit.edu>; Sat, 27 Jul 2002 11:18:48 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 75CB05DF75; Sat, 27 Jul 2002 11:18:48 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from web14106.mail.yahoo.com (web14106.mail.yahoo.com [216.136.172.136]) by segue.merit.edu (Postfix) with SMTP id EB9235DF0D for <idr@merit.edu>; Sat, 27 Jul 2002 11:18:47 -0400 (EDT)
Message-ID: <20020727151847.81840.qmail@web14106.mail.yahoo.com>
Received: from [66.31.70.19] by web14106.mail.yahoo.com via HTTP; Sat, 27 Jul 2002 08:18:47 PDT
Date: Sat, 27 Jul 2002 08:18:47 -0700 (PDT)
From: PamSri <pamsri01@yahoo.com>
Subject: RE: Need your opinion of this.
To: "Turk, Doughan A." <doughan.turk@bell.ca>
Cc: idr@merit.edu
In-Reply-To: <7A733663D06C92499EF88AC4FD660EE302084665@TOROONDC755.on.bell.ca>
MIME-Version: 1.0
Content-Type: multipart/alternative; boundary="0-738754811-1027783127=:80858"
Sender: owner-idr@merit.edu
Precedence: bulk

--0-738754811-1027783127=:80858
Content-Type: text/plain; charset=us-ascii


Hi Turk ,
            Here are a few more questions :
1. Is DoS a problem specific to BGP or the box ? 
2. So do we provide this kind of solution to all the routing protocols.         
3. Here's what i think : DoS, Anti-Spoofing, Firewalls etc.. can be catagorized as IP Services. Do we want to bring these concepts into IP routing ?
4. There are vendors who provide these IP Services along with routing by clearly making a distinction between these two concepts.
Regards,
sri
 "Turk, Doughan A." wrote:The draft does not suggest a way of verifying reverse path or validating
source address. My comment was just an answer to Sri's questions. BGP is the
only routing protocol I can think of with the capabilities to do what the
draft suggests. More information about the draft can be found at
http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt

DT

-----Original Message-----
From: Justin Fletcher [mailto:jfletcher@proficient.net] 
Sent: Friday, July 26, 2002 9:11 PM
To: Turk, Doughan A.
Cc: idr@merit.edu
Subject: RE: Need your opinion of this.


On Fri, 2002-07-26 at 18:24, Turk, Doughan A. wrote:
> I am using BGP to help stopping DoS and DDoS traffic. BGP can send 
> updates quickly and route-maps can translate communities to stop 
> traffic destined to a certain destination. TCP is not the issue here. 
> Attackers usually relay on the fact that most routers out there do 
> destination based routing without checking the validity of the source 
> address in case it is spoofed.
> 
> DT
> 

Is BGP the place to do this? Note this section from RFC1812:

5.3.8 Source Address Validation

A router SHOULD IMPLEMENT the ability to filter traffic based on a
comparison of the source address of a packet and the forwarding table
for a logical interface on which the packet was received. If this
filtering is enabled, the router MUST silently discard a packet if
the interface on which the packet was received is not the interface
on which a packet would be forwarded to reach the address contained
in the source address. In simpler terms, if a router wouldn't route
a packet containing this address through a particular interface, it
shouldn't believe the address if it appears as a source address in a
packet read from this interface.

--
Justin Fletcher
Proficient Networks, Inc.





---------------------------------
Do You Yahoo!?
Yahoo! Health - Feel better, live better
--0-738754811-1027783127=:80858
Content-Type: text/html; charset=us-ascii

<P>Hi Turk ,
<P>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Here are a few more questions :
<P>1. Is DoS a&nbsp;problem specific to BGP or the box ? 
<P>2. So do we provide this kind of solution to all the routing protocols.&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 
<P>3. Here's what i think : DoS, Anti-Spoofing, Firewalls etc.. can be catagorized as IP Services. Do we want to bring these concepts into IP routing ?
<P>4. There are vendors who provide these IP Services along with routing by clearly making a distinction between these two concepts.
<P>Regards,
<P>sri
<P>&nbsp;<B><I>"Turk, Doughan A." <DOUGHAN.TURK@BELL.CA></I></B>wrote:
<BLOCKQUOTE style="PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #1010ff 2px solid">The draft does not suggest a way of verifying reverse path or validating<BR>source address. My comment was just an answer to Sri's questions. BGP is the<BR>only routing protocol I can think of with the capabilities to do what the<BR>draft suggests. More information about the draft can be found at<BR>http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt<BR><BR>DT<BR><BR>-----Original Message-----<BR>From: Justin Fletcher [mailto:jfletcher@proficient.net] <BR>Sent: Friday, July 26, 2002 9:11 PM<BR>To: Turk, Doughan A.<BR>Cc: idr@merit.edu<BR>Subject: RE: Need your opinion of this.<BR><BR><BR>On Fri, 2002-07-26 at 18:24, Turk, Doughan A. wrote:<BR>&gt; I am using BGP to help stopping DoS and DDoS traffic. BGP can send <BR>&gt; updates quickly and route-maps can translate communities to stop <BR>&gt; traffic destined to a certain destination. TCP is not the issue here. <BR>&gt; Attackers usually relay on the fact that most routers out there do <BR>&gt; destination based routing without checking the validity of the source <BR>&gt; address in case it is spoofed.<BR>&gt; <BR>&gt; DT<BR>&gt; <BR><BR>Is BGP the place to do this? Note this section from RFC1812:<BR><BR>5.3.8 Source Address Validation<BR><BR>A router SHOULD IMPLEMENT the ability to filter traffic based on a<BR>comparison of the source address of a packet and the forwarding table<BR>for a logical interface on which the packet was received. If this<BR>filtering is enabled, the router MUST silently discard a packet if<BR>the interface on which the packet was received is not the interface<BR>on which a packet would be forwarded to reach the address contained<BR>in the source address. In simpler terms, if a router wouldn't route<BR>a packet containing this address through a particular interface, it<BR>shouldn't believe the address if it appears as a source address in a<BR>packet read from this interface.<BR><BR>--<BR>Justin Fletcher<BR>Proficient Networks, Inc.<BR><BR><BR></BLOCKQUOTE><p><br><hr size=1><b>Do You Yahoo!?</b><br>
<a href="http://health.yahoo.com/">Yahoo! Health</a> - Feel better, live better
--0-738754811-1027783127=:80858--


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id JAA19015 for <idr-archive@nic.merit.edu>; Sat, 27 Jul 2002 09:03:59 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id DD72491227; Sat, 27 Jul 2002 09:03:35 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id DC39791251; Sat, 27 Jul 2002 09:03:31 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 298C591227 for <idr@trapdoor.merit.edu>; Sat, 27 Jul 2002 09:01:41 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 1210B5DF31; Sat, 27 Jul 2002 09:01:41 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mta0 (unknown [61.144.161.2]) by segue.merit.edu (Postfix) with ESMTP id B2A335DDA1 for <idr@merit.edu>; Sat, 27 Jul 2002 09:01:40 -0400 (EDT)
Received: from lidefeng (localhost [127.0.0.1]) by mta0.huawei.com (iPlanet Messaging Server 5.2 HotFix 0.7 (built Jun 26 2002)) with ESMTPA id <0GZW00CMMS3PHN@mta0.huawei.com> for idr@merit.edu; Sat, 27 Jul 2002 20:59:50 +0800 (CST)
Date: Sat, 27 Jul 2002 20:58:22 +0800
From: lidefeng <lidefeng@huawei.com>
Subject: about draft-ietf-idr-as4bytes-05.txt
To: qv@juniper.net, enke@redback.com
Cc: idr@merit.edu
Message-id: <003601c2356d$4a48fd40$22436e0a@HUAWEI.COM>
MIME-version: 1.0
X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2600.0000
X-Mailer: Microsoft Outlook Express 6.00.2600.0000
Content-type: text/plain; charset=iso-8859-2
Content-transfer-encoding: 7BIT
X-Priority: 3
X-MSMail-priority: Normal
Sender: owner-idr@merit.edu
Precedence: bulk

hi,

   In "Section7. IANA Consideration" of  "draft-ietf-idr-as4bytes-05.txt" ,it states that "The AS number for AS_TRANS has been assigned by the IANA.", I want to know what is the AS_TRANS number assigned by the IANA, where I can find the number assignment scheme, and all the NEW BGP SPEAKER user the only one number ? if so, how to handle the situation when they update across several AS? 

Regards

Li Defeng 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id IAA17861 for <idr-archive@nic.merit.edu>; Sat, 27 Jul 2002 08:25:08 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id ACE1691221; Sat, 27 Jul 2002 08:24:39 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 74B6A91228; Sat, 27 Jul 2002 08:24:39 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id EEEE091221 for <idr@trapdoor.merit.edu>; Sat, 27 Jul 2002 08:24:37 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id D6F205DF41; Sat, 27 Jul 2002 08:24:37 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mta0 (unknown [61.144.161.2]) by segue.merit.edu (Postfix) with ESMTP id 844FF5DF31 for <idr@merit.edu>; Sat, 27 Jul 2002 08:24:37 -0400 (EDT)
Received: from lidefeng (localhost [127.0.0.1]) by mta0.huawei.com (iPlanet Messaging Server 5.2 HotFix 0.7 (built Jun 26 2002)) with ESMTPA id <0GZW00CSGQE2HN@mta0.huawei.com> for idr@merit.edu; Sat, 27 Jul 2002 20:22:51 +0800 (CST)
Date: Sat, 27 Jul 2002 20:21:20 +0800
From: lidefeng <lidefeng@huawei.com>
Subject: Re: draft-ietf-idr-bgp4-18.txt
To: Manav Bhatia <manav@samsung.com>, idr@merit.edu
Message-id: <002601c23568$1fdc5700$22436e0a@HUAWEI.COM>
MIME-version: 1.0
X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2600.0000
X-Mailer: Microsoft Outlook Express 6.00.2600.0000
Content-type: text/plain; charset=iso-8859-1
Content-transfer-encoding: 7BIT
X-Priority: 3
X-MSMail-priority: Normal
References: <006e01c23473$dfc9e870$b4036c6b@sisodomain.com>
Sender: owner-idr@merit.edu
Precedence: bulk

hi,

    I can only get the draft-ietf-idr-bgp4-17.txt from the "Internet-Drafts Index ", and where can I get the newest version:Internet-Drafts Index , which you are discussing.And I wonder why "Internet-Drafts Index" doesn't list the latest version of draft. I am always downloading the draft from there, and I am afraid I will drag behind the discuss.

Regards

Li Defeng


----- Original Message ----- 
From: "Manav Bhatia" <manav@samsung.com>
To: <idr@merit.edu>
Sent: Friday, July 26, 2002 3:12 PM
Subject: draft-ietf-idr-bgp4-18.txt


> sec 6.5 of the latest draft states - " If  system does not receive
> successive KEEPALIVE and/or UPDATE and/or NOTIFICATION messages within the
> period specified in the Hold Time field of the OPEN message, then the
> NOTIFICATION message with   Hold Timer Expired Error Code must be sent and
> the BGP connection closed"
> 
> IMHO we must update the draft to also include other BGP messages (Route
> Refresh, Dynamic Capability, INFORM) which when received by the local
> system act as surrogate for the KEEPALIVE messages it would expect from the
> remote end. The local system upon receiving such messages must restart its
> Hold Timer
> 
> Similarly the draft should mention that each time the local system sends
> such BGP messages it should restart its KeepAlive Timer, unless the
> negotiated HoldTime value is zero.
> 
> These changes can be reflected in the BGP draft or can be done in the
> drafts introducing such BGP messages.
> 
> Regards,
> Manav
> 
> ----
> "When you are courting a nice girl an hour seems like a second. When you
> sit on a red-hot cinder a second seems like an hour. That's relativity."
> 
> -Albert Einstein, on relativity
> 
> 


Received: from trapdoor.merit.edu (trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id WAA28454 for <idr-archive@nic.merit.edu>; Fri, 26 Jul 2002 22:19:00 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 16B869121E; Fri, 26 Jul 2002 22:18:52 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id DEE019134D; Fri, 26 Jul 2002 22:18:51 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 815E69121E for <idr@trapdoor.merit.edu>; Fri, 26 Jul 2002 22:18:50 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 700DA5DE28; Fri, 26 Jul 2002 22:18:50 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from dm3cnd.bell.ca (dm3cnd.bell.ca [206.47.0.146]) by segue.merit.edu (Postfix) with SMTP id 326D55DE21 for <idr@merit.edu>; Fri, 26 Jul 2002 22:18:50 -0400 (EDT)
Received: from 206.47.0.153 by dm3cnd.bell.ca with ESMTP (Tumbleweed MMS SMTP Relay (MMS v4.7)); Fri, 26 Jul 2002 22:18:49 -0400
X-Server-Uuid: b85f21a3-cfd1-11d3-8401-00104bf46ab7
Received: from mta1.on.bell.ca (MTA1 [198.235.102.150]) by mta1.on.bell.ca with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13) id G3BQRT16; Fri, 26 Jul 2002 22:18:49 -0400
Received: FROM toroondc754.on.bell.ca BY mta1.on.bell.ca ; Fri Jul 26 22:18:48 2002 -0400
Received: by toroondc754.on.bell.ca with Internet Mail Service ( 5.5.2653.19) id <3T1HFDP1>; Fri, 26 Jul 2002 22:18:48 -0400
Message-ID: <7A733663D06C92499EF88AC4FD660EE302084665@TOROONDC755.on.bell.ca>
From: "Turk, Doughan A." <doughan.turk@bell.ca>
To: idr@merit.edu
Subject: RE: Need your opinion of this.
Date: Fri, 26 Jul 2002 22:18:48 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
X-WSS-ID: 115CDC83309054-01-01
Content-Type: text/plain;  charset=us-ascii
Content-Transfer-Encoding: 7bit
Sender: owner-idr@merit.edu
Precedence: bulk

The draft does not suggest a way of verifying reverse path or validating
source address. My comment was just an answer to Sri's questions. BGP is the
only routing protocol I can think of with the capabilities to do what the
draft suggests. More information about the draft can be found at
http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt

DT

-----Original Message-----
From: Justin Fletcher [mailto:jfletcher@proficient.net] 
Sent: Friday, July 26, 2002 9:11 PM
To: Turk, Doughan A.
Cc: idr@merit.edu
Subject: RE: Need your opinion of this.


On Fri, 2002-07-26 at 18:24, Turk, Doughan A. wrote:
> I am using BGP to help stopping DoS and DDoS traffic. BGP can send 
> updates quickly and route-maps can translate communities to stop 
> traffic destined to a certain destination. TCP is not the issue here. 
> Attackers usually relay on the fact that most routers out there do 
> destination based routing without checking the validity of the source 
> address in case it is spoofed.
> 
> DT
> 

Is BGP the place to do this?  Note this section from RFC1812:

5.3.8 Source Address Validation
   
   A router SHOULD IMPLEMENT the ability to filter traffic based on a
   comparison of the source address of a packet and the forwarding table
   for a logical interface on which the packet was received.  If this
   filtering is enabled, the router MUST silently discard a packet if
   the interface on which the packet was received is not the interface
   on which a packet would be forwarded to reach the address contained
   in the source address.  In simpler terms, if a router wouldn't route
   a packet containing this address through a particular interface, it
   shouldn't believe the address if it appears as a source address in a
   packet read from this interface.

--
Justin Fletcher
Proficient Networks, Inc.





Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id VAA26984 for <idr-archive@nic.merit.edu>; Fri, 26 Jul 2002 21:33:54 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id B891C9134B; Fri, 26 Jul 2002 21:33:34 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 83B2791238; Fri, 26 Jul 2002 21:33:34 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id D221E9134B for <idr@trapdoor.merit.edu>; Fri, 26 Jul 2002 21:31:40 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id B39F85DDED; Fri, 26 Jul 2002 21:31:40 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from earthquake.proficient.net (unknown [65.209.247.5]) by segue.merit.edu (Postfix) with ESMTP id 5FC235DDAF for <idr@merit.edu>; Fri, 26 Jul 2002 21:31:40 -0400 (EDT)
Received: from riga ([10.0.0.25]) by earthquake.proficient.net with Microsoft SMTPSVC(5.0.2195.4905); Fri, 26 Jul 2002 18:31:35 -0700
Subject: RE: Need your opinion of this.
From: Justin Fletcher <jfletcher@proficient.net>
To: "Turk, Doughan A." <doughan.turk@bell.ca>
Cc: idr@merit.edu
In-Reply-To:  <7A733663D06C92499EF88AC4FD660EE302084663@TOROONDC755.on.bell.ca>
References:  <7A733663D06C92499EF88AC4FD660EE302084663@TOROONDC755.on.bell.ca>
Content-Type: text/plain
Content-Transfer-Encoding: 7bit
X-Mailer: Ximian Evolution 1.0.3 (1.0.3-4) 
Date: 26 Jul 2002 18:10:55 -0700
Message-Id: <1027732256.7532.36.camel@riga>
Mime-Version: 1.0
X-OriginalArrivalTime: 27 Jul 2002 01:31:35.0081 (UTC) FILETIME=[58399190:01C2350D]
Sender: owner-idr@merit.edu
Precedence: bulk

On Fri, 2002-07-26 at 18:24, Turk, Doughan A. wrote:
> I am using BGP to help stopping DoS and DDoS traffic. BGP can send updates
> quickly and route-maps can translate communities to stop traffic destined to
> a certain destination. TCP is not the issue here. Attackers usually relay on
> the fact that most routers out there do destination based routing without
> checking the validity of the source address in case it is spoofed.
> 
> DT
> 

Is BGP the place to do this?  Note this section from RFC1812:

5.3.8 Source Address Validation
   
   A router SHOULD IMPLEMENT the ability to filter traffic based on a
   comparison of the source address of a packet and the forwarding table
   for a logical interface on which the packet was received.  If this
   filtering is enabled, the router MUST silently discard a packet if
   the interface on which the packet was received is not the interface
   on which a packet would be forwarded to reach the address contained
   in the source address.  In simpler terms, if a router wouldn't route
   a packet containing this address through a particular interface, it
   shouldn't believe the address if it appears as a source address in a
   packet read from this interface.

--
Justin Fletcher
Proficient Networks, Inc.



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id VAA26669 for <idr-archive@nic.merit.edu>; Fri, 26 Jul 2002 21:24:52 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id F355491217; Fri, 26 Jul 2002 21:24:24 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id B71F59134B; Fri, 26 Jul 2002 21:24:23 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 4960591217 for <idr@trapdoor.merit.edu>; Fri, 26 Jul 2002 21:24:22 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 392A35DE4F; Fri, 26 Jul 2002 21:24:22 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from dm3cnd.bell.ca (dm3cnd.bell.ca [206.47.0.146]) by segue.merit.edu (Postfix) with SMTP id E9CEB5DDED for <idr@merit.edu>; Fri, 26 Jul 2002 21:24:21 -0400 (EDT)
Received: from 206.47.0.153 by dm3cnd.bell.ca with ESMTP (Tumbleweed MMS SMTP Relay (MMS v4.7)); Fri, 26 Jul 2002 21:24:21 -0400
X-Server-Uuid: b85f21a3-cfd1-11d3-8401-00104bf46ab7
Received: from mta1.on.bell.ca (MTA1 [198.235.102.150]) by mta1.on.bell.ca with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13) id G3BQRSXJ; Fri, 26 Jul 2002 21:24:21 -0400
Received: FROM toroondc754.on.bell.ca BY mta1.on.bell.ca ; Fri Jul 26 21:24:20 2002 -0400
Received: by toroondc754.on.bell.ca with Internet Mail Service ( 5.5.2653.19) id <3T1HFDAW>; Fri, 26 Jul 2002 21:24:20 -0400
Message-ID: <7A733663D06C92499EF88AC4FD660EE302084663@TOROONDC755.on.bell.ca>
From: "Turk, Doughan A." <doughan.turk@bell.ca>
To: PamSri <pamsri01@yahoo.com>
Cc: idr@merit.edu
Subject: RE: Need your opinion of this.
Date: Fri, 26 Jul 2002 21:24:20 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
X-WSS-ID: 115F29CF295146-01-01
Content-Type: text/plain;  charset=us-ascii
Content-Transfer-Encoding: 7bit
Sender: owner-idr@merit.edu
Precedence: bulk

I am using BGP to help stopping DoS and DDoS traffic. BGP can send updates
quickly and route-maps can translate communities to stop traffic destined to
a certain destination. TCP is not the issue here. Attackers usually relay on
the fact that most routers out there do destination based routing without
checking the validity of the source address in case it is spoofed.

DT

-----Original Message-----
From: PamSri [mailto:pamsri01@yahoo.com] 
Sent: Friday, July 26, 2002 9:08 PM
To: Turk, Doughan A.
Subject: RE: Need your opinion of this.


Hi Turk ,
         Just started to read through your doc. I have
a question :
BGP is an application from the perspective of TCP. Why
do you see the need for provding such security at the application level than
TCP ? Do you think the security features in TCP as inadequate ? It would be
really a good idea to provide security at the underlying layers than at this
layer. 

sri

--- "Turk, Doughan A." <doughan.turk@bell.ca> wrote:
> Benjamin,
>  
> BGP will be used to alter next-hop address for a
> particular destination
> address (representing a customer under attack) on
> selected Routers. Instead
> of using BGP to announce a different next-hop for
> the destination address
> everywhere in your network. we can use communities
> to make it selective.
> once that is achieved, your next hop can be a null
> interface or a Tunnel
> interface where the traffic is then pulled to the
> sinkhole router.
>  
> Based on Attack signature you could do various
> packet lever filtering ( with
> hardware as a constrain),  simply rate limit the
> flow or send it out through
> a shared media (Ethernet) for sniffing after you
> pull it to a sinkhole
> router.
>  
> As for overheads on routers in the network, there is
> hardly any since you're
> only altering next-hop on border routers or edge
> routers. MPLS tunnel are
> what I used in my lab. as for the sinkhole router,
> one can invest in a high
> performance line card. The sinkhole router should be
> powerful enough to
> handle whatever filtering or policing you deem
> required to massage the flow.
> 
>  
> Scalability is high in my opinion. the route maps on
> the borders can be a
> part of your template. announcing the destination
> address with the special
> community would have to be done on only one router
> (sinkhole is a good
> idea).
>  
> As for where to place the route maps or policy
> statements, IMHO, this should
> be applied anywhere a large attack can enter from.
> (I.e Transit links or to
> Peers with large number of subscribers) I wouldn't
> apply it on a customer
> interface since a customer is in my reach and can be contacted 
> immediately in case of an attack initiated by him. Another place
> to apply it on is at
> peering interfaces where an access-list or policing
> statement might bring
> down interface performance. at this point traffic
> can be pulled in the
> sinkhole router for the policies to be applied.
>  
> This technique can be used in high volume attacks
> where source based
> filtering is not possible because of spoofing or
> high number of source IP
> addresses. In many cases ISP's are forced to kill
> traffic destined to a
> single /32 to relief the rest of the customer
> network from suffering from
> the high volume of traffic. I would also use this in
> cases of attacks
> targeting infrastructure.
>  
> As for the last question, yes I have built this in a
> large lab and got the
> result I expected.
>  
> Hope this answers your questions.
>  
> Regards
> DT
>  
> 
> -----Original Message-----
> From: Abarbanel, Benjamin [mailto:Benjamin.Abarbanel@Marconi.com]
> Sent: Friday, July 26, 2002 4:55 PM
> To: 'Turk, Doughan A.'; idr@merit.edu
> Subject: RE: Need your opinion of this.
> 
> 
> Doughan:
>   Let me see if I understand the fundamental point
> of your draft. You want
> BGP via communities to be the big MAGNET that
> catches all the denial of
> service attack routes and shoves them through some
> sink hole router that
> will keep statistics and knowledge of the source of
> the attack and be able
> to report it to the operator?
>  
> Did I summarize it properly?
>  
> My first reaction, is what kind of overhead will
> these techniques impose on
> the routers and the sink hole one? 
>  
> Will it be scallable? 
>  
> Can this be done only on border routers, Router
> Reflectors? To minimize the
> processing load needed?
>  
> What real life cases have you seen where this
> mechanism was needed?
>  
> Have you prototyped this idea and gotten some
> empirical data?
>  
> Ben
> 
> -----Original Message-----
> From: Turk, Doughan A. [mailto:doughan.turk@bell.ca]
> Sent: Friday, July 26, 2002 4:09 PM
> To: idr@merit.edu
> Cc: Turk, Doughan A.
> Subject: Need your opinion of this.
> 
> 
> Folks, 
>  
> I would like to get some discussion started on a
> draft I wrote. It was
> suggested to me to have the draft worked on under
> this group. I appreciate
> any constructive comments and pointers. In summary,
> the draft could be
> classified as an operational one. It is an
> enhancement of a technique that
> already exists but relying more on BGP as a protocol
> to help in network
> security issues.
>  
> A copy of the draft could be found at
>
http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt
>
<http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt>
> 
>  
> Regards
> Doughan Turk
>  
>   _____  
> 
> 
>  <http://www.bell.ca/> Bell Canada Logo
> 
> Doughan Turk, M.Eng., CCIE#7274
> Bell Canada, IP Engineering
> Flr 3, 100 Wynford Dr. 
> Toronto M3C 1K4
> 
> Voice: (416) 353-0368
> Fax: (416) 445-9893
> Pager: (416) 685-0590
>  <http://www.bell.ca/> Bell Canada
> 
>  Cisco Certified Internetwork Expert
> <http://www3.sympatico.ca/doughan/ccie_large.gif> 
> 
>          "To Engineer and Integrate world class
>                               IP Services and
> Technologies for BCE" 	
>  
> 
> 


__________________________________________________
Do You Yahoo!?
Yahoo! Health - Feel better, live better
http://health.yahoo.com




Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id UAA24703 for <idr-archive@nic.merit.edu>; Fri, 26 Jul 2002 20:28:06 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 2941891345; Fri, 26 Jul 2002 20:27:36 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id E877B9134A; Fri, 26 Jul 2002 20:27:35 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 52EB791345 for <idr@trapdoor.merit.edu>; Fri, 26 Jul 2002 20:27:34 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 2E7475DF3B; Fri, 26 Jul 2002 20:27:34 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from dm3cn8.bell.ca (dm3cn8.bell.ca [206.47.0.145]) by segue.merit.edu (Postfix) with SMTP id D9FDE5DF3A for <idr@merit.edu>; Fri, 26 Jul 2002 20:27:33 -0400 (EDT)
Received: from 206.47.0.153 by dm3cn8.bell.ca with ESMTP (Tumbleweed MMS SMTP Relay (MMS v4.7)); Fri, 26 Jul 2002 20:27:33 -0400
X-Server-Uuid: b85f21a3-cfd1-11d3-8401-00104bf46ab7
Received: from mta1.on.bell.ca (MTA1 [198.235.102.150]) by mta1.on.bell.ca with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13) id G3BQRSK6; Fri, 26 Jul 2002 20:27:32 -0400
Received: FROM toroondc754.on.bell.ca BY mta1.on.bell.ca ; Fri Jul 26 20:27:32 2002 -0400
Received: by toroondc754.on.bell.ca with Internet Mail Service ( 5.5.2653.19) id <3T1HFC44>; Fri, 26 Jul 2002 20:27:32 -0400
Message-ID: <7A733663D06C92499EF88AC4FD660EE302084661@TOROONDC755.on.bell.ca>
From: "Turk, Doughan A." <doughan.turk@bell.ca>
To: idr@merit.edu
Subject: RE: Need your opinion of this.
Date: Fri, 26 Jul 2002 20:27:31 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
X-WSS-ID: 115F377F11684-01-01
Content-Type: multipart/alternative;  boundary="----_=_NextPart_001_01C23504.65669C40"
Sender: owner-idr@merit.edu
Precedence: bulk

This message is in MIME format. Since your mail reader does not understand
this format, some or all of this message may not be legible.

------_=_NextPart_001_01C23504.65669C40
Content-Type: text/plain; 
 charset=us-ascii
Content-Transfer-Encoding: 7bit

Benjamin,
 
BGP will be used to alter next-hop address for a particular destination
address (representing a customer under attack) on selected Routers. Instead
of using BGP to announce a different next-hop for the destination address
everywhere in your network. we can use communities to make it selective.
once that is achieved, your next hop can be a null interface or a Tunnel
interface where the traffic is then pulled to the sinkhole router. 
 
Based on Attack signature you could do various packet lever filtering ( with
hardware as a constrain),  simply rate limit the flow or send it out through
a shared media (Ethernet) for sniffing after you pull it to a sinkhole
router.
 
As for overheads on routers in the network, there is hardly any since you're
only altering next-hop on border routers or edge routers. MPLS tunnel are
what I used in my lab. as for the sinkhole router, one can invest in a high
performance line card. The sinkhole router should be powerful enough to
handle whatever filtering or policing you deem required to massage the flow.

 
Scalability is high in my opinion. the route maps on the borders can be a
part of your template. announcing the destination address with the special
community would have to be done on only one router (sinkhole is a good
idea).
 
As for where to place the route maps or policy statements, IMHO, this should
be applied anywhere a large attack can enter from. (I.e Transit links or to
Peers with large number of subscribers) I wouldn't apply it on a customer
interface since a customer is in my reach and can be contacted immediately
in case of an attack initiated by him. Another place to apply it on is at
peering interfaces where an access-list or policing statement might bring
down interface performance. at this point traffic can be pulled in the
sinkhole router for the policies to be applied.
 
This technique can be used in high volume attacks where source based
filtering is not possible because of spoofing or high number of source IP
addresses. In many cases ISP's are forced to kill traffic destined to a
single /32 to relief the rest of the customer network from suffering from
the high volume of traffic. I would also use this in cases of attacks
targeting infrastructure.
 
As for the last question, yes I have built this in a large lab and got the
result I expected.
 
Hope this answers your questions.
 
Regards
DT
 

-----Original Message-----
From: Abarbanel, Benjamin [mailto:Benjamin.Abarbanel@Marconi.com] 
Sent: Friday, July 26, 2002 4:55 PM
To: 'Turk, Doughan A.'; idr@merit.edu
Subject: RE: Need your opinion of this.


Doughan:
  Let me see if I understand the fundamental point of your draft. You want
BGP via communities to be the big MAGNET that catches all the denial of
service attack routes and shoves them through some sink hole router that
will keep statistics and knowledge of the source of the attack and be able
to report it to the operator?
 
Did I summarize it properly?
 
My first reaction, is what kind of overhead will these techniques impose on
the routers and the sink hole one? 
 
Will it be scallable? 
 
Can this be done only on border routers, Router Reflectors? To minimize the
processing load needed?
 
What real life cases have you seen where this mechanism was needed?
 
Have you prototyped this idea and gotten some empirical data?
 
Ben

-----Original Message-----
From: Turk, Doughan A. [mailto:doughan.turk@bell.ca]
Sent: Friday, July 26, 2002 4:09 PM
To: idr@merit.edu
Cc: Turk, Doughan A.
Subject: Need your opinion of this.


Folks, 
 
I would like to get some discussion started on a draft I wrote. It was
suggested to me to have the draft worked on under this group. I appreciate
any constructive comments and pointers. In summary, the draft could be
classified as an operational one. It is an enhancement of a technique that
already exists but relying more on BGP as a protocol to help in network
security issues.
 
A copy of the draft could be found at
http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt
<http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt> 
 
Regards
Doughan Turk
 
  _____  


 <http://www.bell.ca/> Bell Canada Logo

Doughan Turk, M.Eng., CCIE#7274
Bell Canada, IP Engineering
Flr 3, 100 Wynford Dr. 
Toronto M3C 1K4

Voice: (416) 353-0368
Fax: (416) 445-9893
Pager: (416) 685-0590
 <http://www.bell.ca/> Bell Canada

 Cisco Certified Internetwork Expert
<http://www3.sympatico.ca/doughan/ccie_large.gif> 

         "To Engineer and Integrate world class
                              IP Services and Technologies for BCE" 	
 


------_=_NextPart_001_01C23504.65669C40
Content-Type: text/html; 
 charset=us-ascii
Content-Transfer-Encoding: 7bit

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=us-ascii">
<TITLE>Message</TITLE>

<META content="MSHTML 6.00.2716.2200" name=GENERATOR></HEAD>
<BODY>
<DIV><SPAN class=957321621-26072002><FONT face=Arial color=#0000ff 
size=2>Benjamin,</FONT></SPAN></DIV>
<DIV><SPAN class=957321621-26072002><FONT face=Arial color=#0000ff 
size=2></FONT></SPAN>&nbsp;</DIV>
<DIV><SPAN class=957321621-26072002><FONT face=Arial color=#0000ff size=2>BGP 
will be used to alter next-hop address for a particular destination address 
(representing a customer under attack)&nbsp;on selected Routers. Instead of 
using BGP to announce a different next-hop for the destination address 
everywhere in your network. we can use communities to make it selective. once 
that is achieved, your next hop can be a null interface or a Tunnel interface 
where the traffic is then pulled to the sinkhole router. </FONT></SPAN></DIV>
<DIV><SPAN class=957321621-26072002><FONT face=Arial color=#0000ff 
size=2></FONT></SPAN>&nbsp;</DIV>
<DIV><SPAN class=957321621-26072002><FONT face=Arial color=#0000ff size=2>Based 
on Attack signature you could do various packet lever filtering ( with hardware 
as a constrain),&nbsp; simply rate limit the flow or send it out through a 
shared media (Ethernet) for sniffing after you pull it to a sinkhole 
router.</FONT></SPAN></DIV>
<DIV><SPAN class=957321621-26072002><FONT face=Arial color=#0000ff 
size=2></FONT></SPAN>&nbsp;</DIV>
<DIV><SPAN class=957321621-26072002><FONT face=Arial color=#0000ff size=2>As for 
overheads on routers in the network, there is hardly any since you're only 
altering next-hop on border routers or edge routers. MPLS tunnel are what I used 
in my lab. as for the sinkhole router, one can invest in a high performance line 
card. The sinkhole router should be powerful enough to handle whatever filtering 
or policing you deem required to massage the flow. </FONT></SPAN></DIV>
<DIV><SPAN class=957321621-26072002><FONT face=Arial color=#0000ff 
size=2></FONT></SPAN>&nbsp;</DIV>
<DIV><SPAN class=957321621-26072002><FONT face=Arial color=#0000ff 
size=2>Scalability is high in my opinion. the route maps on the borders can be a 
part of your template. announcing the destination address with the special 
community would have to be done on only one router (sinkhole is a good 
idea).</FONT></SPAN></DIV>
<DIV><SPAN class=957321621-26072002><FONT face=Arial color=#0000ff 
size=2></FONT></SPAN>&nbsp;</DIV>
<DIV><SPAN class=957321621-26072002><FONT face=Arial color=#0000ff size=2>As for 
where to place the route maps or policy statements, IMHO, this should be applied 
anywhere a large attack can enter from. (I.e Transit links or to Peers with 
large number of subscribers) I wouldn't apply it on a customer interface since a 
customer is in my reach and can be contacted immediately in case of an attack 
initiated by him. Another place to apply it on is at peering interfaces where an 
access-list or policing statement might bring down interface performance. at 
this point traffic can be pulled in the sinkhole router for the policies to be 
applied.</FONT></SPAN></DIV>
<DIV><SPAN class=957321621-26072002><FONT face=Arial color=#0000ff 
size=2></FONT></SPAN>&nbsp;</DIV>
<DIV><SPAN class=957321621-26072002><FONT face=Arial color=#0000ff size=2>This 
technique can be used in high volume attacks where source based filtering is not 
possible because of spoofing or high number of source IP addresses. In many 
cases ISP's are forced to kill traffic destined to a single /32 to relief the 
rest of the customer network from suffering from the high volume of traffic. I 
would also use this in cases of attacks targeting 
infrastructure.</FONT></SPAN></DIV>
<DIV><SPAN class=957321621-26072002><FONT face=Arial color=#0000ff 
size=2></FONT></SPAN>&nbsp;</DIV>
<DIV><SPAN class=957321621-26072002><FONT face=Arial color=#0000ff size=2>As for 
the last question, yes I have built this in a large lab and got the result I 
expected.</FONT></SPAN></DIV>
<DIV><SPAN class=957321621-26072002><FONT face=Arial color=#0000ff 
size=2></FONT></SPAN>&nbsp;</DIV>
<DIV><SPAN class=957321621-26072002><FONT face=Arial color=#0000ff size=2>Hope 
this answers your questions.</FONT></SPAN></DIV>
<DIV><SPAN class=957321621-26072002><FONT face=Arial color=#0000ff 
size=2></FONT></SPAN>&nbsp;</DIV>
<DIV><SPAN class=957321621-26072002><FONT face=Arial color=#0000ff 
size=2>Regards</FONT></SPAN></DIV>
<DIV><SPAN class=957321621-26072002><FONT face=Arial color=#0000ff 
size=2>DT</FONT></SPAN></DIV>
<DIV><FONT face=Arial color=#0000ff size=2></FONT>&nbsp;</DIV>
<BLOCKQUOTE dir=ltr style="MARGIN-RIGHT: 0px">
  <DIV></DIV>
  <DIV class=OutlookMessageHeader lang=en-us dir=ltr align=left><FONT 
  face=Tahoma size=2>-----Original Message-----<BR><B>From:</B> Abarbanel, 
  Benjamin [mailto:Benjamin.Abarbanel@Marconi.com] <BR><B>Sent:</B> Friday, July 
  26, 2002 4:55 PM<BR><B>To:</B> 'Turk, Doughan A.'; 
  idr@merit.edu<BR><B>Subject:</B> RE: Need your opinion of 
  this.<BR><BR></FONT></DIV>
  <DIV><FONT face=Arial color=#0000ff size=2><SPAN 
  class=283324920-26072002>Doughan:</SPAN></FONT></DIV>
  <DIV><FONT face=Arial color=#0000ff size=2><SPAN 
  class=283324920-26072002>&nbsp; Let me see if I understand the fundamental 
  point of your draft. You want BGP via communities to be the big MAGNET that 
  catches all the denial of service attack routes and shoves them through some 
  sink hole router that will keep statistics and knowledge of the source of the 
  attack and be able to report it to the operator?</SPAN></FONT></DIV>
  <DIV>&nbsp;</DIV>
  <DIV><FONT face=Arial color=#0000ff size=2><SPAN class=283324920-26072002>Did 
  I summarize it properly?</SPAN></FONT></DIV>
  <DIV><FONT face=Arial color=#0000ff size=2><SPAN 
  class=283324920-26072002></SPAN></FONT>&nbsp;</DIV>
  <DIV><FONT face=Arial color=#0000ff size=2><SPAN class=283324920-26072002>My 
  first reaction, is what kind of overhead will these techniques impose on the 
  routers and the sink hole one? </SPAN></FONT></DIV>
  <DIV><FONT face=Arial color=#0000ff size=2><SPAN 
  class=283324920-26072002></SPAN></FONT>&nbsp;</DIV>
  <DIV><FONT face=Arial color=#0000ff size=2><SPAN class=283324920-26072002>Will 
  it be scallable? </SPAN></FONT></DIV>
  <DIV><FONT face=Arial color=#0000ff size=2><SPAN 
  class=283324920-26072002></SPAN></FONT>&nbsp;</DIV>
  <DIV><FONT face=Arial color=#0000ff size=2><SPAN class=283324920-26072002>Can 
  this be done only on border routers, Router Reflectors? To minimize the 
  processing load needed?</SPAN></FONT></DIV>
  <DIV><FONT face=Arial color=#0000ff size=2><SPAN 
  class=283324920-26072002></SPAN></FONT>&nbsp;</DIV>
  <DIV><FONT face=Arial color=#0000ff size=2><SPAN class=283324920-26072002>What 
  real life cases have you seen where this mechanism was 
  needed?</SPAN></FONT></DIV>
  <DIV><FONT face=Arial color=#0000ff size=2><SPAN 
  class=283324920-26072002></SPAN></FONT>&nbsp;</DIV>
  <DIV><FONT face=Arial color=#0000ff size=2><SPAN class=283324920-26072002>Have 
  you prototyped this idea and gotten some empirical data?</SPAN></FONT></DIV>
  <DIV><FONT face=Arial color=#0000ff size=2><SPAN 
  class=283324920-26072002></SPAN></FONT>&nbsp;</DIV>
  <DIV><FONT face=Arial color=#0000ff size=2><SPAN 
  class=283324920-26072002>Ben</SPAN></FONT></DIV>
  <BLOCKQUOTE 
  style="PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #0000ff 2px solid; MARGIN-RIGHT: 0px">
    <DIV class=OutlookMessageHeader dir=ltr align=left><FONT face=Tahoma 
    size=2>-----Original Message-----<BR><B>From:</B> Turk, Doughan A. 
    [mailto:doughan.turk@bell.ca]<BR><B>Sent:</B> Friday, July 26, 2002 4:09 
    PM<BR><B>To:</B> idr@merit.edu<BR><B>Cc:</B> Turk, Doughan 
    A.<BR><B>Subject:</B> Need your opinion of this.<BR><BR></DIV></FONT>
    <DIV><FONT face=Arial size=2><SPAN class=247400120-26072002>Folks, 
    </SPAN></FONT></DIV>
    <DIV><FONT face=Arial size=2><SPAN 
    class=247400120-26072002></SPAN></FONT>&nbsp;</DIV>
    <DIV><FONT face=Arial size=2><SPAN class=247400120-26072002>I would like to 
    get some discussion started on a draft I wrote. It was suggested to me to 
    have the draft worked on under this group. I appreciate any constructive 
    comments and pointers. In summary, the draft could be classified as an 
    operational one. It is an enhancement of a technique that already exists but 
    relying more on BGP as a protocol to help in network security 
    issues.</SPAN></FONT></DIV>
    <DIV><FONT face=Arial size=2><SPAN 
    class=247400120-26072002></SPAN></FONT>&nbsp;</DIV>
    <DIV><FONT face=Arial size=2><SPAN class=247400120-26072002>A copy of the 
    draft could be found at <A 
    href="http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt">http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt</A></SPAN></FONT></DIV>
    <DIV><FONT face=Arial size=2><SPAN 
    class=247400120-26072002></SPAN></FONT>&nbsp;</DIV>
    <DIV><FONT face=Arial size=2><SPAN 
    class=247400120-26072002>Regards</SPAN></FONT></DIV>
    <DIV><FONT face=Arial size=2><SPAN class=247400120-26072002>Doughan 
    Turk</SPAN></FONT></DIV>
    <DIV>&nbsp;</DIV>
    <DIV class=Section1>
    <DIV class=MsoNormal style="TEXT-ALIGN: center" align=center>
    <HR align=center width="100%" color=#cccccc noShade SIZE=1>
    </DIV>
    <TABLE class=MsoNormalTable 
    style="BORDER-COLLAPSE: collapse; mso-padding-alt: 0in 5.4pt 0in 5.4pt" 
    cellSpacing=0 cellPadding=0 border=0>
      <TBODY>
      <TR style="mso-yfti-irow: 0">
        <TD 
        style="PADDING-RIGHT: 1.5pt; PADDING-LEFT: 1.5pt; PADDING-BOTTOM: 1.5pt; WIDTH: 93.75pt; PADDING-TOP: 1.5pt" 
        vAlign=top width=125>
          <P class=MsoNormal><SPAN 
          style="FONT-SIZE: 8.5pt; COLOR: #55549a; FONT-FAMILY: Arial"><A 
          href="http://www.bell.ca/"><SPAN 
          style="TEXT-DECORATION: none; text-underline: none"><IMG 
          id=_x0000_i1026 height=46 alt="Bell Canada Logo" 
          src="http://www3.sympatico.ca/doughan/small_bell_logo.gif" width=102 
          border=0 NOSEND="1"></SPAN></A></SPAN></P></TD>
        <TD 
        style="PADDING-RIGHT: 1.5pt; PADDING-LEFT: 1.5pt; PADDING-BOTTOM: 1.5pt; WIDTH: 135pt; PADDING-TOP: 1.5pt" 
        vAlign=top width=180>
          <P class=MsoNormal><SPAN 
          style="FONT-SIZE: 8.5pt; COLOR: #55549a; FONT-FAMILY: Arial">Doughan 
          Turk, M.Eng., CCIE#7274<BR>Bell Canada, IP Engineering<BR>Flr 3, 100 
          Wynford Dr. <BR>Toronto M3C 1K4</SPAN></P></TD>
        <TD 
        style="PADDING-RIGHT: 1.5pt; PADDING-LEFT: 1.5pt; PADDING-BOTTOM: 1.5pt; PADDING-TOP: 1.5pt" 
        vAlign=top noWrap>
          <P class=MsoNormal><SPAN 
          style="FONT-SIZE: 8.5pt; COLOR: #55549a; FONT-FAMILY: Arial">Voice: 
          (416) 353-0368<BR>Fax: (416) 445-9893<BR>Pager: (416) 685-0590<BR><A 
          href="http://www.bell.ca/"><FONT color=#ffb900><SPAN 
          style="COLOR: #ffb900">Bell 
        Canada</SPAN></FONT></A></SPAN></FONT></P></TD>
        <TD 
        style="PADDING-RIGHT: 1.5pt; PADDING-LEFT: 1.5pt; PADDING-BOTTOM: 1.5pt; WIDTH: 52.15pt; PADDING-TOP: 1.5pt" 
        vAlign=top width=70>
          <P class=MsoNormal style="TEXT-ALIGN: right" align=right><SPAN 
          style="FONT-SIZE: 8.5pt; COLOR: #55549a; FONT-FAMILY: Arial"><IMG 
          id=_x0000_i1027 height=60 alt="Cisco Certified Internetwork Expert" 
          src="http://www3.sympatico.ca/doughan/ccie_large.gif" width=61 
          border=0 NOSEND="1"></SPAN></P></TD></TR>
      <TR vAlign=top>
        <TD colSpan=3><B><I><FONT face=Georgia color=#55549a 
          size=1>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp;</FONT><FONT 
          style="FONT-SIZE: 8.5pt" face=Georgia color=#55549a>"To Engineer and 
          Integrate world 
          class<BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 
          &nbsp;&nbsp;&nbsp;IP Services and Technologies for BCE" 
        </FONT></I></B></TD></TR></TBODY></TABLE></DIV>
    <DIV>&nbsp;</DIV></BLOCKQUOTE></BLOCKQUOTE></BODY></HTML>

------_=_NextPart_001_01C23504.65669C40--



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id QAA17930 for <idr-archive@nic.merit.edu>; Fri, 26 Jul 2002 16:55:29 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id F378B91349; Fri, 26 Jul 2002 16:55:01 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id C2A2B9134A; Fri, 26 Jul 2002 16:55:00 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id C357291349 for <idr@trapdoor.merit.edu>; Fri, 26 Jul 2002 16:54:59 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id B0DF25DF37; Fri, 26 Jul 2002 16:54:59 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 6B7C15DE42 for <idr@merit.edu>; Fri, 26 Jul 2002 16:54:59 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id QAA29873; Fri, 26 Jul 2002 16:54:56 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id QAA07183; Fri, 26 Jul 2002 16:54:58 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W99DVKQ>; Fri, 26 Jul 2002 16:54:57 -0400
Message-ID: <39469E08BD83D411A3D900204840EC558227DB@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Turk, Doughan A.'" <doughan.turk@bell.ca>, idr@merit.edu
Subject: RE: Need your opinion of this.
Date: Fri, 26 Jul 2002 16:54:56 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: multipart/alternative; boundary="----_=_NextPart_001_01C234E6.B2B1E2C0"
Sender: owner-idr@merit.edu
Precedence: bulk

This message is in MIME format. Since your mail reader does not understand
this format, some or all of this message may not be legible.

------_=_NextPart_001_01C234E6.B2B1E2C0
Content-Type: text/plain;
	charset="iso-8859-1"

Doughan:
  Let me see if I understand the fundamental point of your draft. You want
BGP via communities to be the big MAGNET that catches all the denial of
service attack routes and shoves them through some sink hole router that
will keep statistics and knowledge of the source of the attack and be able
to report it to the operator?
 
Did I summarize it properly?
 
My first reaction, is what kind of overhead will these techniques impose on
the routers and the sink hole one? 
 
Will it be scallable? 
 
Can this be done only on border routers, Router Reflectors? To minimize the
processing load needed?
 
What real life cases have you seen where this mechanism was needed?
 
Have you prototyped this idea and gotten some empirical data?
 
Ben

-----Original Message-----
From: Turk, Doughan A. [mailto:doughan.turk@bell.ca]
Sent: Friday, July 26, 2002 4:09 PM
To: idr@merit.edu
Cc: Turk, Doughan A.
Subject: Need your opinion of this.


Folks, 
 
I would like to get some discussion started on a draft I wrote. It was
suggested to me to have the draft worked on under this group. I appreciate
any constructive comments and pointers. In summary, the draft could be
classified as an operational one. It is an enhancement of a technique that
already exists but relying more on BGP as a protocol to help in network
security issues.
 
A copy of the draft could be found at
http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt
<http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt> 
 
Regards
Doughan Turk
 
   _____  


 <http://www.bell.ca/> Bell Canada Logo

Doughan Turk, M.Eng., CCIE#7274
Bell Canada, IP Engineering
Flr 3, 100 Wynford Dr. 
Toronto M3C 1K4

Voice: (416) 353-0368
Fax: (416) 445-9893
Pager: (416) 685-0590
 <http://www.bell.ca/> Bell Canada

 Cisco Certified Internetwork Expert
<http://www3.sympatico.ca/doughan/ccie_large.gif> 

         "To Engineer and Integrate world class
                              IP Services and Technologies for BCE" 	
 


------_=_NextPart_001_01C234E6.B2B1E2C0
Content-Type: text/html;
	charset="iso-8859-1"

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=iso-8859-1">
<TITLE>Message</TITLE>

<META content="MSHTML 5.00.2920.0" name=GENERATOR></HEAD>
<BODY>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN 
class=283324920-26072002>Doughan:</SPAN></FONT></DIV>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN class=283324920-26072002>&nbsp; 
Let me see if I understand the fundamental point of your draft. You want BGP via 
communities to be the big MAGNET that catches all the denial of service attack 
routes and shoves them through some sink hole router that will keep statistics 
and knowledge of the source of the attack and be able to report it to the 
operator?</SPAN></FONT></DIV>
<DIV>&nbsp;</DIV>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN class=283324920-26072002>Did I 
summarize it properly?</SPAN></FONT></DIV>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN 
class=283324920-26072002></SPAN></FONT>&nbsp;</DIV>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN class=283324920-26072002>My 
first reaction, is what kind of overhead will these techniques impose on the 
routers and the sink hole one? </SPAN></FONT></DIV>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN 
class=283324920-26072002></SPAN></FONT>&nbsp;</DIV>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN class=283324920-26072002>Will 
it be scallable? </SPAN></FONT></DIV>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN 
class=283324920-26072002></SPAN></FONT>&nbsp;</DIV>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN class=283324920-26072002>Can 
this be done only on border routers, Router Reflectors? To minimize the 
processing load needed?</SPAN></FONT></DIV>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN 
class=283324920-26072002></SPAN></FONT>&nbsp;</DIV>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN class=283324920-26072002>What 
real life cases have you seen where this mechanism was 
needed?</SPAN></FONT></DIV>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN 
class=283324920-26072002></SPAN></FONT>&nbsp;</DIV>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN class=283324920-26072002>Have 
you prototyped this idea and gotten some empirical data?</SPAN></FONT></DIV>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN 
class=283324920-26072002></SPAN></FONT>&nbsp;</DIV>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN 
class=283324920-26072002>Ben</SPAN></FONT></DIV>
<BLOCKQUOTE 
style="BORDER-LEFT: #0000ff 2px solid; MARGIN-LEFT: 5px; MARGIN-RIGHT: 0px; PADDING-LEFT: 5px">
  <DIV align=left class=OutlookMessageHeader dir=ltr><FONT face=Tahoma 
  size=2>-----Original Message-----<BR><B>From:</B> Turk, Doughan A. 
  [mailto:doughan.turk@bell.ca]<BR><B>Sent:</B> Friday, July 26, 2002 4:09 
  PM<BR><B>To:</B> idr@merit.edu<BR><B>Cc:</B> Turk, Doughan 
  A.<BR><B>Subject:</B> Need your opinion of this.<BR><BR></DIV></FONT>
  <DIV><FONT face=Arial size=2><SPAN class=247400120-26072002>Folks, 
  </SPAN></FONT></DIV>
  <DIV><FONT face=Arial size=2><SPAN 
  class=247400120-26072002></SPAN></FONT>&nbsp;</DIV>
  <DIV><FONT face=Arial size=2><SPAN class=247400120-26072002>I would like to 
  get some discussion started on a draft I wrote. It was suggested to me to have 
  the draft worked on under this group. I appreciate any constructive comments 
  and pointers. In summary, the draft could be classified as an operational one. 
  It is an enhancement of a technique that already exists but relying more on 
  BGP as a protocol to help in network security issues.</SPAN></FONT></DIV>
  <DIV><FONT face=Arial size=2><SPAN 
  class=247400120-26072002></SPAN></FONT>&nbsp;</DIV>
  <DIV><FONT face=Arial size=2><SPAN class=247400120-26072002>A copy of the 
  draft could be found at <A 
  href="http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt">http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt</A></SPAN></FONT></DIV>
  <DIV><FONT face=Arial size=2><SPAN 
  class=247400120-26072002></SPAN></FONT>&nbsp;</DIV>
  <DIV><FONT face=Arial size=2><SPAN 
  class=247400120-26072002>Regards</SPAN></FONT></DIV>
  <DIV><FONT face=Arial size=2><SPAN class=247400120-26072002>Doughan 
  Turk</SPAN></FONT></DIV>
  <DIV>&nbsp;</DIV>
  <DIV class=Section1>
  <DIV align=center class=MsoNormal style="TEXT-ALIGN: center">
  <HR align=center color=#cccccc noShade SIZE=1 width="100%">
  </DIV>
  <TABLE border=0 cellPadding=0 cellSpacing=0 class=MsoNormalTable 
  style="BORDER-COLLAPSE: collapse; mso-padding-alt: 0in 5.4pt 0in 5.4pt">
    <TBODY>
    <TR style="mso-yfti-irow: 0">
      <TD 
      style="PADDING-BOTTOM: 1.5pt; PADDING-LEFT: 1.5pt; PADDING-RIGHT: 1.5pt; PADDING-TOP: 1.5pt; WIDTH: 93.75pt" 
      vAlign=top width=125>
        <P class=MsoNormal><SPAN 
        style="COLOR: #55549a; FONT-FAMILY: Arial; FONT-SIZE: 8.5pt"><A 
        href="http://www.bell.ca/"><SPAN 
        style="TEXT-DECORATION: none; text-underline: none"><IMG 
        alt="Bell Canada Logo" border=0 height=46 id=_x0000_i1026 
        src="http://www3.sympatico.ca/doughan/small_bell_logo.gif" width=102 
        NOSEND="1"></SPAN></A></SPAN></P></TD>
      <TD 
      style="PADDING-BOTTOM: 1.5pt; PADDING-LEFT: 1.5pt; PADDING-RIGHT: 1.5pt; PADDING-TOP: 1.5pt; WIDTH: 135pt" 
      vAlign=top width=180>
        <P class=MsoNormal><SPAN 
        style="COLOR: #55549a; FONT-FAMILY: Arial; FONT-SIZE: 8.5pt">Doughan 
        Turk, M.Eng., CCIE#7274<BR>Bell Canada, IP Engineering<BR>Flr 3, 100 
        Wynford Dr. <BR>Toronto M3C 1K4</SPAN></P></TD>
      <TD noWrap 
      style="PADDING-BOTTOM: 1.5pt; PADDING-LEFT: 1.5pt; PADDING-RIGHT: 1.5pt; PADDING-TOP: 1.5pt" 
      vAlign=top>
        <P class=MsoNormal><SPAN 
        style="COLOR: #55549a; FONT-FAMILY: Arial; FONT-SIZE: 8.5pt">Voice: 
        (416) 353-0368<BR>Fax: (416) 445-9893<BR>Pager: (416) 685-0590<BR><A 
        href="http://www.bell.ca/"><FONT color=#ffb900><SPAN 
        style="COLOR: #ffb900">Bell 
Canada</SPAN></FONT></A></SPAN></FONT></P></TD>
      <TD 
      style="PADDING-BOTTOM: 1.5pt; PADDING-LEFT: 1.5pt; PADDING-RIGHT: 1.5pt; PADDING-TOP: 1.5pt; WIDTH: 52.15pt" 
      vAlign=top width=70>
        <P align=right class=MsoNormal style="TEXT-ALIGN: right"><SPAN 
        style="COLOR: #55549a; FONT-FAMILY: Arial; FONT-SIZE: 8.5pt"><IMG 
        alt="Cisco Certified Internetwork Expert" border=0 height=60 
        id=_x0000_i1027 src="http://www3.sympatico.ca/doughan/ccie_large.gif" 
        width=61 NOSEND="1"></SPAN></P></TD></TR>
    <TR vAlign=top>
      <TD colSpan=3><B><I><FONT color=#55549a face=Georgia 
        size=1>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp;</FONT><FONT 
        color=#55549a face=Georgia style="FONT-SIZE: 8.5pt">"To Engineer and 
        Integrate world 
        class<BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 
        &nbsp;&nbsp;&nbsp;IP Services and Technologies for BCE" 
      </FONT></I></B></TD></TR></TBODY></TABLE></DIV>
  <DIV>&nbsp;</DIV></BLOCKQUOTE></BODY></HTML>

------_=_NextPart_001_01C234E6.B2B1E2C0--


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id QAA16451 for <idr-archive@nic.merit.edu>; Fri, 26 Jul 2002 16:09:55 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id B719E91347; Fri, 26 Jul 2002 16:09:26 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 821CF91348; Fri, 26 Jul 2002 16:09:26 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 2B8C991347 for <idr@trapdoor.merit.edu>; Fri, 26 Jul 2002 16:09:25 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 113E15DE42; Fri, 26 Jul 2002 16:09:25 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from dm3cnd.bell.ca (dm3cnd.bell.ca [206.47.0.146]) by segue.merit.edu (Postfix) with SMTP id C79035DDC9 for <idr@merit.edu>; Fri, 26 Jul 2002 16:09:24 -0400 (EDT)
Received: from 206.47.0.152 by dm3cnd.bell.ca with ESMTP (Tumbleweed MMS SMTP Relay (MMS v4.7)); Fri, 26 Jul 2002 16:09:24 -0400
X-Server-Uuid: b85f21a3-cfd1-11d3-8401-00104bf46ab7
Received: from mta2.on.bell.ca (MTA2 [198.235.102.149]) by mta2.on.bell.ca with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13) id GMA09ARG; Fri, 26 Jul 2002 16:09:24 -0400
Received: FROM toroondc754.on.bell.ca BY mta2.on.bell.ca ; Fri Jul 26 16:09:23 2002 -0400
Received: by toroondc754.on.bell.ca with Internet Mail Service ( 5.5.2653.19) id <3T1H17T8>; Fri, 26 Jul 2002 16:09:23 -0400
Message-ID: <7A733663D06C92499EF88AC4FD660EE302084641@TOROONDC755.on.bell.ca>
From: "Turk, Doughan A." <doughan.turk@bell.ca>
To: idr@merit.edu
Cc: "Turk, Doughan A." <doughan.turk@bell.ca>
Subject: Need your opinion of this.
Date: Fri, 26 Jul 2002 16:09:19 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
X-WSS-ID: 115F73FE134713-01-01
Content-Type: multipart/alternative;  boundary="----_=_NextPart_001_01C234E0.53791CC0"
Sender: owner-idr@merit.edu
Precedence: bulk

This message is in MIME format. Since your mail reader does not understand
this format, some or all of this message may not be legible.

------_=_NextPart_001_01C234E0.53791CC0
Content-Type: text/plain; 
 charset=us-ascii
Content-Transfer-Encoding: 7bit

Folks, 
 
I would like to get some discussion started on a draft I wrote. It was
suggested to me to have the draft worked on under this group. I appreciate
any constructive comments and pointers. In summary, the draft could be
classified as an operational one. It is an enhancement of a technique that
already exists but relying more on BGP as a protocol to help in network
security issues.
 
A copy of the draft could be found at
http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt
<http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt> 
 
Regards
Doughan Turk
 
  _____  


 <http://www.bell.ca/> Bell Canada Logo

Doughan Turk, M.Eng., CCIE#7274
Bell Canada, IP Engineering
Flr 3, 100 Wynford Dr. 
Toronto M3C 1K4

Voice: (416) 353-0368
Fax: (416) 445-9893
Pager: (416) 685-0590
 <http://www.bell.ca/> Bell Canada

 Cisco Certified Internetwork Expert
<http://www3.sympatico.ca/doughan/ccie_large.gif> 

         "To Engineer and Integrate world class
                              IP Services and Technologies for BCE" 	
 

------_=_NextPart_001_01C234E0.53791CC0
Content-Type: text/html; 
 charset=us-ascii
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META HTTP-EQUIV=3D"Content-Type" CONTENT=3D"text/html; =
charset=3Dus-ascii">
<TITLE>Message</TITLE>

<META content=3D"MSHTML 6.00.2716.2200" name=3DGENERATOR></HEAD>
<BODY>
<DIV><FONT face=3DArial size=3D2><SPAN =
class=3D247400120-26072002>Folks,=20
</SPAN></FONT></DIV>
<DIV><FONT face=3DArial size=3D2><SPAN=20
class=3D247400120-26072002></SPAN></FONT>&nbsp;</DIV>
<DIV><FONT face=3DArial size=3D2><SPAN class=3D247400120-26072002>I =
would like to get=20
some discussion started on a draft I wrote. It was suggested to me to =
have the=20
draft worked on under this group. I appreciate any constructive =
comments and=20
pointers. In summary, the draft could be classified as an operational =
one. It is=20
an enhancement of a technique that already exists but relying more on =
BGP as a=20
protocol to help in network security issues.</SPAN></FONT></DIV>
<DIV><FONT face=3DArial size=3D2><SPAN=20
class=3D247400120-26072002></SPAN></FONT>&nbsp;</DIV>
<DIV><FONT face=3DArial size=3D2><SPAN class=3D247400120-26072002>A =
copy of the draft=20
could be found at <A=20
href=3D"http://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt">http=
://www.ietf.org/internet-drafts/draft-turk-ertb-00.txt</A></SPAN></FONT>=
</DIV>
<DIV><FONT face=3DArial size=3D2><SPAN=20
class=3D247400120-26072002></SPAN></FONT>&nbsp;</DIV>
<DIV><FONT face=3DArial size=3D2><SPAN=20
class=3D247400120-26072002>Regards</SPAN></FONT></DIV>
<DIV><FONT face=3DArial size=3D2><SPAN =
class=3D247400120-26072002>Doughan=20
Turk</SPAN></FONT></DIV>
<DIV>&nbsp;</DIV>
<DIV class=3DSection1>
<DIV class=3DMsoNormal style=3D"TEXT-ALIGN: center" align=3Dcenter>
<HR align=3Dcenter width=3D"100%" color=3D#cccccc noShade SIZE=3D1>
</DIV>
<TABLE class=3DMsoNormalTable=20
style=3D"BORDER-COLLAPSE: collapse; mso-padding-alt: 0in 5.4pt 0in =
5.4pt"=20
cellSpacing=3D0 cellPadding=3D0 border=3D0>
  <TBODY>
  <TR style=3D"mso-yfti-irow: 0">
    <TD=20
    style=3D"PADDING-RIGHT: 1.5pt; PADDING-LEFT: 1.5pt; PADDING-BOTTOM: =
1.5pt; WIDTH: 93.75pt; PADDING-TOP: 1.5pt"=20
    vAlign=3Dtop width=3D125>
      <P class=3DMsoNormal><SPAN=20
      style=3D"FONT-SIZE: 8.5pt; COLOR: #55549a; FONT-FAMILY: Arial"><A =

      href=3D"http://www.bell.ca/"><SPAN=20
      style=3D"TEXT-DECORATION: none; text-underline: none"><IMG =
id=3D_x0000_i1026=20
      height=3D46 alt=3D"Bell Canada Logo"=20
      src=3D"http://www3.sympatico.ca/doughan/small_bell_logo.gif" =
width=3D102=20
      border=3D0></SPAN></A></SPAN></P></TD>
    <TD=20
    style=3D"PADDING-RIGHT: 1.5pt; PADDING-LEFT: 1.5pt; PADDING-BOTTOM: =
1.5pt; WIDTH: 135pt; PADDING-TOP: 1.5pt"=20
    vAlign=3Dtop width=3D180>
      <P class=3DMsoNormal><SPAN=20
      style=3D"FONT-SIZE: 8.5pt; COLOR: #55549a; FONT-FAMILY: =
Arial">Doughan Turk,=20
      M.Eng., CCIE#7274<BR>Bell Canada, IP Engineering<BR>Flr 3, 100 =
Wynford Dr.=20
      <BR>Toronto M3C 1K4</SPAN></P></TD>
    <TD=20
    style=3D"PADDING-RIGHT: 1.5pt; PADDING-LEFT: 1.5pt; PADDING-BOTTOM: =
1.5pt; PADDING-TOP: 1.5pt"=20
    vAlign=3Dtop noWrap>
      <P class=3DMsoNormal><SPAN=20
      style=3D"FONT-SIZE: 8.5pt; COLOR: #55549a; FONT-FAMILY: =
Arial">Voice: (416)=20
      353-0368<BR>Fax: (416) 445-9893<BR>Pager: (416) 685-0590<BR><A=20
      href=3D"http://www.bell.ca/"><FONT color=3D#ffb900><SPAN=20
      style=3D"COLOR: #ffb900">Bell =
Canada</SPAN></FONT></A></SPAN></FONT></P></TD>
    <TD=20
    style=3D"PADDING-RIGHT: 1.5pt; PADDING-LEFT: 1.5pt; PADDING-BOTTOM: =
1.5pt; WIDTH: 52.15pt; PADDING-TOP: 1.5pt"=20
    vAlign=3Dtop width=3D70>
      <P class=3DMsoNormal style=3D"TEXT-ALIGN: right" =
align=3Dright><SPAN=20
      style=3D"FONT-SIZE: 8.5pt; COLOR: #55549a; FONT-FAMILY: =
Arial"><IMG=20
      id=3D_x0000_i1027 height=3D60 alt=3D"Cisco Certified Internetwork =
Expert"=20
      src=3D"http://www3.sympatico.ca/doughan/ccie_large.gif" =
width=3D61=20
      border=3D0></SPAN></P></TD></TR>
  <TR vAlign=3Dtop>
    <TD colSpan=3D3><B><I><FONT face=3DGeorgia color=3D#55549a=20
      size=3D1>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; =
&nbsp;&nbsp;&nbsp;</FONT><FONT=20
      style=3D"FONT-SIZE: 8.5pt" face=3DGeorgia color=3D#55549a>"To =
Engineer and=20
      Integrate world=20
      =
class<BR>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nb=
sp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nb=
sp;&nbsp;&nbsp;&nbsp;=20
      &nbsp;&nbsp;&nbsp;IP Services and Technologies for BCE"=20
  </FONT></I></B></TD></TR></TBODY></TABLE></DIV>
<DIV>&nbsp;</DIV></BODY></HTML>

------_=_NextPart_001_01C234E0.53791CC0--



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id OAA12315 for <idr-archive@nic.merit.edu>; Fri, 26 Jul 2002 14:01:39 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id D08DD91346; Fri, 26 Jul 2002 14:00:26 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 573A191348; Fri, 26 Jul 2002 14:00:26 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 8CDEA91346 for <idr@trapdoor.merit.edu>; Fri, 26 Jul 2002 14:00:24 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 16FAA5DE16; Fri, 26 Jul 2002 14:00:22 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 4E4EB5DD8C for <idr@merit.edu>; Fri, 26 Jul 2002 14:00:21 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6QI0Iv09717; Fri, 26 Jul 2002 14:00:18 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6QI0E109709; Fri, 26 Jul 2002 14:00:14 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g6QI0Ea09163; Fri, 26 Jul 2002 14:00:14 -0400 (EDT)
Date: Fri, 26 Jul 2002 14:00:14 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: "Natale, Jonathan" <JNatale@celoxnetworks.com>
Cc: "'Sprzaczkowski, Krzysztof'" <Krzysztof.Sprzaczkowski@intel.com>, idr@merit.edu
Subject: Re: "third party" next-hop
Message-ID: <20020726140014.J7985@nexthop.com>
References: <1117F7D44159934FB116E36F4ABF221B020AF8BC@celox-ma1-ems1.celoxnetworks.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <1117F7D44159934FB116E36F4ABF221B020AF8BC@celox-ma1-ems1.celoxnetworks.com>; from JNatale@celoxnetworks.com on Fri, Jul 26, 2002 at 12:49:04PM -0400
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

On Fri, Jul 26, 2002 at 12:49:04PM -0400, Natale, Jonathan wrote:
> Of course you could have another shared subnet that you are not peering
> across--I don't think this case was ever considered/implemented.

This is fine so long as you don't receive a third-party nexthop
for an subnets that you both have configured, but aren't actually
the same subnet.

(Think separate rfc 1918 subnets that are similarly numbered.)

If you are concerned about receiving third party announcements,
force the nexthop to be first party to your peering session.  That
will result in sub-optimal forwarding, but it should still work.

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id NAA12209 for <idr-archive@nic.merit.edu>; Fri, 26 Jul 2002 13:58:41 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id DEA4391344; Fri, 26 Jul 2002 13:58:13 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id AFD6191345; Fri, 26 Jul 2002 13:58:13 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id A455391344 for <idr@trapdoor.merit.edu>; Fri, 26 Jul 2002 13:58:12 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 878A55DE16; Fri, 26 Jul 2002 13:58:12 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 07CEF5DD8C for <idr@merit.edu>; Fri, 26 Jul 2002 13:58:12 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6QHw9j09561; Fri, 26 Jul 2002 13:58:09 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6QHw4109546; Fri, 26 Jul 2002 13:58:04 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g6QHw3M09146; Fri, 26 Jul 2002 13:58:03 -0400 (EDT)
Date: Fri, 26 Jul 2002 13:58:03 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: "Sprzaczkowski, Krzysztof" <Krzysztof.Sprzaczkowski@intel.com>
Cc: idr@merit.edu
Subject: Re: "third party" next-hop
Message-ID: <20020726135803.I7985@nexthop.com>
References: <413FBB0BA5AED1119122000083234B1A02B41E6E@alpha.igk.intel.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <413FBB0BA5AED1119122000083234B1A02B41E6E@alpha.igk.intel.com>; from Krzysztof.Sprzaczkowski@intel.com on Fri, Jul 26, 2002 at 06:31:21PM +0200
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

On Fri, Jul 26, 2002 at 06:31:21PM +0200, Sprzaczkowski, Krzysztof wrote:
> According to this, should we compare next-hop
> address subnet with peer X address subnet (not with interface's address
> subnet where a given peer is established)?

One could know by configuration what the subnet of the remote side is
for interface/nexthop comparisons.

> Krzysztof Sprzaczkowski

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id NAA11661 for <idr-archive@nic.merit.edu>; Fri, 26 Jul 2002 13:41:41 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 5ABE99133F; Fri, 26 Jul 2002 13:40:06 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 1FFC691348; Fri, 26 Jul 2002 13:40:06 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 0F99F9133F for <idr@trapdoor.merit.edu>; Fri, 26 Jul 2002 13:40:03 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id EA49B5DEA8; Fri, 26 Jul 2002 13:40:02 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from sj-msg-core-2.cisco.com (sj-msg-core-2.cisco.com [171.69.24.11]) by segue.merit.edu (Postfix) with ESMTP id A10D75DD8C for <idr@merit.edu>; Fri, 26 Jul 2002 13:40:02 -0400 (EDT)
Received: from achandra-u10.cisco.com (achandra-u10.cisco.com [128.107.162.129]) by sj-msg-core-2.cisco.com (8.12.2/8.12.2) with ESMTP id g6QHe1hI026089; Fri, 26 Jul 2002 10:40:01 -0700 (PDT)
Received: (achandra@localhost) by achandra-u10.cisco.com (8.8.8-Cisco List Logging/CISCO.WS.1.2) id KAA07748; Fri, 26 Jul 2002 10:40:01 -0700 (PDT)
Date: Fri, 26 Jul 2002 10:40:01 -0700
From: Chandrashekhar Appanna <achandra@cisco.com>
To: Pedro Roque Marques <roque@juniper.net>
Cc: idr@merit.edu
Subject: Re: inform (was: IDR WG minutes IETF 54)
Message-ID: <20020726104001.A7742@achandra-u10.cisco.com>
References: <p05111a00b95d0a37ad7b@[133.93.77.124]> <200207190147.g6J1la176882@roque-bsd.juniper.net>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5.1i
In-Reply-To: <200207190147.g6J1la176882@roque-bsd.juniper.net>; from roque@juniper.net on Thu, Jul 18, 2002 at 06:47:36PM -0700
Sender: owner-idr@merit.edu
Precedence: bulk

In addition I think it needs to be discussed how one can
mandate that actions should not be automated/taken on rcv'ing
an INFORM - I think it cannot be mandated. And a compliance 
check will also not be possible since not all implementations 
are open source :)

-
Chandra A.

On Thu, Jul 18, 2002 at 06:47:36PM -0700, Pedro Roque Marques wrote:
> David Ward writes:
> 
> > INFORM message I. Gargi Nalawade presenter draft-nalawade-bgp-inform-00.txt 1.
> > Goals a. no way to tell remote
> > peer that there has been an error or event w/o use of NOTIFICATION
> > i. this resets peering session b. no change to existing
> > implementations c. limit sharing of messages between peers
> > 2. Encoding overview a. Packet format b. Event codes c. Data types
> > 3. Rx'ing the INFORM a. log it b. inform the administrator
> > (implementation specific) Question: How to use w/ NOTIFICATION?
> > Answer: Use it to inform that you are going to reset. Use
> > immediately before the NOTIFICATION but, now can add
> > PDU/ATTR/whatever to help debug.
> 
> > Question: Will this cause a change to the FSM?  Answser: No
> 
> > Questons: What is "too many routes option?" How to define?  Answer:
> > It is an implementation specific configuration
> 
> > Question: Jeff Pickering Should a peer take any specific action upon
> > reception of the INFORM?  The authors should explicitly prohibit any
> > automated action
> 
> > Statement: Andrew Lange Move forward draft
> 
> > Outcome: Sue Hares Accepted as WG item pending taking it to list
> 
> I object to making this a WG document.
> 
> I believe this mechanism is unnecessary and that no sufficient
> justification has been presented to add this to the spec. There should
> be a very compeling reason to be making changes so basic such as
> adding new messages.
> 
> A couple of comments on the details above:
> 
> > Question: How to use w/ NOTIFICATION?
> > Answer: Use it to inform that you are going to reset. Use
> > immediately before the NOTIFICATION but, now can add
> > PDU/ATTR/whatever to help debug.
> 
> How can that be of extra value given that notification allows you to
> carry variable lenght data ? Adding this message is just going to
> cause a debugging mess with all the deployed systems out there that do
> not implement it.
> 
> > Question: Jeff Pickering Should a peer take any specific action upon
> > reception of the INFORM?
> 
> Yes, reset the session, at least in the case that has been vastly
> discussed in the list which is receicept of invalid nlri. Furtunatly
> no code needs to be written to achieve that since it is the default
> action on a receipt of an invalid message code.
> 
>   Pedro.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id MAA09936 for <idr-archive@nic.merit.edu>; Fri, 26 Jul 2002 12:49:27 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 24D6D91339; Fri, 26 Jul 2002 12:49:12 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id E42FE9133A; Fri, 26 Jul 2002 12:49:11 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 9593D91339 for <idr@trapdoor.merit.edu>; Fri, 26 Jul 2002 12:49:10 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 75AEF5DE14; Fri, 26 Jul 2002 12:49:10 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from celox-ma1-ems1.celoxnetworks.com (celox-ma1-imap1.celoxnetworks.com [12.40.60.233]) by segue.merit.edu (Postfix) with ESMTP id 87A705DDAF for <idr@merit.edu>; Fri, 26 Jul 2002 12:49:05 -0400 (EDT)
Received: by celox-ma1-ems1.celoxnetworks.com with Internet Mail Service (5.5.2653.19) id <3T6K1WM1>; Fri, 26 Jul 2002 12:49:05 -0400
Message-ID: <1117F7D44159934FB116E36F4ABF221B020AF8BC@celox-ma1-ems1.celoxnetworks.com>
From: "Natale, Jonathan" <JNatale@celoxnetworks.com>
To: "'Sprzaczkowski, Krzysztof'" <Krzysztof.Sprzaczkowski@intel.com>, idr@merit.edu
Subject: RE: "third party" next-hop 
Date: Fri, 26 Jul 2002 12:49:04 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
Sender: owner-idr@merit.edu
Precedence: bulk

I think it boils down to:
   IF (nexthop is on subnet of interface used to reach peer) OR (ebgp
multi-hop enabled) OR (IBGP) THEN
    Accept it
ELSE
    Toss it

Older IOS (less than 12.0.?) did accept any next-hop.

Of course you could have another shared subnet that you are not peering
across--I don't think this case was ever considered/implemented.  So you
have a point (there are many things like this in the RFC).  I don't think
implementing this to the letter would be practical (and it would be
difficult to write, if possible at all).

-$0.02


-----Original Message-----
From: Sprzaczkowski, Krzysztof [mailto:Krzysztof.Sprzaczkowski@intel.com] 
Sent: Friday, July 26, 2002 12:31 PM
To: idr@merit.edu
Subject: "third party" next-hop 

I have a question about 'third party' next-hop in the context of IPv6 routes
sending over IPv4 session. Does 'third party' next-hop make sense in such
situation? Both documents draft-ietf-idr-bgp4-17 (chapter 5.1.3 NEXT_HOP)
and RFC 2858 (chapter 2 Multiprotocol Reachable NLRI - MP_REACH_NLRI) say
that for 'third party' next-hop peer X should share a common subnet with
received next-hop address. According to this, should we compare next-hop
address subnet with peer X address subnet (not with interface's address
subnet where a given peer is established)?

Krzysztof Sprzaczkowski



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id MAA09495 for <idr-archive@nic.merit.edu>; Fri, 26 Jul 2002 12:35:23 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id DC80091336; Fri, 26 Jul 2002 12:35:03 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id A39E291339; Fri, 26 Jul 2002 12:35:03 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 83B9D91336 for <idr@trapdoor.merit.edu>; Fri, 26 Jul 2002 12:35:02 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 644255DE16; Fri, 26 Jul 2002 12:35:02 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mail1.hd.intel.com (hdfdns01.hd.intel.com [192.52.58.10]) by segue.merit.edu (Postfix) with ESMTP id 1CAFF5DDAF for <idr@merit.edu>; Fri, 26 Jul 2002 12:35:02 -0400 (EDT)
Received: from fmsmsxvs040.fm.intel.com (fmsmsxvs040.fm.intel.com [132.233.42.124]) by mail1.hd.intel.com (8.11.6/8.11.6/d: solo.mc,v 1.42 2002/05/23 22:21:11 root Exp $) with SMTP id g6QGZ1I09651 for <idr@merit.edu>; Fri, 26 Jul 2002 16:35:01 GMT
Received: from alpha.igk.intel.com ([172.28.168.68]) by fmsmsxvs040.fm.intel.com (NAVGW 2.5.2.11) with SMTP id M2002072609341022522 for <idr@merit.edu>; Fri, 26 Jul 2002 09:34:10 -0700
Received: by alpha.igk.intel.com with Internet Mail Service (5.5.2653.19) id <PD56RK1N>; Fri, 26 Jul 2002 18:31:22 +0200
Message-ID: <413FBB0BA5AED1119122000083234B1A02B41E6E@alpha.igk.intel.com>
From: "Sprzaczkowski, Krzysztof" <Krzysztof.Sprzaczkowski@intel.com>
To: idr@merit.edu
Subject: "third party" next-hop 
Date: Fri, 26 Jul 2002 18:31:21 +0200
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain; charset="iso-8859-2"
Sender: owner-idr@merit.edu
Precedence: bulk

I have a question about 'third party' next-hop in the context of IPv6 routes
sending over IPv4 session. Does 'third party' next-hop make sense in such
situation? Both documents draft-ietf-idr-bgp4-17 (chapter 5.1.3 NEXT_HOP)
and RFC 2858 (chapter 2 Multiprotocol Reachable NLRI - MP_REACH_NLRI) say
that for 'third party' next-hop peer X should share a common subnet with
received next-hop address. According to this, should we compare next-hop
address subnet with peer X address subnet (not with interface's address
subnet where a given peer is established)?

Krzysztof Sprzaczkowski




Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id EAA23030 for <idr-archive@nic.merit.edu>; Fri, 26 Jul 2002 04:15:43 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 63DFB9124C; Fri, 26 Jul 2002 04:15:08 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id E88B89131D; Fri, 26 Jul 2002 04:15:07 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 92BE89124C for <idr@trapdoor.merit.edu>; Fri, 26 Jul 2002 04:15:05 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id DD0505DE25; Fri, 26 Jul 2002 04:15:05 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from ep_mailout1 (unknown [203.254.224.24]) by segue.merit.edu (Postfix) with ESMTP id 5658F5DD95 for <idr@merit.edu>; Fri, 26 Jul 2002 04:15:04 -0400 (EDT)
Received: from custom-daemon.mailout1.samsung.com by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) id <0GZU00A01KCZ2Y@mailout1.samsung.com> for idr@merit.edu; Fri, 26 Jul 2002 17:17:23 +0900 (KST)
Received: from ep_mmp2 (localhost [127.0.0.1]) by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTP id <0GZU00A28KCZ1K@mailout1.samsung.com> for idr@merit.edu; Fri, 26 Jul 2002 17:17:23 +0900 (KST)
Received: from Manav ([107.108.3.180]) by mmp2.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTPA id <0GZU0062TKD57H@mmp2.samsung.com> for idr@merit.edu; Fri, 26 Jul 2002 17:17:31 +0900 (KST)
Date: Fri, 26 Jul 2002 13:40:40 +0530
From: Manav Bhatia <manav@samsung.com>
Subject: Re: CEASE problems
To: "Pusz, MateuszX" <MateuszX.Pusz@intel.com>, idr@merit.edu
Reply-To: Manav Bhatia <manav@samsung.com>
Message-id: <009201c2347b$ef958310$b4036c6b@sisodomain.com>
MIME-version: 1.0
X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2600.0000
X-Mailer: Microsoft Outlook Express 6.00.2600.0000
Content-type: text/plain; charset=iso-8859-2
Content-transfer-encoding: 7BIT
X-Priority: 3
X-MSMail-priority: Normal
References: <413FBB0BA5AED1119122000083234B1A030A99FA@alpha.igk.intel.com>
Sender: owner-idr@merit.edu
Precedence: bulk

Hi Pusz,
If a BGP speaker decides to administratively reset the peering with a
neighbor, then the speaker should send a NOTIFICATION message with the
Error Code Cease, and the Error Subcode "Administratively Reset".

Regards,
Manav
----- Original Message -----
From: "Pusz, MateuszX" <MateuszX.Pusz@intel.com>
To: <idr@merit.edu>
Sent: Friday, July 26, 2002 12:37 PM
Subject: CEASE problems


| Hello
|
| I have one problem with CEASE message. 'draft-ietf-idr-bgp4-17' says in
6.7:
|    In absence of any fatal errors (that are indicated in this section),
|    a BGP peer may choose at any given time to close its BGP connection
|    by sending the NOTIFICATION message with Error Code Cease. However,
|    the Cease NOTIFICATION message must not be used when a fatal error
|    indicated by this section does exist.
|
| but in section 8-Established state:
|    In response to a stop event initiated by an operator:
|       - release all resources (including deleting all routes),
|       - set ConnectRetryCnt to zero (0),
|       - set connect retry timer to zero (0), and
|       - transition to the Idle.
|
| so how shuold it be? Should BGP router in such a situation send that
| notification message or not?
|
| Please help me
|
| Thoughts from IDR community welcome.
|
| Best regards
|
|
| Mateusz Pusz



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id DAA21279 for <idr-archive@nic.merit.edu>; Fri, 26 Jul 2002 03:19:44 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id E85F19124A; Fri, 26 Jul 2002 03:19:11 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id B00659124B; Fri, 26 Jul 2002 03:19:11 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 6D7459124A for <idr@trapdoor.merit.edu>; Fri, 26 Jul 2002 03:17:23 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 4D33A5DDC5; Fri, 26 Jul 2002 03:17:23 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from ep_mailout1 (unknown [203.254.224.24]) by segue.merit.edu (Postfix) with ESMTP id DFBE85DD8C for <idr@merit.edu>; Fri, 26 Jul 2002 03:17:22 -0400 (EDT)
Received: from custom-daemon.mailout1.samsung.com by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) id <0GZU00701HOTHS@mailout1.samsung.com> for idr@merit.edu; Fri, 26 Jul 2002 16:19:41 +0900 (KST)
Received: from ep_mmp2 (localhost [127.0.0.1]) by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTP id <0GZU00755HOSDO@mailout1.samsung.com> for idr@merit.edu; Fri, 26 Jul 2002 16:19:41 +0900 (KST)
Received: from Manav ([107.108.3.180]) by mmp2.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTPA id <0GZU004DLHOZ48@mmp2.samsung.com> for idr@merit.edu; Fri, 26 Jul 2002 16:19:48 +0900 (KST)
Date: Fri, 26 Jul 2002 12:42:58 +0530
From: Manav Bhatia <manav@samsung.com>
Subject: draft-ietf-idr-bgp4-18.txt
To: idr@merit.edu
Reply-To: Manav Bhatia <manav@samsung.com>
Message-id: <006e01c23473$dfc9e870$b4036c6b@sisodomain.com>
MIME-version: 1.0
X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2600.0000
X-Mailer: Microsoft Outlook Express 6.00.2600.0000
Content-type: text/plain; charset=iso-8859-1
Content-transfer-encoding: 7BIT
X-Priority: 3
X-MSMail-priority: Normal
Sender: owner-idr@merit.edu
Precedence: bulk

sec 6.5 of the latest draft states - " If  system does not receive
successive KEEPALIVE and/or UPDATE and/or NOTIFICATION messages within the
period specified in the Hold Time field of the OPEN message, then the
NOTIFICATION message with   Hold Timer Expired Error Code must be sent and
the BGP connection closed"

IMHO we must update the draft to also include other BGP messages (Route
Refresh, Dynamic Capability, INFORM) which when received by the local
system act as surrogate for the KEEPALIVE messages it would expect from the
remote end. The local system upon receiving such messages must restart its
Hold Timer

Similarly the draft should mention that each time the local system sends
such BGP messages it should restart its KeepAlive Timer, unless the
negotiated HoldTime value is zero.

These changes can be reflected in the BGP draft or can be done in the
drafts introducing such BGP messages.

Regards,
Manav

----
"When you are courting a nice girl an hour seems like a second. When you
sit on a red-hot cinder a second seems like an hour. That's relativity."

-Albert Einstein, on relativity




Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id DAA20999 for <idr-archive@nic.merit.edu>; Fri, 26 Jul 2002 03:11:59 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id E3DC991247; Fri, 26 Jul 2002 03:11:30 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 8E9AC9124A; Fri, 26 Jul 2002 03:11:29 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 881E691247 for <idr@trapdoor.merit.edu>; Fri, 26 Jul 2002 03:11:26 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id D0D865DE36; Fri, 26 Jul 2002 03:11:25 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mail1.hd.intel.com (hdfdns01.hd.intel.com [192.52.58.10]) by segue.merit.edu (Postfix) with ESMTP id 0FD9E5DD8C for <idr@merit.edu>; Fri, 26 Jul 2002 03:11:25 -0400 (EDT)
Received: from fmsmsxvs040.fm.intel.com (fmsmsxvs040.fm.intel.com [132.233.42.124]) by mail1.hd.intel.com (8.11.6/8.11.6/d: solo.mc,v 1.42 2002/05/23 22:21:11 root Exp $) with SMTP id g6Q7BLq04109 for <idr@merit.edu>; Fri, 26 Jul 2002 07:11:21 GMT
Received: from alpha.igk.intel.com ([172.28.168.68]) by fmsmsxvs040.fm.intel.com (NAVGW 2.5.2.11) with SMTP id M2002072600103117967 for <idr@merit.edu>; Fri, 26 Jul 2002 00:10:31 -0700
Received: by alpha.igk.intel.com with Internet Mail Service (5.5.2653.19) id <PD56R24Z>; Fri, 26 Jul 2002 09:07:43 +0200
Message-ID: <413FBB0BA5AED1119122000083234B1A030A99FA@alpha.igk.intel.com>
From: "Pusz, MateuszX" <MateuszX.Pusz@intel.com>
To: "'idr@merit.edu'" <idr@merit.edu>
Subject: CEASE problems
Date: Fri, 26 Jul 2002 09:07:42 +0200
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain; charset="iso-8859-2"
Sender: owner-idr@merit.edu
Precedence: bulk

Hello

I have one problem with CEASE message. 'draft-ietf-idr-bgp4-17' says in 6.7:
   In absence of any fatal errors (that are indicated in this section),
   a BGP peer may choose at any given time to close its BGP connection
   by sending the NOTIFICATION message with Error Code Cease. However,
   the Cease NOTIFICATION message must not be used when a fatal error
   indicated by this section does exist.

but in section 8-Established state:
   In response to a stop event initiated by an operator:
      - release all resources (including deleting all routes),
      - set ConnectRetryCnt to zero (0),
      - set connect retry timer to zero (0), and
      - transition to the Idle.

so how shuold it be? Should BGP router in such a situation send that
notification message or not?

Please help me

Thoughts from IDR community welcome.

Best regards


Mateusz Pusz


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id MAA22878 for <idr-archive@nic.merit.edu>; Thu, 25 Jul 2002 12:54:36 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 5C65B9131C; Thu, 25 Jul 2002 12:54:13 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 29A3B9131D; Thu, 25 Jul 2002 12:54:13 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 0C9209131C for <idr@trapdoor.merit.edu>; Thu, 25 Jul 2002 12:54:11 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id DAE715DF1E; Thu, 25 Jul 2002 12:54:11 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 5947E5DEA5 for <idr@merit.edu>; Thu, 25 Jul 2002 12:54:11 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6PGrxC65520; Thu, 25 Jul 2002 12:53:59 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6PGru165512; Thu, 25 Jul 2002 12:53:56 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g6PGru604165; Thu, 25 Jul 2002 12:53:56 -0400 (EDT)
Date: Thu, 25 Jul 2002 12:53:56 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: srihari@procket.com, tappan@cisco.com, yakov@juniper.net
Cc: idr@merit.edu
Subject: Thought on non-transitive extended communities
Message-ID: <20020725125355.H3597@nexthop.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

It occurs to me that, for non-transitive extended communities,
we may want to discard them on INGRESS if the extended community
path attribute is marked with the PARTIAL bit.  This can mean
that we have received something that should never had made it to
us in the first place.

Admittedly, this also would hit the case where a given border
router in your AS doesn't understand it, marks it as partial
and sends to an internal peer that does understand it.

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id LAA19836 for <idr-archive@nic.merit.edu>; Thu, 25 Jul 2002 11:19:38 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id A65529131A; Thu, 25 Jul 2002 11:19:19 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 6D6C39131B; Thu, 25 Jul 2002 11:19:19 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 69FDF9131A for <idr@trapdoor.merit.edu>; Thu, 25 Jul 2002 11:19:18 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 56D175DF1E; Thu, 25 Jul 2002 11:19:18 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id C87F45DEE1 for <idr@merit.edu>; Thu, 25 Jul 2002 11:19:17 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id LAA11030; Thu, 25 Jul 2002 11:19:15 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id LAA17536; Thu, 25 Jul 2002 11:19:15 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W99BZC3>; Thu, 25 Jul 2002 11:19:14 -0400
Message-ID: <39469E08BD83D411A3D900204840EC558227D9@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Susan Hares'" <skh@nexthop.com>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: Kunihiro Ishiguro <kunihiro@zebra.org>, BGP mailing list <idr@merit.edu>
Subject: RE: TCP state machine doubts
Date: Thu, 25 Jul 2002 11:19:11 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: multipart/alternative; boundary="----_=_NextPart_001_01C233EE.A0D609D0"
Sender: owner-idr@merit.edu
Precedence: bulk

This message is in MIME format. Since your mail reader does not understand
this format, some or all of this message may not be legible.

------_=_NextPart_001_01C233EE.A0D609D0
Content-Type: text/plain;
	charset="iso-8859-1"

Sue: 
 
 

   General Comments:
  


   2.      Do you have any assumptions on Multi-thread processing impacts to
the FSM?



If not, can you mention any operational assumptions?


The multi-thread processing issues would focus around the choice between 1
or 2 FSMs.   The state machine's event 22 and the call collision allows you
to do both.



No assumptions on the multi-threaded processing impacts to the FSM were
made. 

A two state versus 1 state model was allowed.  The single state model is
indicated in the places were TCP and con
  

Event 22 is merely a an open collision case. I think there is more to it
than just that. There are various phases that BGP goes through that one
might want to use multiple threads. The inter-locking or inter-working
between the threads could affect the state transitions. For example, Say you
are in Establish state and something happens to the session and it is
disconnected. In a single threaded environment it is easy, first come, first
served. In a multi-threaded case with thread priorities and interruptions,
it might get tricky at times. Some assumptions or expectations by the state
machine on actions it should take with regards to semaphore locks (RIB
Table, etc.), thread prioritization, processing, pre-emption and so forth
might be helpfull. Maybe these actions should only be suggestions not
required ways for implementations.

    I hope I have not opened a can of worms here?
 
 
Regards,
Ben


------_=_NextPart_001_01C233EE.A0D609D0
Content-Type: text/html;
	charset="iso-8859-1"

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=iso-8859-1">


<META content="MSHTML 5.00.2920.0" name=GENERATOR></HEAD>
<BODY>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN 
class=475025414-25072002>Sue:</SPAN></FONT><FONT color=#0000ff face=Arial 
size=2><SPAN class=475025414-25072002> </SPAN></FONT></DIV>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN 
class=475025414-25072002></SPAN></FONT>&nbsp;</DIV>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN 
class=475025414-25072002>&nbsp;</SPAN></FONT></DIV>
<BLOCKQUOTE 
style="BORDER-LEFT: #0000ff 2px solid; MARGIN-LEFT: 5px; MARGIN-RIGHT: 0px; PADDING-LEFT: 5px">
  <DIV align=left class=OutlookMessageHeader dir=ltr><FONT color=#0000ff 
  face=Arial size=2><SPAN class=475025414-25072002>&nbsp; 
  &nbsp;</SPAN></FONT><FONT face="Courier New, Courier">General 
  Comments:<BR>&nbsp;</FONT> </DIV>
  <DL><FONT size=4><FONT color=#0000ff><FONT face=Arial>
    <DD></FONT><FONT size=2><SPAN class=475025414-25072002>&nbsp; 
    &nbsp;</SPAN></FONT></FONT></FONT><FONT face="Times New Roman, Times" 
    size=4>2.<X-TAB>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;</X-TAB></FONT><FONT 
    face="Courier New, Courier">Do you have any assumptions on Multi-thread 
    processing impacts to the FSM?<BR><BR></DD></DL>If not, can you mention any 
  operational assumptions?<BR>
  <DL>
    <DD>The multi-thread processing issues would focus around the choice between 
    1 or 2 FSMs.&nbsp;&nbsp; The state machine's event 22 and the call collision 
    allows you to do both.<BR><BR>
    <DD>No assumptions on the multi-threaded processing impacts to the FSM were 
    made. 
    <DD>A two state versus 1 state model was allowed.&nbsp; The single state 
    model is indicated in the places were TCP and con<BR></FONT><FONT 
    size=2><FONT color=#0000ff><FONT face=Arial><SPAN 
    class=475025414-25072002>&nbsp;</SPAN></FONT></FONT></FONT>
    <DD><FONT size=2><FONT color=#0000ff><FONT face=Arial><SPAN 
    class=475025414-25072002><FONT color=#000000><FONT size=3><FONT 
    face="Courier New">Event 22 is merely a an open collision case. I think 
    there is more to it than just that. There are various phases that BGP goes 
    through that&nbsp;one might want to use multiple threads. The inter-locking 
    or inter-working between the threads could affect the state transitions. For 
    example, Say you are in Establish state and something happens to the session 
    and it is disconnected. In a single threaded environment it is easy, first 
    come, first served. In a multi-threaded case with thread priorities and 
    interruptions, it might get tricky at times. Some assumptions or 
    expectations by the state machine on actions it should take with regards to 
    semaphore locks (RIB Table, etc.), thread prioritization, processing, 
    pre-emption and so forth might be helpfull. Maybe these actions should only 
    be suggestions not required ways&nbsp;for 
    implementations.</FONT></FONT></FONT></SPAN></FONT></FONT></FONT></DD></DL>
  <DIV><FONT face="Courier New"><SPAN 
  class=475025414-25072002>&nbsp;&nbsp;&nbsp; I hope I have not opened a can of 
  worms here?</SPAN></FONT></DIV>
  <DIV><FONT face="Courier New"><SPAN 
  class=475025414-25072002></SPAN></FONT>&nbsp;</DIV>
  <DIV><FONT face="Courier New"><SPAN 
  class=475025414-25072002></SPAN></FONT>&nbsp;</DIV>
  <DIV><FONT face="Courier New"><SPAN 
  class=475025414-25072002>Regards,</SPAN></FONT></DIV>
  <DIV><FONT face="Courier New"><SPAN 
  class=475025414-25072002>Ben</SPAN></FONT></DIV></BLOCKQUOTE></BODY></HTML>

------_=_NextPart_001_01C233EE.A0D609D0--


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id PAA11685 for <idr-archive@nic.merit.edu>; Wed, 24 Jul 2002 15:41:21 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id A0E6C91274; Wed, 24 Jul 2002 15:41:00 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 687869127A; Wed, 24 Jul 2002 15:41:00 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 4825191274 for <idr@trapdoor.merit.edu>; Wed, 24 Jul 2002 15:40:59 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 24EB75DEED; Wed, 24 Jul 2002 15:40:59 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from demiurge.exodus.net (demiurge.exodus.net [216.32.171.82]) by segue.merit.edu (Postfix) with ESMTP id 89D9B5DE60 for <idr@merit.edu>; Wed, 24 Jul 2002 15:40:58 -0400 (EDT)
Received: (from andrewl@localhost) by demiurge.exodus.net (8.9.3+Sun/8.9.3) id MAA25986; Wed, 24 Jul 2002 12:38:09 -0700 (PDT)
Date: Wed, 24 Jul 2002 12:38:08 -0700
From: andrewl@xix-w.bengi.exodus.net
To: Jeffrey Haas <jhaas@nexthop.com>
Cc: andrewl@exodus.net, idr@merit.edu
Subject: Re: BGP Extended Communities
Message-ID: <20020724123808.B25824@demiurge.exodus.net>
References: <20020716221414.B1256@demiurge.exodus.net> <20020724121803.D14987@nexthop.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <20020724121803.D14987@nexthop.com>; from jhaas@nexthop.com on Wed, Jul 24, 2002 at 12:18:03PM -0400
Sender: owner-idr@merit.edu
Precedence: bulk

Jeff,

Thanks for the input.  At the meeting in Yokohama, a number of other
people expressed your same concerns about deployment of the existing
2547-oriented extended communities draft.  The consensus was for me to
write up the proposal as a seperate BGP attribute.  I hope to get a 
draft out in the next couple weeks for the idr community to beat on.

Andrew

On Wed, Jul 24, 2002 at 12:18:03PM -0400, Jeffrey Haas wrote:
> Delivered-To: idr-outgoing@trapdoor.merit.edu
> Delivered-To: idr@trapdoor.merit.edu
> Delivered-To: idr@merit.edu
> Date: Wed, 24 Jul 2002 12:18:03 -0400
> From: Jeffrey Haas <jhaas@nexthop.com>
> To: andrewl@exodus.net
> Cc: idr@merit.edu, andrewl@xix-w.bengi.exodus.net
> Subject: Re: BGP Extended Communities
> User-Agent: Mutt/1.2.5i
> In-Reply-To: <20020716221414.B1256@demiurge.exodus.net>; from andrewl@exodus.net on Tue, Jul 16, 2002 at 10:14:14PM -0700
> X-Virus-Scanned: by AMaViS perl-11
> Precedence: bulk
> X-OriginalArrivalTime: 24 Jul 2002 16:19:49.0182 (UTC) FILETIME=[EEBEC5E0:01C2332D]
> 
> Andrew,
> 
> I've given this some time for others to respond.  Since I'm
> mostly hearing resounding silence, here's my take on the issue:
> 
> On Tue, Jul 16, 2002 at 10:14:14PM -0700, andrewl@exodus.net wrote:
> > I was going over the draft-ietf-idr-bgp-ext-communities-05.txt.  I
> > think we can make significant enhancements to the functionality,
> > efficency and flexibility of this draft.  If we step back and look
> > at defining a more generic solution which can address both
> > current and future needs we will be better served.
> 
> IMO:
> 1. The authors *could* have done a much better job at providing
>    a generic coloring mechnism for the routes.
> 2. Extended communities were originally deployed to solve a
>    specific problem - RFC 2547 vpns.  These don't deal with IPv6
>    either.
> 3. This extension is already widely deployed for the above.
> 
> While it would be nice to go back and redo things, IMO this is
> too widely deployed by too many big router vendors to make this
> change.
> 
> > - Support for IPv6.  If we are going to move to IPv6 at some point,
> > we should include support for it in the protocol extentions we define.
> > - More efficient packing of values in the data field.  A good example
> > of where this enhancement would be applicable is in the
> > draft-ietf-ptomaine-bgp-redistribution-00.txt document.
> > - Cleaner support for a broad range of future data field structures,
> > and interpretations.
> > - Support for locally defined community structures (types).
> > - Support for locally defined community sub-types.
> 
> What I would recommend is that IDR spend a little time enumerating
> what properties a generic route coloring mechanism should have.
> (The above is a good start.)
> Even if this has nothing done with it over the short term, it should
> be there for anyone who decides to do the "next greatest thing".
> 
> The thing that makes me queasy is that we can very easily go
> beyond TLVs and sub-TLVs into a mess that looks like BER.
> For all their deficiencies, (extended) communities are easy to
> parse in your code.  BER isn't.
> 
> 
> -- 
> Jeff Haas 
> NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id MAA04944 for <idr-archive@nic.merit.edu>; Wed, 24 Jul 2002 12:18:53 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id A254D912D2; Wed, 24 Jul 2002 12:18:23 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 88EE591282; Wed, 24 Jul 2002 12:18:11 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id E50989130D for <idr@trapdoor.merit.edu>; Wed, 24 Jul 2002 12:18:09 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id C92895DDA4; Wed, 24 Jul 2002 12:18:09 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 3FBCF5DE66 for <idr@merit.edu>; Wed, 24 Jul 2002 12:18:09 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6OGI7521464; Wed, 24 Jul 2002 12:18:07 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6OGI3121457; Wed, 24 Jul 2002 12:18:03 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g6OGI3D20742; Wed, 24 Jul 2002 12:18:03 -0400 (EDT)
Date: Wed, 24 Jul 2002 12:18:03 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: andrewl@exodus.net
Cc: idr@merit.edu, andrewl@xix-w.bengi.exodus.net
Subject: Re: BGP Extended Communities
Message-ID: <20020724121803.D14987@nexthop.com>
References: <20020716221414.B1256@demiurge.exodus.net>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <20020716221414.B1256@demiurge.exodus.net>; from andrewl@exodus.net on Tue, Jul 16, 2002 at 10:14:14PM -0700
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

Andrew,

I've given this some time for others to respond.  Since I'm
mostly hearing resounding silence, here's my take on the issue:

On Tue, Jul 16, 2002 at 10:14:14PM -0700, andrewl@exodus.net wrote:
> I was going over the draft-ietf-idr-bgp-ext-communities-05.txt.  I
> think we can make significant enhancements to the functionality,
> efficency and flexibility of this draft.  If we step back and look
> at defining a more generic solution which can address both
> current and future needs we will be better served.

IMO:
1. The authors *could* have done a much better job at providing
   a generic coloring mechnism for the routes.
2. Extended communities were originally deployed to solve a
   specific problem - RFC 2547 vpns.  These don't deal with IPv6
   either.
3. This extension is already widely deployed for the above.

While it would be nice to go back and redo things, IMO this is
too widely deployed by too many big router vendors to make this
change.

> - Support for IPv6.  If we are going to move to IPv6 at some point,
> we should include support for it in the protocol extentions we define.
> - More efficient packing of values in the data field.  A good example
> of where this enhancement would be applicable is in the
> draft-ietf-ptomaine-bgp-redistribution-00.txt document.
> - Cleaner support for a broad range of future data field structures,
> and interpretations.
> - Support for locally defined community structures (types).
> - Support for locally defined community sub-types.

What I would recommend is that IDR spend a little time enumerating
what properties a generic route coloring mechanism should have.
(The above is a good start.)
Even if this has nothing done with it over the short term, it should
be there for anyone who decides to do the "next greatest thing".

The thing that makes me queasy is that we can very easily go
beyond TLVs and sub-TLVs into a mess that looks like BER.
For all their deficiencies, (extended) communities are easy to
parse in your code.  BER isn't.


-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id MAA04598 for <idr-archive@nic.merit.edu>; Wed, 24 Jul 2002 12:09:00 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 2EFC091270; Wed, 24 Jul 2002 12:08:37 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 03F1191282; Wed, 24 Jul 2002 12:08:36 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id DCA3B91270 for <idr@trapdoor.merit.edu>; Wed, 24 Jul 2002 12:08:35 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id C8C985DDA4; Wed, 24 Jul 2002 12:08:35 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 4519A5DE66 for <idr@merit.edu>; Wed, 24 Jul 2002 12:08:35 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6OG8Ue21091; Wed, 24 Jul 2002 12:08:30 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6OG8P121083; Wed, 24 Jul 2002 12:08:25 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g6OG8OT20721; Wed, 24 Jul 2002 12:08:24 -0400 (EDT)
Date: Wed, 24 Jul 2002 12:08:24 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: vrishab sikand <v_sikand@yahoo.com>
Cc: lidefeng <lidefeng@huawei.com>, idr@merit.edu
Subject: Re: consult :  What is the FTP User ID and Password
Message-ID: <20020724120824.C14987@nexthop.com>
References: <000f01c231f8$8c12fbe0$22436e0a@HUAWEI.COM> <20020723122431.79415.qmail@web12804.mail.yahoo.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <20020723122431.79415.qmail@web12804.mail.yahoo.com>; from v_sikand@yahoo.com on Tue, Jul 23, 2002 at 05:24:31AM -0700
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

On Tue, Jul 23, 2002 at 05:24:31AM -0700, vrishab sikand wrote:
> The archive has not been acessable for quite some time. Either this should be fixed or the link removed from IDR home page.

http://www.merit.edu/mail.archives/idr/
ftp://ftp.merit.edu/mail.archives/idr

Admittedly, their archiver has a y2k issue... :-)

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id LAA03950 for <idr-archive@nic.merit.edu>; Wed, 24 Jul 2002 11:54:24 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 5C07F91240; Wed, 24 Jul 2002 11:53:45 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 29B7991267; Wed, 24 Jul 2002 11:53:45 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 1178291240 for <idr@trapdoor.merit.edu>; Wed, 24 Jul 2002 11:53:44 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 005D05DE66; Wed, 24 Jul 2002 11:53:43 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 9A67F5DDA4 for <idr@merit.edu>; Wed, 24 Jul 2002 11:53:43 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6OFrKk20429; Wed, 24 Jul 2002 11:53:20 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6OFrH120422; Wed, 24 Jul 2002 11:53:17 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g6OFrGB20670; Wed, 24 Jul 2002 11:53:16 -0400 (EDT)
Date: Wed, 24 Jul 2002 11:53:16 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: andrewl@xix-w.bengi.exodus.net
Cc: Pedro Roque Marques <roque@juniper.net>, idr@merit.edu
Subject: Re: inform (was: IDR WG minutes IETF 54)
Message-ID: <20020724115316.B14987@nexthop.com>
References: <p05111a00b95d0a37ad7b@[133.93.77.124]> <200207190147.g6J1la176882@roque-bsd.juniper.net> <20020722073852.B15842@demiurge.exodus.net>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <20020722073852.B15842@demiurge.exodus.net>; from andrewl@xix-w.bengi.exodus.net on Mon, Jul 22, 2002 at 07:38:52AM -0700
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

On Mon, Jul 22, 2002 at 07:38:52AM -0700, andrewl@xix-w.bengi.exodus.net wrote:
> As an operator, I think that there are a number of useful places for an 
> INFORM message, as a mecahanism to pass information between BGP speakers
> that does not result in a session reset.

Pedro has convinced me that INFORM is a bad idea.  However, I still
have great sympathies for the operational issues.

In this light, I propose the following:
The bgp-mib-v2 is still only set in wet concrete (and is due for
the collected comments to be published).

How about if we add a counter/trap on a per-peer basis that reflects
conditions where the implementation is specifically ignoring routes
rather than dropping the peering session?  This will have to be 
worded carefully.

This will at least allow the router being "victimized" by the bad routes
to bring it to an operators attention.  The pushback mechanism
is no longer an INFORM message, rather its back to operators
giving the other operators a call.

> Andrew

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA02048 for <idr-archive@nic.merit.edu>; Wed, 24 Jul 2002 10:56:21 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 6253591205; Wed, 24 Jul 2002 10:55:43 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 3851F91300; Wed, 24 Jul 2002 10:55:43 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 15C0391205 for <idr@trapdoor.merit.edu>; Wed, 24 Jul 2002 10:55:42 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id ED6C45DDA9; Wed, 24 Jul 2002 10:55:41 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 2D4695DDA4 for <idr@merit.edu>; Wed, 24 Jul 2002 10:55:41 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6OEtL118323; Wed, 24 Jul 2002 10:55:21 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6OEtI118314; Wed, 24 Jul 2002 10:55:18 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g6OEtHm15180; Wed, 24 Jul 2002 10:55:17 -0400 (EDT)
Date: Wed, 24 Jul 2002 10:55:17 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: andrewl@exodus.net
Cc: idr@merit.edu
Subject: Re: limit the No. of routes received from a neighbor
Message-ID: <20020724105517.A14987@nexthop.com>
References: <8812A03F65CDD511AE98006008F5E871025D1792@hermes.hyperchip.com> <20020723154306.V4742@demiurge.exodus.net>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <20020723154306.V4742@demiurge.exodus.net>; from andrewl@exodus.net on Tue, Jul 23, 2002 at 03:43:06PM -0700
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

On Tue, Jul 23, 2002 at 03:43:06PM -0700, andrewl@exodus.net wrote:
> If max-prefix is exceeded, tear down the session.  In this case there is
> no way to know which routes you should listen to and which ones you 
> shouldn't.

One *could* make an argument that one could discard any more-specific
routes from your adj-ribs-in.  This will maintain *some* reachability
for that peer.  However, one might want to flag the remaining
routes as being required and if they go away, *then* tear down
the peering session.

This is a lot more work and probably not worth it. :-)

> Andrew

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA01808 for <idr-archive@nic.merit.edu>; Wed, 24 Jul 2002 10:48:24 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 5DF719127A; Wed, 24 Jul 2002 10:48:04 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 2FC3F91205; Wed, 24 Jul 2002 10:48:04 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 0083F9130A for <idr@trapdoor.merit.edu>; Wed, 24 Jul 2002 10:47:51 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id D55A25DDA9; Wed, 24 Jul 2002 10:47:51 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from ep_mailout1 (unknown [203.254.224.24]) by segue.merit.edu (Postfix) with ESMTP id 7A0D65DDA4 for <idr@merit.edu>; Wed, 24 Jul 2002 10:47:51 -0400 (EDT)
Received: from custom-daemon.mailout1.samsung.com by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) id <0GZR00K01D7JM9@mailout1.samsung.com> for idr@merit.edu; Wed, 24 Jul 2002 23:50:07 +0900 (KST)
Received: from ep_mmp2 (localhost [127.0.0.1]) by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTP id <0GZR00K1OD7IJ8@mailout1.samsung.com> for idr@merit.edu; Wed, 24 Jul 2002 23:50:07 +0900 (KST)
Received: from Manav ([107.108.3.180]) by mmp2.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTPA id <0GZR00J7MD7NSQ@mmp2.samsung.com> for idr@merit.edu; Wed, 24 Jul 2002 23:50:13 +0900 (KST)
Date: Wed, 24 Jul 2002 20:13:30 +0530
From: Manav Bhatia <manav@samsung.com>
Subject: Re: My BGP Route Update Pacing Draft
To: "Tsillas, Jim" <jtsillas@tenornetworks.com>
Cc: idr@merit.edu
Reply-To: Manav Bhatia <manav@samsung.com>
Message-id: <02d101c23320$7bd4d130$b4036c6b@sisodomain.com>
MIME-version: 1.0
X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2600.0000
X-Mailer: Microsoft Outlook Express 6.00.2600.0000
Content-type: text/plain; charset=iso-8859-1
Content-transfer-encoding: 7BIT
X-Priority: 3
X-MSMail-priority: Normal
References: <6B190B34070BD411ACA000B0D0214E560165B828@newman.tenornet.com>
Sender: owner-idr@merit.edu
Precedence: bulk

Jim,
All that i'm trying to say is that we don't need to put in any extra
features into BGP (atleast not as of now!) to achieve what all you
mentioned down below. Encompassing a slow peer will be automatically taken
care of by TCP.

- Manav
|
| the real challenge for an implementation is to be both efficient
| in terms of building updates and policy processing but at the same
| time to be accomodating of slow peers. If all peers ran at the
| same speed, an implementation could theoretically build a single
| update and retransmit this update to all the peers (I'm ignoring
| alot of detail here). With the above (efficient approach) when a
| single peer slows down enough to make TCP flow control, all update
| sending must wait on that slow peer. The alternative approach is
| to independently update all peers at their given rate: this has
| the drawback that potentially for every peer the update must be
| rebuilt for sending. A hybrid approach could use some local buffering
| (above the TCP buffering) to accomodate slow peers to some extent
| and (most of the time) allowing reuse of that update buffer.
| Another complicating aspect is that peers can often be grouped
| so that their out-policy is reused - it is beneficial to reuse
| updates for these groups.
|
| Seems like a topic ripe for some modelling.
|
| -Jim.
|
| -----Original Message-----
| From: Dennis Ferguson [mailto:dennis@juniper.net]
| Sent: Tuesday, July 23, 2002 6:36 PM
| To: Manav Bhatia
| Cc: Mareline Sheldon; idr@merit.edu
| Subject: Re: My BGP Route Update Pacing Draft
|
|
| Hello,
|
| > If you have some UPDATEs to announce and if you are holding them back
| > because of the congestion you perceive in the network and then if you
| > recieve a WITHDRAW for one of those same prefixes then you have to
first
| > advertise the UPDATEs because it is invalid to announce the same
address
| > prefix in the WITHDRAWN routes and the NLRI field of an UPDATE message.
| You
| > thus end up flushing all the BGP UPDATEs you had piled up waiting for
the
| > congestion to clear out at your remote end.
|
| Why would you do this?  If you receive a withdrawal for a route for which
| you haven't sent an UPDATE yet, why wouldn't you forget about the UPDATE
| and just send the withdrawal (or, if you've never advertised the route
| to the peer, just send nothing at all)?  If a route is flapping and
| goes up and down 62 times while your peer has you flow controlled, would
| you really send the 62 flaps when the peer allows you to send again?  Why
| wouldn't you just send the final result?
|
| You need to do routing protocol implementations in a way which avoids
| unbounded state, and doing this means that you shouldn't be recording
| all route transitions to play back to a peer which isn't listening at
| the moment.  If the only state you keep is what you advertised to the
| peer before and what the state of the route is now, then the amount
| of state you need to keep remains proportional to the number of routes
| you have and you only need to advertise the current state (if it doesn't
| match what the peer already knows, otherwise you send nothing) when
| the peer starts taking updates again.
|
| > The better solution is to let
| > TCP take care of the congestion internally as it has always been
|
| I don't think this has much to do with TCP, TCP flow control works
| the same way.
|
| Dennis Ferguson
|



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA01584 for <idr-archive@nic.merit.edu>; Wed, 24 Jul 2002 10:42:51 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id F0639912FC; Wed, 24 Jul 2002 10:42:21 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id C1D0391300; Wed, 24 Jul 2002 10:42:20 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 77277912FC for <idr@trapdoor.merit.edu>; Wed, 24 Jul 2002 10:42:19 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 59AA95DE54; Wed, 24 Jul 2002 10:42:19 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from tenornetworks.com (rtu.tenornetworks.com [63.77.213.2]) by segue.merit.edu (Postfix) with ESMTP id DBBE95DDB8 for <idr@merit.edu>; Wed, 24 Jul 2002 10:42:18 -0400 (EDT)
Received: from tenornet.com (newman [192.168.0.185]) by tenornetworks.com (Switch-2.1.3/Switch-2.1.0) with ESMTP id g6OEgGd19153; Wed, 24 Jul 2002 10:42:17 -0400 (EDT)
Received: from 192.168.0.185 by tenornet.com; WED, 24 Jul 2002 10:37:10 -0700
Received: by newman.tenornet.com with Internet Mail Service (5.5.2650.21) id <PDANKQZ6>; Wed, 24 Jul 2002 10:37:10 -0400
Message-ID: <6B190B34070BD411ACA000B0D0214E560165B828@newman.tenornet.com>
From: "Tsillas, Jim" <jtsillas@tenornetworks.com>
To: Manav Bhatia <manav@samsung.com>
Cc: Mareline Sheldon <marelines@yahoo.com>, idr@merit.edu
Subject: RE: My BGP Route Update Pacing Draft 
Date: Wed, 24 Jul 2002 10:37:03 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Manav,

the real challenge for an implementation is to be both efficient
in terms of building updates and policy processing but at the same
time to be accomodating of slow peers. If all peers ran at the
same speed, an implementation could theoretically build a single
update and retransmit this update to all the peers (I'm ignoring
alot of detail here). With the above (efficient approach) when a
single peer slows down enough to make TCP flow control, all update
sending must wait on that slow peer. The alternative approach is
to independently update all peers at their given rate: this has
the drawback that potentially for every peer the update must be
rebuilt for sending. A hybrid approach could use some local buffering
(above the TCP buffering) to accomodate slow peers to some extent
and (most of the time) allowing reuse of that update buffer.
Another complicating aspect is that peers can often be grouped
so that their out-policy is reused - it is beneficial to reuse
updates for these groups.

Seems like a topic ripe for some modelling.

-Jim.

-----Original Message-----
From: Dennis Ferguson [mailto:dennis@juniper.net]
Sent: Tuesday, July 23, 2002 6:36 PM
To: Manav Bhatia
Cc: Mareline Sheldon; idr@merit.edu
Subject: Re: My BGP Route Update Pacing Draft 


Hello,

> If you have some UPDATEs to announce and if you are holding them back
> because of the congestion you perceive in the network and then if you
> recieve a WITHDRAW for one of those same prefixes then you have to first
> advertise the UPDATEs because it is invalid to announce the same address
> prefix in the WITHDRAWN routes and the NLRI field of an UPDATE message.
You
> thus end up flushing all the BGP UPDATEs you had piled up waiting for the
> congestion to clear out at your remote end.

Why would you do this?  If you receive a withdrawal for a route for which
you haven't sent an UPDATE yet, why wouldn't you forget about the UPDATE
and just send the withdrawal (or, if you've never advertised the route
to the peer, just send nothing at all)?  If a route is flapping and
goes up and down 62 times while your peer has you flow controlled, would
you really send the 62 flaps when the peer allows you to send again?  Why
wouldn't you just send the final result?

You need to do routing protocol implementations in a way which avoids
unbounded state, and doing this means that you shouldn't be recording
all route transitions to play back to a peer which isn't listening at
the moment.  If the only state you keep is what you advertised to the
peer before and what the state of the route is now, then the amount
of state you need to keep remains proportional to the number of routes
you have and you only need to advertise the current state (if it doesn't
match what the peer already knows, otherwise you send nothing) when
the peer starts taking updates again.

> The better solution is to let
> TCP take care of the congestion internally as it has always been

I don't think this has much to do with TCP, TCP flow control works
the same way.

Dennis Ferguson



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA01153 for <idr-archive@nic.merit.edu>; Wed, 24 Jul 2002 10:29:18 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 13BCE912E3; Wed, 24 Jul 2002 10:28:55 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id D9406912FC; Wed, 24 Jul 2002 10:28:54 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id DDD2A912E3 for <idr@trapdoor.merit.edu>; Wed, 24 Jul 2002 10:28:53 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id C58BA5DE78; Wed, 24 Jul 2002 10:28:53 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 508D95DE54 for <idr@merit.edu>; Wed, 24 Jul 2002 10:28:53 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA00890; Wed, 24 Jul 2002 10:28:51 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA21203; Wed, 24 Jul 2002 10:28:52 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W99ACHM>; Wed, 24 Jul 2002 10:28:51 -0400
Message-ID: <39469E08BD83D411A3D900204840EC558227D4@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Manav Bhatia'" <manav@samsung.com>, Dennis Ferguson <dennis@juniper.net>
Cc: idr@merit.edu
Subject: RE: My BGP Route Update Pacing Draft
Date: Wed, 24 Jul 2002 10:28:50 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

I will agree the issue with keeping updates current might become stale when
flow control is activated, because each implementation might be different
and
thus may or may not clear up the updates before they leave the box. So the 
effect of historical replay will take place at the destination router. Even
if its physically wrong it will adjust and resolve the state to the current
levels. The problem comes in when the flow control is constantly engaged and
the sending and receiving routers start to drift out of phase from reality.
That is when we will see forwarding problems, due to inaccurate routes.

I will say again, it is the responsiblity of the sending router to send the
best and most current route information available to it, even if it means 
it has to go through some session queue and cleanup the updates, which I
am sure most implementations dont do. As for the TCP queue it is as good as
gone, but as other's have said that is only 8k bytes.

Ben

> -----Original Message-----
> From: Manav Bhatia [mailto:manav@samsung.com]
> Sent: Wednesday, July 24, 2002 12:07 AM
> To: Dennis Ferguson
> Cc: idr@merit.edu
> Subject: Re: My BGP Route Update Pacing Draft
> 
> 
> Hi Dennis,
> 
> | Why would you do this?  If you receive a withdrawal for a 
> route for which
> | you haven't sent an UPDATE yet, why wouldn't you forget 
> about the UPDATE
> | and just send the withdrawal (or, if you've never 
> advertised the route
> | to the peer, just send nothing at all)?  If a route is flapping and
> 
> Depends on whether your BGP UPDATE packets waiting to be 
> announced are in
> some transit queue above the TCP or lying below it. If you 
> have already
> given the UPDATE to the TCP then there isnt much that you can do upon
> receiving the WITHDRAW. If its the former case (above TCP) 
> then i have yet
> to see any implementation which scans through all the pending 
> BGP UPDATEs
> it has in its transmit queue upon receiving the withdraw to 
> remove the NLRI
> if it exists in the withdraw also OR scanning through all the WITHDRAW
> messages when it recieves any new NLRI to advertise.
> 
> Manav
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA00988 for <idr-archive@nic.merit.edu>; Wed, 24 Jul 2002 10:23:24 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id B5E4E912E1; Wed, 24 Jul 2002 10:23:04 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 85480912E3; Wed, 24 Jul 2002 10:23:04 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 7FE60912E1 for <idr@trapdoor.merit.edu>; Wed, 24 Jul 2002 10:23:03 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 692E35DE54; Wed, 24 Jul 2002 10:23:03 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id E5BB45DDB8 for <idr@merit.edu>; Wed, 24 Jul 2002 10:23:02 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA00241; Wed, 24 Jul 2002 10:23:00 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA19358; Wed, 24 Jul 2002 10:23:01 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W99ABV8>; Wed, 24 Jul 2002 10:23:00 -0400
Message-ID: <39469E08BD83D411A3D900204840EC558227D3@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'vrishab sikand'" <v_sikand@yahoo.com>, Manav Bhatia <manav@samsung.com>, Dennis Ferguson <dennis@juniper.net>
Cc: idr@merit.edu
Subject: RE: My BGP Route Update Pacing Draft
Date: Wed, 24 Jul 2002 10:23:00 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: multipart/alternative; boundary="----_=_NextPart_001_01C2331D.9D12D340"
Sender: owner-idr@merit.edu
Precedence: bulk

This message is in MIME format. Since your mail reader does not understand
this format, some or all of this message may not be legible.

------_=_NextPart_001_01C2331D.9D12D340
Content-Type: text/plain;
	charset="iso-8859-1"

Its 16K per session correct. If I have 500 sessions that is 8 million bytes.
 
Ben 

-----Original Message-----
From: vrishab sikand [mailto:v_sikand@yahoo.com]
Sent: Wednesday, July 24, 2002 12:59 AM
To: Manav Bhatia; Dennis Ferguson
Cc: idr@merit.edu
Subject: Re: My BGP Route Update Pacing Draft




  Manav Bhatia <manav@samsung.com> wrote: 


Hi Dennis,

| Why would you do this? If you receive a withdrawal for a route for which
| you haven't sent an UPDATE yet, why wouldn't you forget about the UPDATE
| and just send the withdrawal (or, if you've never advertised the route
| to the peer, just send nothing at all)? If a route is flapping and

Depends on whether your BGP UPDATE packets waiting to be announced are in
some transit queue above the TCP or lying below it. If you have already
given the UPDATE to the TCP then there isnt much that you can do upon
receiving the WITHDRAW. If its the former case (above TCP) then i have yet
to see any implementation which scans through all the pending BGP UPDATEs
it has in its transmit queue upon receiving the withdraw to remove the NLRI
if it exists in the withdraw also OR scanning through all the WITHDRAW
messages when it recieves any new NLRI to advertise.

Such  implemenations exist. and it does not require scanning any update
list. The route entry itself can have enough information in it to indicate
to which neighors it has been sent and to which neighors it has been queued
to be sent to. Off course once it is given to TCP it is as good as sent but
thats not a long list as TCP buffers are not VERY large @16K in most
implemenatations.

Manav





   _____  

Do You Yahoo!?
Yahoo! Health <http://health.yahoo.com/>  - Feel better, live better


------_=_NextPart_001_01C2331D.9D12D340
Content-Type: text/html;
	charset="iso-8859-1"

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=iso-8859-1">


<META content="MSHTML 5.00.2920.0" name=GENERATOR></HEAD>
<BODY>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN class=302432314-24072002>Its 
16K per session correct. If I have 500 sessions that is&nbsp;8 million 
bytes.</SPAN></FONT></DIV>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN 
class=302432314-24072002></SPAN></FONT>&nbsp;</DIV>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN 
class=302432314-24072002>Ben&nbsp;</SPAN></FONT></DIV>
<BLOCKQUOTE 
style="BORDER-LEFT: #0000ff 2px solid; MARGIN-LEFT: 5px; PADDING-LEFT: 5px">
  <DIV align=left class=OutlookMessageHeader dir=ltr><FONT face=Tahoma 
  size=2>-----Original Message-----<BR><B>From:</B> vrishab sikand 
  [mailto:v_sikand@yahoo.com]<BR><B>Sent:</B> Wednesday, July 24, 2002 12:59 
  AM<BR><B>To:</B> Manav Bhatia; Dennis Ferguson<BR><B>Cc:</B> 
  idr@merit.edu<BR><B>Subject:</B> Re: My BGP Route Update Pacing 
  Draft<BR><BR></DIV></FONT>
  <P>
  <P>&nbsp; <B><I>Manav Bhatia &lt;manav@samsung.com&gt;</I></B> wrote: 
  <BLOCKQUOTE 
  style="BORDER-LEFT: #1010ff 2px solid; MARGIN-LEFT: 5px; PADDING-LEFT: 5px">
    <P>Hi Dennis,<BR><BR>| Why would you do this? If you receive a withdrawal 
    for a route for which<BR>| you haven't sent an UPDATE yet, why wouldn't you 
    forget about the UPDATE<BR>| and just send the withdrawal (or, if you've 
    never advertised the route<BR>| to the peer, just send nothing at all)? If a 
    route is flapping and<BR><BR>Depends on whether your BGP UPDATE packets 
    waiting to be announced are in<BR>some transit queue above the TCP or lying 
    below it. If you have already<BR>given the UPDATE to the TCP then there isnt 
    much that you can do upon<BR>receiving the WITHDRAW. If its the former case 
    (above TCP) then i have yet<BR>to see any implementation which scans through 
    all the pending BGP UPDATEs<BR>it has in its transmit queue upon receiving 
    the withdraw to remove the NLRI<BR>if it exists in the withdraw also OR 
    scanning through all the WITHDRAW<BR>messages when it recieves any new NLRI 
    to advertise.</P>
    <P>Such&nbsp; implemenations exist. and it does not require scanning any 
    update list. The route entry itself can have enough information in it to 
    indicate to which neighors it has been sent and to which neighors it has 
    been queued to be sent to. Off course once it is given to TCP it is as good 
    as sent but thats not a long list as TCP buffers are not VERY large @16K in 
    most implemenatations.<BR><BR>Manav<BR></P></BLOCKQUOTE>
  <P><BR>
  <HR SIZE=1>
  <B>Do You Yahoo!?</B><BR><A href="http://health.yahoo.com/">Yahoo! Health</A> 
  - Feel better, live better</BLOCKQUOTE></BODY></HTML>

------_=_NextPart_001_01C2331D.9D12D340--


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA00828 for <idr-archive@nic.merit.edu>; Wed, 24 Jul 2002 10:18:17 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 88C82912F5; Wed, 24 Jul 2002 10:17:13 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 2934D91303; Wed, 24 Jul 2002 10:16:25 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id CE9BE912E1 for <idr@trapdoor.merit.edu>; Wed, 24 Jul 2002 10:16:22 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id AA66B5DF0A; Wed, 24 Jul 2002 10:16:22 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 18F2B5DE78 for <idr@merit.edu>; Wed, 24 Jul 2002 10:16:22 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA29483; Wed, 24 Jul 2002 10:16:19 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA17751; Wed, 24 Jul 2002 10:16:21 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W99ABJK>; Wed, 24 Jul 2002 10:16:20 -0400
Message-ID: <39469E08BD83D411A3D900204840EC558227D1@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Dennis Ferguson'" <dennis@juniper.net>, Manav Bhatia <manav@samsung.com>
Cc: Mareline Sheldon <marelines@yahoo.com>, idr@merit.edu
Subject: RE: My BGP Route Update Pacing Draft 
Date: Wed, 24 Jul 2002 10:16:18 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Dennis:
  I agree we should not send updates that are flow controlled that have 
new changes imposed on them by new events in the network. Only send the 
current state of all routes. It is the responsibility of the sending
router to report current and accurate information to its peers.

As we can see flow control conditions are a messy subject to deal with 
and does often ripple back to the application to take new (current) 
actions.

Ben

> -----Original Message-----
> From: Dennis Ferguson [mailto:dennis@juniper.net]
> Sent: Tuesday, July 23, 2002 6:36 PM
> To: Manav Bhatia
> Cc: Mareline Sheldon; idr@merit.edu
> Subject: Re: My BGP Route Update Pacing Draft 
> 
> 
> Hello,
> 
> > If you have some UPDATEs to announce and if you are holding 
> them back
> > because of the congestion you perceive in the network and 
> then if you
> > recieve a WITHDRAW for one of those same prefixes then you 
> have to first
> > advertise the UPDATEs because it is invalid to announce the 
> same address
> > prefix in the WITHDRAWN routes and the NLRI field of an 
> UPDATE message. You
> > thus end up flushing all the BGP UPDATEs you had piled up 
> waiting for the
> > congestion to clear out at your remote end.
> 
> Why would you do this?  If you receive a withdrawal for a 
> route for which
> you haven't sent an UPDATE yet, why wouldn't you forget about 
> the UPDATE
> and just send the withdrawal (or, if you've never advertised the route
> to the peer, just send nothing at all)?  If a route is flapping and
> goes up and down 62 times while your peer has you flow 
> controlled, would
> you really send the 62 flaps when the peer allows you to send 
> again?  Why
> wouldn't you just send the final result?
> 
> You need to do routing protocol implementations in a way which avoids
> unbounded state, and doing this means that you shouldn't be recording
> all route transitions to play back to a peer which isn't listening at
> the moment.  If the only state you keep is what you advertised to the
> peer before and what the state of the route is now, then the amount
> of state you need to keep remains proportional to the number of routes
> you have and you only need to advertise the current state (if 
> it doesn't
> match what the peer already knows, otherwise you send nothing) when
> the peer starts taking updates again.
> 
> > The better solution is to let
> > TCP take care of the congestion internally as it has always been
> 
> I don't think this has much to do with TCP, TCP flow control works
> the same way.
> 
> Dennis Ferguson
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA00428 for <idr-archive@nic.merit.edu>; Wed, 24 Jul 2002 10:06:35 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id B6DC3912DE; Wed, 24 Jul 2002 10:06:04 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 7C111912E0; Wed, 24 Jul 2002 10:06:04 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 72675912DE for <idr@trapdoor.merit.edu>; Wed, 24 Jul 2002 10:06:03 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 55D5D5DE78; Wed, 24 Jul 2002 10:06:03 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id D478B5DDB8 for <idr@merit.edu>; Wed, 24 Jul 2002 10:06:02 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA28377; Wed, 24 Jul 2002 10:06:00 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA14901; Wed, 24 Jul 2002 10:06:01 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W99AAPW>; Wed, 24 Jul 2002 10:05:59 -0400
Message-ID: <39469E08BD83D411A3D900204840EC558227CF@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Dennis Ferguson'" <dennis@juniper.net>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: FW: My BGP Route Update Pacing Draft 
Date: Wed, 24 Jul 2002 10:05:56 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Dennis:
  I respect your knowledge and have read your books. You have said a
mouthfull
here. I have no ground to respond so I wont. Thanks for the long
explanation.
I hope you are right and I am wrong and we are not loosing a good idea.
Anyway
I have my thoughts and doubts about things, but that is a matter of opinion.

Thanks again,
Ben

> -----Original Message-----
> From: Dennis Ferguson [mailto:dennis@juniper.net]
> Sent: Tuesday, July 23, 2002 5:42 PM
> To: Abarbanel, Benjamin
> Cc: 'idr@merit.edu'
> Subject: Re: FW: My BGP Route Update Pacing Draft 
> 
> 
> Hello,
> 
> > > 
> > > Why not relly on the transport. Flow control is after all
> > > traditionally the function of the transport. What makes 
> you think it
> > > needs to be reinvented... and how is your version of flow control
> > > supperior to the current one ?
> > >
> [...]
> > The other end that is being flow controlled does not know why 
> > or by how much he
> > will be flow controlled off. Remember on a long session with 
> > many hops it takes significant time to communicate the flow 
> > control condition by ACKing or not ACKing messages. For me 
> > this is not a direct but indirect mechanism.
> > In the interim, the pipe will be full of data.
> 
> I think you are confused about this.  TCP flow control does 
> not work by
> withholding ACKs. TCP always ACKs messages that are received, and flow
> control is instead implemented by directly telling the other end
> not to send more.  The sender is informed of this condition as soon as
> it occurs, and will be informed that the condition has cleared in the
> time it takes for a packet to transit the path between the 
> peers (which
> is why it is probably good to have a TCP window of outstanding data
> at the receiver that it can process until the sender gets the
> message to send more, and noting that the receiver is in full control
> of just how much or little data that is).  It doesn't get much better
> than that.
> 
> Now a sender who has had all outstanding packets ACK'd but which
> is flow controlled off knows precisely why this has occurred; it
> has been sending updates faster than the receiver was willing to
> process them.  There is no ambiguity, this is the only way this
> situation could occur.
> 
> Therefore, out of everything you said above, the only thing I can
> see which is perfectly accurate is that the flow-controlled sender
> won't know how long it will be flow-controlled off.  This will depend
> on what other things the receiving router has to do which it finds
> more important than processing BGP messages from this particular
> peer.  It is hard to see how the receiver could know how long that
> is going to take until it actually does it, however, so it seems
> that telling the sender this would require the receiver to be able
> to predict the future (the sender knows all about the past already
> so there isn't much use having the receiver tell the sender about
> that).  I can't dispute that both prescient routers and 
> fortune tellers
> might be useful, but I'm not sure that either is a practical
> possibility in real life.
> 
> While I doubt the implementability of the future-prediction you
> seem to be looking for here, however, I think there is another
> more fundamental issue which says your approach is incorrect.  That
> is, TCP flow control already puts the congested receiver in full
> control of its load.  No one can send data faster than the congested
> receiver is willing to process it.  The congested receiver directly
> controls how much data is outstanding when flow control is asserted.
> The sender can't do anything to the receiver that the receiver doesn't
> explicitly permit.  With TCP flow control the congested receiver
> has full control of its resources.  If there is any problem at all
> here, then, it is that the sender has no control at all over the
> receiver's resources and hence may be inconvenienced by a congested
> receiver which chooses to take its time responding.  And, in fact,
> if we're talking about Capability messages in particular, it is of
> no advantage in general at all to the busy receiver to get this
> message any earlier than updates or anything else; it doesn't 
> even know
> the message is coming until it sees it.  Only the uncongested sender
> knows there is an outstanding capability message.
> 
> So if there is a problem here at all it is a problem for the
> idle sender, which might have to wait for a response to a Capability
> message, rather than the busy receiver.  Yet your proposed solution
> for this is to have the busy receiver do yet more work, continually
> predicting the future and sending Inform messages about it to all of
> its (possibly many) peers, on the off chance that a sender which
> might someday have a Capability message to send, and which has CPU
> cycles to burn, isn't inconvenienced.  Since it is the guy who has
> CPU cycles available which has the problem in the first place it seems
> utterly backwards to make the guy who has no CPU cycles free do more
> work to fix it.  The guy with the CPU cycles should be the one to fix
> this, i.e. the sender should spend his free cycles doing whatever it
> takes to wait until the congested guy gets around to responding.  Then
> everything works fine with no protocol changes.
> 
> > > In BGP a receiver can choose to accept a anyrate of updates 
> > it wants.
> > 
> > Yes, it can by reading or not reading the TCP socket queue. 
> > Which gets us
> > back to the original point I mentioned earlier. TCP FLOW 
> > Control is not
> > perfect.
> 
> Your original point seems to have been based on a faulty understanding
> of how TCP flow control works.
> 
> > > Please do offer more to substantiate this other than your 
> claim that
> > > the problem exists.
> > >
> >  
> > By the fact that the pacing control is more direct and implicit to a
> > particular
> > traffic type (in this case update messages). The pacing mechanism is
> > deterministic, whereas TCP flow control is not. You should 
> talk to people
> > who build IPC mechanisms to replace TCP about this issue. 
> 
> Or the guys that built deterministic FDDI or Token Ring or Token Bus
> or ATM LANs to replace non-deterministic Ethernets.  We have networks
> where non-deterministic stuff just happens despite our best efforts to
> avoid it, so you might be better off to write code which deals with
> that rather than depending on some other box to deterministically
> predict the future so you aren't inconvenienced.  While you 
> are correct
> that this would be easier for your box if other routers could predict
> their future for you with 100% reliability, I don't think this is
> likely to be possible in any case.
> 
> Dennis Ferguson
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA00378 for <idr-archive@nic.merit.edu>; Wed, 24 Jul 2002 10:05:02 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id B9563912DA; Wed, 24 Jul 2002 10:04:23 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 84A2D912DE; Wed, 24 Jul 2002 10:04:23 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id F13F4912DA for <idr@trapdoor.merit.edu>; Wed, 24 Jul 2002 10:04:21 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id D75C55DE81; Wed, 24 Jul 2002 10:04:21 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from web14106.mail.yahoo.com (web14106.mail.yahoo.com [216.136.172.136]) by segue.merit.edu (Postfix) with SMTP id 57DBA5DE78 for <idr@merit.edu>; Wed, 24 Jul 2002 10:04:21 -0400 (EDT)
Message-ID: <20020724140420.22990.qmail@web14106.mail.yahoo.com>
Received: from [47.234.0.51] by web14106.mail.yahoo.com via HTTP; Wed, 24 Jul 2002 07:04:20 PDT
Date: Wed, 24 Jul 2002 07:04:20 -0700 (PDT)
From: PamSri <pamsri01@yahoo.com>
Subject: Re:
To: Manav Bhatia <manav@samsung.com>, Benjamin.Abarbanel@Marconi.com, idr@merit.edu
In-Reply-To: <00af01c232c5$a53d4f80$b4036c6b@sisodomain.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Sender: owner-idr@merit.edu
Precedence: bulk

Hi Manav,
        Per peer prefix limit has a direct impact on
the BGP RIBs. Regarding what happens when the limit is
hit is not what we're discussing. I dont think there
is a shortcoming/flaw in the protocol that could lead
to this situation but is merely a case of the
limitation of the router/system/implementation. If
reasonable peers and the routes handled by them are
configured (based on what the system/router can
handle)then it will not lead to the situation
described by Ben.

Regards,

sri   

--- Manav Bhatia <manav@samsung.com> wrote:
> Padma,
> As the name suggests peer prefix limit is per peer
> basis and is not
> applicable to the BGP RIB in whole. The peer can get
> congested even if it
> receives routes < prefix limit from multiple peers.
> What peer prefix limit
> will merely do is to tear down the session with the
> offending peer if the
> nos. of prefixes we receive from it goes beyond our
> default/configured
> acceptable value.
> 
> configuring and using the peer prefix limit is *not*
> at all a solution for
> peer congestion.
> 
> Manav
> 
> ----- Original Message -----
> From: "Padmaja kotamarthiC" <pamsri01@yahoo.com>
> To: <manav@samsung.com>;
> <Benjamin.Abarbanel@Marconi.com>; <idr@merit.edu>
> Sent: Tuesday, July 23, 2002 8:05 PM
> 
> 
> | Hi Manav,
> |         If your peer is congested for whatever
> reason
> | the simple solution is to: set the peer prefix
> limit
> | to the appropriate value to avoid the problem.
> |
> | sri
> |
> | Hi Ben,
> | I have some thoughts on the draft.
> |
> | The crux of your draft is that if a peer gets so
> | congested that it is not
> | able to send any UPDATE/KEEPALIVE message to the
> | remote uncongested peer
> | then the session can break down because of the
> Hold
> | Timer expiring and it
> | can be prevented by sending some pacing
> information
> | which will inform the
> | uncongested peer to limit its route advertisement
> | rate. The PACING
> | information will too be carried in the TCP
> packets, so
> | if the TCP is
> | implementing flow control and is not sending
> packets
> | to avoid congestion
> | then how will the PACING information be sent? And
> if
> | TCP can send the
> | PACING information then it can as well send the
> | KEEPALIVE packet which will
> | keep the HoldTimer from expiring at the other end.
> |
> | Secondly, if you are aware that your router is old
> and
> | might come in a
> | situation when it might not be able to handle the
> full
> | feed of BGP Updates
> | from the remote end then it can negotiate a
> HoldTime
> | of zero and keep the
> | session running even when it is in the congested
> | state.
> |
> | Thirdly, the very idea of artificially impeding
> | reachability information
> | (or sending of BGP UPDATE messages) doesn't really
> | appeal to me. I was
> | initially against the idea of using
> | MinRouteAdvertisementInternal etc
> | timers in my implementation arguing that a router
> | should pack off any
> | reachability information it has as soon as it
> comes to
> | know of it. But this
> | created a problem when i got say some 80K routes.
> I
> | would then be sending
> | 80K packets to my other peers! Finally , better
> sense
> | prevailed on me and i
> | started using the timers for efficiently packing
> my
> | NLRIs in the UPDATE
> | messages. This is one point where i hold my
> | reachability information from
> | my peers & I discourage it doing any other time.
> IMHO
> | i feel we should let
> | TCP take care of withholding any data it wants
> with
> | itself till the
> | congestion state rides by rather than us
> interfering
> | with it.
> |
> | What happens say when an uncongested router
> receives
> | pacing information
> | telling it to reduce the advertisement rate by
> some
> | 70%. In that case there
> | will be many UPDATEs lying with it remaining to be
> yet
> | announced to the
> | remote peer which is experiencing congestion. Now
> what
> | if the uncongested
> | network receives withdrawals (both implicit and
> | explicit) for certain
> | prefixes which have already been announced or
> remain
> | yet to be announced?
> | You open up a can of worms now !!
> |
> | You have to now flush all your output buffers
> | regardless of the congestion
> | state your remote peer lies in!
> |
> | Thus you end up doing what you always wanted to
> avoid.
> |
> | I think a cleaner and a much simpler approach
> would be
> | to dynamically
> | negotiate the value of the HoldTimer during the
> | session. If you feel you
> | are getting congested then increase the value and
> once
> | the red flags are
> | all down you can revert back to your original
> value or
> | decrease it back
> | again.
> |
> | Regards,
> | Manav
> |
> |
> |
> |
> |
> |
> |
> |
> |
> |
> |
> | ----- Original Message -----
> | From: "Abarbanel, Benjamin"
> | <Benjamin.Abarbanel@Marconi.com>
> | To: <idr@merit.edu>
> | Sent: Friday, July 19, 2002 1:01 AM
> | Subject: My BGP Route Update Pacing Draft
> |
> |
> | | Hi all:
> | |
> | |    Our recent discussions on this list and my
> recent
> | work
> | |  experiences have led me to write this draft and
> | offer it
> | |  to the IETF community. I would appreciate any
> | comments
> | |  anyone has to make.
> | |
> | |  Thanks in advance,
> | |  Ben
> | |
> | |
> | |
> | |
> |
> |
> |
> |
> |
> | __________________________________________________
> | Do You Yahoo!?
> | Yahoo! Health - Feel better, live better
> | http://health.yahoo.com
> 


__________________________________________________
Do You Yahoo!?
Yahoo! Health - Feel better, live better
http://health.yahoo.com


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id AAA12306 for <idr-archive@nic.merit.edu>; Wed, 24 Jul 2002 00:59:46 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 8A7BC912CE; Wed, 24 Jul 2002 00:59:18 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id EF1A2912CF; Wed, 24 Jul 2002 00:59:16 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id A67F8912CE for <idr@trapdoor.merit.edu>; Wed, 24 Jul 2002 00:59:15 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 8C1165DE7B; Wed, 24 Jul 2002 00:59:15 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from web12804.mail.yahoo.com (web12804.mail.yahoo.com [216.136.174.39]) by segue.merit.edu (Postfix) with SMTP id 0B11C5DDBE for <idr@merit.edu>; Wed, 24 Jul 2002 00:59:15 -0400 (EDT)
Message-ID: <20020724045914.8368.qmail@web12804.mail.yahoo.com>
Received: from [141.154.39.203] by web12804.mail.yahoo.com via HTTP; Tue, 23 Jul 2002 21:59:14 PDT
Date: Tue, 23 Jul 2002 21:59:14 -0700 (PDT)
From: vrishab sikand <v_sikand@yahoo.com>
Subject: Re: My BGP Route Update Pacing Draft
To: Manav Bhatia <manav@samsung.com>, Dennis Ferguson <dennis@juniper.net>
Cc: idr@merit.edu
In-Reply-To: <00e901c232c7$9765ff40$b4036c6b@sisodomain.com>
MIME-Version: 1.0
Content-Type: multipart/alternative; boundary="0-1930548566-1027486754=:6384"
Sender: owner-idr@merit.edu
Precedence: bulk

--0-1930548566-1027486754=:6384
Content-Type: text/plain; charset=us-ascii


 
  Manav Bhatia <manav@samsung.com> wrote: 
Hi Dennis,

| Why would you do this? If you receive a withdrawal for a route for which
| you haven't sent an UPDATE yet, why wouldn't you forget about the UPDATE
| and just send the withdrawal (or, if you've never advertised the route
| to the peer, just send nothing at all)? If a route is flapping and

Depends on whether your BGP UPDATE packets waiting to be announced are in
some transit queue above the TCP or lying below it. If you have already
given the UPDATE to the TCP then there isnt much that you can do upon
receiving the WITHDRAW. If its the former case (above TCP) then i have yet
to see any implementation which scans through all the pending BGP UPDATEs
it has in its transmit queue upon receiving the withdraw to remove the NLRI
if it exists in the withdraw also OR scanning through all the WITHDRAW
messages when it recieves any new NLRI to advertise.

Such  implemenations exist. and it does not require scanning any update list. The route entry itself can have enough information in it to indicate to which neighors it has been sent and to which neighors it has been queued to be sent to. Off course once it is given to TCP it is as good as sent but thats not a long list as TCP buffers are not VERY large @16K in most implemenatations.

Manav




---------------------------------
Do You Yahoo!?
Yahoo! Health - Feel better, live better
--0-1930548566-1027486754=:6384
Content-Type: text/html; charset=us-ascii

<P> 
<P>&nbsp; <B><I>Manav Bhatia &lt;manav@samsung.com&gt;</I></B> wrote: 
<BLOCKQUOTE style="PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #1010ff 2px solid">
<P>Hi Dennis,<BR><BR>| Why would you do this? If you receive a withdrawal for a route for which<BR>| you haven't sent an UPDATE yet, why wouldn't you forget about the UPDATE<BR>| and just send the withdrawal (or, if you've never advertised the route<BR>| to the peer, just send nothing at all)? If a route is flapping and<BR><BR>Depends on whether your BGP UPDATE packets waiting to be announced are in<BR>some transit queue above the TCP or lying below it. If you have already<BR>given the UPDATE to the TCP then there isnt much that you can do upon<BR>receiving the WITHDRAW. If its the former case (above TCP) then i have yet<BR>to see any implementation which scans through all the pending BGP UPDATEs<BR>it has in its transmit queue upon receiving the withdraw to remove the NLRI<BR>if it exists in the withdraw also OR scanning through all the WITHDRAW<BR>messages when it recieves any new NLRI to advertise.</P>
<P>Such&nbsp; implemenations exist. and it does not require scanning any update list. The route entry itself can have enough information in it to indicate to which neighors it has been sent and to which neighors it has been queued to be sent to. Off course once it is given to TCP it is as good as sent but thats not a long list as TCP buffers are not VERY large @16K in most implemenatations.<BR><BR>Manav<BR></P></BLOCKQUOTE><p><br><hr size=1><b>Do You Yahoo!?</b><br>
<a href="http://health.yahoo.com/">Yahoo! Health</a> - Feel better, live better
--0-1930548566-1027486754=:6384--


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id AAA10771 for <idr-archive@nic.merit.edu>; Wed, 24 Jul 2002 00:12:00 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 9742891274; Wed, 24 Jul 2002 00:11:33 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 60D30912CE; Wed, 24 Jul 2002 00:11:33 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 1E02E91274 for <idr@trapdoor.merit.edu>; Wed, 24 Jul 2002 00:11:32 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id F00975DEEE; Wed, 24 Jul 2002 00:11:32 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from ep_mailout1 (unknown [203.254.224.24]) by segue.merit.edu (Postfix) with ESMTP id 99B625DEDE for <idr@merit.edu>; Wed, 24 Jul 2002 00:11:31 -0400 (EDT)
Received: from custom-daemon.mailout1.samsung.com by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) id <0GZQ00J01JQYQ4@mailout1.samsung.com> for idr@merit.edu; Wed, 24 Jul 2002 13:13:46 +0900 (KST)
Received: from ep_mmp2 (localhost [127.0.0.1]) by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTP id <0GZQ00JA6JQXBG@mailout1.samsung.com> for idr@merit.edu; Wed, 24 Jul 2002 13:13:46 +0900 (KST)
Received: from Manav ([107.108.3.180]) by mmp2.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTPA id <0GZQ000C5JR1DF@mmp2.samsung.com> for idr@merit.edu; Wed, 24 Jul 2002 13:13:52 +0900 (KST)
Date: Wed, 24 Jul 2002 09:37:11 +0530
From: Manav Bhatia <manav@samsung.com>
Subject: Re: My BGP Route Update Pacing Draft
To: Dennis Ferguson <dennis@juniper.net>
Cc: idr@merit.edu
Reply-To: Manav Bhatia <manav@samsung.com>
Message-id: <00e901c232c7$9765ff40$b4036c6b@sisodomain.com>
MIME-version: 1.0
X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2600.0000
X-Mailer: Microsoft Outlook Express 6.00.2600.0000
Content-type: text/plain; charset=iso-8859-1
Content-transfer-encoding: 7BIT
X-Priority: 3
X-MSMail-priority: Normal
References: <200207232235.g6NMZhm58016@merlot.juniper.net>
Sender: owner-idr@merit.edu
Precedence: bulk

Hi Dennis,

| Why would you do this?  If you receive a withdrawal for a route for which
| you haven't sent an UPDATE yet, why wouldn't you forget about the UPDATE
| and just send the withdrawal (or, if you've never advertised the route
| to the peer, just send nothing at all)?  If a route is flapping and

Depends on whether your BGP UPDATE packets waiting to be announced are in
some transit queue above the TCP or lying below it. If you have already
given the UPDATE to the TCP then there isnt much that you can do upon
receiving the WITHDRAW. If its the former case (above TCP) then i have yet
to see any implementation which scans through all the pending BGP UPDATEs
it has in its transmit queue upon receiving the withdraw to remove the NLRI
if it exists in the withdraw also OR scanning through all the WITHDRAW
messages when it recieves any new NLRI to advertise.

Manav



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id XAA10249 for <idr-archive@nic.merit.edu>; Tue, 23 Jul 2002 23:58:18 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 2139391273; Tue, 23 Jul 2002 23:57:41 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id D8E65912CE; Tue, 23 Jul 2002 23:57:40 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 85C9591273 for <idr@trapdoor.merit.edu>; Tue, 23 Jul 2002 23:57:39 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 63B605DDF0; Tue, 23 Jul 2002 23:57:39 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from ep_mailout1 (unknown [203.254.224.24]) by segue.merit.edu (Postfix) with ESMTP id 0AE6F5DDBE for <idr@merit.edu>; Tue, 23 Jul 2002 23:57:39 -0400 (EDT)
Received: from custom-daemon.mailout1.samsung.com by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) id <0GZQ00J01J3T4M@mailout1.samsung.com> for idr@merit.edu; Wed, 24 Jul 2002 12:59:53 +0900 (KST)
Received: from ep_mmp2 (localhost [127.0.0.1]) by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTP id <0GZQ00I8VJ3TQ6@mailout1.samsung.com> for idr@merit.edu; Wed, 24 Jul 2002 12:59:53 +0900 (KST)
Received: from Manav ([107.108.3.180]) by mmp2.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTPA id <0GZQ0009AJ3U1C@mmp2.samsung.com> for idr@merit.edu; Wed, 24 Jul 2002 12:59:56 +0900 (KST)
Date: Wed, 24 Jul 2002 09:23:16 +0530
From: Manav Bhatia <manav@samsung.com>
Subject: Re:
To: Padmaja kotamarthiC <pamsri01@yahoo.com>, Benjamin.Abarbanel@Marconi.com, idr@merit.edu
Reply-To: Manav Bhatia <manav@samsung.com>
Message-id: <00af01c232c5$a53d4f80$b4036c6b@sisodomain.com>
MIME-version: 1.0
X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2600.0000
X-Mailer: Microsoft Outlook Express 6.00.2600.0000
Content-type: text/plain; charset=iso-8859-1
Content-transfer-encoding: 7BIT
X-Priority: 3
X-MSMail-priority: Normal
References: <20020723143524.45648.qmail@web14106.mail.yahoo.com>
Sender: owner-idr@merit.edu
Precedence: bulk

Padma,
As the name suggests peer prefix limit is per peer basis and is not
applicable to the BGP RIB in whole. The peer can get congested even if it
receives routes < prefix limit from multiple peers. What peer prefix limit
will merely do is to tear down the session with the offending peer if the
nos. of prefixes we receive from it goes beyond our default/configured
acceptable value.

configuring and using the peer prefix limit is *not* at all a solution for
peer congestion.

Manav

----- Original Message -----
From: "Padmaja kotamarthiC" <pamsri01@yahoo.com>
To: <manav@samsung.com>; <Benjamin.Abarbanel@Marconi.com>; <idr@merit.edu>
Sent: Tuesday, July 23, 2002 8:05 PM


| Hi Manav,
|         If your peer is congested for whatever reason
| the simple solution is to: set the peer prefix limit
| to the appropriate value to avoid the problem.
|
| sri
|
| Hi Ben,
| I have some thoughts on the draft.
|
| The crux of your draft is that if a peer gets so
| congested that it is not
| able to send any UPDATE/KEEPALIVE message to the
| remote uncongested peer
| then the session can break down because of the Hold
| Timer expiring and it
| can be prevented by sending some pacing information
| which will inform the
| uncongested peer to limit its route advertisement
| rate. The PACING
| information will too be carried in the TCP packets, so
| if the TCP is
| implementing flow control and is not sending packets
| to avoid congestion
| then how will the PACING information be sent? And if
| TCP can send the
| PACING information then it can as well send the
| KEEPALIVE packet which will
| keep the HoldTimer from expiring at the other end.
|
| Secondly, if you are aware that your router is old and
| might come in a
| situation when it might not be able to handle the full
| feed of BGP Updates
| from the remote end then it can negotiate a HoldTime
| of zero and keep the
| session running even when it is in the congested
| state.
|
| Thirdly, the very idea of artificially impeding
| reachability information
| (or sending of BGP UPDATE messages) doesn't really
| appeal to me. I was
| initially against the idea of using
| MinRouteAdvertisementInternal etc
| timers in my implementation arguing that a router
| should pack off any
| reachability information it has as soon as it comes to
| know of it. But this
| created a problem when i got say some 80K routes. I
| would then be sending
| 80K packets to my other peers! Finally , better sense
| prevailed on me and i
| started using the timers for efficiently packing my
| NLRIs in the UPDATE
| messages. This is one point where i hold my
| reachability information from
| my peers & I discourage it doing any other time. IMHO
| i feel we should let
| TCP take care of withholding any data it wants with
| itself till the
| congestion state rides by rather than us interfering
| with it.
|
| What happens say when an uncongested router receives
| pacing information
| telling it to reduce the advertisement rate by some
| 70%. In that case there
| will be many UPDATEs lying with it remaining to be yet
| announced to the
| remote peer which is experiencing congestion. Now what
| if the uncongested
| network receives withdrawals (both implicit and
| explicit) for certain
| prefixes which have already been announced or remain
| yet to be announced?
| You open up a can of worms now !!
|
| You have to now flush all your output buffers
| regardless of the congestion
| state your remote peer lies in!
|
| Thus you end up doing what you always wanted to avoid.
|
| I think a cleaner and a much simpler approach would be
| to dynamically
| negotiate the value of the HoldTimer during the
| session. If you feel you
| are getting congested then increase the value and once
| the red flags are
| all down you can revert back to your original value or
| decrease it back
| again.
|
| Regards,
| Manav
|
|
|
|
|
|
|
|
|
|
|
| ----- Original Message -----
| From: "Abarbanel, Benjamin"
| <Benjamin.Abarbanel@Marconi.com>
| To: <idr@merit.edu>
| Sent: Friday, July 19, 2002 1:01 AM
| Subject: My BGP Route Update Pacing Draft
|
|
| | Hi all:
| |
| |    Our recent discussions on this list and my recent
| work
| |  experiences have led me to write this draft and
| offer it
| |  to the IETF community. I would appreciate any
| comments
| |  anyone has to make.
| |
| |  Thanks in advance,
| |  Ben
| |
| |
| |
| |
|
|
|
|
|
| __________________________________________________
| Do You Yahoo!?
| Yahoo! Health - Feel better, live better
| http://health.yahoo.com



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id XAA09948 for <idr-archive@nic.merit.edu>; Tue, 23 Jul 2002 23:49:45 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id A4ED591271; Tue, 23 Jul 2002 23:49:09 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 708DC91272; Tue, 23 Jul 2002 23:49:09 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 2FBE091271 for <idr@trapdoor.merit.edu>; Tue, 23 Jul 2002 23:49:08 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 0ED785DDF0; Tue, 23 Jul 2002 23:49:08 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from ep_mailout1 (unknown [203.254.224.24]) by segue.merit.edu (Postfix) with ESMTP id ACDAF5DDBE for <idr@merit.edu>; Tue, 23 Jul 2002 23:49:07 -0400 (EDT)
Received: from custom-daemon.mailout1.samsung.com by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) id <0GZQ00I01IPLVV@mailout1.samsung.com> for idr@merit.edu; Wed, 24 Jul 2002 12:51:21 +0900 (KST)
Received: from ep_mmp2 (localhost [127.0.0.1]) by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTP id <0GZQ00I3JIPKPI@mailout1.samsung.com> for idr@merit.edu; Wed, 24 Jul 2002 12:51:21 +0900 (KST)
Received: from Manav ([107.108.3.180]) by mmp2.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTPA id <0GZQ0000JIPPDF@mmp2.samsung.com> for idr@merit.edu; Wed, 24 Jul 2002 12:51:27 +0900 (KST)
Date: Wed, 24 Jul 2002 09:14:47 +0530
From: Manav Bhatia <manav@samsung.com>
Subject: Re: My BGP Route Update Pacing Draft
To: "M. ELK" <elkou141061@hotmail.com>
Cc: idr@merit.edu
Reply-To: Manav Bhatia <manav@samsung.com>
Message-id: <008f01c232c4$75d83e40$b4036c6b@sisodomain.com>
MIME-version: 1.0
X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2600.0000
X-Mailer: Microsoft Outlook Express 6.00.2600.0000
Content-type: text/plain; charset=iso-8859-1
Content-transfer-encoding: 7BIT
X-Priority: 3
X-MSMail-priority: Normal
References: <F19Zs7fJlzg6MsDt01200020f78@hotmail.com>
Sender: owner-idr@merit.edu
Precedence: bulk

Elk,

| for case 1 :
|
| Could the graceful restart mechanism be a solution to this pblm .
|
| Say R1 is peering with N peer . R1 is under stress
|
| R1 select to close 1..m  TCP session with peer 1 to m , deal with the
remaining  session (m+1 ...N) and clear his congestion (now R1 have
|
| enough HP ) . R1 initiae a session to peer 1 ...m with restart bit set
and f bit set  .

Few comments follow:

- How will R1 know in advance the time it may take to come out of the
congested state? Because R1 has to advertise some "Restart time" as the
worst case time during the startup. And since the Restart Time < HoldTime
carried in the OPEN message you are not left with much time to clear your
congestion !
- On what basis does R1 choose peers 1 .. m to close its TCP session with?
Why not m+1 .. N OR m+1 .. p where p<=N for that matter ?
- Closing the session with 1 .. m because R1 cant take all that load isnt
really a bright idea. Moreover what if the load doesnt go away and the
congestion doesnt clear with time?

|  Here we assume that R1 could sort his stress within a restart time
value.

on what basis? do you have any statistics/reports/studies which state that?

Manav





Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id XAA08977 for <idr-archive@nic.merit.edu>; Tue, 23 Jul 2002 23:18:35 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 20D689126E; Tue, 23 Jul 2002 23:16:59 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id DA8AD9126F; Tue, 23 Jul 2002 23:16:58 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 95A4B9126E for <idr@trapdoor.merit.edu>; Tue, 23 Jul 2002 23:16:57 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 659965DE5B; Tue, 23 Jul 2002 23:16:57 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from ep_mailout1 (unknown [203.254.224.24]) by segue.merit.edu (Postfix) with ESMTP id 402375DDBE for <idr@merit.edu>; Tue, 23 Jul 2002 23:16:55 -0400 (EDT)
Received: from custom-daemon.mailout1.samsung.com by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) id <0GZQ00H01H7WPS@mailout1.samsung.com> for idr@merit.edu; Wed, 24 Jul 2002 12:19:08 +0900 (KST)
Received: from ep_mmp2 (localhost [127.0.0.1]) by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTP id <0GZQ00H7ZH7WAB@mailout1.samsung.com> for idr@merit.edu; Wed, 24 Jul 2002 12:19:08 +0900 (KST)
Received: from Manav ([107.108.3.180]) by mmp2.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTPA id <0GZQ00M34H808B@mmp2.samsung.com> for idr@merit.edu; Wed, 24 Jul 2002 12:19:14 +0900 (KST)
Date: Wed, 24 Jul 2002 08:42:34 +0530
From: Manav Bhatia <manav@samsung.com>
Subject: Re: My BGP Route Update Pacing Draft
To: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>, idr@merit.edu
Reply-To: Manav Bhatia <manav@samsung.com>
Message-id: <006d01c232bf$f5f4aaf0$b4036c6b@sisodomain.com>
MIME-version: 1.0
X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2600.0000
X-Mailer: Microsoft Outlook Express 6.00.2600.0000
Content-type: text/plain; charset=iso-8859-1
Content-transfer-encoding: 7BIT
X-Priority: 3
X-MSMail-priority: Normal
References: <39469E08BD83D411A3D900204840EC558227C3@vie-msgusr-01.dc.fore.com>
Sender: owner-idr@merit.edu
Precedence: bulk

Ben,

| > I think a cleaner and a much simpler approach would be to dynamically
| > negotiate the value of the HoldTimer during the session. If
| > you feel you
| > are getting congested then increase the value and once the
| > red flags are
| > all down you can revert back to your original value or
| > decrease it back
| > again.
| >
| Changing the hold timer does not reduce the congestion imposed on the
| current
| router by its peers. It only extends the time it takes for the session to
| disconnect.

which is exactly what i wanted to say .. you mentioned that the uncongested
peer is unable to receive KEEPALIVEs and UPDATEs from the remote end and it
brings down the session .. "The mistake is made by the uncongested peer
since it does not see any Keepalive/Update messages before its Hold timer
expires and drops the session thereby dropping all its associated routes."

I was merely suggesting an alternative to avoid this "mistake" by
dynamically suggesting a way to increase the time it takes to disconnect
the session.

Manav








Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id SAA00147 for <idr-archive@nic.merit.edu>; Tue, 23 Jul 2002 18:46:45 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 39047912CC; Tue, 23 Jul 2002 18:45:59 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id EA475912CF; Tue, 23 Jul 2002 18:45:58 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id E11DB912CC for <idr@trapdoor.merit.edu>; Tue, 23 Jul 2002 18:45:56 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id C3B035DDC4; Tue, 23 Jul 2002 18:45:56 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from demiurge.exodus.net (demiurge.exodus.net [216.32.171.82]) by segue.merit.edu (Postfix) with ESMTP id 63B705DD8D for <idr@merit.edu>; Tue, 23 Jul 2002 18:45:56 -0400 (EDT)
Received: (from andrewl@localhost) by demiurge.exodus.net (8.9.3+Sun/8.9.3) id PAA09418; Tue, 23 Jul 2002 15:43:06 -0700 (PDT)
Date: Tue, 23 Jul 2002 15:43:06 -0700
From: andrewl@exodus.net
To: Tian Fang <tfang@hyperchip.com>
Cc: idr@merit.edu
Subject: Re: limit the No. of routes received from a neighbor
Message-ID: <20020723154306.V4742@demiurge.exodus.net>
References: <8812A03F65CDD511AE98006008F5E871025D1792@hermes.hyperchip.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <8812A03F65CDD511AE98006008F5E871025D1792@hermes.hyperchip.com>; from tfang@hyperchip.com on Tue, Jul 23, 2002 at 06:11:47PM -0400
Sender: owner-idr@merit.edu
Precedence: bulk

If max-prefix is exceeded, tear down the session.  In this case there is
no way to know which routes you should listen to and which ones you 
shouldn't.  Tearing down the session will force human intervention who
can either fix the problem, if max-prefix was exceeded accidently, or
request that the limit be raised if max-prefix was triggered by normal
growth.

Andrew

On Tue, Jul 23, 2002 at 06:11:47PM -0400, Tian Fang wrote:
> Delivered-To: idr-outgoing@trapdoor.merit.edu
> Delivered-To: idr@trapdoor.merit.edu
> Delivered-To: idr@merit.edu
> From: Tian Fang <tfang@hyperchip.com>
> To: idr@merit.edu
> Subject: limit the No. of routes received from a neighbor
> Date: Tue, 23 Jul 2002 18:11:47 -0400
> X-Mailer: Internet Mail Service (5.5.2653.19)
> Precedence: bulk
> X-OriginalArrivalTime: 23 Jul 2002 22:14:42.0869 (UTC) FILETIME=[585BCE50:01C23296]
> 
> Hi all,
> 
> When the No. of routes received from a neighbor exceeds the configured
> limit, is it better to tear down the session or drop the routes received
> later on? If dropping routes is a option, is there any preference that we
> can use to choose the routes to be dropped? Thx.
> 
> Regards,
> Tian Fang



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id SAA29855 for <idr-archive@nic.merit.edu>; Tue, 23 Jul 2002 18:36:26 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 35E0F912CB; Tue, 23 Jul 2002 18:35:47 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 03A44912CC; Tue, 23 Jul 2002 18:35:46 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 05736912CB for <idr@trapdoor.merit.edu>; Tue, 23 Jul 2002 18:35:45 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id CE0A45DDEC; Tue, 23 Jul 2002 18:35:45 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from merlot.juniper.net (natint.juniper.net [207.17.136.129]) by segue.merit.edu (Postfix) with ESMTP id 7964C5DD8D for <idr@merit.edu>; Tue, 23 Jul 2002 18:35:45 -0400 (EDT)
Received: from juniper.net (wawa.juniper.net [172.17.20.55]) by merlot.juniper.net (8.11.3/8.11.3) with ESMTP id g6NMZhm58016; Tue, 23 Jul 2002 15:35:43 -0700 (PDT) (envelope-from dennis@juniper.net)
Message-Id: <200207232235.g6NMZhm58016@merlot.juniper.net>
X-Mailer: exmh version 2.0.2 2/24/98
To: Manav Bhatia <manav@samsung.com>
Cc: Mareline Sheldon <marelines@yahoo.com>, idr@merit.edu
Subject: Re: My BGP Route Update Pacing Draft 
In-reply-to: Your message of "Tue, 23 Jul 2002 17:31:55 +0530." <008601c23240$bec54410$b4036c6b@sisodomain.com> 
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Tue, 23 Jul 2002 15:35:42 -0700
From: Dennis Ferguson <dennis@juniper.net>
Sender: owner-idr@merit.edu
Precedence: bulk

Hello,

> If you have some UPDATEs to announce and if you are holding them back
> because of the congestion you perceive in the network and then if you
> recieve a WITHDRAW for one of those same prefixes then you have to first
> advertise the UPDATEs because it is invalid to announce the same address
> prefix in the WITHDRAWN routes and the NLRI field of an UPDATE message. You
> thus end up flushing all the BGP UPDATEs you had piled up waiting for the
> congestion to clear out at your remote end.

Why would you do this?  If you receive a withdrawal for a route for which
you haven't sent an UPDATE yet, why wouldn't you forget about the UPDATE
and just send the withdrawal (or, if you've never advertised the route
to the peer, just send nothing at all)?  If a route is flapping and
goes up and down 62 times while your peer has you flow controlled, would
you really send the 62 flaps when the peer allows you to send again?  Why
wouldn't you just send the final result?

You need to do routing protocol implementations in a way which avoids
unbounded state, and doing this means that you shouldn't be recording
all route transitions to play back to a peer which isn't listening at
the moment.  If the only state you keep is what you advertised to the
peer before and what the state of the route is now, then the amount
of state you need to keep remains proportional to the number of routes
you have and you only need to advertise the current state (if it doesn't
match what the peer already knows, otherwise you send nothing) when
the peer starts taking updates again.

> The better solution is to let
> TCP take care of the congestion internally as it has always been

I don't think this has much to do with TCP, TCP flow control works
the same way.

Dennis Ferguson



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id SAA29125 for <idr-archive@nic.merit.edu>; Tue, 23 Jul 2002 18:13:41 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 5EAF9912C9; Tue, 23 Jul 2002 18:13:13 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 2FD83912CA; Tue, 23 Jul 2002 18:13:13 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id E9906912C9 for <idr@trapdoor.merit.edu>; Tue, 23 Jul 2002 18:13:11 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id C7EC95DDF6; Tue, 23 Jul 2002 18:13:11 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mail1.hyperchip.com (mail1.hyperchip.com [216.94.112.3]) by segue.merit.edu (Postfix) with ESMTP id 979105DD8D for <idr@merit.edu>; Tue, 23 Jul 2002 18:13:11 -0400 (EDT)
Received: from hermes.hyperchip.com ([10.0.0.12] helo=hermes.hypernet.com) by mail1.hyperchip.com with esmtp (Exim 3.22 #1) id 17X7u2-0005aj-00 for idr@merit.edu; Tue, 23 Jul 2002 18:13:10 -0400
Received: by hermes.hyperchip.com with Internet Mail Service (5.5.2653.19) id <3V7M71KL>; Tue, 23 Jul 2002 18:11:47 -0400
Message-ID: <8812A03F65CDD511AE98006008F5E871025D1792@hermes.hyperchip.com>
From: Tian Fang <tfang@hyperchip.com>
To: idr@merit.edu
Subject: limit the No. of routes received from a neighbor
Date: Tue, 23 Jul 2002 18:11:47 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: multipart/alternative; boundary="----_=_NextPart_001_01C23295.EF95C100"
Sender: owner-idr@merit.edu
Precedence: bulk

This message is in MIME format. Since your mail reader does not understand
this format, some or all of this message may not be legible.

------_=_NextPart_001_01C23295.EF95C100
Content-Type: text/plain;
	charset="iso-8859-1"

Hi all,

When the No. of routes received from a neighbor exceeds the configured
limit, is it better to tear down the session or drop the routes received
later on? If dropping routes is a option, is there any preference that we
can use to choose the routes to be dropped? Thx.

Regards,
Tian Fang

------_=_NextPart_001_01C23295.EF95C100
Content-Type: text/html;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN">
<HTML>
<HEAD>
<META HTTP-EQUIV=3D"Content-Type" CONTENT=3D"text/html; =
charset=3Diso-8859-1">
<META NAME=3D"Generator" CONTENT=3D"MS Exchange Server version =
5.5.2653.12">
<TITLE>limit the No. of routes received from a neighbor</TITLE>
</HEAD>
<BODY>

<P><FONT SIZE=3D2>Hi all,</FONT>
</P>

<P><FONT SIZE=3D2>When the No. of routes received from a neighbor =
exceeds the configured limit, is it better to tear down the session or =
drop the routes received later on? If dropping routes is a option, is =
there any preference that we can use to choose the routes to be =
dropped? Thx.</FONT></P>

<P><FONT SIZE=3D2>Regards,</FONT>
<BR><FONT SIZE=3D2>Tian Fang</FONT>
</P>

</BODY>
</HTML>
------_=_NextPart_001_01C23295.EF95C100--


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id SAA28914 for <idr-archive@nic.merit.edu>; Tue, 23 Jul 2002 18:06:59 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 2C5BD912C6; Tue, 23 Jul 2002 18:06:44 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 0169B912C7; Tue, 23 Jul 2002 18:06:43 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 2BB92912C6 for <idr@trapdoor.merit.edu>; Tue, 23 Jul 2002 18:06:41 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 14EB35DDB2; Tue, 23 Jul 2002 18:06:41 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from celox-ma1-ems1.celoxnetworks.com (celox-ma1-imap1.celoxnetworks.com [12.40.60.233]) by segue.merit.edu (Postfix) with ESMTP id C5DD35DD8D for <idr@merit.edu>; Tue, 23 Jul 2002 18:06:40 -0400 (EDT)
Received: by celox-ma1-ems1.celoxnetworks.com with Internet Mail Service (5.5.2653.19) id <3T6K1340>; Tue, 23 Jul 2002 18:06:40 -0400
Message-ID: <1117F7D44159934FB116E36F4ABF221B020AF8AF@celox-ma1-ems1.celoxnetworks.com>
From: "Natale, Jonathan" <JNatale@celoxnetworks.com>
To: "'Dennis Ferguson'" <dennis@juniper.net>, Manav Bhatia <manav@samsung.com>
Cc: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>, idr@merit.edu
Subject: RE: My BGP Route Update Pacing Draft 
Date: Tue, 23 Jul 2002 18:06:37 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
Sender: owner-idr@merit.edu
Precedence: bulk

TCP URG ???
2nd session ???

-----Original Message-----
From: Dennis Ferguson [mailto:dennis@juniper.net] 
Sent: Tuesday, July 23, 2002 5:48 PM
To: Manav Bhatia
Cc: Abarbanel, Benjamin; idr@merit.edu
Subject: Re: My BGP Route Update Pacing Draft 

Hello,

> The crux of your draft is that if a peer gets so congested that it is not
> able to send any UPDATE/KEEPALIVE message to the remote uncongested peer
> then the session can break down because of the Hold Timer expiring and it
> can be prevented by sending some pacing information which will inform the
> uncongested peer to limit its route advertisement rate.

Note that, right now, it is impossible to send BGP updates to a peer any
faster than the peer is willing to process them.  Period.  TCP flow control
ensures this.  It hence isn't possible for update senders to put any more
load on a peer than that peer itself chooses to accept.  If a BGP peer
wants to ensure that its CPU is never more that 50% busy it can do
that already by not spending more than 50% of its CPU processing BGP
updates,
there is no way its neighbours can make it work harder than it wants to.
And the peer is free to stop processing update messages whenever it needs
to, like when it needs to send KEEPALIVEs, so a peer which voluntarily keeps
processing incoming updates which it should have been sending KEEPALIVEs is
just broken and needs to be fixed.

I hence don't think this draft has anything to do with fixing busy
BGP speakers, they are already in full control of their load.  The
draft seems to in fact have something to do with the inconvenience
an idle remote peer might suffer if the busy guy takes a long time
to respond to, say, a Capability message, but since the guys who are
inconvenienced are the guys who have available CPU cycles it seems to me
to be appropriate for those guys to spend the cycles dealing with this
themselves.

Dennis Ferguson


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id RAA28540 for <idr-archive@nic.merit.edu>; Tue, 23 Jul 2002 17:56:39 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 62CC2912C5; Tue, 23 Jul 2002 17:56:16 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 2E17E912C6; Tue, 23 Jul 2002 17:56:16 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id B9274912C5 for <idr@trapdoor.merit.edu>; Tue, 23 Jul 2002 17:56:14 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 9AD4F5DDB2; Tue, 23 Jul 2002 17:56:14 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from web14105.mail.yahoo.com (web14105.mail.yahoo.com [216.136.172.135]) by segue.merit.edu (Postfix) with SMTP id BAEA25DD8D for <idr@merit.edu>; Tue, 23 Jul 2002 17:56:13 -0400 (EDT)
Message-ID: <20020723215612.65350.qmail@web14105.mail.yahoo.com>
Received: from [47.234.0.51] by web14105.mail.yahoo.com via HTTP; Tue, 23 Jul 2002 14:56:12 PDT
Date: Tue, 23 Jul 2002 14:56:12 -0700 (PDT)
From: Padmaja kotamarthiC <pamsri01@yahoo.com>
Subject: Re: FW: My BGP Route Update Pacing Draft 
To: Dennis Ferguson <dennis@juniper.net>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: "'idr@merit.edu'" <idr@merit.edu>
In-Reply-To: <200207232142.g6NLgRm53816@merlot.juniper.net>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Sender: owner-idr@merit.edu
Precedence: bulk

Hi ,
    Here's a transcript of my (offline) discussion
with Ben.

sri 

Ben wrote :
> OK,

> Ben

> -----Original Message-----
> From: Padmaja kotamarthiC
[mailto:pamsri01@yahoo.com]
> Sent: Tuesday, July 23, 2002 12:48 PM
> To: Abarbanel, Benjamin
> Subject: RE: Route Update Pacing
> > > Hi Ben ,
> In a *practical* situation with all the
> knobs/configurations provided by the protocol i am
not
> sure if this situation can be encountered. If in a
> *practical* situation this situation is encountered
i
> would consider the onus is on the system/router than
> the protocol. 
> > Like i said for the hypothetical case you mentiond
> limiting the number of routes/prefix per peer based
on
> the system/router specs should sufficiently handle
the
> congestion if not the onus is on the system and not
> the protocol. 
> > sri
> > --- "Abarbanel, Benjamin"
> <Benjamin.Abarbanel@Marconi.com> wrote:
> > 300 peers with 100k routes each stressing the
router
> > continuously by
> > adding and withdrawing routes.
> > > > Although this is hypothetical, the router
should be
> > able to handle
> > it without session drops or data loss. And a
> > reasonable convergence
> > time under 1 minute.
> > > > Ben
> > > > > -----Original Message-----
> > > From: Padmaja kotamarthiC
> > [mailto:pamsri01@yahoo.com]
> > > Sent: Tuesday, July 23, 2002 12:24 PM
> > > To: Abarbanel, Benjamin
> > > Subject: RE: Route Update Pacing
> > > > > > > > > Hi Ben ,
> > > The thing to understand is what conditions
> > > this situation arises. How many peers and routes
> > are
> > > being processed when this situation (congestion)
> > is
> > > encountered by you? I am just trying to
understand
> > if
> > > there's a:  system/router bottleneck or protocol
> > > issue.
> > > > > > sri
> > > > > > --- "Abarbanel, Benjamin"
> > > <Benjamin.Abarbanel@Marconi.com> wrote:
> > > > > > > > > > > > > -----Original Message-----
> > > > > From: Padmaja kotamarthiC
> > > > [mailto:pamsri01@yahoo.com]
> > > > > Sent: Tuesday, July 23, 2002 11:38 AM
> > > > > To: Abarbanel, Benjamin; manav@samsung.com
> > > > > Cc: idr@merit.edu
> > > > > Subject: Route Update Pacing
> > > > > > > > > > > > > > > Hi Ben ,
> > > > > What is the affect of this solution on
> > > > > convergence ? Your solution seems to be
> > similar to
> > > > > minimumrouteadvertisementinterval with a
> > > > modification
> > > > > that this timer comes into picture only
during
> > > > > congestion rather than all times. If we can
> > modify
> > > > the
> > > > > role of minimumrouteadvertisementinterval by
> > > > > implementing as stated above it can solve
the
> > > > issue
> > > > > too rather than introducing new messages.
> > > > > > > > > > sri
> > > > > > > > > Sri:
> > > > The minimumrouteadvertisementinterval for IBGP
> > > > routes works 
> > > > on a preconfigured timeout value. This value
> > does
> > > > not change 
> > > > dynamically to load/stress conditions
therefore
> > it
> > > > is suboptimal
> > > > to handle the congestion problem. There has
been
> > a
> > > > study done
> > > > that shows its partner timer
> > > > MinRouteAdvertisementInternal for EBGP
> > > > routes to take to long and cause convergence
> > > > problems.
> > > > > > > > check this out:
> > > > > > > >
> > >
> >
>
http://www.ripe.net/ripe/meetings/archive/ripe-42/presentation
> > s/ripe42-eof-b
> > > gp/sld038.html
> > > > > > Ben




__________________________________________________
Do You Yahoo!?
Yahoo! Health - Feel better, live better
http://health.yahoo.com


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id RAA28269 for <idr-archive@nic.merit.edu>; Tue, 23 Jul 2002 17:49:22 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id A2D67912C4; Tue, 23 Jul 2002 17:48:50 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 724F991243; Tue, 23 Jul 2002 17:48:50 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id C5ED5912C4 for <idr@trapdoor.merit.edu>; Tue, 23 Jul 2002 17:48:27 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id A40745DE92; Tue, 23 Jul 2002 17:48:27 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from merlot.juniper.net (natint.juniper.net [207.17.136.129]) by segue.merit.edu (Postfix) with ESMTP id 469A65DEA4 for <idr@merit.edu>; Tue, 23 Jul 2002 17:48:27 -0400 (EDT)
Received: from juniper.net (wawa.juniper.net [172.17.20.55]) by merlot.juniper.net (8.11.3/8.11.3) with ESMTP id g6NLmNm54421; Tue, 23 Jul 2002 14:48:23 -0700 (PDT) (envelope-from dennis@juniper.net)
Message-Id: <200207232148.g6NLmNm54421@merlot.juniper.net>
X-Mailer: exmh version 2.0.2 2/24/98
To: Manav Bhatia <manav@samsung.com>
Cc: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>, idr@merit.edu
Subject: Re: My BGP Route Update Pacing Draft 
In-reply-to: Your message of "Tue, 23 Jul 2002 09:42:10 +0530." <00f001c231ff$1ec120b0$b4036c6b@sisodomain.com> 
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Tue, 23 Jul 2002 14:48:23 -0700
From: Dennis Ferguson <dennis@juniper.net>
Sender: owner-idr@merit.edu
Precedence: bulk

Hello,

> The crux of your draft is that if a peer gets so congested that it is not
> able to send any UPDATE/KEEPALIVE message to the remote uncongested peer
> then the session can break down because of the Hold Timer expiring and it
> can be prevented by sending some pacing information which will inform the
> uncongested peer to limit its route advertisement rate.

Note that, right now, it is impossible to send BGP updates to a peer any
faster than the peer is willing to process them.  Period.  TCP flow control
ensures this.  It hence isn't possible for update senders to put any more
load on a peer than that peer itself chooses to accept.  If a BGP peer
wants to ensure that its CPU is never more that 50% busy it can do
that already by not spending more than 50% of its CPU processing BGP updates,
there is no way its neighbours can make it work harder than it wants to.
And the peer is free to stop processing update messages whenever it needs
to, like when it needs to send KEEPALIVEs, so a peer which voluntarily keeps
processing incoming updates which it should have been sending KEEPALIVEs is
just broken and needs to be fixed.

I hence don't think this draft has anything to do with fixing busy
BGP speakers, they are already in full control of their load.  The
draft seems to in fact have something to do with the inconvenience
an idle remote peer might suffer if the busy guy takes a long time
to respond to, say, a Capability message, but since the guys who are
inconvenienced are the guys who have available CPU cycles it seems to me
to be appropriate for those guys to spend the cycles dealing with this
themselves.

Dennis Ferguson



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id RAA28086 for <idr-archive@nic.merit.edu>; Tue, 23 Jul 2002 17:42:55 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 2DB0D91268; Tue, 23 Jul 2002 17:42:31 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id EF7CF912C4; Tue, 23 Jul 2002 17:42:30 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id DD5DB91268 for <idr@trapdoor.merit.edu>; Tue, 23 Jul 2002 17:42:29 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id C47BC5DDB2; Tue, 23 Jul 2002 17:42:29 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from merlot.juniper.net (natint.juniper.net [207.17.136.129]) by segue.merit.edu (Postfix) with ESMTP id 3F52F5DD8D for <idr@merit.edu>; Tue, 23 Jul 2002 17:42:29 -0400 (EDT)
Received: from juniper.net (wawa.juniper.net [172.17.20.55]) by merlot.juniper.net (8.11.3/8.11.3) with ESMTP id g6NLgRm53816; Tue, 23 Jul 2002 14:42:27 -0700 (PDT) (envelope-from dennis@juniper.net)
Message-Id: <200207232142.g6NLgRm53816@merlot.juniper.net>
X-Mailer: exmh version 2.0.2 2/24/98
X-Exmh-Isig-CompType: repl
X-Exmh-Isig-Folder: lists/idr
To: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: Re: FW: My BGP Route Update Pacing Draft 
In-reply-to: Your message of "Thu, 18 Jul 2002 18:14:33 EDT." <39469E08BD83D411A3D900204840EC558227BA@vie-msgusr-01.dc.fore.com>
Mime-Version: 1.0
Content-Type: text/plain
Date: Tue, 23 Jul 2002 14:42:27 -0700
From: Dennis Ferguson <dennis@juniper.net>
Sender: owner-idr@merit.edu
Precedence: bulk

Hello,

> > 
> > Why not relly on the transport. Flow control is after all
> > traditionally the function of the transport. What makes you think it
> > needs to be reinvented... and how is your version of flow control
> > supperior to the current one ?
> >
[...]
> The other end that is being flow controlled does not know why 
> or by how much he
> will be flow controlled off. Remember on a long session with 
> many hops it takes significant time to communicate the flow 
> control condition by ACKing or not ACKing messages. For me 
> this is not a direct but indirect mechanism.
> In the interim, the pipe will be full of data.

I think you are confused about this.  TCP flow control does not work by
withholding ACKs. TCP always ACKs messages that are received, and flow
control is instead implemented by directly telling the other end
not to send more.  The sender is informed of this condition as soon as
it occurs, and will be informed that the condition has cleared in the
time it takes for a packet to transit the path between the peers (which
is why it is probably good to have a TCP window of outstanding data
at the receiver that it can process until the sender gets the
message to send more, and noting that the receiver is in full control
of just how much or little data that is).  It doesn't get much better
than that.

Now a sender who has had all outstanding packets ACK'd but which
is flow controlled off knows precisely why this has occurred; it
has been sending updates faster than the receiver was willing to
process them.  There is no ambiguity, this is the only way this
situation could occur.

Therefore, out of everything you said above, the only thing I can
see which is perfectly accurate is that the flow-controlled sender
won't know how long it will be flow-controlled off.  This will depend
on what other things the receiving router has to do which it finds
more important than processing BGP messages from this particular
peer.  It is hard to see how the receiver could know how long that
is going to take until it actually does it, however, so it seems
that telling the sender this would require the receiver to be able
to predict the future (the sender knows all about the past already
so there isn't much use having the receiver tell the sender about
that).  I can't dispute that both prescient routers and fortune tellers
might be useful, but I'm not sure that either is a practical
possibility in real life.

While I doubt the implementability of the future-prediction you
seem to be looking for here, however, I think there is another
more fundamental issue which says your approach is incorrect.  That
is, TCP flow control already puts the congested receiver in full
control of its load.  No one can send data faster than the congested
receiver is willing to process it.  The congested receiver directly
controls how much data is outstanding when flow control is asserted.
The sender can't do anything to the receiver that the receiver doesn't
explicitly permit.  With TCP flow control the congested receiver
has full control of its resources.  If there is any problem at all
here, then, it is that the sender has no control at all over the
receiver's resources and hence may be inconvenienced by a congested
receiver which chooses to take its time responding.  And, in fact,
if we're talking about Capability messages in particular, it is of
no advantage in general at all to the busy receiver to get this
message any earlier than updates or anything else; it doesn't even know
the message is coming until it sees it.  Only the uncongested sender
knows there is an outstanding capability message.

So if there is a problem here at all it is a problem for the
idle sender, which might have to wait for a response to a Capability
message, rather than the busy receiver.  Yet your proposed solution
for this is to have the busy receiver do yet more work, continually
predicting the future and sending Inform messages about it to all of
its (possibly many) peers, on the off chance that a sender which
might someday have a Capability message to send, and which has CPU
cycles to burn, isn't inconvenienced.  Since it is the guy who has
CPU cycles available which has the problem in the first place it seems
utterly backwards to make the guy who has no CPU cycles free do more
work to fix it.  The guy with the CPU cycles should be the one to fix
this, i.e. the sender should spend his free cycles doing whatever it
takes to wait until the congested guy gets around to responding.  Then
everything works fine with no protocol changes.

> > In BGP a receiver can choose to accept a anyrate of updates 
> it wants.
> 
> Yes, it can by reading or not reading the TCP socket queue. 
> Which gets us
> back to the original point I mentioned earlier. TCP FLOW 
> Control is not
> perfect.

Your original point seems to have been based on a faulty understanding
of how TCP flow control works.

> > Please do offer more to substantiate this other than your claim that
> > the problem exists.
> >
>  
> By the fact that the pacing control is more direct and implicit to a
> particular
> traffic type (in this case update messages). The pacing mechanism is
> deterministic, whereas TCP flow control is not. You should talk to people
> who build IPC mechanisms to replace TCP about this issue. 

Or the guys that built deterministic FDDI or Token Ring or Token Bus
or ATM LANs to replace non-deterministic Ethernets.  We have networks
where non-deterministic stuff just happens despite our best efforts to
avoid it, so you might be better off to write code which deals with
that rather than depending on some other box to deterministically
predict the future so you aren't inconvenienced.  While you are correct
that this would be easier for your box if other routers could predict
their future for you with 100% reliability, I don't think this is
likely to be possible in any case.

Dennis Ferguson


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id LAA15163 for <idr-archive@nic.merit.edu>; Tue, 23 Jul 2002 11:03:46 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 4095091264; Tue, 23 Jul 2002 11:03:27 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id C54199130A; Tue, 23 Jul 2002 11:03:15 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 02E2891306 for <idr@trapdoor.merit.edu>; Tue, 23 Jul 2002 11:02:23 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id D5D1E5DDF8; Tue, 23 Jul 2002 11:02:23 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 535D35DDF7 for <idr@merit.edu>; Tue, 23 Jul 2002 11:02:23 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id LAA05227; Tue, 23 Jul 2002 11:02:21 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id LAA25049; Tue, 23 Jul 2002 11:02:21 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W989SVT>; Tue, 23 Jul 2002 11:02:16 -0400
Message-ID: <39469E08BD83D411A3D900204840EC558227C6@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'M. ELK'" <elkou141061@hotmail.com>, manav@samsung.com, marelines@yahoo.com
Cc: idr@merit.edu
Subject: RE: My BGP Route Update Pacing Draft
Date: Tue, 23 Jul 2002 11:02:14 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: multipart/alternative; boundary="----_=_NextPart_001_01C23259.EE331B00"
Sender: owner-idr@merit.edu
Precedence: bulk

This message is in MIME format. Since your mail reader does not understand
this format, some or all of this message may not be legible.

------_=_NextPart_001_01C23259.EE331B00
Content-Type: text/plain;
	charset="iso-8859-1"

Brgds:

   for case 1 : 

Could the graceful restart mechanism be a solution to this pblm . 

Say R1 is peering with N peer . R1 is under stress 

R1 select to close 1..m  TCP session with peer 1 to m , deal with the
remaining  session (m+1 ...N) and clear his congestion (now R1 have 

enough HP ) . R1 initiae a session to peer 1 ...m with restart bit set and f
bit set  .  

Closing down a peer session just because you cannot handle its load is a bad
idea.  And then using some

gracefull session restart mechanism to control the load. I dont see how that
will work efficiently and keep routes up to date?

 Here we assume that R1 could sort his stress within a restart time value.

  What if the stress does not go away after a time period?

Ben


   _____  

Chat with friends online, try MSN Messenger: Click
<http://g.msn.com/1HM1ENXX/c144??PS=47575> Here



------_=_NextPart_001_01C23259.EE331B00
Content-Type: text/html;
	charset="iso-8859-1"

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=iso-8859-1">


<META content="MSHTML 5.00.2920.0" name=GENERATOR></HEAD>
<BODY>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN 
class=111275814-23072002>Brgds:</SPAN></FONT></DIV>
<BLOCKQUOTE 
style="BORDER-LEFT: #0000ff 2px solid; MARGIN-LEFT: 5px; PADDING-LEFT: 5px">
  <DIV align=left class=OutlookMessageHeader dir=ltr><FONT color=#0000ff 
  face=Arial size=2><SPAN class=111275814-23072002>&nbsp; 
  &nbsp;</SPAN></FONT>for case 1 : </DIV>
  <DIV>
  <DIV>
  <P>Could&nbsp;the graceful restart&nbsp;mechanism be a solution to this pblm . 
  </P>
  <P>Say R1 is peering with N peer . R1 is under stress </P>
  <P>R1 select to close&nbsp;1..m&nbsp; TCP session with peer 1 to m&nbsp;, deal 
  with the remaining &nbsp;session (m+1 ...N)&nbsp;and&nbsp;clear his congestion 
  (now R1 have </P>
  <P>enough HP ) . R1 initiae a session to peer&nbsp;1 ...m with restart bit set 
  and f bit set&nbsp; .&nbsp;<FONT color=#0000ff face=Arial size=2><SPAN 
  class=111275814-23072002>&nbsp;</SPAN></FONT><FONT color=#0000ff face=Arial 
  size=2><SPAN class=111275814-23072002></P>
  <P><FONT color=#0000ff face=Arial size=2><SPAN 
  class=111275814-23072002>Closing down a peer session just because you cannot 
  handle its load is a bad idea.&nbsp; And then using some</SPAN></FONT></P>
  <P><FONT color=#0000ff face=Arial size=2><SPAN 
  class=111275814-23072002>gracefull session restart mechanism to control the 
  load. I dont see how that will work efficiently and keep routes up to 
  date?</SPAN></FONT></P></SPAN></FONT>
  <P>&nbsp;Here we assume that R1 could sort his stress within a restart time 
  value.</P>
  <P><FONT color=#0000ff face=Arial size=2><SPAN 
  class=111275814-23072002>&nbsp;&nbsp;What if the stress does not go away after 
  a time period?</SPAN></FONT></P>
  <P><FONT color=#0000ff face=Arial size=2><SPAN 
  class=111275814-23072002>Ben</SPAN></FONT></P></DIV></DIV><BR clear=all>
  <HR>
  Chat with friends online, try MSN Messenger: <A 
  href="http://g.msn.com/1HM1ENXX/c144??PS=47575">Click 
Here</A><BR></BLOCKQUOTE></BODY></HTML>

------_=_NextPart_001_01C23259.EE331B00--


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA14662 for <idr-archive@nic.merit.edu>; Tue, 23 Jul 2002 10:49:31 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id F0DFE91303; Tue, 23 Jul 2002 10:49:10 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id EE09C91306; Tue, 23 Jul 2002 10:49:07 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 3C78F91303 for <idr@trapdoor.merit.edu>; Tue, 23 Jul 2002 10:47:43 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 2168D5DE90; Tue, 23 Jul 2002 10:47:43 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id BE64C5DDF8 for <idr@merit.edu>; Tue, 23 Jul 2002 10:47:42 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA04138; Tue, 23 Jul 2002 10:47:37 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA22288; Tue, 23 Jul 2002 10:47:38 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W989R9Z>; Tue, 23 Jul 2002 10:47:38 -0400
Message-ID: <39469E08BD83D411A3D900204840EC558227C5@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Padmaja kotamarthiC'" <pamsri01@yahoo.com>, manav@samsung.com, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>, idr@merit.edu
Subject: RE: 
Date: Tue, 23 Jul 2002 10:47:36 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Sri:
  How would you handle many adds and withdrawls of same routes. Would that
not cause
the effect of alot more update messages?

Say we get into route oscillations related problems, would we see alot of
updates in a given period of time?

Ben

> -----Original Message-----
> From: Padmaja kotamarthiC [mailto:pamsri01@yahoo.com]
> Sent: Tuesday, July 23, 2002 10:35 AM
> To: manav@samsung.com; Benjamin.Abarbanel@Marconi.com; idr@merit.edu
> Subject: 
> 
> 
> Hi Manav,
>         If your peer is congested for whatever reason
> the simple solution is to: set the peer prefix limit
> to the appropriate value to avoid the problem. 
> 
> sri
> 
> Hi Ben,
> I have some thoughts on the draft.
> 
> The crux of your draft is that if a peer gets so
> congested that it is not
> able to send any UPDATE/KEEPALIVE message to the
> remote uncongested peer
> then the session can break down because of the Hold
> Timer expiring and it
> can be prevented by sending some pacing information
> which will inform the
> uncongested peer to limit its route advertisement
> rate. The PACING
> information will too be carried in the TCP packets, so
> if the TCP is
> implementing flow control and is not sending packets
> to avoid congestion
> then how will the PACING information be sent? And if 
> TCP can send the
> PACING information then it can as well send the
> KEEPALIVE packet which will
> keep the HoldTimer from expiring at the other end.
> 
> Secondly, if you are aware that your router is old and
> might come in a
> situation when it might not be able to handle the full
> feed of BGP Updates
> from the remote end then it can negotiate a HoldTime
> of zero and keep the
> session running even when it is in the congested
> state.
> 
> Thirdly, the very idea of artificially impeding
> reachability information
> (or sending of BGP UPDATE messages) doesn't really
> appeal to me. I was
> initially against the idea of using
> MinRouteAdvertisementInternal etc
> timers in my implementation arguing that a router
> should pack off any
> reachability information it has as soon as it comes to
> know of it. But this
> created a problem when i got say some 80K routes. I
> would then be sending
> 80K packets to my other peers! Finally , better sense
> prevailed on me and i
> started using the timers for efficiently packing my
> NLRIs in the UPDATE
> messages. This is one point where i hold my
> reachability information from
> my peers & I discourage it doing any other time. IMHO
> i feel we should let
> TCP take care of withholding any data it wants with
> itself till the
> congestion state rides by rather than us interfering
> with it.
> 
> What happens say when an uncongested router receives
> pacing information
> telling it to reduce the advertisement rate by some
> 70%. In that case there
> will be many UPDATEs lying with it remaining to be yet
> announced to the
> remote peer which is experiencing congestion. Now what
> if the uncongested
> network receives withdrawals (both implicit and
> explicit) for certain
> prefixes which have already been announced or remain
> yet to be announced?
> You open up a can of worms now !!
> 
> You have to now flush all your output buffers
> regardless of the congestion
> state your remote peer lies in!
> 
> Thus you end up doing what you always wanted to avoid.
> 
> I think a cleaner and a much simpler approach would be
> to dynamically
> negotiate the value of the HoldTimer during the
> session. If you feel you
> are getting congested then increase the value and once
> the red flags are
> all down you can revert back to your original value or
> decrease it back
> again.
> 
> Regards,
> Manav
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> ----- Original Message -----
> From: "Abarbanel, Benjamin"
> <Benjamin.Abarbanel@Marconi.com>
> To: <idr@merit.edu>
> Sent: Friday, July 19, 2002 1:01 AM
> Subject: My BGP Route Update Pacing Draft
> 
> 
> | Hi all:
> |
> |    Our recent discussions on this list and my recent
> work
> |  experiences have led me to write this draft and
> offer it
> |  to the IETF community. I would appreciate any
> comments
> |  anyone has to make.
> |
> |  Thanks in advance,
> |  Ben
> |
> |
> |
> |
> 
> 
> 
> 
> 
> __________________________________________________
> Do You Yahoo!?
> Yahoo! Health - Feel better, live better
> http://health.yahoo.com
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA14553 for <idr-archive@nic.merit.edu>; Tue, 23 Jul 2002 10:45:51 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 124B491300; Tue, 23 Jul 2002 10:45:00 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id D39C491303; Tue, 23 Jul 2002 10:44:59 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id EA90491300 for <idr@trapdoor.merit.edu>; Tue, 23 Jul 2002 10:44:58 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id D96475DEC8; Tue, 23 Jul 2002 10:44:58 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 3EB535DF99 for <idr@merit.edu>; Tue, 23 Jul 2002 10:44:54 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA03950; Tue, 23 Jul 2002 10:44:52 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA21897; Tue, 23 Jul 2002 10:44:53 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W989R6P>; Tue, 23 Jul 2002 10:44:52 -0400
Message-ID: <39469E08BD83D411A3D900204840EC558227C4@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Manav Bhatia'" <manav@samsung.com>, Mareline Sheldon <marelines@yahoo.com>
Cc: idr@merit.edu
Subject: RE: My BGP Route Update Pacing Draft
Date: Tue, 23 Jul 2002 10:44:50 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Manav:

Response below.

> -----Original Message-----
> From: Manav Bhatia [mailto:manav@samsung.com]
> Sent: Tuesday, July 23, 2002 8:02 AM
> To: Mareline Sheldon
> Cc: idr@merit.edu
> Subject: Re: My BGP Route Update Pacing Draft
> 
> 
> Mareline,
> 
> | | What happens say when an uncongested router receives 
> pacing information
> | | telling it to reduce the advertisement rate by some 70%. 
> In that case
> there
> | | will be many UPDATEs lying with it remaining to be yet 
> announced to the
> | | remote peer which is experiencing congestion. Now what if the
> uncongested
> | | network receives withdrawals (both implicit and explicit) 
> for certain
> | | prefixes which have already been announced or remain yet to be
> announced?
> | | You open up a can of worms now !!
> | |
> | | You have to now flush all your output buffers regardless of the
> congestion
> | | state your remote peer lies in!
> |
> | what exactly do you mean by this?
> 
> If you have some UPDATEs to announce and if you are holding them back
> because of the congestion you perceive in the network and then if you
> recieve a WITHDRAW for one of those same prefixes then you 
> have to first
> advertise the UPDATEs because it is invalid to announce the 
> same address
> prefix in the WITHDRAWN routes and the NLRI field of an 
> UPDATE message. You
> thus end up flushing all the BGP UPDATEs you had piled up 
> waiting for the
> congestion to clear out at your remote end. The better 
> solution is to let
> TCP take care of the congestion internally as it has always 
> been OR if you
> want to dynamically adjust to the congestion then you may 
> re-negotiate the
> HoldTimer using the Dynamic Capabilities.
> 
> I hope i make myself clear.
> 
Manav:

Whether you use the pacing method or TCP flow control the updates will be
roadblocked in the source router and the end result will be the same. 
Remember TCP in the source router will only hold a window full of packets
and the rest of the UPDATE routes will be left in the RIB-OUT area of BGP.
Unless you
have implementation that moves 100,000+ routes into some transmit queue that
sits
above TCP. In which case, I doubt you would update this queue of update
messages
when new withdrawls come in affecting those routes.

So if your implementation leaves it in the RIB-out, the new Route withdrawls
coming in the source router could cause updates that have not made it to the
destination router to be affected. 
 

So there is no difference between the two mechanism in this case.

Ben


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA14512 for <idr-archive@nic.merit.edu>; Tue, 23 Jul 2002 10:44:35 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 2700F91263; Tue, 23 Jul 2002 10:44:14 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id E8BDE91300; Tue, 23 Jul 2002 10:44:13 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id F111691263 for <idr@trapdoor.merit.edu>; Tue, 23 Jul 2002 10:44:12 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id D9AA55DEC8; Tue, 23 Jul 2002 10:44:12 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id D154E5DDF8 for <idr@merit.edu>; Tue, 23 Jul 2002 10:44:07 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA03882; Tue, 23 Jul 2002 10:43:55 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA21788; Tue, 23 Jul 2002 10:43:56 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W989R5X>; Tue, 23 Jul 2002 10:43:55 -0400
Message-ID: <39469E08BD83D411A3D900204840EC558227C3@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Manav Bhatia'" <manav@samsung.com>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>, idr@merit.edu
Subject: RE: My BGP Route Update Pacing Draft
Date: Tue, 23 Jul 2002 10:43:55 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Manav:

> -----Original Message-----
> From: Manav Bhatia [mailto:manav@samsung.com]
> Sent: Tuesday, July 23, 2002 12:12 AM
> To: Abarbanel, Benjamin; idr@merit.edu
> Subject: Re: My BGP Route Update Pacing Draft
> 
> 
> Hi Ben,
> I have some thoughts on the draft.
> 
> The crux of your draft is that if a peer gets so congested 
> that it is not
> able to send any UPDATE/KEEPALIVE message to the remote 
> uncongested peer
> then the session can break down because of the Hold Timer 
> expiring and it
> can be prevented by sending some pacing information which 
> will inform the
> uncongested peer to limit its route advertisement rate. The PACING
> information will too be carried in the TCP packets, so if the TCP is
> implementing flow control and is not sending packets to avoid 
> congestion
> then how will the PACING information be sent? And if  TCP can send the
> PACING information then it can as well send the KEEPALIVE 
> packet which will
> keep the HoldTimer from expiring at the other end.
> 
The assumption is that pacing is keeping the congestion low enough by
throttling the rate at which the source peer is transmitting to the local
peer to be
able to send more pacing messages back to the source router. When the router
is totally congested, its too late and even pacing messages will fail to
reach back to the source router. You would say you can achieve the same
control of the incoming stream by artificially limiting the rate at which
you
read the BGP TCP session socket. The only difference between the two
mechanism is
one is conveyed end to end between the BGP peers (via pacing) the other is
done
indirectly via the TCP end points being controlled via the socket reads. I
think
the Pacing method allows one router to send more detail information about
its 
congestion state then a simple flow on/off condition that is provided via
TCP.
The pacing mechanism is more deterministic than the simple TCP flow control
mechanism
which consumes alot more memory resources when flow control is activated.
 
Maybe now we dont need such fancy capabilities, but tomorrow is another day.


> Secondly, if you are aware that your router is old and might come in a
> situation when it might not be able to handle the full feed 
> of BGP Updates
> from the remote end then it can negotiate a HoldTime of zero 
> and keep the
> session running even when it is in the congested state.
>

If you have a zero hold timer, you would have extended period of time where
routes are not current with topological states. The reason we use the hold 
timer is in the case that the remote peer is not talking without a session
disconnect.
This time period is the most a given timer will tolerate before recomputing
alternate
paths for that router's routes. I dont believe that setting this timer off
will
help an old congested peer. It will make the topology highly unstable and
cause
possible black holes via the unresponsive router.
 
> Thirdly, the very idea of artificially impeding reachability 
> information
> (or sending of BGP UPDATE messages) doesn't really appeal to me. I was
> initially against the idea of using MinRouteAdvertisementInternal etc
> timers in my implementation arguing that a router should pack off any
> reachability information it has as soon as it comes to know 
> of it. But this
> created a problem when i got say some 80K routes. I would 
> then be sending
> 80K packets to my other peers! Finally , better sense 
> prevailed on me and i
> started using the timers for efficiently packing my NLRIs in 
> the UPDATE
> messages. This is one point where i hold my reachability 
> information from
> my peers & I discourage it doing any other time. IMHO i feel 
> we should let
> TCP take care of withholding any data it wants with itself till the
> congestion state rides by rather than us interfering with it.
> 
> What happens say when an uncongested router receives pacing 
> information
> telling it to reduce the advertisement rate by some 70%. In 
> that case there
> will be many UPDATEs lying with it remaining to be yet 
> announced to the
> remote peer which is experiencing congestion. Now what if the 
> uncongested
> network receives withdrawals (both implicit and explicit) for certain
> prefixes which have already been announced or remain yet to 
> be announced?
> You open up a can of worms now !!
> 
> You have to now flush all your output buffers regardless of 
> the congestion
> state your remote peer lies in!
> 
> Thus you end up doing what you always wanted to avoid.
> 
> I think a cleaner and a much simpler approach would be to dynamically
> negotiate the value of the HoldTimer during the session. If 
> you feel you
> are getting congested then increase the value and once the 
> red flags are
> all down you can revert back to your original value or 
> decrease it back
> again.
> 
Changing the hold timer does not reduce the congestion imposed on the
current
router by its peers. It only extends the time it takes for the session to
disconnect.

My draft was not attempting to treat session disconnects as the main issue
but reduce congestion level on a given router.  


Ben

P.S. Since there is no interest in this draft I have withdrawn it from IETF.

> 
> 
> 
> 
> 
> ----- Original Message -----
> From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
> To: <idr@merit.edu>
> Sent: Friday, July 19, 2002 1:01 AM
> Subject: My BGP Route Update Pacing Draft
> 
> 
> | Hi all:
> |
> |    Our recent discussions on this list and my recent work
> |  experiences have led me to write this draft and offer it
> |  to the IETF community. I would appreciate any comments
> |  anyone has to make.
> |
> |  Thanks in advance,
> |  Ben
> |
> |
> |
> |
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA12915 for <idr-archive@nic.merit.edu>; Tue, 23 Jul 2002 10:00:22 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 50F17912FC; Tue, 23 Jul 2002 09:59:43 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 1A293912FD; Tue, 23 Jul 2002 09:59:43 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id CA19C912FC for <idr@trapdoor.merit.edu>; Tue, 23 Jul 2002 09:59:41 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 9F3775DF9E; Tue, 23 Jul 2002 09:59:41 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from hotmail.com (f19.pav1.hotmail.com [64.4.31.19]) by segue.merit.edu (Postfix) with ESMTP id 427D85DFA3 for <idr@merit.edu>; Tue, 23 Jul 2002 09:59:41 -0400 (EDT)
Received: from mail pickup service by hotmail.com with Microsoft SMTPSVC; Tue, 23 Jul 2002 06:59:37 -0700
Received: from 57.250.229.136 by pv1fd.pav1.hotmail.msn.com with HTTP; Tue, 23 Jul 2002 13:59:37 GMT
X-Originating-IP: [57.250.229.136]
From: "M. ELK" <elkou141061@hotmail.com>
To: manav@samsung.com, marelines@yahoo.com
Cc: idr@merit.edu
Subject: Re: My BGP Route Update Pacing Draft
Date: Tue, 23 Jul 2002 13:59:37 +0000
Mime-Version: 1.0
Content-Type: text/html
Message-ID: <F19Zs7fJlzg6MsDt01200020f78@hotmail.com>
X-OriginalArrivalTime: 23 Jul 2002 13:59:37.0550 (UTC) FILETIME=[2E9BD6E0:01C23251]
Sender: owner-idr@merit.edu
Precedence: bulk

<html><div style='background-color:'><DIV>
<P><BR>Manav / Marelines </P>
<P>May be it will be clear if we define what pbm this draft is trying to</P>
<P>solve . </P>
<P>1) R1 could be&nbsp;under stress occasionaly </P>
<P>or </P>
<P>2)R1 have a low HP (relative to his location in the netw ) and R1 will be </P>
<P>in this situation very often .</P>
<P>&nbsp;</P>
<P>i guess solving&nbsp;case 2 is not appropriate&nbsp;by a protocol change ,also we</P>
<P>are pushing the purden of&nbsp; R1 weakness to his peers . </P>
<P>&nbsp;</P>
<P>for case 1 : </P>
<P>Could&nbsp;the graceful restart&nbsp;mechanism be a solution to this pblm . </P>
<P>Say R1 is peering with N peer . R1 is under stress </P>
<P>R1 select to close&nbsp;1..m&nbsp; TCP session with peer 1 to m&nbsp;, deal with the remaining &nbsp;session (m+1 ...N)&nbsp;and&nbsp;clear his congestion (now R1 have </P>
<P>enough HP ) . R1 initiae a session to peer&nbsp;1 ...m with restart bit set and f bit set&nbsp; . </P>
<P>&nbsp;Here we assume that R1 could sort his stress within a restart time value.</P>
<P>Brgds </P>
<P>&nbsp;</P></DIV></div><br clear=all><hr>Chat with friends online, try MSN Messenger: <a href='http://g.msn.com/1HM1ENXX/c144??PS=47575'>Click Here</a><br></html>


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id IAA09493 for <idr-archive@nic.merit.edu>; Tue, 23 Jul 2002 08:25:12 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 077FD912F8; Tue, 23 Jul 2002 08:24:35 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id CAEF3912F9; Tue, 23 Jul 2002 08:24:34 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id A0CC1912F8 for <idr@trapdoor.merit.edu>; Tue, 23 Jul 2002 08:24:33 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 9014D5DDB9; Tue, 23 Jul 2002 08:24:33 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from web12804.mail.yahoo.com (web12804.mail.yahoo.com [216.136.174.39]) by segue.merit.edu (Postfix) with SMTP id 065675DD9D for <idr@merit.edu>; Tue, 23 Jul 2002 08:24:33 -0400 (EDT)
Message-ID: <20020723122431.79415.qmail@web12804.mail.yahoo.com>
Received: from [141.154.39.203] by web12804.mail.yahoo.com via HTTP; Tue, 23 Jul 2002 05:24:31 PDT
Date: Tue, 23 Jul 2002 05:24:31 -0700 (PDT)
From: vrishab sikand <v_sikand@yahoo.com>
Subject: Re: consult :  What is the FTP User ID and Password
To: lidefeng <lidefeng@huawei.com>, idr@merit.edu
In-Reply-To: <000f01c231f8$8c12fbe0$22436e0a@HUAWEI.COM>
MIME-Version: 1.0
Content-Type: multipart/alternative; boundary="0-1183507891-1027427071=:79155"
Sender: owner-idr@merit.edu
Precedence: bulk

--0-1183507891-1027427071=:79155
Content-Type: text/plain; charset=us-ascii


The archive has not been acessable for quite some time. Either this should be fixed or the link removed from IDR home page.
  lidefeng <lidefeng@huawei.com> wrote: 
----- Original Message ----- 
From: "lidefeng" 

To: 
Sent: Tuesday, July 23, 2002 11:22 AM
Subject: consult : What is the FTP User ID and Password


> Dear Sir,
> 
> In the Mailing List of Work Group, listed following FTP directory for us to get the relevant archives:
> 
> Archive: ftp://ftp.merit.edu/mail.archives/idr
> 
> But I don't know the FTP User ID and the Paddword,so I can't get the document. Could you please tell me ?
> Thank you very much!
> 
> Li Defeng
> 


---------------------------------
Do You Yahoo!?
Yahoo! Health - Feel better, live better
--0-1183507891-1027427071=:79155
Content-Type: text/html; charset=us-ascii

<P>The archive has not been acessable for quite some time. Either this should be fixed or the link removed from IDR home page.
<P>&nbsp; <B><I>lidefeng &lt;lidefeng@huawei.com&gt;</I></B> wrote: 
<BLOCKQUOTE style="PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #1010ff 2px solid"><BR>----- Original Message ----- <BR>From: "lidefeng" <LIDEFENG@HUAWEI.COM><BR>To: <IDR@MERIT.EDU><BR>Sent: Tuesday, July 23, 2002 11:22 AM<BR>Subject: consult : What is the FTP User ID and Password<BR><BR><BR>&gt; Dear Sir,<BR>&gt; <BR>&gt; In the Mailing List of Work Group, listed following FTP directory for us to get the relevant archives:<BR>&gt; <BR>&gt; Archive: ftp://ftp.merit.edu/mail.archives/idr<BR>&gt; <BR>&gt; But I don't know the FTP User ID and the Paddword,so I can't get the document. Could you please tell me ?<BR>&gt; Thank you very much!<BR>&gt; <BR>&gt; Li Defeng<BR>&gt; </BLOCKQUOTE><p><br><hr size=1><b>Do You Yahoo!?</b><br>
<a href="http://health.yahoo.com/">Yahoo! Health</a> - Feel better, live better
--0-1183507891-1027427071=:79155--


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id IAA08907 for <idr-archive@nic.merit.edu>; Tue, 23 Jul 2002 08:06:48 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id F32CE912F5; Tue, 23 Jul 2002 08:06:21 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id BA612912F6; Tue, 23 Jul 2002 08:06:20 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 861A1912F5 for <idr@trapdoor.merit.edu>; Tue, 23 Jul 2002 08:06:19 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 700F85DDB9; Tue, 23 Jul 2002 08:06:19 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from ep_mailout1 (unknown [203.254.224.24]) by segue.merit.edu (Postfix) with ESMTP id 182DB5DD9D for <idr@merit.edu>; Tue, 23 Jul 2002 08:06:19 -0400 (EDT)
Received: from custom-daemon.mailout1.samsung.com by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) id <0GZP00501B226S@mailout1.samsung.com> for idr@merit.edu; Tue, 23 Jul 2002 21:08:26 +0900 (KST)
Received: from ep_mmp2 (localhost [127.0.0.1]) by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTP id <0GZP00556B2204@mailout1.samsung.com> for idr@merit.edu; Tue, 23 Jul 2002 21:08:26 +0900 (KST)
Received: from Manav ([107.108.3.180]) by mmp2.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTPA id <0GZP00G3UB28OV@mmp2.samsung.com> for idr@merit.edu; Tue, 23 Jul 2002 21:08:34 +0900 (KST)
Date: Tue, 23 Jul 2002 17:31:55 +0530
From: Manav Bhatia <manav@samsung.com>
Subject: Re: My BGP Route Update Pacing Draft
To: Mareline Sheldon <marelines@yahoo.com>
Cc: idr@merit.edu
Reply-To: Manav Bhatia <manav@samsung.com>
Message-id: <008601c23240$bec54410$b4036c6b@sisodomain.com>
MIME-version: 1.0
X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2600.0000
X-Mailer: Microsoft Outlook Express 6.00.2600.0000
Content-type: text/plain; charset=iso-8859-1
Content-transfer-encoding: 7BIT
X-Priority: 3
X-MSMail-priority: Normal
References: <20020723115205.63839.qmail@web20310.mail.yahoo.com>
Sender: owner-idr@merit.edu
Precedence: bulk

Mareline,

| | What happens say when an uncongested router receives pacing information
| | telling it to reduce the advertisement rate by some 70%. In that case
there
| | will be many UPDATEs lying with it remaining to be yet announced to the
| | remote peer which is experiencing congestion. Now what if the
uncongested
| | network receives withdrawals (both implicit and explicit) for certain
| | prefixes which have already been announced or remain yet to be
announced?
| | You open up a can of worms now !!
| |
| | You have to now flush all your output buffers regardless of the
congestion
| | state your remote peer lies in!
|
| what exactly do you mean by this?

If you have some UPDATEs to announce and if you are holding them back
because of the congestion you perceive in the network and then if you
recieve a WITHDRAW for one of those same prefixes then you have to first
advertise the UPDATEs because it is invalid to announce the same address
prefix in the WITHDRAWN routes and the NLRI field of an UPDATE message. You
thus end up flushing all the BGP UPDATEs you had piled up waiting for the
congestion to clear out at your remote end. The better solution is to let
TCP take care of the congestion internally as it has always been OR if you
want to dynamically adjust to the congestion then you may re-negotiate the
HoldTimer using the Dynamic Capabilities.

I hope i make myself clear.

Regards,
Manav




Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id HAA08439 for <idr-archive@nic.merit.edu>; Tue, 23 Jul 2002 07:52:36 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id ECA8B912C2; Tue, 23 Jul 2002 07:52:09 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id A5670912F4; Tue, 23 Jul 2002 07:52:08 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 99B1F912C2 for <idr@trapdoor.merit.edu>; Tue, 23 Jul 2002 07:52:07 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 862BD5DDE7; Tue, 23 Jul 2002 07:52:07 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from web20310.mail.yahoo.com (web20310.mail.yahoo.com [216.136.226.91]) by segue.merit.edu (Postfix) with SMTP id CE3E75DD9D for <idr@merit.edu>; Tue, 23 Jul 2002 07:52:06 -0400 (EDT)
Message-ID: <20020723115205.63839.qmail@web20310.mail.yahoo.com>
Received: from [203.200.20.226] by web20310.mail.yahoo.com via HTTP; Tue, 23 Jul 2002 04:52:05 PDT
Date: Tue, 23 Jul 2002 04:52:05 -0700 (PDT)
From: Mareline Sheldon <marelines@yahoo.com>
Subject: Re: My BGP Route Update Pacing Draft
To: manav@samsung.com, idr@merit.edu
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Sender: owner-idr@merit.edu
Precedence: bulk

Hi Manav,
| 
| What happens say when an uncongested router receives pacing information
| telling it to reduce the advertisement rate by some 70%. In that case there
| will be many UPDATEs lying with it remaining to be yet announced to the
| remote peer which is experiencing congestion. Now what if the uncongested
| network receives withdrawals (both implicit and explicit) for certain
| prefixes which have already been announced or remain yet to be announced?
| You open up a can of worms now !!
| 
| You have to now flush all your output buffers regardless of the congestion
| state your remote peer lies in!

what exactly do you mean by this?

Regards,
mareline s.


__________________________________________________
Do You Yahoo!?
Yahoo! Health - Feel better, live better
http://health.yahoo.com


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id AAA23788 for <idr-archive@nic.merit.edu>; Tue, 23 Jul 2002 00:25:59 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id B85AA91311; Tue, 23 Jul 2002 00:21:26 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id DD80D91318; Tue, 23 Jul 2002 00:21:25 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 2795991301 for <idr@trapdoor.merit.edu>; Tue, 23 Jul 2002 00:16:30 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 12A885DD8E; Tue, 23 Jul 2002 00:16:30 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from ep_mailout1 (unknown [203.254.224.24]) by segue.merit.edu (Postfix) with ESMTP id 965095DDF9 for <idr@merit.edu>; Tue, 23 Jul 2002 00:16:29 -0400 (EDT)
Received: from custom-daemon.mailout1.samsung.com by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) id <0GZO00901PB37N@mailout1.samsung.com> for idr@merit.edu; Tue, 23 Jul 2002 13:18:39 +0900 (KST)
Received: from ep_mmp2 (localhost [127.0.0.1]) by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTP id <0GZO008B8PB3S1@mailout1.samsung.com> for idr@merit.edu; Tue, 23 Jul 2002 13:18:39 +0900 (KST)
Received: from Manav ([107.108.3.180]) by mmp2.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTPA id <0GZO00259PB8EW@mmp2.samsung.com> for idr@merit.edu; Tue, 23 Jul 2002 13:18:47 +0900 (KST)
Date: Tue, 23 Jul 2002 09:42:10 +0530
From: Manav Bhatia <manav@samsung.com>
Subject: Re: My BGP Route Update Pacing Draft
To: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>, idr@merit.edu
Reply-To: Manav Bhatia <manav@samsung.com>
Message-id: <00f001c231ff$1ec120b0$b4036c6b@sisodomain.com>
MIME-version: 1.0
X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2600.0000
X-Mailer: Microsoft Outlook Express 6.00.2600.0000
Content-type: text/plain; charset=iso-8859-1
Content-transfer-encoding: 7BIT
X-Priority: 3
X-MSMail-priority: Normal
References: <39469E08BD83D411A3D900204840EC558227B4@vie-msgusr-01.dc.fore.com>
Sender: owner-idr@merit.edu
Precedence: bulk

Hi Ben,
I have some thoughts on the draft.

The crux of your draft is that if a peer gets so congested that it is not
able to send any UPDATE/KEEPALIVE message to the remote uncongested peer
then the session can break down because of the Hold Timer expiring and it
can be prevented by sending some pacing information which will inform the
uncongested peer to limit its route advertisement rate. The PACING
information will too be carried in the TCP packets, so if the TCP is
implementing flow control and is not sending packets to avoid congestion
then how will the PACING information be sent? And if  TCP can send the
PACING information then it can as well send the KEEPALIVE packet which will
keep the HoldTimer from expiring at the other end.

Secondly, if you are aware that your router is old and might come in a
situation when it might not be able to handle the full feed of BGP Updates
from the remote end then it can negotiate a HoldTime of zero and keep the
session running even when it is in the congested state.

Thirdly, the very idea of artificially impeding reachability information
(or sending of BGP UPDATE messages) doesn't really appeal to me. I was
initially against the idea of using MinRouteAdvertisementInternal etc
timers in my implementation arguing that a router should pack off any
reachability information it has as soon as it comes to know of it. But this
created a problem when i got say some 80K routes. I would then be sending
80K packets to my other peers! Finally , better sense prevailed on me and i
started using the timers for efficiently packing my NLRIs in the UPDATE
messages. This is one point where i hold my reachability information from
my peers & I discourage it doing any other time. IMHO i feel we should let
TCP take care of withholding any data it wants with itself till the
congestion state rides by rather than us interfering with it.

What happens say when an uncongested router receives pacing information
telling it to reduce the advertisement rate by some 70%. In that case there
will be many UPDATEs lying with it remaining to be yet announced to the
remote peer which is experiencing congestion. Now what if the uncongested
network receives withdrawals (both implicit and explicit) for certain
prefixes which have already been announced or remain yet to be announced?
You open up a can of worms now !!

You have to now flush all your output buffers regardless of the congestion
state your remote peer lies in!

Thus you end up doing what you always wanted to avoid.

I think a cleaner and a much simpler approach would be to dynamically
negotiate the value of the HoldTimer during the session. If you feel you
are getting congested then increase the value and once the red flags are
all down you can revert back to your original value or decrease it back
again.

Regards,
Manav











----- Original Message -----
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: <idr@merit.edu>
Sent: Friday, July 19, 2002 1:01 AM
Subject: My BGP Route Update Pacing Draft


| Hi all:
|
|    Our recent discussions on this list and my recent work
|  experiences have led me to write this draft and offer it
|  to the IETF community. I would appreciate any comments
|  anyone has to make.
|
|  Thanks in advance,
|  Ben
|
|
|
|



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id XAA21756 for <idr-archive@nic.merit.edu>; Mon, 22 Jul 2002 23:28:37 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 7653F9129C; Mon, 22 Jul 2002 23:28:15 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 3FC289129D; Mon, 22 Jul 2002 23:28:15 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id F139C9129C for <idr@trapdoor.merit.edu>; Mon, 22 Jul 2002 23:28:13 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id CC3675DE8C; Mon, 22 Jul 2002 23:28:13 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mta0 (unknown [61.144.161.2]) by segue.merit.edu (Postfix) with ESMTP id 817E75DE8A for <idr@merit.edu>; Mon, 22 Jul 2002 23:28:13 -0400 (EDT)
Received: from lidefeng (localhost [127.0.0.1]) by mta0.huawei.com (iPlanet Messaging Server 5.2 HotFix 0.7 (built Jun 26 2002)) with ESMTPA id <0GZO00DKPMW2GW@mta0.huawei.com> for idr@merit.edu; Tue, 23 Jul 2002 11:26:27 +0800 (CST)
Date: Tue, 23 Jul 2002 11:25:08 +0800
From: lidefeng <lidefeng@huawei.com>
Subject: Re: consult :  What is the FTP User ID and Password
To: idr@merit.edu
Message-id: <000f01c231f8$8c12fbe0$22436e0a@HUAWEI.COM>
MIME-version: 1.0
X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2600.0000
X-Mailer: Microsoft Outlook Express 6.00.2600.0000
Content-type: text/plain; charset=iso-8859-2
Content-transfer-encoding: 7BIT
X-Priority: 3
X-MSMail-priority: Normal
References: <000701c231f8$1c4db3e0$22436e0a@HUAWEI.COM>
Sender: owner-idr@merit.edu
Precedence: bulk

----- Original Message ----- 
From: "lidefeng" <lidefeng@huawei.com>
To: <idr@merit.edu>
Sent: Tuesday, July 23, 2002 11:22 AM
Subject: consult : What is the FTP User ID and Password


> Dear Sir,
> 
>     In the Mailing List of  Work Group, listed following FTP directory for us to get the relevant archives:
> 
> Archive: ftp://ftp.merit.edu/mail.archives/idr
> 
> But I don't know the FTP User ID and the Paddword,so I can't get the document. Could you please tell me ?
> Thank you very much!
> 
>                 Li Defeng
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id XAA21667 for <idr-archive@nic.merit.edu>; Mon, 22 Jul 2002 23:25:39 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 5CCF091299; Mon, 22 Jul 2002 23:25:12 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 284499129C; Mon, 22 Jul 2002 23:25:12 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id E3E0D91299 for <idr@trapdoor.merit.edu>; Mon, 22 Jul 2002 23:25:10 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id CA76B5DE8A; Mon, 22 Jul 2002 23:25:10 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mta0 (unknown [61.144.161.2]) by segue.merit.edu (Postfix) with ESMTP id 4F53B5DD8E for <idr@merit.edu>; Mon, 22 Jul 2002 23:25:10 -0400 (EDT)
Received: from lidefeng (localhost [127.0.0.1]) by mta0.huawei.com (iPlanet Messaging Server 5.2 HotFix 0.7 (built Jun 26 2002)) with ESMTPA id <0GZO00D8XMQVNP@mta0.huawei.com> for idr@merit.edu; Tue, 23 Jul 2002 11:23:20 +0800 (CST)
Date: Tue, 23 Jul 2002 11:22:00 +0800
From: lidefeng <lidefeng@huawei.com>
Subject: consult :  What is the FTP User ID and Password
To: idr@merit.edu
Message-id: <000701c231f8$1c4db3e0$22436e0a@HUAWEI.COM>
MIME-version: 1.0
X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2600.0000
X-Mailer: Microsoft Outlook Express 6.00.2600.0000
Content-type: text/plain; charset=iso-8859-2
Content-transfer-encoding: 7BIT
X-Priority: 3
X-MSMail-priority: Normal
Sender: owner-idr@merit.edu
Precedence: bulk

Dear Sir,

    In the Mailing List of  Work Group, listed following FTP directory for us to get the relevant archives:

Archive: ftp://ftp.merit.edu/mail.archives/idr

But I don't know the FTP User ID and the Paddword,so I can't get the document. Could you please tell me ?
Thank you very much!

                Li Defeng



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id VAA17310 for <idr-archive@nic.merit.edu>; Mon, 22 Jul 2002 21:15:45 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id C68EC91294; Mon, 22 Jul 2002 21:15:07 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 2C0F091295; Mon, 22 Jul 2002 21:15:06 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 894D091294 for <idr@trapdoor.merit.edu>; Mon, 22 Jul 2002 21:15:04 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id D82915DE88; Mon, 22 Jul 2002 21:15:03 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from psg.com (psg.com [147.28.0.62]) by segue.merit.edu (Postfix) with ESMTP id 48C495DD8E for <idr@merit.edu>; Mon, 22 Jul 2002 21:15:03 -0400 (EDT)
Received: from localhost ([127.0.0.1] helo=127.0.0.1) by psg.com with esmtp (Exim 3.36 #1) id 17WoGT-0004ym-00; Mon, 22 Jul 2002 18:15:01 -0700
Date: Mon, 22 Jul 2002 18:13:42 -0700
From: Alex Zinin <zinin@psg.com>
X-Mailer: The Bat! (v1.51) Personal
Reply-To: Alex Zinin <zinin@psg.com>
X-Priority: 3 (Normal)
Message-ID: <6924373577.20020722181342@psg.com>
To: David Ward <dward@cisco.com>
Cc: idr@merit.edu
Subject: Re: IDR WG minutes IETF 54
In-Reply-To: <p05111a00b95d0a37ad7b@[133\.93\.77\.124]>
References: <p05111a00b95d0a37ad7b@[133.93.77.124]>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Sender: owner-idr@merit.edu
Precedence: bulk

David,

  Please see some corrections below:

> II. Alvaro Retana presenter:
...
> QUESTION: Will the number of oscillating routes increase in the future?
> ANSWER: Potentially

> QUESTION (Alex Zinin): What about detection mechanism?
> ANSWER: YTBD

I think it was probably someone else. I asked the presenters
to please summarize the discussion on the list, explaining
what issues have been raised, how they have been addressed and
what the remaining ones are.

>         draft-walton-bgp-add-paths-00.txt
...
>         4. problems in 3107
>                 a. how to send multipath and encoding doesn't match text
>                 b. proposal is to change 3107 encoding to use new proposal
>                 NOTE: PPVPN owns 3107

Add please:

Alex: If we have a problem in 3107, maybe we should fix it
Sue : 3107 is not in IDR


> III. Sue Hares presenter
...
>         4. How to proceed?
>                 a. vote for adoption

I would change "vote" here to something like "show of hands to get the
feeling of the room" :)

>                         i. no wilfong
>                         ii. some walton
>                         iii. few Chen

Might be useful to note the approximate number of hands
for ii and iii. If I recall correctly it was not more than
half a dozen for each.

Add here:

Sue : Need to take this to the list, as there's no clear
      consensus.

Thanks.
Alex




Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id SAA12440 for <idr-archive@nic.merit.edu>; Mon, 22 Jul 2002 18:46:45 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 942B39122E; Mon, 22 Jul 2002 18:45:38 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 6200A91231; Mon, 22 Jul 2002 18:45:38 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 417F09122E for <idr@trapdoor.merit.edu>; Mon, 22 Jul 2002 18:45:37 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 124C95DDA7; Mon, 22 Jul 2002 18:45:37 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from prattle.redback.com (prattle.redback.com [155.53.12.9]) by segue.merit.edu (Postfix) with ESMTP id 4B49F5DD99 for <idr@merit.edu>; Mon, 22 Jul 2002 18:45:35 -0400 (EDT)
Received: from popserv2.redback.com (popserv2.redback.com [155.53.12.59]) by prattle.redback.com (Postfix) with ESMTP id E11DD18D3F0; Mon, 22 Jul 2002 15:45:33 -0700 (PDT)
Received: from redback.com (fall.redback.com [155.53.36.220]) by popserv2.redback.com (Postfix) with ESMTP id 967C7979C1; Mon, 22 Jul 2002 15:45:33 -0700 (PDT)
To: David Ward <dward@cisco.com>
Cc: idr@merit.edu, enke@redback.com
Subject: Re: IDR WG minutes IETF 54 
In-Reply-To: Message from David Ward <dward@cisco.com>  of "Thu, 18 Jul 2002 19:45:20 CDT." <p05111a00b95d0a37ad7b@[133.93.77.124]> 
Date: Mon, 22 Jul 2002 15:45:33 -0700
From: Enke Chen <enke@redback.com>
Message-Id: <20020722224533.967C7979C1@popserv2.redback.com>
Sender: owner-idr@merit.edu
Precedence: bulk

Hi, David:
Some comments are in-lined.

> Message-Id: <p05111a00b95d0a37ad7b@[133.93.77.124]>
> Date: Thu, 18 Jul 2002 19:45:20 -0500
> To: idr@merit.edu
> From: David Ward <dward@cisco.com>
> Subject: IDR WG minutes IETF 54
> Cc: dward@cisco.com
> Content-Type: text/plain; charset="us-ascii" ; format="flowed"
> Sender: owner-idr@merit.edu
> Precedence: bulk
> 
> NOTES: IDR WG meeting IETF 54
> Scribe: David Ward
> 
> Agenda
> 	Route Oscillation
> 	Inform message type
> 	Extended Communities
> 	Document Status
> 
> Route Oscillation
> I.	Anindya Basu Lucent presenter
> 	draft-wilfong-ibgp-oscillations-00.txt
> 
> 	1. Brief Overview of Problem (Route Reflection example)
> 	2. Overview of Transient Route Oscillations
> 		a. proposal converges to stable situation
> 			i. potential problem in scenario where there 
> could be an
> 			non-deterministic outcome
> 	3. Solution
> 		a. send xtra info
> 		b. if same aspath length, same loc_pref, diff AS: 
> send both prefixes
> 	4. Correctness
> 		a. always converges
> 		b. loop free
> 		c. works for multi level RR and confeds
> 	5. Scaling
> 		a. don't need to always send xtra routes
> 		b. only advert when detect oscillations
> 			i. propose using low-pass filter to weed out 
> restarts and detect
> 			high frequency osc.
> 	6. Issues seen in other drafts
> 		a. Walton
> 			i. doesn't solve transient and not all 
> persistent oscillations

> 		b. Chen
> 			i. believes all clients must be fully meshed

Actually only clients at the same level need to be fully meshed to
guaratee route selection consistency. (We will make this clearer in the
next revision.)

Is this a real issue considering that is how many networks are
configured?

> 
> QUESTION:
> 	How to detect oscillation?
> ANSWER:
> 	Add a counter to each entry in FIB and a timer and use 
> low-pass filter to determine if osc.
> NOTE from author: If want to scale must fine some measure to detect oscillation
> 
> 
> II. Alvaro Retana presenter:
> 	draft-walton-bgp-route-oscillation-stop-00.txt
> 
> 
> 	1. Brief Overview of Problem
> 		a. Type 1: single level RR/Confed
> 			i. solved in RFC 2796, 3065
> 		b. Type 2: multitier/hierarchical RR/Config
> 			i. must send multiple paths
> 			ii. always-compare-med considered unacceptable solution
> 		c. Oscillation stats from empirical analysis (2001 data)
> 			112086	 total routes
> 			9011	 candidates for oscillation (~10%)
> 			899           actually oscillating		(< 1%)
> 	2. Solution
> 		a. If RR or ASBR of confed
> 			i. send best path
> 			ii. send 'neighbor AS best path'
> 	3. Comparison
> 		a. Walton or Wilfong - requires oscillation detection

> 		b. Chen - not multi-level

Actually the mechanism does work with multi-level route reflection.
I did not finish this part before taking the vaction. This portion
will be revised in the next version.

> 	4. Next Steps
> 		a. determine oscillation detection mechanism - may be 
> implementation specific
> QUESTION: Will the number of oscillating routes increase in the future?
> ANSWER: Potentially
> 
> QUESTION (Alex Zinin): What about detection mechanism?
> ANSWER: YTBD
> 
> 	draft-walton-bgp-add-paths-00.txt
> 	1. This extension is required for oscillation solution
> 		a. could be adopted as an additional solution above 
> and beyond oscillation
> 	2. Proposal
> 		a. change NLRI encoding in 2858
> 			i. add bestpath bit - represents that path 
> has been selected by speaker
> 			and put into FIB
> 		b. same change to 3107 for MPLs
> 			i. additional clarifications necessary
> 		c. add new capability advertisement
> 			i. multipath capability
> 			NOTE: must not use encoding in 3107
> 	3. Soon to be released new proposal
> 		a. remove multiprotocol linkage
> 		b. change encoding to flags (1 octet) in all specs:
> 			i. 1771, 2858, 3107
> 		c. can also solve route server functionality in 
> addition to oscillation
> 	4. problems in 3107
> 		a. how to send multipath and encoding doesn't match text
> 		b. proposal is to change 3107 encoding to use new proposal
> 		NOTE: PPVPN owns 3107
> 
> III. Sue Hares presenter
> 	1. Mixed Issues w/ draft 17 -> 18 and FSM seen on list in 
> dicussion of osc. drafts
> 		a. tie breaking rules
> 			i. e.g. cluster_id
> 	2. Oscillation drafts comparison and issues
> 		a. always-compare-med not adequate sol'n
> 			i. solution today is to restrict topology
> 			i. too difficult to coordinate MED values 
> across multiple domains (asp)
> 	3. Presentation of Enke Chen's proposal
> 	draft-chen-route-oscillation-avoid-00.txt
> 		a. restrictions
> 			i. solves one level RR/Confed hierarchy
> 			ii. requires full mesh
> 		b. solutions
> 			i. announe best path rx'ed from clients
> 			ii. best path from all
> 			iii. calc IBEST and EBEST
> 			iv. announce according to rules in draft
> Statement: Naiming Sheng: Enke Chen's proposal has fewer protocol and 
> implementation changes
> 	4. How to proceed?
> 		a. vote for adoption
> 			i. no wilfong
> 			ii. some walton
> 			iii. few Chen
> Statement: Alvaro Retana
> 	What is problem we are trying to solve?
> 	Enke's doesn't solve any problem
> 	Wilfong's solves problems that are not known to exist
> 	There is a real problem as there are multilevel topologies deployed
> Statement: David Ward
> 	We know that Enke's proposal does not solve the problem in 
> the protocol specification
> 		and his proposed solution is already covered in 
> exisiting RFCs. Therefore, it should
> 		be removed from consideration.

Do not follow the comments by Alvaro and David on "does not solve
any/the problem". The essence of draft-chen-route-oscillation-avoid-00.txt
is to suggest:

    o advertising the best external path by a non-RR to IBGP (as
      in RFC 1771).

    o revising the RR behavior to remove advertisemnet dependence
      on IBGP updates.

The combination does solve the problem of route oscillation.

Admittedly the draft has some overlap with the "oscillation-reduce"
drafts, and we will consolidate the drafts in the next revision.

-- Enke


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id RAA10244 for <idr-archive@nic.merit.edu>; Mon, 22 Jul 2002 17:37:31 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 982B79120D; Mon, 22 Jul 2002 17:37:06 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 61A209121C; Mon, 22 Jul 2002 17:37:06 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 1A9429120D for <idr@trapdoor.merit.edu>; Mon, 22 Jul 2002 17:37:05 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id ED6785DE06; Mon, 22 Jul 2002 17:37:04 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from demiurge.exodus.net (demiurge.exodus.net [216.32.171.82]) by segue.merit.edu (Postfix) with ESMTP id 6BDEA5DDF6 for <idr@merit.edu>; Mon, 22 Jul 2002 17:37:04 -0400 (EDT)
Received: (from andrewl@localhost) by demiurge.exodus.net (8.9.3+Sun/8.9.3) id OAA19286; Mon, 22 Jul 2002 14:34:15 -0700 (PDT)
Date: Mon, 22 Jul 2002 14:34:15 -0700
From: andrewl@xix-w.bengi.exodus.net
To: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: Re: My BGP Route Update Pacing Draft
Message-ID: <20020722143415.G18094@demiurge.exodus.net>
References: <39469E08BD83D411A3D900204840EC558227B4@vie-msgusr-01.dc.fore.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=iso-8859-1
Content-Disposition: inline
Content-Transfer-Encoding: 8bit
User-Agent: Mutt/1.2.5i
In-Reply-To: <39469E08BD83D411A3D900204840EC558227B4@vie-msgusr-01.dc.fore.com>; from Benjamin.Abarbanel@Marconi.com on Thu, Jul 18, 2002 at 03:31:54PM -0400
Sender: owner-idr@merit.edu
Precedence: bulk

Ben,

One thing we discussed at the meeting in Yokohama was that we MUST NOT
associate protocol actions with the INFORM message.  It is Informational
ONLY.  An INFORM may be followed by a protocol action (such as a NOTIFY),
but, in and of itself, it is not to be used to trigger protocol actions.

Andrew

On Thu, Jul 18, 2002 at 03:31:54PM -0400, Abarbanel, Benjamin wrote:
> Delivered-To: idr-outgoing@trapdoor.merit.edu
> Delivered-To: idr@trapdoor.merit.edu
> Delivered-To: idr@merit.edu
> From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
> To: "'idr@merit.edu'" <idr@merit.edu>
> Subject: My BGP Route Update Pacing Draft
> Date: Thu, 18 Jul 2002 15:31:54 -0400
> X-Mailer: Internet Mail Service (5.5.2650.21)
> Precedence: bulk
> X-OriginalArrivalTime: 18 Jul 2002 19:34:50.0014 (UTC) FILETIME=[2E818BE0:01C22E92]
> 
> Hi all:
> 
>    Our recent discussions on this list and my recent work 
>  experiences have led me to write this draft and offer it 
>  to the IETF community. I would appreciate any comments 
>  anyone has to make.
>  
>  Thanks in advance,
>  Ben
> 
>   
> 

> 
> 
> 
> 
> Network Working Group                                     Ben Abarbanel
> Internet Draft                                            Marconi Communicatons
> Expiration Date: December 2002                          
> 
> 
>           				
> 					BGP Route Update Pacing
> 
>                     draft-abarbanel-bgp-route-update-pacing-00.txt
> 
> 
> 1. Status of this Memo
> 
>    This document is an Internet-Draft and is in full conformance with
>    all provisions of Section 10 of RFC2026 except that the right to
>    produce derivative works is not granted.
> 
>    Internet-Drafts are working documents of the Internet Engineering
>    Task Force (IETF), its areas, and its working groups.  Note that
>    other groups may also distribute working documents as Internet-
>    Drafts.
> 
>    Internet-Drafts are draft documents valid for a maximum of six months
>    and may be updated, replaced, or obsoleted by other documents at any
>    time.  It is inappropriate to use Internet-Drafts as reference
>    material or to cite them other than as ``work in progress.''
> 
>    The list of current Internet-Drafts can be accessed at
>    http://www.ietf.org/ietf/1id-abstracts.txt
> 
>    The list of Internet-Draft Shadow Directories can be accessed at
>    http://www.ietf.org/shadow.html.
> 
> 
> 2. Abstract
> 
> This document defines a mechanism for controlling or limiting the rate at 
> which BGP update messages are sent from one peer to the next when a BGP peer 
> experiences internal congestion. With the introduction of new dynamic BGP 
> protocol capabilities [CAP] message or other none BGP session destructive 
> messages, it is necessary to limit the rate at which the BGP update messages 
> are sent without affecting the entire BGP session and without relying on the 
> transport (TCP) layer to do so as a normal reaction to data congestion of the 
> TCP session across the network.
> 
> 3. Specification of Requirements
> 
> The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", 
> "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be 
> interpreted as described in RFC 2119 [RFC2119].
> 
>  
> 
> 
> Internet Draft      draft-abarbanel-bgp-route-update-pacing-00.txt    [Page 2]
> 
>  
> 
> 4. Introduction
> 
> This document defines a mechanism for controlling or limiting the rate at 
> which BGP update messages are sent from one peer to the next when a BGP peer 
> experiences internal congestion. With the introduction of new dynamic BGP 
> protocol capabilities [CAP] message or other none BGP session destructive 
> messages, it is necessary to limit the rate at which the BGP update messages 
> are sent without affecting the entire BGP session and without relying on the 
> transport (TCP) layer to do so as a normal reaction to data congestion of the 
> TCP session across the network.
> 
> When a router enters a state where either its CPU utilization is maximized 
> (reaches close to 100%) or its memory is nearly depleted (less than 10% of 
> memory left), it cannot handle new or heavy streams of updates from its peers 
> and at times unable to send messages to its peer in a timely manner (within 
> 30 seconds). As a consequence it cannot keep current with the topological 
> state. In some scenarios, a non-congested peer might want to negotiate new 
> capabilities with a congested peer. The congested peer is so degraded that 
> its TCP session goes into significant flow control off conditions and is 
> unable to see the new BGP messages. Peer to peer communication is severely 
> hampered and as a result the uncongested peer will take corrective action 
> when its hold timer expires and drops the session. The uncongested peer 
> computes alternate routing paths that are suboptimal in distance or attribute 
> and thus affect the forwarding decisions of all routers in this network. It 
> is possible that the congested peer's routing (control) plane is badly 
> degraded but its forwarding plane is at normal working level. The mistake is 
> made by the uncongested peer since it does not see any Keepalive/Update 
> messages before its Hold timer expires and drops the session thereby dropping 
> all its associated routes. After routes are dropped, network instability 
> occurs and suboptimal paths are used by the remaining peers.
> 
> The MinRouteAdvertisementInternal and MinAsOriginationInterval timers are 
> inadequate since they are mostly implemented on a peer session basis and 
> studies have shown when they are used they severely degrade route convergence 
> time. The problem with statically defined timers (initialized during system 
> load or session establishment phase) is that they do not adjust to peer 
> internal dynamically changing congestion conditions. The problem with the 
> congested peer using TCP flow control to reduce its congestion condition, is 
> that it completely stops all incoming session traffic and thus preventing any 
> messages of high priority nature from being seen.
> 
> TCP Out of Band Data or Urgent Message is one way to bypass the TCP flow 
> control condition and allow high priority BGP messages to get to the 
> congested peer, assuming it is able to read the session socket queue. This 
> solution has its drawbacks as described in [UNIX-NET] chapter 21, p. 568.  
> 
> 
> 
> 
> 
> 
> 
> 
> Internet Draft      draft-abarbanel-bgp-route-update-pacing-00.txt    [Page 3]
> 
>  
> 
> This document presents a mechanism by which the BGP session of a congested 
> router need not be degraded to the level that communication is broken with 
> its peers. By using a peer to peer pacing mechanism which allows one peer to 
> rate limit the number of update messages per second received from another 
> peer, it can avoid the severe congestion conditions and process these 
> messages at a manageable level. In addition, the uncongested peer has the 
> knowledge to send high priority messages ahead of or in place of the normal 
> high volume of update messages.
> 
> Usually, topological disturbances are spiky in nature and once they subside, 
> the network returns to its optimum path oriented level. Whatever caused the 
> network to become unstable, such as routers handling too much data in a small 
> period of time or routers loosing their sessions or their links going up and 
> down, occurs in most live network on an infrequent basis. By using the pacing 
> mechanism as outlined in this draft, a BGP peer can prevent the serious 
> congestion long before it is in trouble and thus ride through topological 
> disturbances and still regain its stability without causing its sessions to 
> drop or its routes/paths to be discarded and recomputed.
> 
> The assumption in this spec is that the source of most or all of the internal 
> BGP router congestion is due to the heavy reception of update messages from 
> neighboring peers containing large number of routes.
> 
> 
> 5. BGP Update Pacing Mechanism
> 
> The BGP Update Message Pacing Mechanism is used to slow down the rate at 
> which a peer sends update messages to another. This extension to the BGP 
> protocol is simplified to use session non-destructive messages such as INFORM 
> as described in [INFORM]. The pacing is performed dynamically upon congestion 
> detection and subsidence and thus needs to use the new [INFORM] message that 
> will not infringe on the underlying BGP protocol or all its semantics and 
> rules.
>  
> 
> 5.1 BGP Update Message Pacing (PACE) Dynamic Capability
> 
> The BGP Update Message Pacing capability is dynamically negotiated with all 
> BGP speakers as type code=PACE (where, PACE=TBD) per TLV structures as 
> defined in [BGP-CAP] of the OPEN message or anytime after session is 
> established using the Dynamic Capability Message as defined [DYN-CAP] 
> specification. All those peers that accept the PACE capability will be 
> expected to support the new INFORM message as defined in [INFORM] 
> specification to carry the pacing TLV structure. 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> Internet Draft      draft-abarbanel-bgp-route-update-pacing-00.txt    [Page 4]
> 
>  
> 
> All PACE capable routers will provide a configuration option to their 
> operator to enable the BGP Update Message Pacing mechanism on a per peer 
> basis. 
> 
> Any peer wishing to withdraw the PACE capability can do so dynamically using 
> the Dynamic Capability message as outlined in [DYN-CAP] specification. Once 
> withdrawn, affected peering session will remain intact but will not benefit 
> from the performance improvements offered by the pacing mechanism.
> 
> 
> 5.2 Use of INFORM Message
> 
> The INFORM message as described in [INFORM] is used to carry the rate 
> limiting (pacing TLV) control structure to neighboring peers. This 
> information informs the peer that the current BGP router has entered a 
> congestion state and it is to rate limit its transmission to the level 
> specified. 
> 
> The INFORM message contains the following PACE TLV structure:
>  
>    Type = Pacing Information, type=TBD 
>    Length = 2
>    1st Byte of Value = Cmd as shown below 
>    2nd Byte of Value = Level as shown below
> 
>  Cmd       Description 
> -------    -----------
>   11       Request to Pace update messages
>   12       ACK Response to Pace Update messages
>   13       NACK Response to Pace Update Messages
> 
> A. Request to Pace Update Messages Level Indication (code=11)
> 
> The rate is divided into sub levels in term of categories (Gray, Yellow, 
> Orange, Red, and Green) to denote the level of pacing which is directly 
> related to the level of congestion experienced within the congested peer. The 
> colors are used as simple handles used for flagging the severity of the 
> condition. The colors are also used as indicators for display by the NMS to 
> identify the rate limiting levels from any neighboring peer to the operator. 
> Associated Level Management MIBs are defined in section 6. 
> 
> The actual pacing control is done via the sub levels within these colors. 
>   
>     Level     Description
>    ------     -----------
>     Gray      (Entering minor congestion state)
>          1       Reduce update message traffic by 10% your normal rate.
>          2       Reduce update message traffic by 20% your normal rate.
>          3       Reduce update message traffic by 30% your normal rate.
> 
> 
> 
>                                                     
> Internet Draft      draft-abarbanel-bgp-route-update-pacing-00.txt    [Page 5]
> 
> 
> 
>       Yellow     (Entering medium congestion state)
>          4       Reduce update message traffic by 40% your normal rate. 
>          5       Reduce update message traffic by 50% your normal rate.
>          6       Reduce update message traffic by 60% your normal rate.
> 
>       Orange     (Entering major congestion state)
>          7       Reduce update message traffic by 70% your normal rate. 
>          8       Reduce update message traffic by 80% your normal rate.
>          9       Reduce update message traffic by 90% your normal rate
> 
>         Red      (Entering critical congestion state)
>         10       Complete cessation of all update message traffic (flow off). 
>                  At this Level the receiving peer might decide to bypass the 
>                  Congested Router and pick another less optimal router for the
>                  affected Routes.
> 
>     Green     (Exiting any congestion state)
>          0       Restore update message traffic to your normal level (flow on).
>               Routes redirected can be resumed to the original peer.
> 
> The receiving peer will send an INFORM message with an ACK or NACK Indicating 
> it has received and understood the pacing request and will either comply with 
> the it or forbid it. If the congested peer receives a NACK, it should remove 
> that peer from its list of PACE capable peers but still maintaining its 
> session without the use of Pacing.  If the NACKing peer decides at a future 
> date to re-enable the Pacing with the local peer, it will renegotiate the 
> PACE capability with the local peer at that time.
> 
> B. Response Message Error Indication:
> 
>     Level     Description
>    ------     -----------
> 0	 ACK indication. The peer will comply with the 
>      		     Pacing level request.
> 1	 NACK indication. The peer is unable to perform Pacing or comply  
>         with the pacing level requested.
> 
> 
> 5.3 INFORM Message Response Timer
>    
> When a congested peer sends an INFORM message with a "Request to Pace Update 
> Messages (code=11)" to all peers that support the pacing feature, it will 
> also start an associated INFORM Message Response Timer. Any peer that does 
> not respond within a 30 second timeout period with either an ACK/NACK INFORM 
> Response message, its associated session will be dropped. This is done to 
> clear any PACE capable peers that are also congested to the point where their 
> communication with the local congested peer is severed.
> 
> 
>  
> 
> 
> 
> Internet Draft      draft-abarbanel-bgp-route-update-pacing-00.txt    [Page 6]
> 
>  
> 
> 5.4 Congestion Detection Within the Router
> 
> There are at least two ways congestion could be detected and measured by a 
> BGP router.
> 
> - A high CPU utilization condition 
> 
> - A Lack of available memory to accept incoming BGP messages or inability 
>   to get memory to successfully complete BGP processing. 
> 
> 5.4.1 CPU Utilization Based Level Computation
> 
> It is recommended that when the congested router detects its inability to 
> perform route calculations or accept new BGP session messages at a normal 
> rate meaning the current router's CPU utilization is more than 49%, it should 
> inform all its peers of a rate limiting level and slow them down accordingly. 
> 
> The recommended way for computing the Level, based on percent CPU 
> Utilization, is done using the following:
> 
>    Level = (((%CPU Utilization – 50) x 2) + 5) / 10).
>    Where, % CPU Utilization is a whole integer number. Use Integer math  
>    here to remove any fractional value.
> 
>   Note: If computed Level is negative, go to Green and set Level = 0. If   
>          Level = 0 send INFORM message with pacing Level = 0, implying return 
>          to normal (100%) update rate.
> 
> Level implies, whatever number of messages you transmitted per second 
> before, transmit (100 – (Level x 10))% of that now. 
> 
>   e.g. If you transmitted 100 messages/second before. A level 2 (20%) 
>        reduction will cause you to transmit 80 messages/second now.
>           Assumption is that these messages have an average size (1500 bytes).
> 
> 
> 5.4.2 Memory Allocation Based Level Computation
> 
> It is recommended that when the congested router detects its inability to 
> perform route calculations or accept new BGP session messages at a normal 
> rate meaning the current router's memory allocation is more than 69%, it 
> should inform all its peers of a rate limiting level and slow them down 
> accordingly. 
> 
> The recommended way for computing the Level, based on percent Memory 
> Allocation, is done using the following:
> 
>    Level = (((%Memory Allocated – 70) x 2) + 5) / 10).
>     Where, % Memory Allocated is a whole integer number. Use Integer math  
>     here to remove any fractional value.
> 
> 
> 
> Internet Draft      draft-abarbanel-bgp-route-update-pacing-00.txt    [Page 7]
> 
> 
> 
>    Note: If computed Level is negative, go to Green sub-group and set 
>          Level = 0. If Level = 0 send INFORM message with pacing Level = 0, 
>          implying return to normal (100%) update rate.
>  
> 
> 5.5 INFORM Message Throttling
> 
> Once the congested peer receives acknowledgement from another peer, it will 
> send a modification INFORM message with a new Level to that peer after the 
> computed pacing Level changes by at least 1 value. This will amount to no 
> more than one INFORM modification message every 5 seconds. This is done to 
> debounce any spiky bursts of INFORM messages to all PACE negotiated peers 
> each time the computed pacing Level changes. Depending on vendor 
> implementations, the internal utilization levels could change at the 
> Microsecond or Millisecond rate.
>   
> 
> 6. Implementation Specific Mechanisms
> 
> The Memory Allocation and CPU Utilization Level detection algorithms 
> discussed in section 5.4.1 and 5.4.2 are suggested ways one can implement 
> these solutions. However, each vendor can implement a unique Memory 
> Allocation and CPU Utilization Level detection algorithms that best suits 
> his/her needs and will not negatively impact the overall BGP Route Update 
> Pacing mechanism described in this spec. Any issues relating to internal 
> implementation algorithms are outside the scope of this document.
> 
>  
> 7. Level Indicators Management MIBs
> 
>    TBD
> 
> 
> 8. Security Considerations
> 
>    This extension to BGP does not change the underlying security issues.
> 
> 
> 9. References
>    
>    [BGP-CAP] Chandra, R., Scudder, J., "Capabilities Advertisement with 
>              BGP-4", draft-ietf-idr-rfc2842bis-02.txt
> 
>        [DYN-CAP]         Chen E., Sangli S.,  "Dynamic Capability for BGP-4", 
>             draft-ietf-idr-dynamic-cap-02.txt, October 2002.
> 
>    [INFORM] Nalawade G., Scudder J., "BGPv4 INFORM Message",
>             draft-nalawade-bgp-inform-00.txt, December 2002
> 
>    [BGP-4]  Rekhter, Y., and T. Li, "A Border Gateway Protocol 4 
>                               (BGP-4)", RFC 1771, March 1995.
> 
> 
> Internet Draft      draft-abarbanel-bgp-route-update-pacing-00.txt    [Page 8]
> 
> 
> 
>    [BGP-4-DRAFT] Rekhter, Y. and T. Li (editors), "A Border Gateway Protocol 4
>                           (BGP-4)", Internet Draft draft-ietf-idr-bgp4-18.txt, 
>                            January 2002.
> 
>    [UNIX-NET]  Stevens, R., "UNIX Network Programming, Vol 1", Second Edition
>                             1998 Prentice Hall, Inc.
>    
> 
> 10. Author Information
> 
>    Ben Abarbanel
>    Marconi Communications
>    1595 Spring Hill Road, 5th Floor
>    Vienna, VA 22182
>    Email: benjamin.abarbanel@marconi.com
> 
> 



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id OAA03946 for <idr-archive@nic.merit.edu>; Mon, 22 Jul 2002 14:28:39 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 8AC1191288; Mon, 22 Jul 2002 14:28:19 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 5437191289; Mon, 22 Jul 2002 14:28:19 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 5A95391288 for <idr@trapdoor.merit.edu>; Mon, 22 Jul 2002 14:28:18 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 4A1165DDC4; Mon, 22 Jul 2002 14:28:18 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from prattle.redback.com (prattle.redback.com [155.53.12.9]) by segue.merit.edu (Postfix) with ESMTP id 04A6A5DDB6 for <idr@merit.edu>; Mon, 22 Jul 2002 14:28:18 -0400 (EDT)
Received: from popserv3.redback.com (popserv3.redback.com [155.53.12.64]) by prattle.redback.com (Postfix) with ESMTP id 6B500262802; Mon, 22 Jul 2002 11:28:17 -0700 (PDT)
Received: from redback.com (fall.redback.com [155.53.36.220]) by popserv3.redback.com (Postfix) with ESMTP id A49157E6C1; Mon, 22 Jul 2002 11:28:11 -0700 (PDT)
To: "Ankur Goyal" <Ankur@coronanetworks.com>
Cc: idr@merit.edu, enke@redback.com
Subject: Re: RFC-2842 Encoding BGP Capabilities 
In-Reply-To: Message from "Ankur Goyal" <Ankur@coronanetworks.com>  of "Thu, 18 Jul 2002 15:20:57 PDT." <C61C9973831E2949A9AA57F3B0318464E31C6F@exch-srv.CoronaNetworks.com> 
Date: Mon, 22 Jul 2002 11:28:11 -0700
From: Enke Chen <enke@redback.com>
Message-Id: <20020722182811.A49157E6C1@popserv3.redback.com>
Sender: owner-idr@merit.edu
Precedence: bulk

Hi, Ankur:

> Subject: RFC-2842 Encoding  BGP Capabilities
> Date: Thu, 18 Jul 2002 15:20:57 -0700
> Message-ID: <C61C9973831E2949A9AA57F3B0318464E31C6F@exch-srv.CoronaNetworks.com>
> From: "Ankur Goyal" <Ankur@coronanetworks.com>
> To: <idr@merit.edu>
> Sender: owner-idr@merit.edu
> Precedence: bulk
> 
> Hi,
> 
> Section 2 of RFC 2842 states that:
> 
>   This is an Optional Parameter that is used by a BGP speaker to convey
>   to its BGP peer the list of capabilities supported by the speaker.
> 
>   The parameter contains one or more triples <Capability Code,
>   Capability Length, Capability Value>
>   ...
>   <snip>
>   ...
>    A particular capability, as identified by its Capability Code, may
>    occur more than once within the Optional Parameter.
> 
> I wonder which or both of the following encoding schemes are valid for =
> advertising supported capabilities to a peer.
> 
> OP# - is the Optional Parameter number in the OPEN message
> C   - represents the optional parameter type Capability
> A   - represents the optional parameter for Authentication
> c#  - represents individual Capabilities in the form code+length+value
> 
> (1) OP1-C:c1.c2.c3.c1 + OP2-A
> (2) OP1-C:c1 + OP2-C:c2 + OP3-C:c3 + OP4-C:c1 + OP5-A
> 
> any clarification on this will be greatly appreciated.

I am not aware of the use of the "Authentication" field. Other than that,
both encoding types (1) and (2) are in use, although (1) is clearly more
more compact than (2).

-- Enke



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id OAA03259 for <idr-archive@nic.merit.edu>; Mon, 22 Jul 2002 14:05:54 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 1B6339127F; Mon, 22 Jul 2002 14:05:33 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id BA33A91287; Mon, 22 Jul 2002 14:05:28 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 4336C9127F for <idr@trapdoor.merit.edu>; Mon, 22 Jul 2002 14:05:27 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 310475DF9E; Mon, 22 Jul 2002 14:05:27 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from prattle.redback.com (prattle.redback.com [155.53.12.9]) by segue.merit.edu (Postfix) with ESMTP id F17525DDB6 for <idr@merit.edu>; Mon, 22 Jul 2002 14:05:26 -0400 (EDT)
Received: from popserv3.redback.com (popserv3.redback.com [155.53.12.64]) by prattle.redback.com (Postfix) with ESMTP id 560A340625E; Mon, 22 Jul 2002 11:05:20 -0700 (PDT)
Received: from redback.com (fall.redback.com [155.53.36.220]) by popserv3.redback.com (Postfix) with ESMTP id E69757E6C1; Mon, 22 Jul 2002 11:05:17 -0700 (PDT)
To: "Stephen Waters" <Stephen.Waters@riverstonenet.com>
Cc: idr@merit.edu, enke@redback.com
Subject: Re: Cooperative Route Filtering Capability 
In-Reply-To: Message from "Stephen Waters" <Stephen.Waters@riverstonenet.com>  of "Sat, 20 Jul 2002 09:58:36 BST." <D2ECDCD8A528BF43A01EECCBEB191DD115ECEE@rs-eu-exc1.rs.riverstonenet.com> 
Date: Mon, 22 Jul 2002 11:05:15 -0700
From: Enke Chen <enke@redback.com>
Message-Id: <20020722180518.E69757E6C1@popserv3.redback.com>
Sender: owner-idr@merit.edu
Precedence: bulk

Hi, Steve:

> Subject: Cooperative Route Filtering Capability 
> Date: Sat, 20 Jul 2002 09:58:36 +0100
> Message-ID: <D2ECDCD8A528BF43A01EECCBEB191DD115ECEE@rs-eu-exc1.rs.riverstonenet.com>
> From: "Stephen Waters" <Stephen.Waters@riverstonenet.com>
> To: <idr@merit.edu>
> Sender: owner-idr@merit.edu
> Precedence: bulk
> 
> 
> What is the correct interpretation on filtering if I receive a =
> REMOVE_ALL/IMMEDIATE?  I delete all the received filters for that =
> AFI/SAFI, but is the default filter action still DENY?

REMOVE_ALL would remove all the entries of a particular ORF type.
As a result the filter no longer exists. As there is no filter to
be applied, the action should be PERMIT instead of DENY.

This is different from having implicit DENY for an existing filter.

-- Enke

> 
> 
> (my view is that the default action should be PERMIT at all times - or =
> at least settable).
> Regards, Steve.=20



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA26661 for <idr-archive@nic.merit.edu>; Mon, 22 Jul 2002 10:42:40 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 6D0249125A; Mon, 22 Jul 2002 10:42:13 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 369289125B; Mon, 22 Jul 2002 10:42:13 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 34B929125A for <idr@trapdoor.merit.edu>; Mon, 22 Jul 2002 10:42:12 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 110475DF93; Mon, 22 Jul 2002 10:42:12 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from demiurge.exodus.net (unknown [216.32.171.82]) by segue.merit.edu (Postfix) with ESMTP id 666E05DE6E for <idr@merit.edu>; Mon, 22 Jul 2002 10:42:11 -0400 (EDT)
Received: (from andrewl@localhost) by demiurge.exodus.net (8.9.3+Sun/8.9.3) id HAA13639; Mon, 22 Jul 2002 07:38:52 -0700 (PDT)
Date: Mon, 22 Jul 2002 07:38:52 -0700
From: andrewl@xix-w.bengi.exodus.net
To: Pedro Roque Marques <roque@juniper.net>
Cc: idr@merit.edu
Subject: Re: inform (was: IDR WG minutes IETF 54)
Message-ID: <20020722073852.B15842@demiurge.exodus.net>
References: <p05111a00b95d0a37ad7b@[133.93.77.124]> <200207190147.g6J1la176882@roque-bsd.juniper.net>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <200207190147.g6J1la176882@roque-bsd.juniper.net>; from roque@juniper.net on Thu, Jul 18, 2002 at 06:47:36PM -0700
Sender: owner-idr@merit.edu
Precedence: bulk

Pedro,

As an operator, I think that there are a number of useful places for an 
INFORM message, as a mecahanism to pass information between BGP speakers
that does not result in a session reset.  Two that come to mind are:

1) Max-prefix.  Currently we only have the option of a hard max-prefix
limit.  With an INFORM we could create a hard and a soft limit.  When
a peer hits the soft limit, among other things, it sends them an INFORM
telling them so.  The peer could then look at the issue, and if the
route growth is legitimate, call us to ask that the limit be increased.
This would limit unnecessary peering session resets that do occur due
to normal route growth.

2) As-confed seq/set propogation.  Although, I do think the default
behavior upon receipt of a type 3/4 from a non-confed peer should be
a session reset, the fact remains that doing so is not a current option
due to the installed base (at least not as a default option).  Under 
the current behavior of existing code, routes with this inapproprate
information are ignored.  With an INFORM, we can tell a peer that
route x is being ignored because it contains an as-confed set/seq.
At least that way they know there is a problem, and there exists
an automated mechanism above and beyond trying to track down their
NOC.

I'm sure others can think of additional uses.  So long as we specify
that no protocol action is to be taken upon receipt of INFORMs, I believe
that the utility justifys implementing the functionality.

Andrew

On Thu, Jul 18, 2002 at 06:47:36PM -0700, Pedro Roque Marques wrote:
> Delivered-To: idr-outgoing@trapdoor.merit.edu
> Delivered-To: idr@trapdoor.merit.edu
> Delivered-To: idr@merit.edu
> Date: Thu, 18 Jul 2002 18:47:36 -0700 (PDT)
> From: Pedro Roque Marques <roque@juniper.net>
> To: idr@merit.edu
> Subject: inform (was: IDR WG minutes IETF 54)
> In-Reply-To: <p05111a00b95d0a37ad7b@[133.93.77.124]>
> X-Mailer: VM 6.34 under 19.16 "Lille" XEmacs Lucid
> Precedence: bulk
> X-OriginalArrivalTime: 19 Jul 2002 01:49:37.0068 (UTC) FILETIME=[89D5AEC0:01C22EC6]
> 
> David Ward writes:
> 
> > INFORM message I. Gargi Nalawade presenter draft-nalawade-bgp-inform-00.txt 1.
> > Goals a. no way to tell remote
> > peer that there has been an error or event w/o use of NOTIFICATION
> > i. this resets peering session b. no change to existing
> > implementations c. limit sharing of messages between peers
> > 2. Encoding overview a. Packet format b. Event codes c. Data types
> > 3. Rx'ing the INFORM a. log it b. inform the administrator
> > (implementation specific) Question: How to use w/ NOTIFICATION?
> > Answer: Use it to inform that you are going to reset. Use
> > immediately before the NOTIFICATION but, now can add
> > PDU/ATTR/whatever to help debug.
> 
> > Question: Will this cause a change to the FSM?  Answser: No
> 
> > Questons: What is "too many routes option?" How to define?  Answer:
> > It is an implementation specific configuration
> 
> > Question: Jeff Pickering Should a peer take any specific action upon
> > reception of the INFORM?  The authors should explicitly prohibit any
> > automated action
> 
> > Statement: Andrew Lange Move forward draft
> 
> > Outcome: Sue Hares Accepted as WG item pending taking it to list
> 
> I object to making this a WG document.
> 
> I believe this mechanism is unnecessary and that no sufficient
> justification has been presented to add this to the spec. There should
> be a very compeling reason to be making changes so basic such as
> adding new messages.
> 
> A couple of comments on the details above:
> 
> > Question: How to use w/ NOTIFICATION?
> > Answer: Use it to inform that you are going to reset. Use
> > immediately before the NOTIFICATION but, now can add
> > PDU/ATTR/whatever to help debug.
> 
> How can that be of extra value given that notification allows you to
> carry variable lenght data ? Adding this message is just going to
> cause a debugging mess with all the deployed systems out there that do
> not implement it.
> 
> > Question: Jeff Pickering Should a peer take any specific action upon
> > reception of the INFORM?
> 
> Yes, reset the session, at least in the case that has been vastly
> discussed in the list which is receicept of invalid nlri. Furtunatly
> no code needs to be written to achieve that since it is the default
> action on a receipt of an invalid message code.
> 
>   Pedro.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id EAA07611 for <idr-archive@nic.merit.edu>; Sat, 20 Jul 2002 04:59:07 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 7DA719120A; Sat, 20 Jul 2002 04:58:43 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 4156491211; Sat, 20 Jul 2002 04:58:43 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id EA5A69120A for <idr@trapdoor.merit.edu>; Sat, 20 Jul 2002 04:58:41 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id C28735DE2B; Sat, 20 Jul 2002 04:58:41 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from rs-eu-exc1.rs.riverstonenet.com (unknown [193.128.15.220]) by segue.merit.edu (Postfix) with ESMTP id 53E725DD93 for <idr@merit.edu>; Sat, 20 Jul 2002 04:58:41 -0400 (EDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"
Subject: Cooperative Route Filtering Capability 
X-MimeOLE: Produced By Microsoft Exchange V6.0.5762.3
Date: Sat, 20 Jul 2002 09:58:36 +0100
Message-ID: <D2ECDCD8A528BF43A01EECCBEB191DD115ECEE@rs-eu-exc1.rs.riverstonenet.com>
X-MS-Has-Attach: 
X-MS-TNEF-Correlator: 
Thread-Topic: Cooperative Route Filtering Capability 
Thread-Index: AcIvzFGr1Y8TPJu8EdaEpADAT0MeLQ==
From: "Stephen Waters" <Stephen.Waters@riverstonenet.com>
To: <idr@merit.edu>
Sender: owner-idr@merit.edu
Precedence: bulk
Content-Transfer-Encoding: 8bit
X-MIME-Autoconverted: from quoted-printable to 8bit by nic.merit.edu id EAA07611

What is the correct interpretation on filtering if I receive a REMOVE_ALL/IMMEDIATE?  I delete all the received filters for that AFI/SAFI, but is the default filter action still DENY?


(my view is that the default action should be PERMIT at all times - or at least settable).
Regards, Steve. 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id NAA07945 for <idr-archive@nic.merit.edu>; Fri, 19 Jul 2002 13:40:35 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 2F242912E9; Fri, 19 Jul 2002 13:39:19 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id B992B912EC; Fri, 19 Jul 2002 13:39:18 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 7BD01912E9 for <idr@trapdoor.merit.edu>; Fri, 19 Jul 2002 13:39:12 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 66E365DED6; Fri, 19 Jul 2002 13:39:12 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from merlot.juniper.net (natint.juniper.net [207.17.136.129]) by segue.merit.edu (Postfix) with ESMTP id D53A65DDBF for <idr@merit.edu>; Fri, 19 Jul 2002 13:39:11 -0400 (EDT)
Received: from roque-bsd.juniper.net (roque-bsd.juniper.net [172.17.12.183]) by merlot.juniper.net (8.11.3/8.11.3) with ESMTP id g6JHdAm94365; Fri, 19 Jul 2002 10:39:10 -0700 (PDT) (envelope-from roque@juniper.net)
Received: (from roque@localhost) by roque-bsd.juniper.net (8.11.6/8.9.3) id g6JHdA878269; Fri, 19 Jul 2002 10:39:10 -0700 (PDT) (envelope-from roque)
Date: Fri, 19 Jul 2002 10:39:10 -0700 (PDT)
Message-Id: <200207191739.g6JHdA878269@roque-bsd.juniper.net>
From: Pedro Roque Marques <roque@juniper.net>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
To: andrey_movshin@agilent.com
Cc: idr@merit.edu
Subject: RFC 2545 (Link-Local)
In-Reply-To: <0D9185CE635BD511ACA50090277A6FCF0230C7A5@axcs18.cos.agilent.com>
References: <0D9185CE635BD511ACA50090277A6FCF0230C7A5@axcs18.cos.agilent.com>
X-Mailer: VM 6.34 under 19.16 "Lille" XEmacs Lucid
Sender: owner-idr@merit.edu
Precedence: bulk

andrey movshin writes:

> Dear Sirs/Madam,

> I'm working with BGP protocol for IPv6 routing and I'm a bit
> confused with sect.3. Constructing the Next Hop field of RFC 2545.

> Could you be so kind to clarify me the following:

> The RFC 2545 "Use of BGP-4 Multiprotocol Extensions for IPv6
> Inter-Domain Routing" says:

> "The link-local address shall be included in the Next Hop field if
> and only if the BGP speaker shares a common subnet with the entity
> identified by the global IPv6 address carried in the Network Address
> of Next Hop field and the peer the route is being advertised to.

You can look at it in another way...
whenever 2 BGP speakers share a link, and the advertising router uses
the global address on the link as a nexthop address it should also
include a link local address.

> In all other cases a BGP speaker shall advertise to its peer in the
> Network Address field only the global IPv6 address of the next hop
> (the value of the Length of Network Address of Next Hop field shall
> be set to 16)".

> If I have a configuration
> 
>              Router A (AS1)            Router B (AS1)               Router C (AS2)
>              3FFE:0:0:12::2  ------>   3FFE:0:0:22::1 --------->    3FFE:0:0:22::2                 
>                                        3FFE:0:0:12::1  __               
> 						                     |
> 						                     V
>                                            Router D (AS3)
>                                            3FFE:0:0:12::3
> 
> where router A and router B belong to AS1, router C belongs to AS2
> and router D belongs to AS3.

> Router A sends UPDATE to Router B with MP_REACH_NLRI containing
> NEXT_HOP 3FFE:0:0:12::2. I expect that router C should receive
> UPDATE with MP_REACH_NLRI containing NEXT_HOP 3FFE:0:0:22:1 (only
> Global) and router D should receive UPDATE with MP_REACH_NLRI
> containing NEXT_HOP 3FFE:0:0:12::21 (Global) and Link-Local. Am I
> correct?

It seems to me that in all your examples the nexthop address
advertised is the link address... if that is true the link-local
should be included always... but i'm probably not understanding your
diagram.

  Pedro.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id JAA29706 for <idr-archive@nic.merit.edu>; Fri, 19 Jul 2002 09:33:09 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 8C52A9123E; Fri, 19 Jul 2002 09:32:50 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 2D54491240; Fri, 19 Jul 2002 09:32:50 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 2D2E59123E for <idr@trapdoor.merit.edu>; Fri, 19 Jul 2002 09:32:48 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 104D65DE64; Fri, 19 Jul 2002 09:32:48 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from xover.netplane.com (unknown [198.62.10.2]) by segue.merit.edu (Postfix) with ESMTP id 81B945DDA4 for <idr@merit.edu>; Fri, 19 Jul 2002 09:32:47 -0400 (EDT)
Received: by XOVER.dedham.mindspeed.com with Internet Mail Service (5.5.2653.19) id <NYGKSNCV>; Fri, 19 Jul 2002 09:32:37 -0400
Message-ID: <E7E13AAF2F3ED41197C100508BD6A32868847D@india_exch.hyderabad.mindspeed.com>
From: "Khanna, Vinay" <vinayk@netplane.com>
To: "'John G. Scudder'" <jgs@cisco.com>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: "BGP graceful restart" clarifications
Date: Fri, 19 Jul 2002 09:35:03 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

At 2:16 AM -0400 7/19/02, Khanna, Vinay wrote:
>  >2) Does this draft address only planned BGP outages?
>
>It may apply to any outage.
>[Vinay]: Agreed. But in unplanned outages, ONLY the restart mechanism for
>the Restarting Speaker is applicable. The Receiving Speaker shall NOT
retain
>the routes received from the Restarting Speaker on detecting a connection
>closure; since the Forwarding State bit is 0. Thoughts?

No, the FS bit can be set, it depends on the restarting speaker.  The 
receiving speaker just takes action as given in the spec according to 
the FS bit, no additional cleverness is required. :-)
[Vinay]: Got it. Last para in section 6.1 misled me to believe so:)

>  >3) What happens when both the speakers advertise themselves as
'receiving
>>speakers' (by setting the BGP Restart State bit to FALSE)? Should usual
BGP
>>procedures as specified in draft-17 be followed in this scenario?
>
>They should both follow the procedures for "receiving speaker".
>[Vinay]: On revisiting the draft, it makes sense. The Receiving Speaker
>shall either flush the peer routes (draft-17 behaviour) OR shall mark them
>as stale routes (graceful restart behaviour) depending on the value of the
>Forwarding State bit. Thoughts?

Right.

--John


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id JAA29231 for <idr-archive@nic.merit.edu>; Fri, 19 Jul 2002 09:18:02 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 6166A9123E; Fri, 19 Jul 2002 09:17:49 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 2B4299126B; Fri, 19 Jul 2002 09:17:49 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id B8DA59123E for <idr@trapdoor.merit.edu>; Fri, 19 Jul 2002 09:16:23 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 99AB55DE64; Fri, 19 Jul 2002 09:16:23 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from cisco.com (uzura.cisco.com [64.102.17.77]) by segue.merit.edu (Postfix) with ESMTP id 2AB955DDA4 for <idr@merit.edu>; Fri, 19 Jul 2002 09:16:23 -0400 (EDT)
Received: from ruwhite-u10.cisco.com (ruwhite-u10.cisco.com [64.102.48.251]) by cisco.com (8.8.8/2.6/Cisco List Logging/8.8.8) with ESMTP id JAA26833; Fri, 19 Jul 2002 09:16:21 -0400 (EDT)
Date: Fri, 19 Jul 2002 09:16:21 -0400 (EDT)
From: Russ White <ruwhite@cisco.com>
Reply-To: Russ White <riw@cisco.com>
To: Pedro Roque Marques <roque@juniper.net>
Cc: idr@merit.edu
Subject: Re: inform (was: IDR WG minutes IETF 54)
In-Reply-To: <200207190147.g6J1la176882@roque-bsd.juniper.net>
Message-ID: <Pine.GSO.4.21.0207190911460.6970-100000@ruwhite-u10.cisco.com>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
Sender: owner-idr@merit.edu
Precedence: bulk

I think the inform message originally came from a desire to find
ways of _not_ resetting the session on simple, can be fixed
inline errors. It would provide you, as an implementor, and the
operators, based on your implementation, the option of doing
something _other_ than a session reset. What is done with the
information is clearly up to the operator and implementor....
Simple logging seems like a valuable enough thing to start with.

But simply adding the information should not be a big deal either
way.

:-)

Russ

On Thu, 18 Jul 2002, Pedro Roque Marques wrote:

> David Ward writes:
> 
> > INFORM message I. Gargi Nalawade presenter draft-nalawade-bgp-inform-00.txt 1.
> > Goals a. no way to tell remote
> > peer that there has been an error or event w/o use of NOTIFICATION
> > i. this resets peering session b. no change to existing
> > implementations c. limit sharing of messages between peers
> > 2. Encoding overview a. Packet format b. Event codes c. Data types
> > 3. Rx'ing the INFORM a. log it b. inform the administrator
> > (implementation specific) Question: How to use w/ NOTIFICATION?
> > Answer: Use it to inform that you are going to reset. Use
> > immediately before the NOTIFICATION but, now can add
> > PDU/ATTR/whatever to help debug.
> 
> > Question: Will this cause a change to the FSM?  Answser: No
> 
> > Questons: What is "too many routes option?" How to define?  Answer:
> > It is an implementation specific configuration
> 
> > Question: Jeff Pickering Should a peer take any specific action upon
> > reception of the INFORM?  The authors should explicitly prohibit any
> > automated action
> 
> > Statement: Andrew Lange Move forward draft
> 
> > Outcome: Sue Hares Accepted as WG item pending taking it to list
> 
> I object to making this a WG document.
> 
> I believe this mechanism is unnecessary and that no sufficient
> justification has been presented to add this to the spec. There should
> be a very compeling reason to be making changes so basic such as
> adding new messages.
> 
> A couple of comments on the details above:
> 
> > Question: How to use w/ NOTIFICATION?
> > Answer: Use it to inform that you are going to reset. Use
> > immediately before the NOTIFICATION but, now can add
> > PDU/ATTR/whatever to help debug.
> 
> How can that be of extra value given that notification allows you to
> carry variable lenght data ? Adding this message is just going to
> cause a debugging mess with all the deployed systems out there that do
> not implement it.
> 
> > Question: Jeff Pickering Should a peer take any specific action upon
> > reception of the INFORM?
> 
> Yes, reset the session, at least in the case that has been vastly
> discussed in the list which is receicept of invalid nlri. Furtunatly
> no code needs to be written to achieve that since it is the default
> action on a receipt of an invalid message code.
> 
>   Pedro.
> 

__________________________________
riw@cisco.com CCIE <>< Grace Alone




Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id JAA28752 for <idr-archive@nic.merit.edu>; Fri, 19 Jul 2002 09:03:09 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 249E691240; Fri, 19 Jul 2002 09:00:31 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id DA42491267; Fri, 19 Jul 2002 09:00:30 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 3069191240 for <idr@trapdoor.merit.edu>; Fri, 19 Jul 2002 09:00:29 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 6AD095DDA4; Fri, 19 Jul 2002 09:00:26 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from msgbas2.cos.agilent.com (msgbas2x.cos.agilent.com [192.25.240.37]) by segue.merit.edu (Postfix) with ESMTP id F3B485DEF5 for <idr@merit.edu>; Fri, 19 Jul 2002 09:00:25 -0400 (EDT)
Received: from msgrel1t.cos.agilent.com (msgrel1t.cos.agilent.com [130.29.152.157]) by msgbas2.cos.agilent.com (Postfix) with ESMTP id D51C11E64 for <idr@merit.edu>; Fri, 19 Jul 2002 07:00:23 -0600 (MDT)
Received: from axcsbh6.cos.agilent.com (axcsbh6.cos.agilent.com [130.29.152.171]) by msgrel1t.cos.agilent.com (Postfix) with SMTP id EB7E96A7 for <idr@merit.edu>; Fri, 19 Jul 2002 07:00:20 -0600 (MDT)
Received: from 130.29.152.171 by axcsbh6.cos.agilent.com (InterScan E-Mail VirusWall NT); Fri, 19 Jul 2002 07:00:08 -0600
Received: by axcsbh6.cos.agilent.com with Internet Mail Service (5.5.2653.19) id <3443CAJF>; Fri, 19 Jul 2002 07:00:08 -0600
Message-ID: <0D9185CE635BD511ACA50090277A6FCF0230C7A5@axcs18.cos.agilent.com>
From: andrey_movshin@agilent.com
To: idr@merit.edu
Subject:  RFC 2545 (Link-Local)
Date: Fri, 19 Jul 2002 07:00:06 -0600
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Dear Sirs/Madam,

I'm working with BGP protocol for IPv6 routing and I'm a bit confused with sect.3. Constructing the Next Hop field of RFC 2545.

Could you be so kind to clarify me the following:

The RFC 2545 "Use of BGP-4 Multiprotocol Extensions for IPv6 Inter-Domain Routing" says:

"The link-local address shall be included in the Next Hop field if and
only if the BGP speaker shares a common subnet with the entity identified by the global IPv6 address carried in the Network Address
of Next Hop field and the peer the route is being advertised to.

In all other cases a BGP speaker shall advertise to its peer in the  Network Address field only the global IPv6 address of the next hop   (the value of the Length of Network Address of Next Hop field shall be set to 16)".

If I have a configuration

             Router A (AS1)            Router B (AS1)               Router C (AS2)
             3FFE:0:0:12::2  ------>   3FFE:0:0:22::1 --------->    3FFE:0:0:22::2                 
                                       3FFE:0:0:12::1  __               
						                     |
						                     V
                                           Router D (AS3)
                                           3FFE:0:0:12::3

where router A and router B belong to AS1, router C belongs to AS2 and router D belongs to AS3.

Router A sends UPDATE to Router B with MP_REACH_NLRI containing NEXT_HOP 3FFE:0:0:12::2. I expect that router C should receive UPDATE with MP_REACH_NLRI containing NEXT_HOP 3FFE:0:0:22:1 (only Global) and router D should receive UPDATE with MP_REACH_NLRI containing NEXT_HOP  3FFE:0:0:12::21 (Global) and Link-Local. Am I correct?                


Sincerely,


Andrey Movshin
Software Development Engineer
Qosnetics Product Operation
Agilent Technologies Inc.
75 Rochester Avenue, Unit 1
Portsmouth, NH 03801
andrey_movshin@agilent.com
603-559-5769




Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id IAA27156 for <idr-archive@nic.merit.edu>; Fri, 19 Jul 2002 08:13:29 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id F200191240; Fri, 19 Jul 2002 08:13:01 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id BFB0B9126B; Fri, 19 Jul 2002 08:13:00 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id B39EF91240 for <idr@trapdoor.merit.edu>; Fri, 19 Jul 2002 08:12:59 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 9E5C15DE9E; Fri, 19 Jul 2002 08:12:59 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from cisco.com (router.cisco.com [171.69.182.20]) by segue.merit.edu (Postfix) with ESMTP id D2C325DDA4 for <idr@merit.edu>; Fri, 19 Jul 2002 08:12:58 -0400 (EDT)
Received: from [192.168.42.10] (ssh-sjc-1.cisco.com [171.68.225.134]) by cisco.com (8.8.8/2.6/Cisco List Logging/8.8.8) with ESMTP id IAA09040; Fri, 19 Jul 2002 08:12:16 -0400 (EDT)
Mime-Version: 1.0
X-Sender: jgs@router
Message-Id: <p05111a1cb95db16797dc@[192.168.42.10]>
In-Reply-To:  <E7E13AAF2F3ED41197C100508BD6A328688475@india_exch.hyderabad.mindspeed.com >
References:  <E7E13AAF2F3ED41197C100508BD6A328688475@india_exch.hyderabad.mindspeed.com >
Date: Fri, 19 Jul 2002 08:11:23 -0400
To: "Khanna, Vinay" <vinayk@netplane.com>
From: "John G. Scudder" <jgs@cisco.com>
Subject: RE: "BGP graceful restart" clarifications
Cc: "'srihari@procket.com'" <srihari@procket.com>, "'yakov@juniper.net'" <yakov@juniper.net>, "'rex@procket.com'" <rex@procket.com>, "'enke@redback.com'" <enke@redback.com>, "'Alex Zinin'" <zinin@psg.com>, "Khanna, Vinay" <vinayk@netplane.com>, "'idr@merit.edu'" <idr@merit.edu>
Content-Type: text/plain; charset="us-ascii" ; format="flowed"
Sender: owner-idr@merit.edu
Precedence: bulk

At 2:16 AM -0400 7/19/02, Khanna, Vinay wrote:
>  >2) Does this draft address only planned BGP outages?
>
>It may apply to any outage.
>[Vinay]: Agreed. But in unplanned outages, ONLY the restart mechanism for
>the Restarting Speaker is applicable. The Receiving Speaker shall NOT retain
>the routes received from the Restarting Speaker on detecting a connection
>closure; since the Forwarding State bit is 0. Thoughts?

No, the FS bit can be set, it depends on the restarting speaker.  The 
receiving speaker just takes action as given in the spec according to 
the FS bit, no additional cleverness is required. :-)

>  >3) What happens when both the speakers advertise themselves as 'receiving
>>speakers' (by setting the BGP Restart State bit to FALSE)? Should usual BGP
>>procedures as specified in draft-17 be followed in this scenario?
>
>They should both follow the procedures for "receiving speaker".
>[Vinay]: On revisiting the draft, it makes sense. The Receiving Speaker
>shall either flush the peer routes (draft-17 behaviour) OR shall mark them
>as stale routes (graceful restart behaviour) depending on the value of the
>Forwarding State bit. Thoughts?

Right.

--John


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id FAA22313 for <idr-archive@nic.merit.edu>; Fri, 19 Jul 2002 05:48:39 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 2271791282; Fri, 19 Jul 2002 05:48:24 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id E2126912D2; Fri, 19 Jul 2002 05:48:23 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id AD92991282 for <idr@trapdoor.merit.edu>; Fri, 19 Jul 2002 05:48:22 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 97F445DDBF; Fri, 19 Jul 2002 05:48:22 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from xover.netplane.com (cnxt10002.conexant.com [198.62.10.2]) by segue.merit.edu (Postfix) with ESMTP id 1457D5DDA4 for <idr@merit.edu>; Fri, 19 Jul 2002 05:48:22 -0400 (EDT)
Received: by XOVER.dedham.mindspeed.com with Internet Mail Service (5.5.2653.19) id <NYGKSMWZ>; Fri, 19 Jul 2002 05:48:15 -0400
Message-ID: <E7E13AAF2F3ED41197C100508BD6A328688479@india_exch.hyderabad.mindspeed.com>
From: "Khanna, Vinay" <vinayk@netplane.com>
To: "'Stephen Waters'" <Stephen.Waters@riverstonenet.com>
Cc: idr@merit.edu, "Khanna, Vinay" <vinayk@netplane.com>
Subject: RE: "BGP graceful restart" clarifications
Date: Fri, 19 Jul 2002 05:50:42 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Steve,

Thanks for the comments. Please find my responses inlined with [Vinay].

Regards.
Vinay:)

-----Original Message-----
From: Stephen Waters [mailto:Stephen.Waters@riverstonenet.com]
Sent: Friday, July 19, 2002 2:38 PM
To: Khanna, Vinay
Cc: idr@merit.edu
Subject: RE: "BGP graceful restart" clarifications


> [Vinay]: Agreed. But in unplanned outages, ONLY the restart 
> mechanism for
> the Restarting Speaker is applicable. The Receiving Speaker 
> shall NOT retain
> the routes received from the Restarting Speaker on detecting 
> a connection
> closure; since the Forwarding State bit is 0. Thoughts?
> 

The receiving speaker should take any failure of the peer of a 
signal to enter GR processing if the peer sent the GR option.
The restarting router then has 'restart-time' to reconnect to the 
receiving speaker before the stale routes on the the receiving speaker
are deleted.
[Vinay] Agreed.

Forwarding from the receiving speaker continues. 

Note :

"It is noted that the normal BGP procedures must be followed when the
   TCP session terminates due to the sending or receiving of a BGP
   NOTIFICATION message."
[Vinay] True.

So, the 'end' of the restarting peer was messy to some degree. Whether
the cpu was planned to bounce or not should not make any difference to 
the outcome -ideally.  
[Vinay]: I dont agree here. The Receiving Speaker behaviour is different in
these scenarios. During unplanned outages, the Restarting Speaker sets the
Forwarding State bit to 0. This forces the Receiving Speaker to delete the
stale routes after the session re-establishment. This will happen even if
the session is re-established within 'Restart Time'.
>From Section 6.2
   "Once the session is re-established, if the "Forwarding State" bit for
   a specific address family is not set in the newly received Graceful
   Restart Capability, or if a specific address family is not included
   in the newly received Graceful Restart Capability, or if the Graceful
   Restart Capability isn't received in the re-established session at
   all, then Receiving Speaker shall immediately remove all the stale
   routes from the peer that it is retaining for that address family."


Regards, Steve.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id FAA21003 for <idr-archive@nic.merit.edu>; Fri, 19 Jul 2002 05:08:23 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 3A9B39123F; Fri, 19 Jul 2002 05:07:45 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 026339127B; Fri, 19 Jul 2002 05:07:44 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id BFA8D9123F for <idr@trapdoor.merit.edu>; Fri, 19 Jul 2002 05:07:43 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 9889F5DDB8; Fri, 19 Jul 2002 05:07:43 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from rs-eu-exc1.rs.riverstonenet.com (unknown [193.128.15.220]) by segue.merit.edu (Postfix) with ESMTP id 2F9685DDA4 for <idr@merit.edu>; Fri, 19 Jul 2002 05:07:43 -0400 (EDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"
Subject: RE: "BGP graceful restart" clarifications
X-MimeOLE: Produced By Microsoft Exchange V6.0.5762.3
Date: Fri, 19 Jul 2002 10:07:38 +0100
Message-ID: <D2ECDCD8A528BF43A01EECCBEB191DD108F344@rs-eu-exc1.rs.riverstonenet.com>
X-MS-Has-Attach: 
X-MS-TNEF-Correlator: 
Thread-Topic: "BGP graceful restart" clarifications
Thread-Index: AcIu65zzyJVmpeQTRnKLP2VEpmATewAF5c8A
From: "Stephen Waters" <Stephen.Waters@riverstonenet.com>
To: "Khanna, Vinay" <vinayk@netplane.com>
Cc: <idr@merit.edu>
Sender: owner-idr@merit.edu
Precedence: bulk
Content-Transfer-Encoding: 8bit
X-MIME-Autoconverted: from quoted-printable to 8bit by nic.merit.edu id FAA21003

> [Vinay]: Agreed. But in unplanned outages, ONLY the restart 
> mechanism for
> the Restarting Speaker is applicable. The Receiving Speaker 
> shall NOT retain
> the routes received from the Restarting Speaker on detecting 
> a connection
> closure; since the Forwarding State bit is 0. Thoughts?
> 

The receiving speaker should take any failure of the peer of a 
signal to enter GR processing if the peer sent the GR option.
The restarting router then has 'restart-time' to reconnect to the 
receiving speaker before the stale routes on the the receiving speaker
are deleted.

Forwarding from the receiving speaker continues. 

Note :

"It is noted that the normal BGP procedures must be followed when the
   TCP session terminates due to the sending or receiving of a BGP
   NOTIFICATION message."

So, the 'end' of the restarting peer was messy to some degree. Whether
the cpu was planned to bounce or not should not make any difference to 
the outcome -ideally.  

Regards, Steve.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id CAA15348 for <idr-archive@nic.merit.edu>; Fri, 19 Jul 2002 02:15:06 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 591EA9122C; Fri, 19 Jul 2002 02:14:37 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 2D0C191278; Fri, 19 Jul 2002 02:14:37 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id A75D29122C for <idr@trapdoor.merit.edu>; Fri, 19 Jul 2002 02:14:35 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 8E2165DED4; Fri, 19 Jul 2002 02:14:35 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from xover.netplane.com (cnxt10002.conexant.com [198.62.10.2]) by segue.merit.edu (Postfix) with ESMTP id ED30D5DDC2 for <idr@merit.edu>; Fri, 19 Jul 2002 02:14:34 -0400 (EDT)
Received: by XOVER.dedham.mindspeed.com with Internet Mail Service (5.5.2653.19) id <NYGKSMNG>; Fri, 19 Jul 2002 02:14:28 -0400
Message-ID: <E7E13AAF2F3ED41197C100508BD6A328688475@india_exch.hyderabad.mindspeed.com>
From: "Khanna, Vinay" <vinayk@netplane.com>
To: "'John G. Scudder'" <jgs@cisco.com>
Cc: "'srihari@procket.com'" <srihari@procket.com>, "'yakov@juniper.net'" <yakov@juniper.net>, "'rex@procket.com'" <rex@procket.com>, "'enke@redback.com'" <enke@redback.com>, "'Alex Zinin'" <zinin@psg.com>, "Khanna, Vinay" <vinayk@netplane.com>, "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: "BGP graceful restart" clarifications
Date: Fri, 19 Jul 2002 02:16:23 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

John,

Thanks for your clarifications. Please find my responses inlined with
[Vinay].

Regards.
Vinay:)

-----Original Message-----
From: John G. Scudder [mailto:jgs@cisco.com]
Sent: Thursday, July 18, 2002 5:20 PM
To: Khanna, Vinay
Cc: 'srihari@procket.com'; 'yakov@juniper.net'; 'rex@procket.com';
'enke@redback.com'; 'Alex Zinin'; Khanna, Vinay
Subject: Re: "BGP graceful restart" clarifications


At 2:15 AM -0400 7/18/02, Khanna, Vinay wrote:
>Hello Everybody,
>
>I've some doubts with respect to the graceful restart mechanisms for BGP.
>
>1) Section 6.1 says
>"When the Restarting Speaker restarts, if possible it shall retain the
>forwarding state for the BGP routes in the Loc-RIB, and shall mark them as
>stale."
>This sentence gives an impression as if the Loc-RIB is retained in certain
>cases. Why is it so? Isn't it enough if ONLY forwarding state is preserved
>on the restarting speaker? Although the stale route marking mechanism is
>necessary on the receiving side (to avoid flapping), I dont feel it's
>necessity on the restarting side. It 's enough if the route-selection
>procedure is deferred on the restarting speaker to avoid unnecessary route
>advertisements as and when routes are received from receiving speakers. Am
I
>missing something? Please clarify.

As with any routing protocol spec, the spec describes behavior.  The 
implementation is up to you.  Is it possible to produce the described 
implementation using retained FIB information only?  I would say yes.
[Vinay]: That answers my concern.

(Many implementations don't physically implement distince Loc-RIB, 
Adj-RIB-In, etc, but they're still used for description in the base 
spec.)
[Vinay]: Section 3.2, draft-17 explicitely states
   "Although the conceptual model distinguishes between Adj-RIBs-In, Loc-
   RIB, and Adj-RIBs-Out, this neither implies nor requires that an
   implementation must maintain three separate copies of the routing
   information. The choice of implementation (for example, 3 copies of
   the information vs 1 copy with pointers) is not constrained by the
   protocol."

Shouldn't a similar statement be added in the GR draft?

>2) Does this draft address only planned BGP outages?

It may apply to any outage.
[Vinay]: Agreed. But in unplanned outages, ONLY the restart mechanism for
the Restarting Speaker is applicable. The Receiving Speaker shall NOT retain
the routes received from the Restarting Speaker on detecting a connection
closure; since the Forwarding State bit is 0. Thoughts?

>Can these mechanisms be
>applied to a subset of peers too?

Sure, although one might want to be judicious about exactly how one 
did such a mixed deployment.

>3) What happens when both the speakers advertise themselves as 'receiving
>speakers' (by setting the BGP Restart State bit to FALSE)? Should usual BGP
>procedures as specified in draft-17 be followed in this scenario?

They should both follow the procedures for "receiving speaker".
[Vinay]: On revisiting the draft, it makes sense. The Receiving Speaker
shall either flush the peer routes (draft-17 behaviour) OR shall mark them
as stale routes (graceful restart behaviour) depending on the value of the
Forwarding State bit. Thoughts?

>Thanks in advance,
>Vinay:)

Regards,

--John


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id VAA06565 for <idr-archive@nic.merit.edu>; Thu, 18 Jul 2002 21:48:32 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id EFAA891270; Thu, 18 Jul 2002 21:48:17 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 4325591273; Thu, 18 Jul 2002 21:48:04 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 696BA91271 for <idr@trapdoor.merit.edu>; Thu, 18 Jul 2002 21:47:39 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 3B6B85DE18; Thu, 18 Jul 2002 21:47:39 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from merlot.juniper.net (natint.juniper.net [207.17.136.129]) by segue.merit.edu (Postfix) with ESMTP id 96F295DDD0 for <idr@merit.edu>; Thu, 18 Jul 2002 21:47:36 -0400 (EDT)
Received: from roque-bsd.juniper.net (roque-bsd.juniper.net [172.17.12.183]) by merlot.juniper.net (8.11.3/8.11.3) with ESMTP id g6J1lam52550 for <idr@merit.edu>; Thu, 18 Jul 2002 18:47:36 -0700 (PDT) (envelope-from roque@juniper.net)
Received: (from roque@localhost) by roque-bsd.juniper.net (8.11.6/8.9.3) id g6J1la176882; Thu, 18 Jul 2002 18:47:36 -0700 (PDT) (envelope-from roque)
Date: Thu, 18 Jul 2002 18:47:36 -0700 (PDT)
Message-Id: <200207190147.g6J1la176882@roque-bsd.juniper.net>
From: Pedro Roque Marques <roque@juniper.net>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
To: idr@merit.edu
Subject: inform (was: IDR WG minutes IETF 54)
In-Reply-To: <p05111a00b95d0a37ad7b@[133.93.77.124]>
References: <p05111a00b95d0a37ad7b@[133.93.77.124]>
X-Mailer: VM 6.34 under 19.16 "Lille" XEmacs Lucid
Sender: owner-idr@merit.edu
Precedence: bulk

David Ward writes:

> INFORM message I. Gargi Nalawade presenter draft-nalawade-bgp-inform-00.txt 1.
> Goals a. no way to tell remote
> peer that there has been an error or event w/o use of NOTIFICATION
> i. this resets peering session b. no change to existing
> implementations c. limit sharing of messages between peers
> 2. Encoding overview a. Packet format b. Event codes c. Data types
> 3. Rx'ing the INFORM a. log it b. inform the administrator
> (implementation specific) Question: How to use w/ NOTIFICATION?
> Answer: Use it to inform that you are going to reset. Use
> immediately before the NOTIFICATION but, now can add
> PDU/ATTR/whatever to help debug.

> Question: Will this cause a change to the FSM?  Answser: No

> Questons: What is "too many routes option?" How to define?  Answer:
> It is an implementation specific configuration

> Question: Jeff Pickering Should a peer take any specific action upon
> reception of the INFORM?  The authors should explicitly prohibit any
> automated action

> Statement: Andrew Lange Move forward draft

> Outcome: Sue Hares Accepted as WG item pending taking it to list

I object to making this a WG document.

I believe this mechanism is unnecessary and that no sufficient
justification has been presented to add this to the spec. There should
be a very compeling reason to be making changes so basic such as
adding new messages.

A couple of comments on the details above:

> Question: How to use w/ NOTIFICATION?
> Answer: Use it to inform that you are going to reset. Use
> immediately before the NOTIFICATION but, now can add
> PDU/ATTR/whatever to help debug.

How can that be of extra value given that notification allows you to
carry variable lenght data ? Adding this message is just going to
cause a debugging mess with all the deployed systems out there that do
not implement it.

> Question: Jeff Pickering Should a peer take any specific action upon
> reception of the INFORM?

Yes, reset the session, at least in the case that has been vastly
discussed in the list which is receicept of invalid nlri. Furtunatly
no code needs to be written to achieve that since it is the default
action on a receipt of an invalid message code.

  Pedro.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id UAA04465 for <idr-archive@nic.merit.edu>; Thu, 18 Jul 2002 20:48:04 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 9B5709126B; Thu, 18 Jul 2002 20:45:32 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 66FE09126D; Thu, 18 Jul 2002 20:45:32 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id AAA519126B for <idr@trapdoor.merit.edu>; Thu, 18 Jul 2002 20:45:30 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 6A4B35DE4A; Thu, 18 Jul 2002 20:45:30 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from cisco.com (cfcentral.cisco.com [64.101.210.32]) by segue.merit.edu (Postfix) with ESMTP id E8C8E5DDF3 for <idr@merit.edu>; Thu, 18 Jul 2002 20:45:29 -0400 (EDT)
Received: from [133.93.77.124] (ssh-sjc-1.cisco.com [171.68.225.134]) by cisco.com (8.8.8/2.6/Cisco List Logging/8.8.8) with ESMTP id TAA16097; Thu, 18 Jul 2002 19:45:27 -0500 (CDT)
Mime-Version: 1.0
X-Sender: wardd@127.0.0.1 (Unverified)
Message-Id: <p05111a00b95d0a37ad7b@[133.93.77.124]>
Date: Thu, 18 Jul 2002 19:45:20 -0500
To: idr@merit.edu
From: David Ward <dward@cisco.com>
Subject: IDR WG minutes IETF 54
Cc: dward@cisco.com
Content-Type: text/plain; charset="us-ascii" ; format="flowed"
Sender: owner-idr@merit.edu
Precedence: bulk

NOTES: IDR WG meeting IETF 54
Scribe: David Ward

Agenda
	Route Oscillation
	Inform message type
	Extended Communities
	Document Status

Route Oscillation
I.	Anindya Basu Lucent presenter
	draft-wilfong-ibgp-oscillations-00.txt

	1. Brief Overview of Problem (Route Reflection example)
	2. Overview of Transient Route Oscillations
		a. proposal converges to stable situation
			i. potential problem in scenario where there 
could be an
			non-deterministic outcome
	3. Solution
		a. send xtra info
		b. if same aspath length, same loc_pref, diff AS: 
send both prefixes
	4. Correctness
		a. always converges
		b. loop free
		c. works for multi level RR and confeds
	5. Scaling
		a. don't need to always send xtra routes
		b. only advert when detect oscillations
			i. propose using low-pass filter to weed out 
restarts and detect
			high frequency osc.
	6. Issues seen in other drafts
		a. Walton
			i. doesn't solve transient and not all 
persistent oscillations
		b. Chen
			i. believes all clients must be fully meshed

QUESTION:
	How to detect oscillation?
ANSWER:
	Add a counter to each entry in FIB and a timer and use 
low-pass filter to determine if osc.
NOTE from author: If want to scale must fine some measure to detect oscillation


II. Alvaro Retana presenter:
	draft-walton-bgp-route-oscillation-stop-00.txt


	1. Brief Overview of Problem
		a. Type 1: single level RR/Confed
			i. solved in RFC 2796, 3065
		b. Type 2: multitier/hierarchical RR/Config
			i. must send multiple paths
			ii. always-compare-med considered unacceptable solution
		c. Oscillation stats from empirical analysis (2001 data)
			112086	 total routes
			9011	 candidates for oscillation (~10%)
			899           actually oscillating		(< 1%)
	2. Solution
		a. If RR or ASBR of confed
			i. send best path
			ii. send 'neighbor AS best path'
	3. Comparison
		a. Walton or Wilfong - requires oscillation detection
		b. Chen - not multi-level
	4. Next Steps
		a. determine oscillation detection mechanism - may be 
implementation specific
QUESTION: Will the number of oscillating routes increase in the future?
ANSWER: Potentially

QUESTION (Alex Zinin): What about detection mechanism?
ANSWER: YTBD

	draft-walton-bgp-add-paths-00.txt
	1. This extension is required for oscillation solution
		a. could be adopted as an additional solution above 
and beyond oscillation
	2. Proposal
		a. change NLRI encoding in 2858
			i. add bestpath bit - represents that path 
has been selected by speaker
			and put into FIB
		b. same change to 3107 for MPLs
			i. additional clarifications necessary
		c. add new capability advertisement
			i. multipath capability
			NOTE: must not use encoding in 3107
	3. Soon to be released new proposal
		a. remove multiprotocol linkage
		b. change encoding to flags (1 octet) in all specs:
			i. 1771, 2858, 3107
		c. can also solve route server functionality in 
addition to oscillation
	4. problems in 3107
		a. how to send multipath and encoding doesn't match text
		b. proposal is to change 3107 encoding to use new proposal
		NOTE: PPVPN owns 3107

III. Sue Hares presenter
	1. Mixed Issues w/ draft 17 -> 18 and FSM seen on list in 
dicussion of osc. drafts
		a. tie breaking rules
			i. e.g. cluster_id
	2. Oscillation drafts comparison and issues
		a. always-compare-med not adequate sol'n
			i. solution today is to restrict topology
			i. too difficult to coordinate MED values 
across multiple domains (asp)
	3. Presentation of Enke Chen's proposal
	draft-chen-route-oscillation-avoid-00.txt
		a. restrictions
			i. solves one level RR/Confed hierarchy
			ii. requires full mesh
		b. solutions
			i. announe best path rx'ed from clients
			ii. best path from all
			iii. calc IBEST and EBEST
			iv. announce according to rules in draft
Statement: Naiming Sheng: Enke Chen's proposal has fewer protocol and 
implementation changes
	4. How to proceed?
		a. vote for adoption
			i. no wilfong
			ii. some walton
			iii. few Chen
Statement: Alvaro Retana
	What is problem we are trying to solve?
	Enke's doesn't solve any problem
	Wilfong's solves problems that are not known to exist
	There is a real problem as there are multilevel topologies deployed
Statement: David Ward
	We know that Enke's proposal does not solve the problem in 
the protocol specification
		and his proposed solution is already covered in 
exisiting RFCs. Therefore, it should
		be removed from consideration.
Statement: Basu
	Must solve problem w/o toplogy restriction
	All possible topologies must be under consideration
	Protocol changes required
Statement: Robert Rasuk:
	Should separate add_path from oscillation solution as it 
solves other problems
Question: Sue Hares
	Should we solve the problem of only one level or multilevel 
topologies? Should we ask
	operators?
Question: Kireeti Kompella
	Is a survey of operators available? Where they forced to 
restrict topology?
Answer: Sue Hares
	In an informal survey Sue has found multilevel topologies 
deployed. Other operators
	restricted their topology but, didn't want to
Question: Amir
	If we are trying to solve a known problem why is there an issue?
Statement: David Ward
	We should move forward in working group today to adopt a 
draft or limit the numbers of
	drafts under consideration. IDR has identified and been 
working on this problem for
	over three years and still have no agreed upon solution 
leaving a large hole in the
	protocol specification. We should solve the problem that is 
known today and solve it
	fully and take on future problems as they arise. Only 
Walton's draft fits that requirement.
Statement: Basu
	Our draft does solve problems that do occur today and may 
occur. We cannot discount the
	ability to solve potential problems.
Statement: Alex Zinin
	Discussion must continue on list as there is no agreement. No 
working group action to be
	taken.

INFORM message
I. Gargi Nalawade presenter
draft-nalawade-bgp-inform-00.txt
	1. Goals
		a. no way to tell remote peer that there has been an 
error or event w/o use of
		NOTIFICATION
			i. this resets peering session
		b. no change to existing implementations
		c. limit sharing of messages between peers
	2. Encoding overview
		a. Packet format
		b. Event codes
		c. Data types
	3. Rx'ing the INFORM
		a. log it
		b. inform the administrator (implementation specific)
Question: How to use w/ NOTIFICATION?
Answer: Use it to inform that you are going to reset. Use immediately 
before the NOTIFICATION
but, now can add PDU/ATTR/whatever to help debug.

Question: Will this cause a change to the FSM?
Answser: No

Questons: What is "too many routes option?" How to define?
Answer: It is an implementation specific configuration

Question: Jeff Pickering
	Should a peer take any specific action upon reception of the INFORM?
	The authors should explicitly prohibit any automated action

Statement: Andrew Lange
	Move forward draft

Outcome: Sue Hares
	Accepted as WG item pending taking it to list


Extended Communities
1. Andrew Lange presenter
discussion on list is source of material
	1. Problem
		a. need to support IPV6
		b. could have more efficient packeing of values in data field
			i. see ptomain-bgp-redist
		c. clean for future data types
		d. support for locally defined community structures 
(types and subtypes)
	2. Improvements
		a. regular typed encoding over extended encoding
		b. value field becomes variable length
		c. type field and subtype field added
	3. Packet format
		a. see archive on list
		b. adds new base type
		c. type
		d. subtype
		e. locally defined types/subtypes
	4. Issues
		a. backward compatibility
		b. there is installed base (2547 deployments) of 
extended communities
Statement: Barry Friedman
	Make new attribute type
Question: Barry
	Does transitive mean across AS boundary?
Answer: Andrew
	Transitive means ti will cross boundary but, perhaps go no further
Queston: Geoff H.
	What is proposal to deal w/ changes in length? See historical 
issues w/ SNMP
	Recommend that the draft specifically states what to do.
		c. there are three ways to deal w/ backward compatibility
			i. there is a hack
				Only have new encoding beyond the 
first 4 codepoints (in ext.
				comm now)
				and then switch to new encoding
			ii. new attribute type
				Move forward existing ext comm work
				perhaps deprecate over time
Statement: Kireeti
	2 specs w/ new attr type is fine but, don't use word 
deprecate of orig ext. comm.
Statement: Gargi
	Don't use word deprecate as there are many installed bases

		d. Should we duplicate all ext comm into new 
attribute for IPV6?

Statement: Kireeti
	It wouuld be nice to do everything under new atttr but, have 
to create transition plan
	Need feedback from operators
Question: What happens if you have Route Target in old and new 
format? Perhaps dont' define in new attr
Answer: Andrew
	That is a transition and implementation issue but, we can add 
a statement on how to
	deal w/ duplicate information

Statement: Sue Hares
	WG action: Andrew to write new draft as new attribute and submit

Document status (Sue went too fast for me to get it all)
	1. draft 17 -> 18 based on FSM work
		a. skh wrote two new drafts
			i. statemt - state machine diagram
			ii. backoff - optional (in draft opt means optional)
Question: should there be a state machine and FSM table in draft?
Answer: Yes, there will be (concensus vote yes)
	2. V1 MIB working on security section
	3. V2 MIB going to LC
	4. various things in queue for RFC editor
	5. Various docs under discussion
	6. 2545bis
		a. MIB ready
		b. site local issues
		c. BGP ID issues
Statement: Alex Zinin
	What are consideratinos for site local addressing? Need to define:
		If needed
		How needed
		Where needed
	Alex requests WG to provide some clue on how to resolve issue in BGP.
Question: Sue Hares to Alex
	Can the IPV6 WG define site boundaries and scoping architecture?
	When will this be resolved?
Answer: Bill Fenner
	Perhaps soon. It would be good if IDR could help w/ 
architectural question if a site
	should be limited to within an AS, cross AS',etc

	7. Dynamic capability draft? Is it adequate?

Statement: Bill Fenner
	Cannot go to standards track w/ IPR statement as is. Must be 
removed to be standard

	8. Want implemenation report for Graceful Restart

	9. ORF to be discussed on mailing list

	10. Murphy's vulnerability draft to be IDR WG item

	11. Murphy's protection mechanisms to be RPSec WG item

Statement: Alex Zinin
Please help skh finish base spec. This needs to be completed for 
interoperable implementations.
Skh should not have to carry burden alone. Please help.



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id SAA29934 for <idr-archive@nic.merit.edu>; Thu, 18 Jul 2002 18:23:44 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 4634F9126B; Thu, 18 Jul 2002 18:23:06 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 15E47912E4; Thu, 18 Jul 2002 18:23:06 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id B2E249126B for <idr@trapdoor.merit.edu>; Thu, 18 Jul 2002 18:23:04 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 9F1755DEF8; Thu, 18 Jul 2002 18:23:04 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from exch-srv.CoronaNetworks.com (unknown [205.219.34.104]) by segue.merit.edu (Postfix) with ESMTP id 510375DDF3 for <idr@merit.edu>; Thu, 18 Jul 2002 18:23:04 -0400 (EDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"
Subject: RFC-2842 Encoding  BGP Capabilities
X-MimeOLE: Produced By Microsoft Exchange V6.0.5762.3
Date: Thu, 18 Jul 2002 15:20:57 -0700
Message-ID: <C61C9973831E2949A9AA57F3B0318464E31C6F@exch-srv.CoronaNetworks.com>
X-MS-Has-Attach: 
X-MS-TNEF-Correlator: 
Thread-Topic: RFC-2842 Encoding  BGP Capabilities
Thread-Index: AcIuqaKjkOZxQZp0Eda8nQABAx4zuQ==
From: "Ankur Goyal" <Ankur@coronanetworks.com>
To: <idr@merit.edu>
Sender: owner-idr@merit.edu
Precedence: bulk
Content-Transfer-Encoding: 8bit
X-MIME-Autoconverted: from quoted-printable to 8bit by nic.merit.edu id SAA29934

Hi,

Section 2 of RFC 2842 states that:

  This is an Optional Parameter that is used by a BGP speaker to convey
  to its BGP peer the list of capabilities supported by the speaker.

  The parameter contains one or more triples <Capability Code,
  Capability Length, Capability Value>
  ...
  <snip>
  ...
   A particular capability, as identified by its Capability Code, may
   occur more than once within the Optional Parameter.

I wonder which or both of the following encoding schemes are valid for advertising supported capabilities to a peer.

OP# - is the Optional Parameter number in the OPEN message
C   - represents the optional parameter type Capability
A   - represents the optional parameter for Authentication
c#  - represents individual Capabilities in the form code+length+value

(1) OP1-C:c1.c2.c3.c1 + OP2-A
(2) OP1-C:c1 + OP2-C:c2 + OP3-C:c3 + OP4-C:c1 + OP5-A


any clarification on this will be greatly appreciated.

Ankur





Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id SAA29655 for <idr-archive@nic.merit.edu>; Thu, 18 Jul 2002 18:15:19 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 90A9791269; Thu, 18 Jul 2002 18:14:38 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 5E58A912E4; Thu, 18 Jul 2002 18:14:38 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 2DC0F91269 for <idr@trapdoor.merit.edu>; Thu, 18 Jul 2002 18:14:37 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 0AAD45DDF3; Thu, 18 Jul 2002 18:14:37 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id B1F095DDE1 for <idr@merit.edu>; Thu, 18 Jul 2002 18:14:36 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id SAA20912 for <idr@merit.edu>; Thu, 18 Jul 2002 18:14:34 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id SAA16198 for <idr@merit.edu>; Thu, 18 Jul 2002 18:14:35 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W985VT3>; Thu, 18 Jul 2002 18:14:34 -0400
Message-ID: <39469E08BD83D411A3D900204840EC558227BA@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'idr@merit.edu'" <idr@merit.edu>
Subject: FW: My BGP Route Update Pacing Draft
Date: Thu, 18 Jul 2002 18:14:33 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

> -----Original Message-----
> From: Abarbanel, Benjamin 
> Sent: Thursday, July 18, 2002 6:10 PM
> To: 'Pedro Roque Marques'; Abarbanel, Benjamin
> Subject: RE: My BGP Route Update Pacing Draft
> 
> 
> Pedro:
> 
> > -----Original Message-----
> > From: Pedro Roque Marques [mailto:roque@juniper.net]
> > Sent: Thursday, July 18, 2002 5:28 PM
> > To: Abarbanel, Benjamin
> > Cc: 'idr@merit.edu'
> > Subject: My BGP Route Update Pacing Draft
> > 
> > 
> > Abarbanel, Benjamin writes:
> > 
> > 
> > Ben,
> > I'm my opinion your draft offers a much worse alternative 
> solution to
> > a problem that is already solved in a much more elegant way 
> using TCP
> > flow control mechanism.
> > 
>   I would agree only if you flow control only update packets. 
> But now we
>   introduce other packets like Capability Message, Inform 
> Message, etc.
>   they too will be flow controlled, right?
> 
>   If so, would it be a problem?
> 
> > To test the assertion above that the problem has been solved, take a
> > mature BGP implementation and congest the resources of receiver. You
> > will see that the sender will automatically throttle it's sending
> > proccess. Contrary to what you state, sessions are not dropped.
> > 
> > BGP when correctly implemented is a very resilient protocol 
> and it has
> > been widly deployed in very demanding environments.
> > 
> > I'm also including some comments inline...
> > 
> > > 2. Abstract
> > 
> > > This document defines a mechanism for controlling or limiting the
> > > rate at which BGP update messages are sent from one peer 
> to the next
> > > when a BGP peer experiences internal congestion. With the
> > > introduction of new dynamic BGP protocol capabilities 
> [CAP] message
> > > or other none BGP session destructive messages, it is necessary to
> > > limit the rate at which the BGP update messages are sent 
> > 
> > What difference does the introduction of capabilities do ?
> > 
>  Same as above.
> 
> > > without
> > > affecting the entire BGP session and without relying on the
> > > transport (TCP) layer
> > 
> > Why not relly on the transport. Flow control is after all
> > traditionally the function of the transport. What makes you think it
> > needs to be reinvented... and how is your version of flow control
> > supperior to the current one ?
> >
> My way allows the two peer to exchange more detail 
> information on the proper
> course of action during congestion, like how fast or how slow 
> to go. The 
> TCP flow control is a stop, start mechanism. Either the pipe 
> is open or closed.
> The other end that is being flow controlled does not know why 
> or by how much he
> will be flow controlled off. Remember on a long session with 
> many hops it takes significant time to communicate the flow 
> control condition by ACKing or not ACKing messages. For me 
> this is not a direct but indirect mechanism.
> In the interim, the pipe will be full of data.
> 
> With a pacing message you can add more intelligence to the 
> communication and convey directly whatever you want without 
> affecting the underlying TCP protocol.
>  
> > > to do so as a normal reaction to data
> > > congestion of the TCP session across the network.
> > 
> > [...]
> > 
> > > 4. Introduction
> > 
> > > This document defines a mechanism for controlling or limiting the
> > > rate at which BGP update messages are sent from one peer 
> to the next
> > > when a BGP peer experiences internal congestion. With the
> > > introduction of new dynamic BGP protocol capabilities 
> [CAP] message
> > > or other none BGP session destructive messages, it is necessary to
> > > limit the rate at which the BGP update messages are sent without
> > > affecting the entire BGP session and without relying on the
> > > transport (TCP) layer to do so as a normal reaction to data
> > > congestion of the TCP session across the network.
> > 
> > > When a router enters a state where either its CPU utilization is
> > > maximized (reaches close to 100%) or its memory is nearly depleted
> > > (less than 10% of memory left), it cannot handle new or heavy
> > > streams of updates from its peers and at times unable to send
> > > messages to its peer in a timely manner (within 30 seconds).
> > 
> > If a router cannot send a message for 30 seconds there is something
> > very wrong with it. It is not clear to me how your 
> mechanism prevents
> > that.
> 
> By starting the pacing early in the game you avoid the 
> implosion that might
> occur. Now, maybe you might say its an internal 
> implementation bug. I bet
> I can put the top vendor's routers under alot of update 
> stress and they will 
> implode. Especially with several hundred sessions.
> 
> 
> > 
> > > As a
> > > consequence it cannot keep current with the topological state. In
> > > some scenarios, a non-congested peer might want to negotiate new
> > > capabilities with a congested peer. The congested peer is so
> > > degraded that its TCP session goes into significant flow 
> control off
> > > conditions and is unable to see the new BGP messages. Peer to peer
> > > communication is severely hampered and as a result the uncongested
> > > peer will take corrective action when its hold timer expires and
> > > drops the session. The uncongested peer computes alternate routing
> > > paths that are suboptimal in distance or attribute and thus affect
> > > the forwarding decisions of all routers in this network. It is
> > > possible that the congested peer's routing (control) 
> plane is badly
> > > degraded but its forwarding plane is at normal working level. The
> > > mistake
> > 
> > What mistake ?
> 
> Recomputing the routes because control plane is not working 
> right. Yet, 
> forwarding plane is working perfect.
> 
> > 
> > > is made by the uncongested peer since it does not see any
> > > Keepalive/Update messages before its Hold timer expires and drops
> > > the session thereby dropping all its associated routes. 
> After routes
> > > are dropped, network instability occurs and suboptimal paths are
> > > used by the remaining peers.
> > 
> > The scenario that you present above is one where you 
> basically have a
> > peer that is totally broken. What does it matter ? A decent
> > implementation will choose to degrade it is service levels
> > gradually. It may achieve that with the current mechanisms.
> > 
> > > The MinRouteAdvertisementInternal and MinAsOriginationInterval
> > 
> > What does min advert interval have to do with the topic in
> > discussion... ? These control only the rate at which the sender may
> > attempt to generate updates, not the rate at which the 
> receiver needs
> > to accept it.
> >
> 
> True, but these timers cause the sender to slow down the rate 
> at which it
> sends the updates on an artificial basis with no regards to 
> current processing
> congestions or such. That is why I mentioned it, in case 
> someone would say
> we have these timers to control the sender why do we need 
> your solution.
>  
> > In BGP a receiver can choose to accept a anyrate of updates 
> it wants.
> >
> 
> Yes, it can by reading or not reading the TCP socket queue. 
> Which gets us
> back to the original point I mentioned earlier. TCP FLOW 
> Control is not
> perfect.
>  
> > 
> > > timers are inadequate since they are mostly implemented on a peer
> > > session basis and studies have shown when they are used they
> > > severely degrade route convergence time. The problem with 
> statically
> > > defined timers (initialized during system load or session
> > > establishment phase) is that they do not adjust to peer internal
> > > dynamically changing congestion conditions.
> > 
> > You are completly missing the point... 
> 
> Please explain what I missed. I found very little background 
> information on these
> timers.
> 
> I have a report done, that shows when the 
> MinRouteAdvertisemetInterval is used
> and set to 30 seconds it degrades the convergence time. When 
> set to 0 or 1, things work much better.
> 
> Check this out:
> http://www.ripe.net/ripe/meetings/archive/ripe-42/presentation
s/ripe42-eof-bgp/sld038.html

> > The problem with the
> > congested peer using TCP flow control to reduce its congestion
> > condition, is that it completely stops all incoming session traffic
> > and thus preventing any messages of high priority nature from being
> > seen.
> 
> What would be high priority in a BGP session... ?
> To my understanding there is no need to introduce priorities 
> in BGP messages.

I think by having dynamic messages that change the nature of the session
traffic, they need to have a sense of priority. You dont want to change
the type of NLRI (say from IPv4 to IPv6) and wait 10 minutes to see if it
happened. Since BGP is piggy backing so much other stuff in its update
messages
it only seems reasonable to put priority on these messages.

> 
> > This document presents a mechanism by which the BGP session of a
> > congested router need not be degraded to the level that
> > communication is broken with its peers.
> 
> Please do offer more to substantiate this other than your claim that
> the problem exists.
>
 
By the fact that the pacing control is more direct and implicit to a
particular
traffic type (in this case update messages). The pacing mechanism is
deterministic, whereas TCP flow control is not. You should talk to people
who build IPC mechanisms to replace TCP about this issue. 

whether the session breaks or badly degraded is an implementation specific
case. 
But I feel there is enough there to warrant the improvements.

The rest needs to be proving empiracly.


Ben


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id RAA28258 for <idr-archive@nic.merit.edu>; Thu, 18 Jul 2002 17:31:54 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 13B9C912E2; Thu, 18 Jul 2002 17:28:40 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id CCA4191243; Thu, 18 Jul 2002 17:28:39 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id F1099912E2 for <idr@trapdoor.merit.edu>; Thu, 18 Jul 2002 17:27:48 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id E08F15DDD0; Thu, 18 Jul 2002 17:27:48 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from merlot.juniper.net (natint.juniper.net [207.17.136.129]) by segue.merit.edu (Postfix) with ESMTP id 5AE5D5DDCC for <idr@merit.edu>; Thu, 18 Jul 2002 17:27:48 -0400 (EDT)
Received: from roque-bsd.juniper.net (roque-bsd.juniper.net [172.17.12.183]) by merlot.juniper.net (8.11.3/8.11.3) with ESMTP id g6ILRlm35551; Thu, 18 Jul 2002 14:27:47 -0700 (PDT) (envelope-from roque@juniper.net)
Received: (from roque@localhost) by roque-bsd.juniper.net (8.11.6/8.9.3) id g6ILRlg76449; Thu, 18 Jul 2002 14:27:47 -0700 (PDT) (envelope-from roque)
Date: Thu, 18 Jul 2002 14:27:47 -0700 (PDT)
Message-Id: <200207182127.g6ILRlg76449@roque-bsd.juniper.net>
From: Pedro Roque Marques <roque@juniper.net>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
To: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: My BGP Route Update Pacing Draft
In-Reply-To: <39469E08BD83D411A3D900204840EC558227B4@vie-msgusr-01.dc.fore.com>
References: <39469E08BD83D411A3D900204840EC558227B4@vie-msgusr-01.dc.fore.com>
X-Mailer: VM 6.34 under 19.16 "Lille" XEmacs Lucid
Sender: owner-idr@merit.edu
Precedence: bulk

Abarbanel, Benjamin writes:


Ben,
I'm my opinion your draft offers a much worse alternative solution to
a problem that is already solved in a much more elegant way using TCP
flow control mechanism.

To test the assertion above that the problem has been solved, take a
mature BGP implementation and congest the resources of receiver. You
will see that the sender will automatically throttle it's sending
proccess. Contrary to what you state, sessions are not dropped.

BGP when correctly implemented is a very resilient protocol and it has
been widly deployed in very demanding environments.

I'm also including some comments inline...

> 2. Abstract

> This document defines a mechanism for controlling or limiting the
> rate at which BGP update messages are sent from one peer to the next
> when a BGP peer experiences internal congestion. With the
> introduction of new dynamic BGP protocol capabilities [CAP] message
> or other none BGP session destructive messages, it is necessary to
> limit the rate at which the BGP update messages are sent 

What difference does the introduction of capabilities do ?

> without
> affecting the entire BGP session and without relying on the
> transport (TCP) layer

Why not relly on the transport. Flow control is after all
traditionally the function of the transport. What makes you think it
needs to be reinvented... and how is your version of flow control
supperior to the current one ?

> to do so as a normal reaction to data
> congestion of the TCP session across the network.

[...]

> 4. Introduction

> This document defines a mechanism for controlling or limiting the
> rate at which BGP update messages are sent from one peer to the next
> when a BGP peer experiences internal congestion. With the
> introduction of new dynamic BGP protocol capabilities [CAP] message
> or other none BGP session destructive messages, it is necessary to
> limit the rate at which the BGP update messages are sent without
> affecting the entire BGP session and without relying on the
> transport (TCP) layer to do so as a normal reaction to data
> congestion of the TCP session across the network.

> When a router enters a state where either its CPU utilization is
> maximized (reaches close to 100%) or its memory is nearly depleted
> (less than 10% of memory left), it cannot handle new or heavy
> streams of updates from its peers and at times unable to send
> messages to its peer in a timely manner (within 30 seconds).

If a router cannot send a message for 30 seconds there is something
very wrong with it. It is not clear to me how your mechanism prevents
that.

> As a
> consequence it cannot keep current with the topological state. In
> some scenarios, a non-congested peer might want to negotiate new
> capabilities with a congested peer. The congested peer is so
> degraded that its TCP session goes into significant flow control off
> conditions and is unable to see the new BGP messages. Peer to peer
> communication is severely hampered and as a result the uncongested
> peer will take corrective action when its hold timer expires and
> drops the session. The uncongested peer computes alternate routing
> paths that are suboptimal in distance or attribute and thus affect
> the forwarding decisions of all routers in this network. It is
> possible that the congested peer's routing (control) plane is badly
> degraded but its forwarding plane is at normal working level. The
> mistake

What mistake ?

> is made by the uncongested peer since it does not see any
> Keepalive/Update messages before its Hold timer expires and drops
> the session thereby dropping all its associated routes. After routes
> are dropped, network instability occurs and suboptimal paths are
> used by the remaining peers.

The scenario that you present above is one where you basically have a
peer that is totally broken. What does it matter ? A decent
implementation will choose to degrade it is service levels
gradually. It may achieve that with the current mechanisms.

> The MinRouteAdvertisementInternal and MinAsOriginationInterval

What does min advert interval have to do with the topic in
discussion... ? These control only the rate at which the sender may
attempt to generate updates, not the rate at which the receiver needs
to accept it.

In BGP a receiver can choose to accept a anyrate of updates it wants.


> timers are inadequate since they are mostly implemented on a peer
> session basis and studies have shown when they are used they
> severely degrade route convergence time. The problem with statically
> defined timers (initialized during system load or session
> establishment phase) is that they do not adjust to peer internal
> dynamically changing congestion conditions.

You are completly missing the point... 

> The problem with the
> congested peer using TCP flow control to reduce its congestion
> condition, is that it completely stops all incoming session traffic
> and thus preventing any messages of high priority nature from being
> seen.

What would be high priority in a BGP session... ?
To my understanding there is no need to introduce priorities in BGP messages.

> This document presents a mechanism by which the BGP session of a
> congested router need not be degraded to the level that
> communication is broken with its peers.

Please do offer more to substantiate this other than your claim that
the problem exists.

  Pedro.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id QAA26897 for <idr-archive@nic.merit.edu>; Thu, 18 Jul 2002 16:49:01 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id AE744912CD; Thu, 18 Jul 2002 16:48:43 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 77CFC912DE; Thu, 18 Jul 2002 16:48:43 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 18B20912CD for <idr@trapdoor.merit.edu>; Thu, 18 Jul 2002 16:48:04 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id DA0215DDF3; Thu, 18 Jul 2002 16:48:03 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 366B65DDE5 for <idr@merit.edu>; Thu, 18 Jul 2002 16:48:03 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id QAA16387; Thu, 18 Jul 2002 16:48:01 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id QAA00231; Thu, 18 Jul 2002 16:48:02 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W985RBF>; Thu, 18 Jul 2002 16:48:01 -0400
Message-ID: <39469E08BD83D411A3D900204840EC558227B6@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'vrishab sikand'" <v_sikand@yahoo.com>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>, "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: My BGP Route Update Pacing Draft
Date: Thu, 18 Jul 2002 16:47:59 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: multipart/alternative; boundary="----_=_NextPart_001_01C22E9C.66EA1D30"
Sender: owner-idr@merit.edu
Precedence: bulk

This message is in MIME format. Since your mail reader does not understand
this format, some or all of this message may not be legible.

------_=_NextPart_001_01C22E9C.66EA1D30
Content-Type: text/plain;
	charset="iso-8859-1"

Vrishab:

-----Original Message-----
From: vrishab sikand [mailto:v_sikand@yahoo.com]
Sent: Thursday, July 18, 2002 4:37 PM
To: Abarbanel, Benjamin; 'idr@merit.edu'
Subject: Re: My BGP Route Update Pacing Draft



Specific comment: 


Unless the rate of pacing is defined in absolute terms (as opposed to the a
subjectively for e.g 


yellow 4: reduce update message traffic by 40% of your normal rate). It will
be very difficult to verify compliance. 


Also in my experience, the "normal rate" varies drastically from vendor to
vendor. 


 What I meant here, whatever the message/second or route/second transmission
was as an average, reduce it by 40%. So if source router sent 100
messages/second it should now send 60 messages/second. That implies that the
sending router is keeping average statistics of its transmission rate to the
given peer on a message or route per second basis. I dont think this will be
too costly to do. 


General comment: 


 It appears to me that here is an attempt to solve an internal scheduling or
horse power problem with the help of external protocol.   


Yes in way it does, but it could also help scale older routers when they are
placed in a new environment where the loads exceed their previous tuned
expectations. You can think of this as a gracefull way to deal with abnormal
loading conditions without emploding internally.  


This mechanism may put undue burden on the transmitting router, which is in
the best interest of quick convergence in mind is sending prefixes as fast
as possible.   


I am not sure this is the case, it all depends on how people implement the
solution.  


  


Ben 


  "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com> wrote: 


Hi all:

Our recent discussions on this list and my recent work 
experiences have led me to write this draft and offer it 
to the IETF community. I would appreciate any comments 
anyone has to make.

Thanks in advance,
Ben







Network Working Group Ben Abarbanel
Internet Draft Marconi Communicatons
Expiration Date: December 2002 



BGP Route Update Pacing

draft-abarbanel-bgp-route-update-pacing-00.txt


1. Status of this Memo

This document is an Internet-Draft and is in full conformance with
all provisions of Section 10 of RFC2026 except that the right to
produce derivative works is not granted.

Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF), its areas, and its working groups. Note that
other groups may also distribute working documents as Internet-
Drafts.

Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as ``work in progress.''

The list of current Internet-Drafts can be accessed at
http://www.ietf.org/ietf/1id-abstracts.txt

The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html.


2. Abstract

This document defines a mechanism for controlling or limiting the rate at 
which BGP update messages are sent from one peer to the next when a BGP peer

experiences internal congestion. With the introduction of new dynamic BGP 
protocol capabilities [CAP] message or other none BGP session destructive 
messages, it is necessary to limit the rate at which the BGP update messages

are sent without affecting the entire BGP session and without relying on the

transport (TCP) layer to do so as a normal reaction to data congestion of
the 
TCP session across the network.

3. Specification of Requirements

The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", 
"SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to
be 
interpreted as described in RFC 2119 [RFC2119].




Internet Draft draft-abarbanel-bgp-route-update-pacing-00.txt [Page 2]



4. Introduction

This document defines a mechanism for controlling or limiting the rate at 
which BGP update messages are sent from one peer to the next when a BGP peer

experiences internal congestion. With the introduction of new dynamic BGP 
protocol capabilities [CAP] message or other none BGP session destructive 
messages, it is necessary to limit the rate at which the BGP update messages

are sent without affecting the entire BGP session and without relying on the

transport (TCP) layer to do so as a normal reaction to data congestion of
the 
TCP session across the network.

When a router enters a state where either its CPU utilization is maximized 
(reaches close to 100%) or its memory is nearly depleted (less than 10% of 
memory left), it cannot handle new or heavy streams of updates from its
peers 
and at times unable to send messages to its peer in a timely manner (within 
30 seconds). As a consequence it cannot keep current with the topological 
state. In some scenarios, a non-congested peer might want to negotiate new 
capabilities with a congested peer. The congested peer is so degraded that 
its TCP session goes into significant flow control off conditions and is 
unable to see the new BGP messages. Peer to peer communication is severely 
hampered and as a result the uncongested peer will take corrective action 
when its hold timer expires and drops the session. The uncongested peer 
computes alternate routing paths that are suboptimal in distance or
attribute 
and thus affect the forwarding decisions of all routers in this network. It 
is possible that the congested peer's routing (control) plane is badly 
degraded but its forwarding plane is at normal working level. The mistake is

made by the uncongested peer since it does not see any Keepalive/Update 
messages before its Hold timer expires and drops the session thereby
dropping 
all its associated routes. After routes are dropped, network instability 
occurs and suboptimal paths are used by the remaining peers.

The MinRouteAdvertisementInternal and MinAsOriginationInterval timers are 
inadequate since they are mostly implemented on a peer session basis and 
studies have shown when they are used they severely degrade route
convergence 
time. The problem with statically defined timers (initialized during system 
load or session establishment phase) is that they do not adjust to peer 
internal dynamically changing congestion conditions. The problem with the 
congested peer using TCP flow control to reduce its congestion condition, is

that it completely stops all incoming session traffic and thus preventing
any 
messages of high priority nature from being seen.

TCP Out of Band Data or Urgent Message is one way to bypass the TCP flow 
control condition and allow high priority BGP messages to get to the 
congested peer, assuming it is able to read the session socket queue. This 
solution has its drawbacks as described in [UNIX-NET] chapter 21, p. 568. 








Internet Draft draft-abarbanel-bgp-route-update-pacing-00.txt [Page 3]



This document presents a mechanism by which the BGP session of a congested 
router need not be degraded to the level that communication is broken with 
its peers. By using a peer to peer pacing mechanism which allows one peer to

rate limit the number of update messages per second received from another 
peer, it can avoid the severe congestion conditions and process these 
messages at a manageable level. In addition, the uncongested peer has the 
knowledge to send high priority messages ahead of or in place of the normal 
high volume of update messages.

Usually, topological disturbances are spiky in nature and once they subside,

the network returns to its optimum path oriented level. Whatever caused the 
network to become unstable, such as routers handling too much data in a
small 
period of time or routers loosing their sessions or their links going up and

down, occurs in most live network on an infrequent basis. By using the
pacing 
mechanism as outlined in this draft, a BGP peer can prevent the serious 
congestion long before it is in trouble and thus ride through topological 
disturbances and still regain its stability without causing its sessions to 
drop or its routes/paths to be discarded and recomputed.

The assumption in this spec is that the source of most or all of the
internal 
BGP router congestion is due to the heavy reception of update messages from 
neighboring peers containing large number of routes.


5. BGP Update Pacing Mechanism

The BGP Update Message Pacing Mechanism is used to slow down the rate at 
which a peer sends update messages to another. This extension to the BGP 
protocol is simplified to use session non-destructive messages such as
INFORM 
as described in [INFORM]. The pacing is performed dynamically upon
congestion 
detection and subsidence and thus needs to use the new [INFORM] message that

will not infringe on the underlying BGP protocol or all its semantics and 
rules.


5.1 BGP Update Message Pacing (PACE) Dynamic Capability

The BGP Update Message Pacing capability is dynamically negotiated with all 
BGP speakers as type code=PACE (where, PACE=TBD) per TLV structures as 
defined in [BGP-CAP] of the OPEN message or anytime after session is 
established using the Dynamic Capability Message as defined [DYN-CAP] 
specification. All those peers that accept the PACE capability will be 
expected to support the new INFORM message as defined in [INFORM] 
specification to carry the pacing TLV structure. 









Internet Draft draft-abarbanel-bgp-route-update-pacing-00.txt [Page 4]



All PACE capable routers will provide a configuration option to their 
operator to enable the BGP Update Message Pacing mechanism on a per peer 
basis. 

Any peer wishing to withdraw the PACE capability can do so dynamically using

the Dynamic Capability message as outlined in [DYN-CAP] specification. Once 
withdrawn, affected peering session will remain intact but will not benefit 
from the performance improvements offered by the pacing mechanism.


5.2 Use of INFORM Message

The INFORM message as described in [INFORM] is used to carry the rate 
limiting (pacing TLV) control structure to neighboring peers. This 
information informs the peer that the current BGP router has entered a 
congestion state and it is to rate limit its transmission to the level 
specified. 

The INFORM message contains the following PACE TLV structure:

Type = Pacing Information, type=TBD 
Length = 2
1st Byte of Value = Cmd as shown below 
2nd Byte of Value = Level as shown below

Cmd Description 
------- -----------
11 Request to Pace update messages
12 ACK Response to Pace Update messages
13 NACK Response to Paace Update Messages

A. Request to Pace Update Messages Level Indication (code=11)

The rate is divided into sub levels in term of categories (Gray, Yellow, 
Orange, Red, and Green) to denote the level of pacing which is directly 
related to the level of congestion experienced within the congested peer.
The 
colors are used as simple handles used for flagging the severity of the 
condition. The colors are also used as indicators for display by the NMS to 
identify the rate limiting levels from any neighboring peer to the operator.

Associated Level Management MIBs are defined in section 6. 

The actual pacing control is done via the sub levels within these colors. 

Level Description
------ -----------
Gray (Entering minor congestion state)
1 Reduce update message traffic by 10% your normal rate.
2 Reduce update message traffic by 20% your normal rate.
3 Reduce update message traffic by 30% your normal rate.




Internet Draft draft-abarbanel-bgp-route-update-pacing-00.txt [Page 5]



Yellow (Entering medium congestion state)
4 Reduce update message traffic by 40% your normal rate. 
5 Reduce update message traffic by 50% your normal rate.
6 Reduce update message traffic by 60% your normal rate.

Orange (Entering major congestion state)
7 Reduce update message traffic by 70% your normal rate. 
8 Reduce update message traffic by 80% your normal rate.
9 Reduce update message traffic by 90% your normal rate

Red (Entering critical congestion state)
10 Complete cessation of all update message traffic (flow off). 
At this Level the receiving peer might decide to bypass the 
Congested Router and pick another less optimal router for the
affected Routes.

Green (Exiting any congestion state)
0 Restore update message traffic to your normal level (flow on).
Routes redirected can be resumed to the original peer.

The receiving peer will send an INFORM message with an ACK or NACK
Indicating 
it has received and understood the pacing request and will either comply
with 
the it or forbid it. If the congested peer receives a NACK, it should remove

that peer from its list of PACE capable peers but still maintaining its 
session without the use of Pacing. If the NACKing peer decides at a future 
date to re-enable the Pacing with the local peer, it will renegotiate the 
PACE capability with the local peer at that time.

B. Response Message Error Indication:

Level Description
------ -----------
0 ACK indication. The peer will comply with the 
Pacing level request.
1 NACK indication. The peer is unable to perform Pacing or comply 
with the pacing level requested.


5.3 INFORM Message Response Timer

When a congested peer sends an INFORM message with a "Request to Pace Update

Messages (code=11)" to all peers that support the pacing feature, it will 
also start an associated INFORM Message Response Timer. Any peer that does 
not respond within a 30 second timeout period with either an ACK/NACK INFORM

Response message, its associated session will be dropped. This is done to 
clear any PACE capable peers that are also congested to the point where
their 
communication with the local congested peer is severed.






Internet Draft draft-abarbanel-bgp-route-update-pacing-00.txt [Page 6]



5.4 Congestion Detection Within the Router

There are at least two ways congestion could be detected and measured by a 
BGP router.

- A high CPU utilization condition 

- A Lack of available memory to accept incoming BGP messages or inability 
to get memory to successfully complete BGP processing. 

5.4.1 CPU Utilization Based Level Computation

It is recommended that when the congested router detects its inability to 
perform route calculations or accept new BGP session messages at a normal 
rate meaning the current router's CPU utilization is more than 49%, it
should 
inform all its peers of a rate limiting level and slow them down
accordingly. 

The recommended way for computing the Level, based on percent CPU 
Utilization, is done using the following:

Level = (((%CPU Utilization - 50) x 2) + 5) / 10).
Where, % CPU Utilization is a whole integer number. Use Integer math 
here to remove any fractional value.

Note: If computed Level is negative, go to Green and set Level = 0. If 
Level = 0 send INFORM message with pacing Level = 0, implying return 
to normal (100%) update rate.

Level implies, whatever number of messages you transmitted per second 
before, transmit (100 - (Level x 10))% of that now. 

e.g. If you transmitted 100 messages/second before. A level 2 (20%) 
reduction will cause you to transmit 80 messages/second now.
Assumption is that these messages have an average size (1500 bytes).


5.4.2 Memory Allocation Based Level Computation

It is recommended that when the congested router detects its inability to 
perform route calculations or accept new BGP session messages at a normal 
rate meaning the current router's memory allocation is more than 69%, it 
should inform all its peers of a rate limiting level and slow them down 
accordingly. 

The recommended way for computing the Level, based on percent Memory 
Allocation, is done using the following:

Level = (((%Memory Allocated - 70) x 2) + 5) / 10).
Where, % Memory Allocated is a whole integer number. Use Integer math 
here to remove any fractional value.



Internet Draft draft-abarbanel-bgp-route-update-pacing-00.txt [Page 7]



Note: If computed Level is negative, go to Green sub-group and set 
Level = 0. If Level = 0 send INFORM message with pacing Level = 0, 
implying return to normal (100%) update rate.


5.5 INFORM Message Throttling

Once the congested peer receives acknowledgement from another peer, it will 
send a modification INFORM message with a new Level to that peer after the 
computed pacing Level changes by at least 1 value. This will amount to no 
more than one INFORM modification message every 5 seconds. This is done to 
debounce any spiky bursts of INFORM messages to all PACE negotiated peers 
each time the computed pacing Level changes. Depending on vendor 
implementations, the internal utilization levels could change at the 
Microsecond or Millisecond rate.


6. Implementation Specific Mechanisms

The Memory Allocation and CPU Utilization Level detection algorithms 
discussed in section 5.4.1 and 5.4.2 are suggested ways one can implement 
these solutions. However, each vendor can implement a unique Memory 
Allocation and CPU Utilization Level detection algorithms that best suits 
his/her needs and will not negatively impact the overall BGP Route Update 
Pacing mechanism described in this spec. Any issues relating to internal 
implementation algorithms are outside the scope of this document.


7. Level Indicators Management MIBs

TBD


8. Security Considerations

This extension to BGP does not change the underlying security issues.


9. References

[BGP-CAP] Chandra, R., Scudder, J., "Capabilities Advertisement with 
BGP-4", draft-ietf-idr-rfc2842bis-02.txt

[DYN-CAP] Chen E., Sangli S., "Dynamic Capability for BGP-4", 
draft-ietf-idr-dynamic-cap-02.txt, October 2002.

[INFORM] Nalawade G., Scudder J., "BGPv4 INFORM Message",
draft-nalawade-bgp-inform-00.txt, December 2002

[BGP-4] Rekhter, Y., and T. Li, "A Border Gateway Protocol 4 
(BGP-4)", RFC 1771, March 1995.


Internet Draft draft-abarbanel-bgp-route-update-pacing-00.txt [Page 8]



[BGP-4-DRAFT] Rekhter, Y. and T. Li (editors), "A Border Gateway Protocol 4
(BGP-4)", Internet Draft draft-ietf-idr-bgp4-18.txt, 
January 2002.

[UNIX-NET] Stevens, R., "UNIX Network Programming, Vol 1", Second Edition
1998 Prentice Hall, Inc.


10. Author Information

Ben Abarbanel
Marconi Communications
1595 Spring Hill Road, 5th Floor
Vienna, VA 22182
Email: benjamin.abarbanel@marconi.com







   _____  

Do You Yahoo!?
Yahoo! Autos <http://autos.yahoo.com/>  - Get free new car price quotes


------_=_NextPart_001_01C22E9C.66EA1D30
Content-Type: text/html;
	charset="iso-8859-1"

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=iso-8859-1">


<META content="MSHTML 5.00.2920.0" name=GENERATOR></HEAD>
<BODY>
<DIV><FONT color=#0000ff face=Arial size=2><SPAN 
class=316534120-18072002>Vrishab:</SPAN></FONT></DIV>
<BLOCKQUOTE 
style="BORDER-LEFT: #0000ff 2px solid; MARGIN-LEFT: 5px; PADDING-LEFT: 5px">
  <DIV align=left class=OutlookMessageHeader dir=ltr><FONT face=Tahoma 
  size=2>-----Original Message-----<BR><B>From:</B> vrishab sikand 
  [mailto:v_sikand@yahoo.com]<BR><B>Sent:</B> Thursday, July 18, 2002 4:37 
  PM<BR><B>To:</B> Abarbanel, Benjamin; 'idr@merit.edu'<BR><B>Subject:</B> Re: 
  My BGP Route Update Pacing Draft<BR><BR></DIV></FONT>
  <P>Specific comment: 
  <P>Unless the rate of pacing is defined in absolute terms (as opposed to the a 
  subjectively for e.g 
  <P>yellow 4: reduce update message traffic by 40% of your normal rate). It 
  will be very difficult to verify compliance. 
  <P>Also in my experience, the "normal rate" varies drastically from vendor to 
  vendor. 
  <P>&nbsp;<FONT color=#0000ff face=Arial size=2><SPAN 
  class=316534120-18072002>What I meant here, whatever the message/second or 
  route/second transmission was as an average, reduce it by 40%. So if source 
  router sent 100 messages/second&nbsp;it should now send 60 messages/second. 
  That implies that the sending router is keeping average statistics of its 
  transmission rate to the given peer on a message or route per second basis. I 
  dont think this will be too costly to do.</SPAN></FONT>
  <P>General comment: 
  <P>&nbsp;It appears to me that here is an attempt to solve an internal 
  scheduling or horse power problem with the help of external 
  protocol.&nbsp;<FONT color=#0000ff face=Arial size=2><SPAN 
  class=316534120-18072002>&nbsp;</SPAN></FONT>
  <P><FONT color=#0000ff face=Arial size=2><SPAN class=316534120-18072002>Yes in 
  way it does, but it could also help scale older routers when they are placed 
  in a new environment where the loads exceed their previous tuned 
  expectations.&nbsp;You can think of this as a gracefull way to deal with 
  abnormal loading conditions without emploding internally.&nbsp;</SPAN></FONT>
  <P>This mechanism may put undue burden on the transmitting router, which is in 
  the best interest of quick convergence in mind is sending prefixes as fast as 
  possible.&nbsp;<FONT color=#0000ff face=Arial size=2><SPAN 
  class=316534120-18072002>&nbsp;</SPAN></FONT>
  <P><FONT color=#0000ff face=Arial size=2><SPAN class=316534120-18072002>I am 
  not sure this is the case, it all depends on how people implement the 
  solution.&nbsp;</SPAN></FONT>
  <P><FONT color=#0000ff face=Arial size=2><SPAN 
  class=316534120-18072002></SPAN></FONT>&nbsp;
  <P><FONT color=#0000ff face=Arial size=2><SPAN 
  class=316534120-18072002>Ben</SPAN></FONT>
  <P>&nbsp; <B><I>"Abarbanel, Benjamin" 
  &lt;Benjamin.Abarbanel@Marconi.com&gt;</I></B> wrote: 
  <BLOCKQUOTE 
  style="BORDER-LEFT: #1010ff 2px solid; MARGIN-LEFT: 5px; PADDING-LEFT: 5px">Hi 
    all:<BR><BR>Our recent discussions on this list and my recent work 
    <BR>experiences have led me to write this draft and offer it <BR>to the IETF 
    community. I would appreciate any comments <BR>anyone has to 
    make.<BR><BR>Thanks in 
    advance,<BR>Ben<BR><BR><BR><BR><BR><BR><BR><BR>Network Working Group Ben 
    Abarbanel<BR>Internet Draft Marconi Communicatons<BR>Expiration Date: 
    December 2002 <BR><BR><BR><BR>BGP Route Update 
    Pacing<BR><BR>draft-abarbanel-bgp-route-update-pacing-00.txt<BR><BR><BR>1. 
    Status of this Memo<BR><BR>This document is an Internet-Draft and is in full 
    conformance with<BR>all provisions of Section 10 of RFC2026 except that the 
    right to<BR>produce derivative works is not granted.<BR><BR>Internet-Drafts 
    are working documents of the Internet Engineering<BR>Task Force (IETF), its 
    areas, and its working groups. Note that<BR>other groups may also distribute 
    working documents as Internet-<BR>Drafts.<BR><BR>Internet-Drafts are draft 
    documents valid for a maximum of six months<BR>and may be updated, replaced, 
    or obsoleted by other documents at any<BR>time. It is inappropriate to use 
    Internet-Drafts as reference<BR>material or to cite them other than as 
    ``work in progress.''<BR><BR>The list of current Internet-Drafts can be 
    accessed at<BR>http://www.ietf.org/ietf/1id-abstracts.txt<BR><BR>The list of 
    Internet-Draft Shadow Directories can be accessed 
    at<BR>http://www.ietf.org/shadow.html.<BR><BR><BR>2. Abstract<BR><BR>This 
    document defines a mechanism for controlling or limiting the rate at 
    <BR>which BGP update messages are sent from one peer to the next when a BGP 
    peer <BR>experiences internal congestion. With the introduction of new 
    dynamic BGP <BR>protocol capabilities [CAP] message or other none BGP 
    session destructive <BR>messages, it is necessary to limit the rate at which 
    the BGP update messages <BR>are sent without affecting the entire BGP 
    session and without relying on the <BR>transport (TCP) layer to do so as a 
    normal reaction to data congestion of the <BR>TCP session across the 
    network.<BR><BR>3. Specification of Requirements<BR><BR>The key words 
    "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", <BR>"SHOULD 
    NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be 
    <BR>interpreted as described in RFC 2119 
    [RFC2119].<BR><BR><BR><BR><BR>Internet Draft 
    draft-abarbanel-bgp-route-update-pacing-00.txt [Page 2]<BR><BR><BR><BR>4. 
    Introduction<BR><BR>This document defines a mechanism for controlling or 
    limiting the rate at <BR>which BGP update messages are sent from one peer to 
    the next when a BGP peer <BR>experiences internal congestion. With the 
    introduction of new dynamic BGP <BR>protocol capabilities [CAP] message or 
    other none BGP session destructive <BR>messages, it is necessary to limit 
    the rate at which the BGP update messages <BR>are sent without affecting the 
    entire BGP session and without relying on the <BR>transport (TCP) layer to 
    do so as a normal reaction to data congestion of the <BR>TCP session across 
    the network.<BR><BR>When a router enters a state where either its CPU 
    utilization is maximized <BR>(reaches close to 100%) or its memory is nearly 
    depleted (less than 10% of <BR>memory left), it cannot handle new or heavy 
    streams of updates from its peers <BR>and at times unable to send messages 
    to its peer in a timely manner (within <BR>30 seconds). As a consequence it 
    cannot keep current with the topological <BR>state. In some scenarios, a 
    non-congested peer might want to negotiate new <BR>capabilities with a 
    congested peer. The congested peer is so degraded that <BR>its TCP session 
    goes into significant flow control off conditions and is <BR>unable to see 
    the new BGP messages. Peer to peer communication is severely <BR>hampered 
    and as a result the uncongested peer will take corrective action <BR>when 
    its hold timer expires and drops the session. The uncongested peer 
    <BR>computes alternate routing paths that are suboptimal in distance or 
    attribute <BR>and thus affect the forwarding decisions of all routers in 
    this network. It <BR>is possible that the congested peer's routing (control) 
    plane is badly <BR>degraded but its forwarding plane is at normal working 
    level. The mistake is <BR>made by the uncongested peer since it does not see 
    any Keepalive/Update <BR>messages before its Hold timer expires and drops 
    the session thereby dropping <BR>all its associated routes. After routes are 
    dropped, network instability <BR>occurs and suboptimal paths are used by the 
    remaining peers.<BR><BR>The MinRouteAdvertisementInternal and 
    MinAsOriginationInterval timers are <BR>inadequate since they are mostly 
    implemented on a peer session basis and <BR>studies have shown when they are 
    used they severely degrade route convergence <BR>time. The problem with 
    statically defined timers (initialized during system <BR>load or session 
    establishment phase) is that they do not adjust to peer <BR>internal 
    dynamically changing congestion conditions. The problem with the 
    <BR>congested peer using TCP flow control to reduce its congestion 
    condition, is <BR>that it completely stops all incoming session traffic and 
    thus preventing any <BR>messages of high priority nature from being 
    seen.<BR><BR>TCP Out of Band Data or Urgent Message is one way to bypass the 
    TCP flow <BR>control condition and allow high priority BGP messages to get 
    to the <BR>congested peer, assuming it is able to read the session socket 
    queue. This <BR>solution has its drawbacks as described in [UNIX-NET] 
    chapter 21, p. 568. <BR><BR><BR><BR><BR><BR><BR><BR><BR>Internet Draft 
    draft-abarbanel-bgp-route-update-pacing-00.txt [Page 3]<BR><BR><BR><BR>This 
    document presents a mechanism by which the BGP session of a congested 
    <BR>router need not be degraded to the level that communication is broken 
    with <BR>its peers. By using a peer to peer pacing mechanism which allows 
    one peer to <BR>rate limit the number of update messages per second received 
    from another <BR>peer, it can avoid the severe congestion conditions and 
    process these <BR>messages at a manageable level. In addition, the 
    uncongested peer has the <BR>knowledge to send high priority messages ahead 
    of or in place of the normal <BR>high volume of update 
    messages.<BR><BR>Usually, topological disturbances are spiky in nature and 
    once they subside, <BR>the network returns to its optimum path oriented 
    level. Whatever caused the <BR>network to become unstable, such as routers 
    handling too much data in a small <BR>period of time or routers loosing 
    their sessions or their links going up and <BR>down, occurs in most live 
    network on an infrequent basis. By using the pacing <BR>mechanism as 
    outlined in this draft, a BGP peer can prevent the serious <BR>congestion 
    long before it is in trouble and thus ride through topological 
    <BR>disturbances and still regain its stability without causing its sessions 
    to <BR>drop or its routes/paths to be discarded and recomputed.<BR><BR>The 
    assumption in this spec is that the source of most or all of the internal 
    <BR>BGP router congestion is due to the heavy reception of update messages 
    from <BR>neighboring peers containing large number of routes.<BR><BR><BR>5. 
    BGP Update Pacing Mechanism<BR><BR>The BGP Update Message Pacing Mechanism 
    is used to slow down the rate at <BR>which a peer sends update messages to 
    another. This extension to the BGP <BR>protocol is simplified to use session 
    non-destructive messages such as INFORM <BR>as described in [INFORM]. The 
    pacing is performed dynamically upon congestion <BR>detection and subsidence 
    and thus needs to use the new [INFORM] message that <BR>will not infringe on 
    the underlying BGP protocol or all its semantics and 
    <BR>rules.<BR><BR><BR>5.1 BGP Update Message Pacing (PACE) Dynamic 
    Capability<BR><BR>The BGP Update Message Pacing capability is dynamically 
    negotiated with all <BR>BGP speakers as type code=PACE (where, PACE=TBD) per 
    TLV structures as <BR>defined in [BGP-CAP] of the OPEN message or anytime 
    after session is <BR>established using the Dynamic Capability Message as 
    defined [DYN-CAP] <BR>specification. All those peers that accept the PACE 
    capability will be <BR>expected to support the new INFORM message as defined 
    in [INFORM] <BR>specification to carry the pacing TLV structure. 
    <BR><BR><BR><BR><BR><BR><BR><BR><BR><BR>Internet Draft 
    draft-abarbanel-bgp-route-update-pacing-00.txt [Page 4]<BR><BR><BR><BR>All 
    PACE capable routers will provide a configuration option to their 
    <BR>operator to enable the BGP Update Message Pacing mechanism on a per peer 
    <BR>basis. <BR><BR>Any peer wishing to withdraw the PACE capability can do 
    so dynamically using <BR>the Dynamic Capability message as outlined in 
    [DYN-CAP] specification. Once <BR>withdrawn, affected peering session will 
    remain intact but will not benefit <BR>from the performance improvements 
    offered by the pacing mechanism.<BR><BR><BR>5.2 Use of INFORM 
    Message<BR><BR>The INFORM message as described in [INFORM] is used to carry 
    the rate <BR>limiting (pacing TLV) control structure to neighboring peers. 
    This <BR>information informs the peer that the current BGP router has 
    entered a <BR>congestion state and it is to rate limit its transmission to 
    the level <BR>specified. <BR><BR>The INFORM message contains the following 
    PACE TLV structure:<BR><BR>Type = Pacing Information, type=TBD <BR>Length = 
    2<BR>1st Byte of Value = Cmd as shown below <BR>2nd Byte of Value = Level as 
    shown below<BR><BR>Cmd Description <BR>------- -----------<BR>11 Request to 
    Pace update messages<BR>12 ACK Response to Pace Update messages<BR>13 NACK 
    Response to Paace Update Messages<BR><BR>A. Request to Pace Update Messages 
    Level Indication (code=11)<BR><BR>The rate is divided into sub levels in 
    term of categories (Gray, Yellow, <BR>Orange, Red, and Green) to denote the 
    level of pacing which is directly <BR>related to the level of congestion 
    experienced within the congested peer. The <BR>colors are used as simple 
    handles used for flagging the severity of the <BR>condition. The colors are 
    also used as indicators for display by the NMS to <BR>identify the rate 
    limiting levels from any neighboring peer to the operator. <BR>Associated 
    Level Management MIBs are defined in section 6. <BR><BR>The actual pacing 
    control is done via the sub levels within these colors. <BR><BR>Level 
    Description<BR>------ -----------<BR>Gray (Entering minor congestion 
    state)<BR>1 Reduce update message traffic by 10% your normal rate.<BR>2 
    Reduce update message traffic by 20% your normal rate.<BR>3 Reduce update 
    message traffic by 30% your normal rate.<BR><BR><BR><BR><BR>Internet Draft 
    draft-abarbanel-bgp-route-update-pacing-00.txt [Page 
    5]<BR><BR><BR><BR>Yellow (Entering medium congestion state)<BR>4 Reduce 
    update message traffic by 40% your normal rate. <BR>5 Reduce update message 
    traffic by 50% your normal rate.<BR>6 Reduce update message traffic by 60% 
    your normal rate.<BR><BR>Orange (Entering major congestion state)<BR>7 
    Reduce update message traffic by 70% your normal rate. <BR>8 Reduce update 
    message traffic by 80% your normal rate.<BR>9 Reduce update message traffic 
    by 90% your normal rate<BR><BR>Red (Entering critical congestion 
    state)<BR>10 Complete cessation of all update message traffic (flow off). 
    <BR>At this Level the receiving peer might decide to bypass the 
    <BR>Congested Router and pick another less optimal router for 
    the<BR>affected Routes.<BR><BR>Green (Exiting any congestion state)<BR>0 
    Restore update message traffic to your normal level (flow on).<BR>Routes 
    redirected can be resumed to the original peer.<BR><BR>The receiving peer 
    will send an INFORM message with an ACK or NACK Indicating <BR>it has 
    received and understood the pacing request and will either comply with 
    <BR>the it or forbid it. If the congested peer receives a NACK, it should 
    remove <BR>that peer from its list of PACE capable peers but still 
    maintaining its <BR>session without the use of Pacing. If the NACKing peer 
    decides at a future <BR>date to re-enable the Pacing with the local peer, it 
    will renegotiate the <BR>PACE capability with the local peer at that 
    time.<BR><BR>B. Response Message Error Indication:<BR><BR>Level 
    Description<BR>------ -----------<BR>0 ACK indication. The peer will comply 
    with the <BR>Pacing level request.<BR>1 NACK indication. The peer is unable 
    to perform Pacing or comply <BR>with the pacing level 
    requested.<BR><BR><BR>5.3 INFORM Message Response Timer<BR><BR>When a 
    congested peer sends an INFORM message with a "Request to Pace Update 
    <BR>Messages (code=11)" to all peers that support the pacing feature, it 
    will <BR>also start an associated INFORM Message Response Timer. Any peer 
    that does <BR>not respond within a 30 second timeout period with either an 
    ACK/NACK INFORM <BR>Response message, its associated session will be 
    dropped. This is done to <BR>clear any PACE capable peers that are also 
    congested to the point where their <BR>communication with the local 
    congested peer is severed.<BR><BR><BR><BR><BR><BR><BR>Internet Draft 
    draft-abarbanel-bgp-route-update-pacing-00.txt [Page 6]<BR><BR><BR><BR>5.4 
    Congestion Detection Within the Router<BR><BR>There are at least two ways 
    congestion could be detected and measured by a <BR>BGP router.<BR><BR>- A 
    high CPU utilization condition <BR><BR>- A Lack of available memory to 
    accept incoming BGP messages or inability <BR>to get memory to successfully 
    complete BGP processing. <BR><BR>5.4.1 CPU Utilization Based Level 
    Computation<BR><BR>It is recommended that when the congested router detects 
    its inability to <BR>perform route calculations or accept new BGP session 
    messages at a normal <BR>rate meaning the current router's CPU utilization 
    is more than 49%, it should <BR>inform all its peers of a rate limiting 
    level and slow them down accordingly. <BR><BR>The recommended way for 
    computing the Level, based on percent CPU <BR>Utilization, is done using the 
    following:<BR><BR>Level = (((%CPU Utilization - 50) x 2) + 5) / 
    10).<BR>Where, % CPU Utilization is a whole integer number. Use Integer math 
    <BR>here to remove any fractional value.<BR><BR>Note: If computed Level is 
    negative, go to Green and set Level = 0. If <BR>Level = 0 send INFORM 
    message with pacing Level = 0, implying return <BR>to normal (100%) update 
    rate.<BR><BR>Level implies, whatever number of messages you transmitted per 
    second <BR>before, transmit (100 - (Level x 10))% of that now. <BR><BR>e.g. 
    If you transmitted 100 messages/second before. A level 2 (20%) <BR>reduction 
    will cause you to transmit 80 messages/second now.<BR>Assumption is that 
    these messages have an average size (1500 bytes).<BR><BR><BR>5.4.2 Memory 
    Allocation Based Level Computation<BR><BR>It is recommended that when the 
    congested router detects its inability to <BR>perform route calculations or 
    accept new BGP session messages at a normal <BR>rate meaning the current 
    router's memory allocation is more than 69%, it <BR>should inform all its 
    peers of a rate limiting level and slow them down <BR>accordingly. 
    <BR><BR>The recommended way for computing the Level, based on percent Memory 
    <BR>Allocation, is done using the following:<BR><BR>Level = (((%Memory 
    Allocated - 70) x 2) + 5) / 10).<BR>Where, % Memory Allocated is a whole 
    integer number. Use Integer math <BR>here to remove any fractional 
    value.<BR><BR><BR><BR>Internet Draft 
    draft-abarbanel-bgp-route-update-pacing-00.txt [Page 7]<BR><BR><BR><BR>Note: 
    If computed Level is negative, go to Green sub-group and set <BR>Level = 0. 
    If Level = 0 send INFORM message with pacing Level = 0, <BR>implying return 
    to normal (100%) update rate.<BR><BR><BR>5.5 INFORM Message 
    Throttling<BR><BR>Once the congested peer receives acknowledgement from 
    another peer, it will <BR>send a modification INFORM message with a new 
    Level to that peer after the <BR>computed pacing Level changes by at least 1 
    value. This will amount to no <BR>more than one INFORM modification message 
    every 5 seconds. This is done to <BR>debounce any spiky bursts of INFORM 
    messages to all PACE negotiated peers <BR>each time the computed pacing 
    Level changes. Depending on vendor <BR>implementations, the internal 
    utilization levels could change at the <BR>Microsecond or Millisecond 
    rate.<BR><BR><BR>6. Implementation Specific Mechanisms<BR><BR>The Memory 
    Allocation and CPU Utilization Level detection algorithms <BR>discussed in 
    section 5.4.1 and 5.4.2 are suggested ways one can implement <BR>these 
    solutions. However, each vendor can implement a unique Memory <BR>Allocation 
    and CPU Utilization Level detection algorithms that best suits <BR>his/her 
    needs and will not negatively impact the overall BGP Route Update <BR>Pacing 
    mechanism described in this spec. Any issues relating to internal 
    <BR>implementation algorithms are outside the scope of this 
    document.<BR><BR><BR>7. Level Indicators Management 
    MIBs<BR><BR>TBD<BR><BR><BR>8. Security Considerations<BR><BR>This extension 
    to BGP does not change the underlying security issues.<BR><BR><BR>9. 
    References<BR><BR>[BGP-CAP] Chandra, R., Scudder, J., "Capabilities 
    Advertisement with <BR>BGP-4", 
    draft-ietf-idr-rfc2842bis-02.txt<BR><BR>[DYN-CAP] Chen E., Sangli S., 
    "Dynamic Capability for BGP-4", <BR>draft-ietf-idr-dynamic-cap-02.txt, 
    October 2002.<BR><BR>[INFORM] Nalawade G., Scudder J., "BGPv4 INFORM 
    Message",<BR>draft-nalawade-bgp-inform-00.txt, December 2002<BR><BR>[BGP-4] 
    Rekhter, Y., and T. Li, "A Border Gateway Protocol 4 <BR>(BGP-4)", RFC 1771, 
    March 1995.<BR><BR><BR>Internet Draft 
    draft-abarbanel-bgp-route-update-pacing-00.txt [Page 
    8]<BR><BR><BR><BR>[BGP-4-DRAFT] Rekhter, Y. and T. Li (editors), "A Border 
    Gateway Protocol 4<BR>(BGP-4)", Internet Draft draft-ietf-idr-bgp4-18.txt, 
    <BR>January 2002.<BR><BR>[UNIX-NET] Stevens, R., "UNIX Network Programming, 
    Vol 1", Second Edition<BR>1998 Prentice Hall, Inc.<BR><BR><BR>10. Author 
    Information<BR><BR>Ben Abarbanel<BR>Marconi Communications<BR>1595 Spring 
    Hill Road, 5th Floor<BR>Vienna, VA 22182<BR>Email: 
    benjamin.abarbanel@marconi.com<BR><BR><BR></BLOCKQUOTE>
  <P><BR>
  <HR SIZE=1>
  <B>Do You Yahoo!?</B><BR><A href="http://autos.yahoo.com/">Yahoo! Autos</A> - 
  Get free new car price quotes</BLOCKQUOTE></BODY></HTML>

------_=_NextPart_001_01C22E9C.66EA1D30--


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id QAA26554 for <idr-archive@nic.merit.edu>; Thu, 18 Jul 2002 16:38:17 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id D1BFD912D7; Thu, 18 Jul 2002 16:37:52 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 9AD0B912CD; Thu, 18 Jul 2002 16:37:52 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 8D76F912D7 for <idr@trapdoor.merit.edu>; Thu, 18 Jul 2002 16:37:24 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 5EE515DF02; Thu, 18 Jul 2002 16:37:24 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from web12808.mail.yahoo.com (web12808.mail.yahoo.com [216.136.174.43]) by segue.merit.edu (Postfix) with SMTP id 463D15DDD0 for <idr@merit.edu>; Thu, 18 Jul 2002 16:37:23 -0400 (EDT)
Message-ID: <20020718203720.79222.qmail@web12808.mail.yahoo.com>
Received: from [208.246.215.128] by web12808.mail.yahoo.com via HTTP; Thu, 18 Jul 2002 13:37:20 PDT
Date: Thu, 18 Jul 2002 13:37:20 -0700 (PDT)
From: vrishab sikand <v_sikand@yahoo.com>
Subject: Re: My BGP Route Update Pacing Draft
To: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>, "'idr@merit.edu'" <idr@merit.edu>
In-Reply-To: <39469E08BD83D411A3D900204840EC558227B4@vie-msgusr-01.dc.fore.com>
MIME-Version: 1.0
Content-Type: multipart/alternative; boundary="0-504351559-1027024640=:78832"
Sender: owner-idr@merit.edu
Precedence: bulk

--0-504351559-1027024640=:78832
Content-Type: text/plain; charset=us-ascii


Specific comment:
Unless the rate of pacing is defined in absolute terms (as opposed to the a subjectively for e.g
yellow 4: reduce update message traffic by 40% of your normal rate). It will be very difficult to verify compliance.
Also in my experience, the "normal rate" varies drastically from vendor to vendor.
 
General comment:
 It appears to me that here is an attempt to solve an internal scheduling or horse power problem with the help of external protocol. 
This mechanism may put undue burden on the transmitting router, which is in the best interest of quick convergence in mind is sending prefixes as fast as possible. 
  "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com> wrote: Hi all:

Our recent discussions on this list and my recent work 
experiences have led me to write this draft and offer it 
to the IETF community. I would appreciate any comments 
anyone has to make.

Thanks in advance,
Ben







Network Working Group Ben Abarbanel
Internet Draft Marconi Communicatons
Expiration Date: December 2002 



BGP Route Update Pacing

draft-abarbanel-bgp-route-update-pacing-00.txt


1. Status of this Memo

This document is an Internet-Draft and is in full conformance with
all provisions of Section 10 of RFC2026 except that the right to
produce derivative works is not granted.

Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF), its areas, and its working groups. Note that
other groups may also distribute working documents as Internet-
Drafts.

Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as ``work in progress.''

The list of current Internet-Drafts can be accessed at
http://www.ietf.org/ietf/1id-abstracts.txt

The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html.


2. Abstract

This document defines a mechanism for controlling or limiting the rate at 
which BGP update messages are sent from one peer to the next when a BGP peer 
experiences internal congestion. With the introduction of new dynamic BGP 
protocol capabilities [CAP] message or other none BGP session destructive 
messages, it is necessary to limit the rate at which the BGP update messages 
are sent without affecting the entire BGP session and without relying on the 
transport (TCP) layer to do so as a normal reaction to data congestion of the 
TCP session across the network.

3. Specification of Requirements

The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", 
"SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be 
interpreted as described in RFC 2119 [RFC2119].




Internet Draft draft-abarbanel-bgp-route-update-pacing-00.txt [Page 2]



4. Introduction

This document defines a mechanism for controlling or limiting the rate at 
which BGP update messages are sent from one peer to the next when a BGP peer 
experiences internal congestion. With the introduction of new dynamic BGP 
protocol capabilities [CAP] message or other none BGP session destructive 
messages, it is necessary to limit the rate at which the BGP update messages 
are sent without affecting the entire BGP session and without relying on the 
transport (TCP) layer to do so as a normal reaction to data congestion of the 
TCP session across the network.

When a router enters a state where either its CPU utilization is maximized 
(reaches close to 100%) or its memory is nearly depleted (less than 10% of 
memory left), it cannot handle new or heavy streams of updates from its peers 
and at times unable to send messages to its peer in a timely manner (within 
30 seconds). As a consequence it cannot keep current with the topological 
state. In some scenarios, a non-congested peer might want to negotiate new 
capabilities with a congested peer. The congested peer is so degraded that 
its TCP session goes into significant flow control off conditions and is 
unable to see the new BGP messages. Peer to peer communication is severely 
hampered and as a result the uncongested peer will take corrective action 
when its hold timer expires and drops the session. The uncongested peer 
computes alternate routing paths that are suboptimal in distance or attribute 
and thus affect the forwarding decisions of all routers in this network. It 
is possible that the congested peer's routing (control) plane is badly 
degraded but its forwarding plane is at normal working level. The mistake is 
made by the uncongested peer since it does not see any Keepalive/Update 
messages before its Hold timer expires and drops the session thereby dropping 
all its associated routes. After routes are dropped, network instability 
occurs and suboptimal paths are used by the remaining peers.

The MinRouteAdvertisementInternal and MinAsOriginationInterval timers are 
inadequate since they are mostly implemented on a peer session basis and 
studies have shown when they are used they severely degrade route convergence 
time. The problem with statically defined timers (initialized during system 
load or session establishment phase) is that they do not adjust to peer 
internal dynamically changing congestion conditions. The problem with the 
congested peer using TCP flow control to reduce its congestion condition, is 
that it completely stops all incoming session traffic and thus preventing any 
messages of high priority nature from being seen.

TCP Out of Band Data or Urgent Message is one way to bypass the TCP flow 
control condition and allow high priority BGP messages to get to the 
congested peer, assuming it is able to read the session socket queue. This 
solution has its drawbacks as described in [UNIX-NET] chapter 21, p. 568. 








Internet Draft draft-abarbanel-bgp-route-update-pacing-00.txt [Page 3]



This document presents a mechanism by which the BGP session of a congested 
router need not be degraded to the level that communication is broken with 
its peers. By using a peer to peer pacing mechanism which allows one peer to 
rate limit the number of update messages per second received from another 
peer, it can avoid the severe congestion conditions and process these 
messages at a manageable level. In addition, the uncongested peer has the 
knowledge to send high priority messages ahead of or in place of the normal 
high volume of update messages.

Usually, topological disturbances are spiky in nature and once they subside, 
the network returns to its optimum path oriented level. Whatever caused the 
network to become unstable, such as routers handling too much data in a small 
period of time or routers loosing their sessions or their links going up and 
down, occurs in most live network on an infrequent basis. By using the pacing 
mechanism as outlined in this draft, a BGP peer can prevent the serious 
congestion long before it is in trouble and thus ride through topological 
disturbances and still regain its stability without causing its sessions to 
drop or its routes/paths to be discarded and recomputed.

The assumption in this spec is that the source of most or all of the internal 
BGP router congestion is due to the heavy reception of update messages from 
neighboring peers containing large number of routes.


5. BGP Update Pacing Mechanism

The BGP Update Message Pacing Mechanism is used to slow down the rate at 
which a peer sends update messages to another. This extension to the BGP 
protocol is simplified to use session non-destructive messages such as INFORM 
as described in [INFORM]. The pacing is performed dynamically upon congestion 
detection and subsidence and thus needs to use the new [INFORM] message that 
will not infringe on the underlying BGP protocol or all its semantics and 
rules.


5.1 BGP Update Message Pacing (PACE) Dynamic Capability

The BGP Update Message Pacing capability is dynamically negotiated with all 
BGP speakers as type code=PACE (where, PACE=TBD) per TLV structures as 
defined in [BGP-CAP] of the OPEN message or anytime after session is 
established using the Dynamic Capability Message as defined [DYN-CAP] 
specification. All those peers that accept the PACE capability will be 
expected to support the new INFORM message as defined in [INFORM] 
specification to carry the pacing TLV structure. 









Internet Draft draft-abarbanel-bgp-route-update-pacing-00.txt [Page 4]



All PACE capable routers will provide a configuration option to their 
operator to enable the BGP Update Message Pacing mechanism on a per peer 
basis. 

Any peer wishing to withdraw the PACE capability can do so dynamically using 
the Dynamic Capability message as outlined in [DYN-CAP] specification. Once 
withdrawn, affected peering session will remain intact but will not benefit 
from the performance improvements offered by the pacing mechanism.


5.2 Use of INFORM Message

The INFORM message as described in [INFORM] is used to carry the rate 
limiting (pacing TLV) control structure to neighboring peers. This 
information informs the peer that the current BGP router has entered a 
congestion state and it is to rate limit its transmission to the level 
specified. 

The INFORM message contains the following PACE TLV structure:

Type = Pacing Information, type=TBD 
Length = 2
1st Byte of Value = Cmd as shown below 
2nd Byte of Value = Level as shown below

Cmd Description 
------- -----------
11 Request to Pace update messages
12 ACK Response to Pace Update messages
13 NACK Response to Pace Update Messages

A. Request to Pace Update Messages Level Indication (code=11)

The rate is divided into sub levels in term of categories (Gray, Yellow, 
Orange, Red, and Green) to denote the level of pacing which is directly 
related to the level of congestion experienced within the congested peer. The 
colors are used as simple handles used for flagging the severity of the 
condition. The colors are also used as indicators for display by the NMS to 
identify the rate limiting levels from any neighboring peer to the operator. 
Associated Level Management MIBs are defined in section 6. 

The actual pacing control is done via the sub levels within these colors. 

Level Description
------ -----------
Gray (Entering minor congestion state)
1 Reduce update message traffic by 10% your normal rate.
2 Reduce update message traffic by 20% your normal rate.
3 Reduce update message traffic by 30% your normal rate.




Internet Draft draft-abarbanel-bgp-route-update-pacing-00.txt [Page 5]



Yellow (Entering medium congestion state)
4 Reduce update message traffic by 40% your normal rate. 
5 Reduce update message traffic by 50% your normal rate.
6 Reduce update message traffic by 60% your normal rate.

Orange (Entering major congestion state)
7 Reduce update message traffic by 70% your normal rate. 
8 Reduce update message traffic by 80% your normal rate.
9 Reduce update message traffic by 90% your normal rate

Red (Entering critical congestion state)
10 Complete cessation of all update message traffic (flow off). 
At this Level the receiving peer might decide to bypass the 
Congested Router and pick another less optimal router for the
affected Routes.

Green (Exiting any congestion state)
0 Restore update message traffic to your normal level (flow on).
Routes redirected can be resumed to the original peer.

The receiving peer will send an INFORM message with an ACK or NACK Indicating 
it has received and understood the pacing request and will either comply with 
the it or forbid it. If the congested peer receives a NACK, it should remove 
that peer from its list of PACE capable peers but still maintaining its 
session without the use of Pacing. If the NACKing peer decides at a future 
date to re-enable the Pacing with the local peer, it will renegotiate the 
PACE capability with the local peer at that time.

B. Response Message Error Indication:

Level Description
------ -----------
0 ACK indication. The peer will comply with the 
Pacing level request.
1 NACK indication. The peer is unable to perform Pacing or comply 
with the pacing level requested.


5.3 INFORM Message Response Timer

When a congested peer sends an INFORM message with a "Request to Pace Update 
Messages (code=11)" to all peers that support the pacing feature, it will 
also start an associated INFORM Message Response Timer. Any peer that does 
not respond within a 30 second timeout period with either an ACK/NACK INFORM 
Response message, its associated session will be dropped. This is done to 
clear any PACE capable peers that are also congested to the point where their 
communication with the local congested peer is severed.






Internet Draft draft-abarbanel-bgp-route-update-pacing-00.txt [Page 6]



5.4 Congestion Detection Within the Router

There are at least two ways congestion could be detected and measured by a 
BGP router.

- A high CPU utilization condition 

- A Lack of available memory to accept incoming BGP messages or inability 
to get memory to successfully complete BGP processing. 

5.4.1 CPU Utilization Based Level Computation

It is recommended that when the congested router detects its inability to 
perform route calculations or accept new BGP session messages at a normal 
rate meaning the current router's CPU utilization is more than 49%, it should 
inform all its peers of a rate limiting level and slow them down accordingly. 

The recommended way for computing the Level, based on percent CPU 
Utilization, is done using the following:

Level = (((%CPU Utilization – 50) x 2) + 5) / 10).
Where, % CPU Utilization is a whole integer number. Use Integer math 
here to remove any fractional value.

Note: If computed Level is negative, go to Green and set Level = 0. If 
Level = 0 send INFORM message with pacing Level = 0, implying return 
to normal (100%) update rate.

Level implies, whatever number of messages you transmitted per second 
before, transmit (100 – (Level x 10))% of that now. 

e.g. If you transmitted 100 messages/second before. A level 2 (20%) 
reduction will cause you to transmit 80 messages/second now.
Assumption is that these messages have an average size (1500 bytes).


5.4.2 Memory Allocation Based Level Computation

It is recommended that when the congested router detects its inability to 
perform route calculations or accept new BGP session messages at a normal 
rate meaning the current router's memory allocation is more than 69%, it 
should inform all its peers of a rate limiting level and slow them down 
accordingly. 

The recommended way for computing the Level, based on percent Memory 
Allocation, is done using the following:

Level = (((%Memory Allocated – 70) x 2) + 5) / 10).
Where, % Memory Allocated is a whole integer number. Use Integer math 
here to remove any fractional value.



Internet Draft draft-abarbanel-bgp-route-update-pacing-00.txt [Page 7]



Note: If computed Level is negative, go to Green sub-group and set 
Level = 0. If Level = 0 send INFORM message with pacing Level = 0, 
implying return to normal (100%) update rate.


5.5 INFORM Message Throttling

Once the congested peer receives acknowledgement from another peer, it will 
send a modification INFORM message with a new Level to that peer after the 
computed pacing Level changes by at least 1 value. This will amount to no 
more than one INFORM modification message every 5 seconds. This is done to 
debounce any spiky bursts of INFORM messages to all PACE negotiated peers 
each time the computed pacing Level changes. Depending on vendor 
implementations, the internal utilization levels could change at the 
Microsecond or Millisecond rate.


6. Implementation Specific Mechanisms

The Memory Allocation and CPU Utilization Level detection algorithms 
discussed in section 5.4.1 and 5.4.2 are suggested ways one can implement 
these solutions. However, each vendor can implement a unique Memory 
Allocation and CPU Utilization Level detection algorithms that best suits 
his/her needs and will not negatively impact the overall BGP Route Update 
Pacing mechanism described in this spec. Any issues relating to internal 
implementation algorithms are outside the scope of this document.


7. Level Indicators Management MIBs

TBD


8. Security Considerations

This extension to BGP does not change the underlying security issues.


9. References

[BGP-CAP] Chandra, R., Scudder, J., "Capabilities Advertisement with 
BGP-4", draft-ietf-idr-rfc2842bis-02.txt

[DYN-CAP] Chen E., Sangli S., "Dynamic Capability for BGP-4", 
draft-ietf-idr-dynamic-cap-02.txt, October 2002.

[INFORM] Nalawade G., Scudder J., "BGPv4 INFORM Message",
draft-nalawade-bgp-inform-00.txt, December 2002

[BGP-4] Rekhter, Y., and T. Li, "A Border Gateway Protocol 4 
(BGP-4)", RFC 1771, March 1995.


Internet Draft draft-abarbanel-bgp-route-update-pacing-00.txt [Page 8]



[BGP-4-DRAFT] Rekhter, Y. and T. Li (editors), "A Border Gateway Protocol 4
(BGP-4)", Internet Draft draft-ietf-idr-bgp4-18.txt, 
January 2002.

[UNIX-NET] Stevens, R., "UNIX Network Programming, Vol 1", Second Edition
1998 Prentice Hall, Inc.


10. Author Information

Ben Abarbanel
Marconi Communications
1595 Spring Hill Road, 5th Floor
Vienna, VA 22182
Email: benjamin.abarbanel@marconi.com





---------------------------------
Do You Yahoo!?
Yahoo! Autos - Get free new car price quotes
--0-504351559-1027024640=:78832
Content-Type: text/html; charset=us-ascii

<P>Specific comment:
<P>Unless the rate of pacing is defined in absolute terms (as opposed to the a subjectively for e.g
<P>yellow 4: reduce update message traffic by 40% of your normal rate). It will be very difficult to verify compliance.
<P>Also in my experience, the "normal rate" varies drastically from vendor to vendor.
<P>&nbsp;
<P>General comment:
<P>&nbsp;It appears to me that here is an attempt to solve an internal scheduling or horse power problem with the help of external protocol. 
<P>This mechanism may put undue burden on the transmitting router, which is in the best interest of quick convergence in mind is sending prefixes as fast as possible. 
<P>&nbsp; <B><I>"Abarbanel, Benjamin" &lt;Benjamin.Abarbanel@Marconi.com&gt;</I></B> wrote: 
<BLOCKQUOTE style="PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #1010ff 2px solid">Hi all:<BR><BR>Our recent discussions on this list and my recent work <BR>experiences have led me to write this draft and offer it <BR>to the IETF community. I would appreciate any comments <BR>anyone has to make.<BR><BR>Thanks in advance,<BR>Ben<BR><BR><BR><BR><BR><BR><BR><BR>Network Working Group Ben Abarbanel<BR>Internet Draft Marconi Communicatons<BR>Expiration Date: December 2002 <BR><BR><BR><BR>BGP Route Update Pacing<BR><BR>draft-abarbanel-bgp-route-update-pacing-00.txt<BR><BR><BR>1. Status of this Memo<BR><BR>This document is an Internet-Draft and is in full conformance with<BR>all provisions of Section 10 of RFC2026 except that the right to<BR>produce derivative works is not granted.<BR><BR>Internet-Drafts are working documents of the Internet Engineering<BR>Task Force (IETF), its areas, and its working groups. Note that<BR>other groups may also distribute working documents as Internet-<BR>Drafts.<BR><BR>Internet-Drafts are draft documents valid for a maximum of six months<BR>and may be updated, replaced, or obsoleted by other documents at any<BR>time. It is inappropriate to use Internet-Drafts as reference<BR>material or to cite them other than as ``work in progress.''<BR><BR>The list of current Internet-Drafts can be accessed at<BR>http://www.ietf.org/ietf/1id-abstracts.txt<BR><BR>The list of Internet-Draft Shadow Directories can be accessed at<BR>http://www.ietf.org/shadow.html.<BR><BR><BR>2. Abstract<BR><BR>This document defines a mechanism for controlling or limiting the rate at <BR>which BGP update messages are sent from one peer to the next when a BGP peer <BR>experiences internal congestion. With the introduction of new dynamic BGP <BR>protocol capabilities [CAP] message or other none BGP session destructive <BR>messages, it is necessary to limit the rate at which the BGP update messages <BR>are sent without affecting the entire BGP session and without relying on the <BR>transport (TCP) layer to do so as a normal reaction to data congestion of the <BR>TCP session across the network.<BR><BR>3. Specification of Requirements<BR><BR>The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", <BR>"SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be <BR>interpreted as described in RFC 2119 [RFC2119].<BR><BR><BR><BR><BR>Internet Draft draft-abarbanel-bgp-route-update-pacing-00.txt [Page 2]<BR><BR><BR><BR>4. Introduction<BR><BR>This document defines a mechanism for controlling or limiting the rate at <BR>which BGP update messages are sent from one peer to the next when a BGP peer <BR>experiences internal congestion. With the introduction of new dynamic BGP <BR>protocol capabilities [CAP] message or other none BGP session destructive <BR>messages, it is necessary to limit the rate at which the BGP update messages <BR>are sent without affecting the entire BGP session and without relying on the <BR>transport (TCP) layer to do so as a normal reaction to data congestion of the <BR>TCP session across the network.<BR><BR>When a router enters a state where either its CPU utilization is maximized <BR>(reaches close to 100%) or its memory is nearly depleted (less than 10% of <BR>memory left), it cannot handle new or heavy streams of updates from its peers <BR>and at times unable to send messages to its peer in a timely manner (within <BR>30 seconds). As a consequence it cannot keep current with the topological <BR>state. In some scenarios, a non-congested peer might want to negotiate new <BR>capabilities with a congested peer. The congested peer is so degraded that <BR>its TCP session goes into significant flow control off conditions and is <BR>unable to see the new BGP messages. Peer to peer communication is severely <BR>hampered and as a result the uncongested peer will take corrective action <BR>when its hold timer expires and drops the session. The uncongested peer <BR>computes alternate routing paths that are suboptimal in distance or attribute <BR>and thus affect the forwarding decisions of all routers in this network. It <BR>is possible that the congested peer's routing (control) plane is badly <BR>degraded but its forwarding plane is at normal working level. The mistake is <BR>made by the uncongested peer since it does not see any Keepalive/Update <BR>messages before its Hold timer expires and drops the session thereby dropping <BR>all its associated routes. After routes are dropped, network instability <BR>occurs and suboptimal paths are used by the remaining peers.<BR><BR>The MinRouteAdvertisementInternal and MinAsOriginationInterval timers are <BR>inadequate since they are mostly implemented on a peer session basis and <BR>studies have shown when they are used they severely degrade route convergence <BR>time. The problem with statically defined timers (initialized during system <BR>load or session establishment phase) is that they do not adjust to peer <BR>internal dynamically changing congestion conditions. The problem with the <BR>congested peer using TCP flow control to reduce its congestion condition, is <BR>that it completely stops all incoming session traffic and thus preventing any <BR>messages of high priority nature from being seen.<BR><BR>TCP Out of Band Data or Urgent Message is one way to bypass the TCP flow <BR>control condition and allow high priority BGP messages to get to the <BR>congested peer, assuming it is able to read the session socket queue. This <BR>solution has its drawbacks as described in [UNIX-NET] chapter 21, p. 568. <BR><BR><BR><BR><BR><BR><BR><BR><BR>Internet Draft draft-abarbanel-bgp-route-update-pacing-00.txt [Page 3]<BR><BR><BR><BR>This document presents a mechanism by which the BGP session of a congested <BR>router need not be degraded to the level that communication is broken with <BR>its peers. By using a peer to peer pacing mechanism which allows one peer to <BR>rate limit the number of update messages per second received from another <BR>peer, it can avoid the severe congestion conditions and process these <BR>messages at a manageable level. In addition, the uncongested peer has the <BR>knowledge to send high priority messages ahead of or in place of the normal <BR>high volume of update messages.<BR><BR>Usually, topological disturbances are spiky in nature and once they subside, <BR>the network returns to its optimum path oriented level. Whatever caused the <BR>network to become unstable, such as routers handling too much data in a small <BR>period of time or routers loosing their sessions or their links going up and <BR>down, occurs in most live network on an infrequent basis. By using the pacing <BR>mechanism as outlined in this draft, a BGP peer can prevent the serious <BR>congestion long before it is in trouble and thus ride through topological <BR>disturbances and still regain its stability without causing its sessions to <BR>drop or its routes/paths to be discarded and recomputed.<BR><BR>The assumption in this spec is that the source of most or all of the internal <BR>BGP router congestion is due to the heavy reception of update messages from <BR>neighboring peers containing large number of routes.<BR><BR><BR>5. BGP Update Pacing Mechanism<BR><BR>The BGP Update Message Pacing Mechanism is used to slow down the rate at <BR>which a peer sends update messages to another. This extension to the BGP <BR>protocol is simplified to use session non-destructive messages such as INFORM <BR>as described in [INFORM]. The pacing is performed dynamically upon congestion <BR>detection and subsidence and thus needs to use the new [INFORM] message that <BR>will not infringe on the underlying BGP protocol or all its semantics and <BR>rules.<BR><BR><BR>5.1 BGP Update Message Pacing (PACE) Dynamic Capability<BR><BR>The BGP Update Message Pacing capability is dynamically negotiated with all <BR>BGP speakers as type code=PACE (where, PACE=TBD) per TLV structures as <BR>defined in [BGP-CAP] of the OPEN message or anytime after session is <BR>established using the Dynamic Capability Message as defined [DYN-CAP] <BR>specification. All those peers that accept the PACE capability will be <BR>expected to support the new INFORM message as defined in [INFORM] <BR>specification to carry the pacing TLV structure. <BR><BR><BR><BR><BR><BR><BR><BR><BR><BR>Internet Draft draft-abarbanel-bgp-route-update-pacing-00.txt [Page 4]<BR><BR><BR><BR>All PACE capable routers will provide a configuration option to their <BR>operator to enable the BGP Update Message Pacing mechanism on a per peer <BR>basis. <BR><BR>Any peer wishing to withdraw the PACE capability can do so dynamically using <BR>the Dynamic Capability message as outlined in [DYN-CAP] specification. Once <BR>withdrawn, affected peering session will remain intact but will not benefit <BR>from the performance improvements offered by the pacing mechanism.<BR><BR><BR>5.2 Use of INFORM Message<BR><BR>The INFORM message as described in [INFORM] is used to carry the rate <BR>limiting (pacing TLV) control structure to neighboring peers. This <BR>information informs the peer that the current BGP router has entered a <BR>congestion state and it is to rate limit its transmission to the level <BR>specified. <BR><BR>The INFORM message contains the following PACE TLV structure:<BR><BR>Type = Pacing Information, type=TBD <BR>Length = 2<BR>1st Byte of Value = Cmd as shown below <BR>2nd Byte of Value = Level as shown below<BR><BR>Cmd Description <BR>------- -----------<BR>11 Request to Pace update messages<BR>12 ACK Response to Pace Update messages<BR>13 NACK Response to Pace Update Messages<BR><BR>A. Request to Pace Update Messages Level Indication (code=11)<BR><BR>The rate is divided into sub levels in term of categories (Gray, Yellow, <BR>Orange, Red, and Green) to denote the level of pacing which is directly <BR>related to the level of congestion experienced within the congested peer. The <BR>colors are used as simple handles used for flagging the severity of the <BR>condition. The colors are also used as indicators for display by the NMS to <BR>identify the rate limiting levels from any neighboring peer to the operator. <BR>Associated Level Management MIBs are defined in section 6. <BR><BR>The actual pacing control is done via the sub levels within these colors. <BR><BR>Level Description<BR>------ -----------<BR>Gray (Entering minor congestion state)<BR>1 Reduce update message traffic by 10% your normal rate.<BR>2 Reduce update message traffic by 20% your normal rate.<BR>3 Reduce update message traffic by 30% your normal rate.<BR><BR><BR><BR><BR>Internet Draft draft-abarbanel-bgp-route-update-pacing-00.txt [Page 5]<BR><BR><BR><BR>Yellow (Entering medium congestion state)<BR>4 Reduce update message traffic by 40% your normal rate. <BR>5 Reduce update message traffic by 50% your normal rate.<BR>6 Reduce update message traffic by 60% your normal rate.<BR><BR>Orange (Entering major congestion state)<BR>7 Reduce update message traffic by 70% your normal rate. <BR>8 Reduce update message traffic by 80% your normal rate.<BR>9 Reduce update message traffic by 90% your normal rate<BR><BR>Red (Entering critical congestion state)<BR>10 Complete cessation of all update message traffic (flow off). <BR>At this Level the receiving peer might decide to bypass the <BR>Congested Router and pick another less optimal router for the<BR>affected Routes.<BR><BR>Green (Exiting any congestion state)<BR>0 Restore update message traffic to your normal level (flow on).<BR>Routes redirected can be resumed to the original peer.<BR><BR>The receiving peer will send an INFORM message with an ACK or NACK Indicating <BR>it has received and understood the pacing request and will either comply with <BR>the it or forbid it. If the congested peer receives a NACK, it should remove <BR>that peer from its list of PACE capable peers but still maintaining its <BR>session without the use of Pacing. If the NACKing peer decides at a future <BR>date to re-enable the Pacing with the local peer, it will renegotiate the <BR>PACE capability with the local peer at that time.<BR><BR>B. Response Message Error Indication:<BR><BR>Level Description<BR>------ -----------<BR>0 ACK indication. The peer will comply with the <BR>Pacing level request.<BR>1 NACK indication. The peer is unable to perform Pacing or comply <BR>with the pacing level requested.<BR><BR><BR>5.3 INFORM Message Response Timer<BR><BR>When a congested peer sends an INFORM message with a "Request to Pace Update <BR>Messages (code=11)" to all peers that support the pacing feature, it will <BR>also start an associated INFORM Message Response Timer. Any peer that does <BR>not respond within a 30 second timeout period with either an ACK/NACK INFORM <BR>Response message, its associated session will be dropped. This is done to <BR>clear any PACE capable peers that are also congested to the point where their <BR>communication with the local congested peer is severed.<BR><BR><BR><BR><BR><BR><BR>Internet Draft draft-abarbanel-bgp-route-update-pacing-00.txt [Page 6]<BR><BR><BR><BR>5.4 Congestion Detection Within the Router<BR><BR>There are at least two ways congestion could be detected and measured by a <BR>BGP router.<BR><BR>- A high CPU utilization condition <BR><BR>- A Lack of available memory to accept incoming BGP messages or inability <BR>to get memory to successfully complete BGP processing. <BR><BR>5.4.1 CPU Utilization Based Level Computation<BR><BR>It is recommended that when the congested router detects its inability to <BR>perform route calculations or accept new BGP session messages at a normal <BR>rate meaning the current router's CPU utilization is more than 49%, it should <BR>inform all its peers of a rate limiting level and slow them down accordingly. <BR><BR>The recommended way for computing the Level, based on percent CPU <BR>Utilization, is done using the following:<BR><BR>Level = (((%CPU Utilization – 50) x 2) + 5) / 10).<BR>Where, % CPU Utilization is a whole integer number. Use Integer math <BR>here to remove any fractional value.<BR><BR>Note: If computed Level is negative, go to Green and set Level = 0. If <BR>Level = 0 send INFORM message with pacing Level = 0, implying return <BR>to normal (100%) update rate.<BR><BR>Level implies, whatever number of messages you transmitted per second <BR>before, transmit (100 – (Level x 10))% of that now. <BR><BR>e.g. If you transmitted 100 messages/second before. A level 2 (20%) <BR>reduction will cause you to transmit 80 messages/second now.<BR>Assumption is that these messages have an average size (1500 bytes).<BR><BR><BR>5.4.2 Memory Allocation Based Level Computation<BR><BR>It is recommended that when the congested router detects its inability to <BR>perform route calculations or accept new BGP session messages at a normal <BR>rate meaning the current router's memory allocation is more than 69%, it <BR>should inform all its peers of a rate limiting level and slow them down <BR>accordingly. <BR><BR>The recommended way for computing the Level, based on percent Memory <BR>Allocation, is done using the following:<BR><BR>Level = (((%Memory Allocated – 70) x 2) + 5) / 10).<BR>Where, % Memory Allocated is a whole integer number. Use Integer math <BR>here to remove any fractional value.<BR><BR><BR><BR>Internet Draft draft-abarbanel-bgp-route-update-pacing-00.txt [Page 7]<BR><BR><BR><BR>Note: If computed Level is negative, go to Green sub-group and set <BR>Level = 0. If Level = 0 send INFORM message with pacing Level = 0, <BR>implying return to normal (100%) update rate.<BR><BR><BR>5.5 INFORM Message Throttling<BR><BR>Once the congested peer receives acknowledgement from another peer, it will <BR>send a modification INFORM message with a new Level to that peer after the <BR>computed pacing Level changes by at least 1 value. This will amount to no <BR>more than one INFORM modification message every 5 seconds. This is done to <BR>debounce any spiky bursts of INFORM messages to all PACE negotiated peers <BR>each time the computed pacing Level changes. Depending on vendor <BR>implementations, the internal utilization levels could change at the <BR>Microsecond or Millisecond rate.<BR><BR><BR>6. Implementation Specific Mechanisms<BR><BR>The Memory Allocation and CPU Utilization Level detection algorithms <BR>discussed in section 5.4.1 and 5.4.2 are suggested ways one can implement <BR>these solutions. However, each vendor can implement a unique Memory <BR>Allocation and CPU Utilization Level detection algorithms that best suits <BR>his/her needs and will not negatively impact the overall BGP Route Update <BR>Pacing mechanism described in this spec. Any issues relating to internal <BR>implementation algorithms are outside the scope of this document.<BR><BR><BR>7. Level Indicators Management MIBs<BR><BR>TBD<BR><BR><BR>8. Security Considerations<BR><BR>This extension to BGP does not change the underlying security issues.<BR><BR><BR>9. References<BR><BR>[BGP-CAP] Chandra, R., Scudder, J., "Capabilities Advertisement with <BR>BGP-4", draft-ietf-idr-rfc2842bis-02.txt<BR><BR>[DYN-CAP] Chen E., Sangli S., "Dynamic Capability for BGP-4", <BR>draft-ietf-idr-dynamic-cap-02.txt, October 2002.<BR><BR>[INFORM] Nalawade G., Scudder J., "BGPv4 INFORM Message",<BR>draft-nalawade-bgp-inform-00.txt, December 2002<BR><BR>[BGP-4] Rekhter, Y., and T. Li, "A Border Gateway Protocol 4 <BR>(BGP-4)", RFC 1771, March 1995.<BR><BR><BR>Internet Draft draft-abarbanel-bgp-route-update-pacing-00.txt [Page 8]<BR><BR><BR><BR>[BGP-4-DRAFT] Rekhter, Y. and T. Li (editors), "A Border Gateway Protocol 4<BR>(BGP-4)", Internet Draft draft-ietf-idr-bgp4-18.txt, <BR>January 2002.<BR><BR>[UNIX-NET] Stevens, R., "UNIX Network Programming, Vol 1", Second Edition<BR>1998 Prentice Hall, Inc.<BR><BR><BR>10. Author Information<BR><BR>Ben Abarbanel<BR>Marconi Communications<BR>1595 Spring Hill Road, 5th Floor<BR>Vienna, VA 22182<BR>Email: benjamin.abarbanel@marconi.com<BR><BR><BR></BLOCKQUOTE><p><br><hr size=1><b>Do You Yahoo!?</b><br>
<a href="http://autos.yahoo.com/">Yahoo! Autos</a> - Get free new car price quotes
--0-504351559-1027024640=:78832--


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id PAA24470 for <idr-archive@nic.merit.edu>; Thu, 18 Jul 2002 15:33:44 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 9CC87912C9; Thu, 18 Jul 2002 15:32:00 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 62084912D7; Thu, 18 Jul 2002 15:32:00 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id D2A52912C9 for <idr@trapdoor.merit.edu>; Thu, 18 Jul 2002 15:31:58 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id B7EAF5DEA4; Thu, 18 Jul 2002 15:31:58 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 4C1165DDD0 for <idr@merit.edu>; Thu, 18 Jul 2002 15:31:58 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id PAA10610 for <idr@merit.edu>; Thu, 18 Jul 2002 15:31:55 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id PAA14256 for <idr@merit.edu>; Thu, 18 Jul 2002 15:31:56 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W985M72>; Thu, 18 Jul 2002 15:31:56 -0400
Message-ID: <39469E08BD83D411A3D900204840EC558227B4@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'idr@merit.edu'" <idr@merit.edu>
Subject: My BGP Route Update Pacing Draft
Date: Thu, 18 Jul 2002 15:31:54 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: multipart/mixed; boundary="----_=_NextPart_000_01C22E91.C60E3270"
Sender: owner-idr@merit.edu
Precedence: bulk

This message is in MIME format. Since your mail reader does not understand
this format, some or all of this message may not be legible.

------_=_NextPart_000_01C22E91.C60E3270
Content-Type: text/plain;
	charset="iso-8859-1"

Hi all:

   Our recent discussions on this list and my recent work 
 experiences have led me to write this draft and offer it 
 to the IETF community. I would appreciate any comments 
 anyone has to make.
 
 Thanks in advance,
 Ben

  


------_=_NextPart_000_01C22E91.C60E3270
Content-Type: text/plain;
	name="draft-abarbanel-bgp-update-pacing-txt.txt"
Content-Transfer-Encoding: quoted-printable
Content-Disposition: attachment;
	filename="draft-abarbanel-bgp-update-pacing-txt.txt"





Network Working Group                                     Ben Abarbanel
Internet Draft                                            Marconi =
Communicatons
Expiration Date: December 2002                         =20


          			=09
					BGP Route Update Pacing

                    draft-abarbanel-bgp-route-update-pacing-00.txt


1. Status of this Memo

   This document is an Internet-Draft and is in full conformance with
   all provisions of Section 10 of RFC2026 except that the right to
   produce derivative works is not granted.

   Internet-Drafts are working documents of the Internet Engineering
   Task Force (IETF), its areas, and its working groups.  Note that
   other groups may also distribute working documents as Internet-
   Drafts.

   Internet-Drafts are draft documents valid for a maximum of six =
months
   and may be updated, replaced, or obsoleted by other documents at any
   time.  It is inappropriate to use Internet-Drafts as reference
   material or to cite them other than as ``work in progress.''

   The list of current Internet-Drafts can be accessed at
   http://www.ietf.org/ietf/1id-abstracts.txt

   The list of Internet-Draft Shadow Directories can be accessed at
   http://www.ietf.org/shadow.html.


2. Abstract

This document defines a mechanism for controlling or limiting the rate =
at=20
which BGP update messages are sent from one peer to the next when a BGP =
peer=20
experiences internal congestion. With the introduction of new dynamic =
BGP=20
protocol capabilities [CAP] message or other none BGP session =
destructive=20
messages, it is necessary to limit the rate at which the BGP update =
messages=20
are sent without affecting the entire BGP session and without relying =
on the=20
transport (TCP) layer to do so as a normal reaction to data congestion =
of the=20
TCP session across the network.

3. Specification of Requirements

The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", =
"SHOULD",=20
"SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are =
to be=20
interpreted as described in RFC 2119 [RFC2119].

=20


Internet Draft      draft-abarbanel-bgp-route-update-pacing-00.txt    =
[Page 2]

=20

4. Introduction

This document defines a mechanism for controlling or limiting the rate =
at=20
which BGP update messages are sent from one peer to the next when a BGP =
peer=20
experiences internal congestion. With the introduction of new dynamic =
BGP=20
protocol capabilities [CAP] message or other none BGP session =
destructive=20
messages, it is necessary to limit the rate at which the BGP update =
messages=20
are sent without affecting the entire BGP session and without relying =
on the=20
transport (TCP) layer to do so as a normal reaction to data congestion =
of the=20
TCP session across the network.

When a router enters a state where either its CPU utilization is =
maximized=20
(reaches close to 100%) or its memory is nearly depleted (less than 10% =
of=20
memory left), it cannot handle new or heavy streams of updates from its =
peers=20
and at times unable to send messages to its peer in a timely manner =
(within=20
30 seconds). As a consequence it cannot keep current with the =
topological=20
state. In some scenarios, a non-congested peer might want to negotiate =
new=20
capabilities with a congested peer. The congested peer is so degraded =
that=20
its TCP session goes into significant flow control off conditions and =
is=20
unable to see the new BGP messages. Peer to peer communication is =
severely=20
hampered and as a result the uncongested peer will take corrective =
action=20
when its hold timer expires and drops the session. The uncongested peer =

computes alternate routing paths that are suboptimal in distance or =
attribute=20
and thus affect the forwarding decisions of all routers in this =
network. It=20
is possible that the congested peer's routing (control) plane is badly=20
degraded but its forwarding plane is at normal working level. The =
mistake is=20
made by the uncongested peer since it does not see any Keepalive/Update =

messages before its Hold timer expires and drops the session thereby =
dropping=20
all its associated routes. After routes are dropped, network =
instability=20
occurs and suboptimal paths are used by the remaining peers.

The MinRouteAdvertisementInternal and MinAsOriginationInterval timers =
are=20
inadequate since they are mostly implemented on a peer session basis =
and=20
studies have shown when they are used they severely degrade route =
convergence=20
time. The problem with statically defined timers (initialized during =
system=20
load or session establishment phase) is that they do not adjust to peer =

internal dynamically changing congestion conditions. The problem with =
the=20
congested peer using TCP flow control to reduce its congestion =
condition, is=20
that it completely stops all incoming session traffic and thus =
preventing any=20
messages of high priority nature from being seen.

TCP Out of Band Data or Urgent Message is one way to bypass the TCP =
flow=20
control condition and allow high priority BGP messages to get to the=20
congested peer, assuming it is able to read the session socket queue. =
This=20
solution has its drawbacks as described in [UNIX-NET] chapter 21, p. =
568. =20








Internet Draft      draft-abarbanel-bgp-route-update-pacing-00.txt    =
[Page 3]

=20

This document presents a mechanism by which the BGP session of a =
congested=20
router need not be degraded to the level that communication is broken =
with=20
its peers. By using a peer to peer pacing mechanism which allows one =
peer to=20
rate limit the number of update messages per second received from =
another=20
peer, it can avoid the severe congestion conditions and process these=20
messages at a manageable level. In addition, the uncongested peer has =
the=20
knowledge to send high priority messages ahead of or in place of the =
normal=20
high volume of update messages.

Usually, topological disturbances are spiky in nature and once they =
subside,=20
the network returns to its optimum path oriented level. Whatever caused =
the=20
network to become unstable, such as routers handling too much data in a =
small=20
period of time or routers loosing their sessions or their links going =
up and=20
down, occurs in most live network on an infrequent basis. By using the =
pacing=20
mechanism as outlined in this draft, a BGP peer can prevent the serious =

congestion long before it is in trouble and thus ride through =
topological=20
disturbances and still regain its stability without causing its =
sessions to=20
drop or its routes/paths to be discarded and recomputed.

The assumption in this spec is that the source of most or all of the =
internal=20
BGP router congestion is due to the heavy reception of update messages =
from=20
neighboring peers containing large number of routes.


5. BGP Update Pacing Mechanism

The BGP Update Message Pacing Mechanism is used to slow down the rate =
at=20
which a peer sends update messages to another. This extension to the =
BGP=20
protocol is simplified to use session non-destructive messages such as =
INFORM=20
as described in [INFORM]. The pacing is performed dynamically upon =
congestion=20
detection and subsidence and thus needs to use the new [INFORM] message =
that=20
will not infringe on the underlying BGP protocol or all its semantics =
and=20
rules.
=20

5.1 BGP Update Message Pacing (PACE) Dynamic Capability

The BGP Update Message Pacing capability is dynamically negotiated with =
all=20
BGP speakers as type code=3DPACE (where, PACE=3DTBD) per TLV structures =
as=20
defined in [BGP-CAP] of the OPEN message or anytime after session is=20
established using the Dynamic Capability Message as defined [DYN-CAP]=20
specification. All those peers that accept the PACE capability will be=20
expected to support the new INFORM message as defined in [INFORM]=20
specification to carry the pacing TLV structure.=20









Internet Draft      draft-abarbanel-bgp-route-update-pacing-00.txt    =
[Page 4]

=20

All PACE capable routers will provide a configuration option to their=20
operator to enable the BGP Update Message Pacing mechanism on a per =
peer=20
basis.=20

Any peer wishing to withdraw the PACE capability can do so dynamically =
using=20
the Dynamic Capability message as outlined in [DYN-CAP] specification. =
Once=20
withdrawn, affected peering session will remain intact but will not =
benefit=20
from the performance improvements offered by the pacing mechanism.


5.2 Use of INFORM Message

The INFORM message as described in [INFORM] is used to carry the rate=20
limiting (pacing TLV) control structure to neighboring peers. This=20
information informs the peer that the current BGP router has entered a=20
congestion state and it is to rate limit its transmission to the level=20
specified.=20

The INFORM message contains the following PACE TLV structure:
=20
   Type =3D Pacing Information, type=3DTBD=20
   Length =3D 2
   1st Byte of Value =3D Cmd as shown below=20
   2nd Byte of Value =3D Level as shown below

 Cmd       Description=20
-------    -----------
  11       Request to Pace update messages
  12       ACK Response to Pace Update messages
  13       NACK Response to Pace Update Messages

A. Request to Pace Update Messages Level Indication (code=3D11)

The rate is divided into sub levels in term of categories (Gray, =
Yellow,=20
Orange, Red, and Green) to denote the level of pacing which is directly =

related to the level of congestion experienced within the congested =
peer. The=20
colors are used as simple handles used for flagging the severity of the =

condition. The colors are also used as indicators for display by the =
NMS to=20
identify the rate limiting levels from any neighboring peer to the =
operator.=20
Associated Level Management MIBs are defined in section 6.=20

The actual pacing control is done via the sub levels within these =
colors.=20
 =20
    Level     Description
   ------     -----------
    Gray      (Entering minor congestion state)
         1       Reduce update message traffic by 10% your normal rate.
         2       Reduce update message traffic by 20% your normal rate.
         3       Reduce update message traffic by 30% your normal rate.



                                                   =20
Internet Draft      draft-abarbanel-bgp-route-update-pacing-00.txt    =
[Page 5]



      Yellow     (Entering medium congestion state)
         4       Reduce update message traffic by 40% your normal rate. =

         5       Reduce update message traffic by 50% your normal rate.
         6       Reduce update message traffic by 60% your normal rate.

      Orange     (Entering major congestion state)
         7       Reduce update message traffic by 70% your normal rate. =

         8       Reduce update message traffic by 80% your normal rate.
         9       Reduce update message traffic by 90% your normal rate

        Red      (Entering critical congestion state)
        10       Complete cessation of all update message traffic (flow =
off).=20
                 At this Level the receiving peer might decide to =
bypass the=20
                 Congested Router and pick another less optimal router =
for the
                 affected Routes.

    Green     (Exiting any congestion state)
         0       Restore update message traffic to your normal level =
(flow on).
              Routes redirected can be resumed to the original peer.

The receiving peer will send an INFORM message with an ACK or NACK =
Indicating=20
it has received and understood the pacing request and will either =
comply with=20
the it or forbid it. If the congested peer receives a NACK, it should =
remove=20
that peer from its list of PACE capable peers but still maintaining its =

session without the use of Pacing.  If the NACKing peer decides at a =
future=20
date to re-enable the Pacing with the local peer, it will renegotiate =
the=20
PACE capability with the local peer at that time.

B. Response Message Error Indication:

    Level     Description
   ------     -----------
0	 ACK indication. The peer will comply with the=20
     		     Pacing level request.
1	 NACK indication. The peer is unable to perform Pacing or comply =20
        with the pacing level requested.


5.3 INFORM Message Response Timer
  =20
When a congested peer sends an INFORM message with a "Request to Pace =
Update=20
Messages (code=3D11)" to all peers that support the pacing feature, it =
will=20
also start an associated INFORM Message Response Timer. Any peer that =
does=20
not respond within a 30 second timeout period with either an ACK/NACK =
INFORM=20
Response message, its associated session will be dropped. This is done =
to=20
clear any PACE capable peers that are also congested to the point where =
their=20
communication with the local congested peer is severed.


=20



Internet Draft      draft-abarbanel-bgp-route-update-pacing-00.txt    =
[Page 6]

=20

5.4 Congestion Detection Within the Router

There are at least two ways congestion could be detected and measured =
by a=20
BGP router.

- A high CPU utilization condition=20

- A Lack of available memory to accept incoming BGP messages or =
inability=20
  to get memory to successfully complete BGP processing.=20

5.4.1 CPU Utilization Based Level Computation

It is recommended that when the congested router detects its inability =
to=20
perform route calculations or accept new BGP session messages at a =
normal=20
rate meaning the current router's CPU utilization is more than 49%, it =
should=20
inform all its peers of a rate limiting level and slow them down =
accordingly.=20

The recommended way for computing the Level, based on percent CPU=20
Utilization, is done using the following:

   Level =3D (((%CPU Utilization =96 50) x 2) + 5) / 10).
   Where, % CPU Utilization is a whole integer number. Use Integer math =
=20
   here to remove any fractional value.

  Note: If computed Level is negative, go to Green and set Level =3D 0. =
If  =20
         Level =3D 0 send INFORM message with pacing Level =3D 0, =
implying return=20
         to normal (100%) update rate.

Level implies, whatever number of messages you transmitted per second=20
before, transmit (100 =96 (Level x 10))% of that now.=20

  e.g. If you transmitted 100 messages/second before. A level 2 (20%)=20
       reduction will cause you to transmit 80 messages/second now.
          Assumption is that these messages have an average size (1500 =
bytes).


5.4.2 Memory Allocation Based Level Computation

It is recommended that when the congested router detects its inability =
to=20
perform route calculations or accept new BGP session messages at a =
normal=20
rate meaning the current router's memory allocation is more than 69%, =
it=20
should inform all its peers of a rate limiting level and slow them down =

accordingly.=20

The recommended way for computing the Level, based on percent Memory=20
Allocation, is done using the following:

   Level =3D (((%Memory Allocated =96 70) x 2) + 5) / 10).
    Where, % Memory Allocated is a whole integer number. Use Integer =
math =20
    here to remove any fractional value.



Internet Draft      draft-abarbanel-bgp-route-update-pacing-00.txt    =
[Page 7]



   Note: If computed Level is negative, go to Green sub-group and set=20
         Level =3D 0. If Level =3D 0 send INFORM message with pacing =
Level =3D 0,=20
         implying return to normal (100%) update rate.
=20

5.5 INFORM Message Throttling

Once the congested peer receives acknowledgement from another peer, it =
will=20
send a modification INFORM message with a new Level to that peer after =
the=20
computed pacing Level changes by at least 1 value. This will amount to =
no=20
more than one INFORM modification message every 5 seconds. This is done =
to=20
debounce any spiky bursts of INFORM messages to all PACE negotiated =
peers=20
each time the computed pacing Level changes. Depending on vendor=20
implementations, the internal utilization levels could change at the=20
Microsecond or Millisecond rate.
 =20

6. Implementation Specific Mechanisms

The Memory Allocation and CPU Utilization Level detection algorithms=20
discussed in section 5.4.1 and 5.4.2 are suggested ways one can =
implement=20
these solutions. However, each vendor can implement a unique Memory=20
Allocation and CPU Utilization Level detection algorithms that best =
suits=20
his/her needs and will not negatively impact the overall BGP Route =
Update=20
Pacing mechanism described in this spec. Any issues relating to =
internal=20
implementation algorithms are outside the scope of this document.

=20
7. Level Indicators Management MIBs

   TBD


8. Security Considerations

   This extension to BGP does not change the underlying security =
issues.


9. References
  =20
   [BGP-CAP] Chandra, R., Scudder, J., "Capabilities Advertisement with =

             BGP-4", draft-ietf-idr-rfc2842bis-02.txt

       [DYN-CAP]         Chen E., Sangli S.,  "Dynamic Capability for =
BGP-4",=20
            draft-ietf-idr-dynamic-cap-02.txt, October 2002.

   [INFORM] Nalawade G., Scudder J., "BGPv4 INFORM Message",
            draft-nalawade-bgp-inform-00.txt, December 2002

   [BGP-4]  Rekhter, Y., and T. Li, "A Border Gateway Protocol 4=20
                              (BGP-4)", RFC 1771, March 1995.


Internet Draft      draft-abarbanel-bgp-route-update-pacing-00.txt    =
[Page 8]



   [BGP-4-DRAFT] Rekhter, Y. and T. Li (editors), "A Border Gateway =
Protocol 4
                          (BGP-4)", Internet Draft =
draft-ietf-idr-bgp4-18.txt,=20
                           January 2002.

   [UNIX-NET]  Stevens, R., "UNIX Network Programming, Vol 1", Second =
Edition
                            1998 Prentice Hall, Inc.
  =20

10. Author Information

   Ben Abarbanel
   Marconi Communications
   1595 Spring Hill Road, 5th Floor
   Vienna, VA 22182
   Email: benjamin.abarbanel@marconi.com



------_=_NextPart_000_01C22E91.C60E3270--


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA13496 for <idr-archive@nic.merit.edu>; Thu, 18 Jul 2002 10:06:05 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 4AB3E9125F; Thu, 18 Jul 2002 10:05:36 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 0E2DA91263; Thu, 18 Jul 2002 10:05:35 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 191089125F for <idr@trapdoor.merit.edu>; Thu, 18 Jul 2002 10:05:34 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 00D4B5DDE1; Thu, 18 Jul 2002 10:05:34 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mail.alcatel.be (alc250.alcatel.be [195.207.101.250]) by segue.merit.edu (Postfix) with ESMTP id 750165DDC6 for <idr@merit.edu>; Thu, 18 Jul 2002 10:05:32 -0400 (EDT)
Received: from Bemail06.net.alcatel.be (relay3 [127.0.0.1]) by mail.alcatel.be (8.11.0/8.11.4) with ESMTP id g6IE1LF26507; Thu, 18 Jul 2002 16:01:21 +0200
Received: from alcatel.be ([138.203.191.116]) by Bemail06.net.alcatel.be (Lotus Domino Release 5.0.8) with ESMTP id 2002071816052502:3158 ; Thu, 18 Jul 2002 16:05:25 +0200 
Message-ID: <3D36CAF8.DB16289@alcatel.be>
Date: Thu, 18 Jul 2002 16:04:40 +0200
From: hans.de_vleeschouwer@alcatel.be
X-Mailer: Mozilla 4.72 [en] (WinNT; I)
X-Accept-Language: en
MIME-Version: 1.0
To: Bill Fenner <fenner@research.att.com>
Cc: idr@merit.edu
References: <200207180614.XAA16290@windsor.research.att.com>
X-MIMETrack: Itemize by SMTP Server on BEMAIL06/BE/ALCATEL(Release 5.0.8 |June 18, 2001) at 07/18/2002 16:05:25, Serialize by Router on BEMAIL06/BE/ALCATEL(Release 5.0.8 |June 18, 2001) at 07/18/2002 16:05:26, Serialize complete at 07/18/2002 16:05:26
Subject: Re: FSM words for the draft-18 
Content-Type: multipart/mixed; boundary="------------149A43A8660ACEFC11F84974"
Sender: owner-idr@merit.edu
Precedence: bulk

This is a multi-part message in MIME format.
--------------149A43A8660ACEFC11F84974
Content-Transfer-Encoding: 7bit
Content-Type: text/plain; charset=us-ascii

All,

I have reviewed the FSM document below, upto the Active state. I have added my
comments inline in the text.
If it felt that it makes sense to continue the review in this way, please let
me know.

Kind regards,
  Hans De Vleeschouwer.


Bill Fenner wrote:

> Sue asked me to forward this since it didn't appear on the list.
> Sorry if this causes duplicates.
>
>   Bill
>
> ----- Begin forwarded message:
>
> From: Susan Hares <skh@nexthop.com>
> Subject: FSM words for the draft-18
> Date: Wed, 17 Jul 2002 23:24:40 -0400
> To: idr@merit.edu
> Cc: yakov Rekhter <yakov@juniper.net>, fenner@research.att.com,
>     zinin@psg.com
>
> I would like to propose that section 8 be replaced with the
> attached words.
>
> Sue Hares
>
>   ------------------------------------------------------------------------
>
>    FSM-words-05b.txtName: FSM-words-05b.txt
>                     Type: Plain Text (text/plain)
>
>   ------------------------------------------------------------------------

--------------149A43A8660ACEFC11F84974
Content-Transfer-Encoding: 7bit
Content-Type: text/plain; charset=us-ascii;
 name="FSM-words-05b.txt"
Content-Disposition: inline;
 filename="FSM-words-05b.txt"

> General questions/doubts/remarks: 
>
> - Several versions of chapter 8 are floating around.
>   The version below was posted by Bill Fenner on July 17th.   
>    
> - I have placed comments inline in the text. The comment
>   lines start with a >. 
>
> - For timers I noticed that several syntaxes are being used.
>   e.g. 
>     Connect Retry Timer
>     ConnectRetry Timer
>     ConnectRetryTimer
>
>   As the second notation seems most used, I assumed that this was    
>   the correct notation.
>
> - for timers I suggest in the text to use consistently:
>   'start timer' and 'stop timer'. 
>    
> - Section 8.1 decribes all possible events. Would it not be good
>   Also to have a section describing the purpose of eah timer.
>
>



Note: (this is version 5 of the changes to
       the text.) 

8.0	BGP Finite State Machine


This section specifies the BGP operation in terms of a 
Finite State Machine (FSM).  The section falls into 2 
parts:

    1)	Description of Events for the State machine (section 
        8.1
    2)	Description of the FSM (section 8.2)

Session Attributes required for each connection are; 

   1)	state
   2)	Connect Retry timer

>>
> Naming Consistency:  ConnectRetry timer
>>

   3)	Hold timer
   4)	Hold time
   5)	Keepalive timer

>>
> Naming Consistency: KeepAlive timer
>>



8.1 Events for the BGP FSM

8.1.1   Administrative Events (events 1-5)

Please note that only Event 1 (manual start) and Event 2 
(manual stop) are mandatory administrative events. All 
other administrative events are optional.

 Event1: Manual start 

	   Definition: Administrator manually starts peer
                       connection
>>
> Syntax: Administrator -> Local system administrator
>         For consistency with Event2 below. 
>>

	   Status: 	Mandatory

 Event2: Manual stop

	   Definition: Local system administrator manually 
                       stops the peer connection.

	   Status: 	mandatory
   

 Event3: Automatic Start

 	   Definition: Local system automatically starts the 
                       BGP peer session

	   Status:     Optional depending on local system

 Event4: Manual start with passive Transport Establishment

	   Definition: Administrator manually start the peer
>>
> Syntax: Administrator maually start->
>          Local system administrator manually starts
>
>>
                       Connection, but has the passive flag 
                       enabled.  The passive flag indicates 
                       that the peer will listen prior to 
                       establishing the connection.

	   Status:     Optional depending on local system
		

 Event5: Automatic start with passive Transport establishment

	  Definition:  Local system automatically starts the
                       BGP session with the passive flag 
                       enabled.  The passive flag indicates 
                       that the peer will listen prior to 
                       establishing a connection.

	  Status:      Optional depending on local system use
		       of a passive connection.

 Event6: Automatic start with bgp_stop_flap option set

	   Definition: Local system automatically starts the 
                       BGP peer session with persistent peer 
                       oscillation damping enabled.  The exact 
                       method of damping persistent peer 
                       oscillations is left up to the 
                       implementation.   These methods of 
                       damping persistent BGP adjacency 
                       flapping are outside the scope of this 
                       document.
 

           Status: 	Optional, used only if the bgp peer has 
	  	        Enabled a method of damping persistent
		        BGP peer flapping.


 Event7: Auto stop

	   Definition: Local system automatically stops the
                       BGP peer session.  
	
	   Status:     Optional depending on local system


8.1.2 Timer Events (events 8-11)

 Event8:  idle Hold timer expires 

>>
> Naming Consistency: idle Hold timer --> IdleHold timer
>>

           Definition: Idle Hold timer expires.  The Idle 
                       Hold Timer is only used when persistent  
>>
> Naming Consistency: idle Hold timer --> IdleHold timer
>>

                       BGP oscillation damping functions are 
                       enabled.  

           Status: 	Optional.  Used when persistent
			BGP peer oscillation damping functions
			are enabled. 



 Event9: connect retry timer expires

>>
> Naming Consistency: connect retry timer --> ConnectRetry timer
>>
           Definition: An event triggered by the expiration of 
                       the ConnectRetry timer.
  
	   Status:     Mandatory

 Event10: hold time expires

	   Definition: An event generated when the HoldTimer 
                       expires.

>>
> Naming COnsistency: HoldTimer --> Hold timer
>>

	   Status:     Mandatory

 Event11: keepalive timer expires


>>
> Naming Consistency: Keepalive --> KeepAlive
>>

	   Definition: A periodic event generated due to the 
                       expiration of the KeepAlive Timer. 
 
	   Status:     Mandatory

 Event12: DelayBGP Open timer expires [changed]


>>
> NamingConsistency: DelayBGP Open -> DelayBGPOpen 
>>
	
	   Definition: A timer that delays sending of the BGP
	               Open message for n seconds after the
                       Transport connection has been completed. 

	   status:     Optional

8.2.3) Tranport Message based (13-16) 

 Event13: Transport Connection Indication & valid remote peer[changed]

	   Definition: Event indicating that transport
                       Request valid source IP address and TCP 
                       port, and valid destination IP address 
                       and TCP Port.  The definition of 
                       invalid Source, and invalid Destination 
                       IP address is left to the implementation.
                       BGP's destination port should be port 
                       179 as defined by IANA.

>>
> Comment:
>   If it is the intention of the text to allow BGP to use other
>   transport protocol besides TCP, then this text will probably 
>   need to be modified a bit in the lines of:
>
>           Event indicating that a transport connection is 
>           initiated by a peer with a valid source and 
>           destination address
>                      
>           The definition of valid source, and valid destination 
>           address is left to the implementation.
>                                              
>           If BGP is using TCP as transport protocol, the event 
>           is denoted by the local system receiving a TCP SYN 
>           on port  179 as defined by IANA from a peer, with a valid 
>           source and destination IP address, and a valid source
>            TCP source port.
>
>>


	   Status:     Mandatory

 Event14: RCV Transport Connection Indication and Invalid Source or 
          Destination [changed]

           Definition: Transport request received with either
 		       an invalid source address or port
		       number or an invalid destination 
                       address or port number. BGP destination 
                       port  number should be 179 as defined 
                       by IANA.

	    Status: 	 Mandatory
	
>>
> Same comment as for Event13. Here the text could become something like:
>
>                       Event indicating that a transport connection is 
>                       initiated by a peer with an invalid source and/or
>                       destination address
>                       
>                       The definition of invalid source, and invalid
>                       destination address is left to the implementation.
>                                              
>                       If BGP is using TCP a transport protocol, the event 
>                       is denoted by the local system receiving a TCP SYN 
>                       on port 179 with an invalid source IP address, 
>                       source TCP port, or destination IP address.
>>


 Event15: Transport Connection request sent received an ACK.

           Definition: Local system's request to establish a transport
		       Connection to the remote side received
		       an ACK.
>>
> Typo:  remote side received an ACK -> remote side, and received an ACK.
>>

	    Status:    Mandatory

 Event16: Transport Connection Confirmed [changed]

	   Definition: The local system has received a Confirmation that
                       the transport connection has been established by
		       the remote site. 

	  Status:      Mandatory

 Event17: Transport connection fails [changed]

           Definition: This BGP peer receives a transport
		       connection failure notice.  This
		       connection notice could be caused by a
		       Transport disconnect message or a
		       Timeout in the transport session.

                       If this connection is using TCP, the
		       remote BGP peer's TCP machine could
		       have sent a FIN.  The local peer would
		       respond with a FIN-ACK. Another
		       alternative is that the local peer
		       indicated a timeout in the TCP session
		       and downed the connection.
>>
> Comment: what about the TCP signal RST. Should this not also be
>          considered as an error?
>>



	    Status:    Mandatory


8.1.4 BGP Messages (18-27)

 Event18: BGPOpen

           Definition:  An event indicating that a valid Open 
>>
> Naming Consistency:  Open -> BGP Open
>>
			message has been received.

	   Status: 	Mandatory

 Event19: BGPOpen with BGP Delay Open Timer running [changed]

           Definition: An event indicating that a valid Open 
                       Message has been successful 
                       established for a peer that is 
                       currently delaying the sending of an 
                       BGP Open message. 
>>
> 
>"...for a peer that is currently delaying the sending of an 
> BGP Open message." 
>
> In fact I guess it is not the peer who is delaying the sending of 
> the BGP Open, but the local system. proposed rephrase"
>
>"...for a peer for which the local peer is currently delaying the
>    sending of a BGP Open message." 
>
>>


	    Status:    Optional

 Event20: BGPHeaderErr

	   Definition: BGP message header is not valid.  

	   Status:     Mandatory

 Event21: BGPOpenMsgErr 
	   Definition: An BGP Open message has been received 
                        with errors. 
 
	   Status: 	Mandatory


 Event22: Open collision discard

           Definition: An event generated administratively 
                       when a connection Collision has been 
                       detected while processing an incoming  
                       Open message. This connection has been 
>>
> Naming Consystency: Open message -> BGP Open message
>>
                       selected to disconnected.  See section 

>>
> Typo: selected to disconnected -> selected to be disconnected.
>>

                       6.8 for more information on collision
	               detection.  

                       Event 22 is an administrative could 
                       occur if FSM is implemented as two 
                       linked state machines.  

>>
> Typo:  Event 22 is an administrative could occur if FSM -> 
>        Event22 is an administrative event that could occur if the FSM
>>
  

	  Status:      Optional

 Event23: NotifMsgVerErr

	    Definition: An event is generated when a 
                        NOTIFICIATION message with "version   
                        error" is received.

	    Status:     Mandatory


 Event24: NotifMsg

	    Definition: An event is generated when a 
                        NOTIFICATION messages is received and
                        the error code is anything but 	 
			"version error".

	     Status:	Mandatory
		        
 Event25: KeepAliveMsg

	    Definition: An event is generated when a KEEPALIVE
			Message is received.

		Status: Mandatory

 Event26: UpdateMsg

	    Definition: An event is generated when a valid 
			Update Message is received.

		Status: Mandatory

 Event27: UpdateMsgErr

	    Definition: An event is generated when an invalid
			Update message is received.

		Status:	Mandatory

 
8.2) Description of FSM


Idle state

   Initially BGP is in the Idle state.   

>>
>
> In fact BGP itself is not in the Idle state, but the FSM for a given 
> peer. Therefore I would suggest a rephrase to:
>
>   Initially the BGP FSM for a peer is in the Idle state. 
>
>
>>


   In this state BGP refuses all incoming BGP connections.  No 
   resources are allocated to the peer.    In response to a 
   manual start event(Event1) or an automatic start 
   event(Event3), the local system 
      -	initializes all BGP resources, 

>>
>
> The text says that "No resources are allocated to the peer. "
> Therefore i would suggest to rephrase this bullet as:
>
>     - allocates and initializes all BGP resources,
>
>  This also matches better with the text furtheron where 
>  sometimes I read " release all BGP resources and return to Idle state"
>>
 


      -	sets the Connect retry counter to zero
      -	starts the ConnectRetry timer with initial value, 
      -	initiates a transport connection to the other BGP 
        peer, 
      -	listens for a connection that may be initiated by 
        the remote BGP peer, and
        [Action A in the FSM table]
      -	changes its state to connect.

>>
>1. Naming Consistency:
>      -	sets the ConnectRetryCnt to zero
>      -	changes its state to Connect.
>
>
>2. I see nowhere a place where the value of the ConnectRetry timer
>   is changed. Is then not enough to say
>     - start the ConnectRetry timer.
>>


  An manual stop event (Event2) is ignored in the Idle state.

  In response to a manual start event with the passive Transport
  flag (Event 4) or automatic start with the passive Transport flag 
  (Event 5), the local system:
      -	initializes all BGP resources,
>>
> Same remark:  suggested new text:
>  
>     - allocates and initializes all BGP resources,
>>

      -	sets the connect retry count to zero,
      -	start the Connect Retry timer with initial value, 
>>
> Naming Consistency: 
>      -	sets the ConnectRetryCnt to zero,
>      -	start the ConnectRetry timer with initial value, 
>>

      -	listens for a connection that may be initiated by 
        the remote peer
        [Action B in the FSM table] 
      -	changes its state to Active.

  The exact value of the ConnectRetry timer is a local 
  matter, but it should be sufficiently large to allow TCP 
  initialization.   

  If a persistent BGP peer oscillation damping function is 
  enabled, two additional events may occur within Idle state:  
      -	Automatic start with bgp_stop_flap set [Event6],
      -	Idle Hold Timer expired [Event 8].

  The method of preventing persistent BGP peer oscillation is 
  outside the scope of this document.

  Any other events [Events 9-27] received in the Idle state, 
  are noted by the MIB processing as FSM Errors [action V]
  and the local peer stays in the Idle State.


Connect State:

  In this state, BGP is waiting for the transport protocol 
  connection to be completed.

  If the transport connection succeeds [Event 15 or 
  Event 16], the local system checks the "Delay Open 
  Flag".  If the Delay Open flag is set, the local system: 
     -	clears the ConnectRetry timer, 
     -	set the BGP Open Delay timer to the initial 
        value.

>>
> Naming Consistency:
>    
>   - stop the ConnectRetry timer, 
>   - start the BGPOpenDelay timer with its initial value
>
>
>>

        [Action ZZ in FSM table]

  If the Delay Open flag is not set, the local system: 
     - clears the ConnectRetry timer, 
     - completes BGP initialization
     - send an Open message to its peer,
     - sets Hold timer to a large value,
>>
> Naming Consistency:
>
>    - stop the ConnectRetry timer,
>    - send a BGP Open message to its peer,
>    - start the Hold timer with a large value
>
>>


     - Change the state to Open Sent 
   [Action H in the FSM table]

>>
> In all other cases the state change is not part of the action itself.
> Should therefore the order not be reversed to:
>
>  [Action H in the FSM table], and
>  - Change the state to Open Sent 
>   
>>


  A hold timer value of 4 minutes is suggested.

>>
> Question:
>The text above assumes the same behavior for both event 15 and 16. In 
>reality I  wonder whether most implementations would not use 2 linked
> FSM machines for this. In this case a scenario could be:
>- if 15 arrives: continue as described in the text;
>- if 16 arrives: As this is in fact a new connection that comes in 
>orginated by the remote peer, create a new FSM.
>  The new FSM then eithers starts the delay timer and goes into connect 
>state, or sends an OPEN message and goes into OPEN sent state.
>
>This behavior is it seems also more in line with section 6.8 of the BGP RFC 
>on collisions.  
>
>Would it make sense to work this out any further?
>
>>


  If the Open Delay timer expires [Event 12] in the connect 
  state,
>>
> Naming consistency BGPOpenDelay
>>

     - send an Open message to its peer,
     - set the Hold timer to a large value,
>>
> Naming consistency:
> - start the Hold timer with a large value
>>

	[Action H in the FSM Table], and
     - change the state to Open Sent. 

  If the BGP port receives a Transport connection indication 
  [Event 13], the Transport connection is processed (actions AA or
  AB in the FSM table) and the connection remains
  in the connected state. 

>>
> If it is the intention to make this text independent of the Transport
> protocol, it might be better to remove the term port:
>
>    If a Transport connection indication [Event13] is received, the event 
>    is processed (actions AA orAB in the FSM table) and the connection
>    remains in the Connect state. 
>>

  If the transport connection receives an Transport indication
  that is invalid or unconfigured. [Event 14]:
     - the transport connection is rejected.
	[Action L in the FSM table]
>>
> This seems to me like a strange case. We are in the Connect state for
> a session. The session is identified sofar with a source and destination 
> IP address and (in case of TCP the correct TCP port). 
>
> If a Transport indication would come in that either has invalid info
> or which is not configured, I can not really see how this event could be
> 'dropped' inside this FSM instance. I would assume that this request
> would be treated outside the scope of any peer FSM, in the
> 'BGP-main-session'
>
>>


  If the transport connection fails (timeout or transport
  disconnect) [Event17], the local system:
      - restarts the ConnectRetry timer, 
      - continues to listen for a connection that may be 
        initiated by the remote BGP peer, and
     [Action G in the FSM table]
      - changes its state to Active. 


  If an Open is received with the BGP Delay Open timer is 
  running [Event 19], the local system:
	- clears the connect retry timer (cleared to zero),
	- completes the BGP initialization, 
	- Stops and clears the BGP Open Delay timer
	- Sends an Open message

>>
>
> 1. I think an action is missing here: The setting of the Hold timer.
>
> 2. For naming consistency: 
>
>       - stops the ConnectRetry timer
>	- completes the BGP initialization, 
>	- Stops the BGPOpenDelay timer
>	- Sends an Open message
>       
>     ? - set the hold timer to a large value?
>>


	[Action H in the FSM table], and
	- changes its state to Open Confirm.


 The start events [Event 1, 3-6] are ignored in connect 
 state.


 A manual stop event[Event2], the local system:
       - drops the transport connection, 
       - releases all BGP resources,
       - sets connect retry count to zero
       - resets the connect retry timer (sets to zero)
      [Action Z in the FSM table]
       - goes to Idle state.

>>
>  Would it not be enough to say:
>
>  If a manual stop event[Event2] is received, the local system:
>    - drops the transport connection, 
>    - stops all running timers,
>    - releases all BGP resources,    
>      [Action Z in the FSM table], and
>    - goes to Idle state.
>
>   Note that it is indicated in the idle state that
>   - the connct retry count is set to 0; and
>   - the ConnectRetry timer is started with its inital value.
>
>   Also note that I moved the clearing of the resources to the end
>   as, I guess , resources are needed to clear the timers.
>
>>


  In response to the ConnectRetry timer expired event(Event 
  9), the local system: 
      - Sets the MIB FSM error information with ConnectRetry 
        expired, 
      - drops the transport connection 
      - restarts the ConnectRetry timer
      - initiates a transport connection to the other BGP 
        peer, 
      - continues to listen for a connection that may be 
        initiated by the remote BGP peer, and
      [Action O in the FSM table]
      - stays in Connect state.

 In response to any other events [Events 7-8, 10-11, 18, 20-
 27] the local system:


     - resets the connect retry timer (sets to zero),
     - drops the Transport connection,
     - release all BGP resources, 
     - increments the ConnectRetryCnt by 1,
     - [optionally] performs bgp peer oscillation damping, and
     [Action D in the FSM table],
     - goes to Idle state.

>>
> 
> Note that in the Idle state 'all resources' for this peer will
> be re-allocated and initialised.
> 
> So, with the current definition of the Idle state you will not be
> able to increment the connectRetryCnt if you move into Idle state.
> A solution might be to create a new init-state where resources are
> allocated an intialised, and which is then braught into idle state.
> In this way the resources intialisation can be removed from the Idle
> state and hence the ConnectRetryCnt can be kept.
>
>>


Active State:

 In this state BGP is trying to acquire a peer by listening 
 for and accepting a transport protocol connection.  

 A transport connection succeeds [Event 15 or Event 16], the 
 local system: process the transport connection flags
	- If the BGP delay open flag is set:
          o clears the ConnectRetry timer, 
          o completes the BGP initialization,
          o sets the BGP delay Open timer
         [Action ZZ]
>>
>
>Naming consistency:
>          o stops the ConnectRetry timer, 
>          o completes the BGP initialization,
>          o starts the BGPOpenDelay timer
>
>> 

        - If the BGP delay open flag is not set:
          o clears the ConnectRetry timer,
          o completes the BGP initialization, 
          o sends the Open message to it's peer, 
          o sets its Hold timer to a large value, 
         [Action H in the FSM table]
          and changes its state to OpenSent.  

>>
>Naming consistency
>
>          o stops the ConnectRetry timer,
>          o sends the BGP Open message to it's peer, 
>          o start the Hold timer with a large value, 
>>

 A Hold timer value of 4 minutes is suggested.  

 If the local system receives a valid Transport Indication
 [Event 13], the local system processes the Transport flags
  (actions aa or ab in FSM section 2.3.4).  

>>
>
> This way of describing actions, means that the complete sepcification
> for basic BGP is not longer complete without the FSM document.
>
> Is this the way the RFC is intended to evolve?
>>


 If the local system receives a Transport indication
 that is invalid for this connection [Event 14]:
      - the transport connection is rejected
        [Action L in the FSM table]

>>
> Same comment as for [event 14] in the previous state.
>>

 If the local system receives a Transport connection
 failed [Event 17] (timeout or receives Transport
 disconnect), the local system will:
	- set Transport disconnect in the MIB reason code,
	- restart ConnectRetry timer (with initial value)
	- release all BGP resources
	- Acknowledge the drop of Transport connection if
          transport disconnect (If TCP, send a FIN ACK),
	- Increment ConnectRetryCnt by 1,
	- perform the BGP peer oscillation damping process [2]
	[Action Y in the FSM table]

>>
>1.  To what state is the FSM going here? As all BGP resources are released
>    it seems to me that we can do little more then to go to Idle state?
>
>2. In case of TCP a RST signal is also possible I guess. In this case
>   no FIN ACK is needed.
>
>> 

 If the local system has the Delay Open timer expired [event 
 12] local system: 
        - clears the Connect Retry timer (set to zero),
	- stops and clears the Delay Open timer (set to zero) 
        - completes the BGP initialization,
        - sends the Open message to it's remote peer,
        - sets it's Hold timer to a large value,
        [Action H in the FSM table]
        - and set the state to Open Confirm.

>>
> 1. Naming consistency (not repeated in detail hereafter.
>
> 2. Should we not move to the state Open Sent? In the State Open Confirm
>    the OPEN message from the peer will not longer be accepted.
>>

 A Hold timer value of 4 minutes is also suggested for this 
 state transition.

 If an Open is received with the BGP Delay Open timer is 
 running [Event 19], the local system 
	- clears the connect retry timer (cleared to zero),
	- Stops and clears the BGP Open Delay timer  
	- completes the BGP initialization, 
	- Stops and clears the BGP Open Delay timer
	- Sends an Open message
	- Set the Hold timer to a large value (4 minutes),
	[Action H in the FSM table], and
	- changes its state to Open Confirm.


 In response the ConnectRetry timer expired event[Event9], 
 the local system:
        - restarts the ConnectRetry timer (with initial value),
        - initiates a transport connection to the other BGP 
          peer,
        - Continues to listen for transport connection that may be 
          initiated by remote BGP peer, 
        [Action F in the FSM table]
	- and changes its state to Connect.  


 The start events [Event1, 3-6] are ignored in the Active 
 state.

 A manual stop event[Event2], the local system:
        - Sets the administrative down in the MIB reason code,
	- Sends a notification with a Cease,
	- If any BGP routes exist, delete the routes
	- release all BGP resources, 
	- drops the Transport connection, 
        - sets connect retry count to zero
        - resets the connect retry timer (sets to zero)
        [Action S in the FSM table]
        - goes to Idle state.

 In response to any other event (Events 7-8, 10-11,18, 20-
 27), the local system:
        - stores the MIB information to indicate appropriate 
          error [FSM for Events 7-8, 10-11, 18, 20-27]
        - reset the connect retry timer (sets to zero),
        - release all BGP resources, 
	- drops the transport connection, 
        - increments the ConnectRetryCnt by one,
        - optionally performs BGP peer oscillation damping, 
  	  [Action D in FSM table],
	- and goes to the idle state


--------------149A43A8660ACEFC11F84974--



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id CAA28128 for <idr-archive@nic.merit.edu>; Thu, 18 Jul 2002 02:14:37 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 9F9DA91234; Thu, 18 Jul 2002 02:14:07 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 571EB91235; Thu, 18 Jul 2002 02:14:07 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id BC31891234 for <idr@trapdoor.merit.edu>; Thu, 18 Jul 2002 02:14:03 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 892445DD8F; Thu, 18 Jul 2002 02:14:03 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mail-green.research.att.com (mail-green.research.att.com [135.207.30.103]) by segue.merit.edu (Postfix) with ESMTP id DE6885DE04 for <idr@merit.edu>; Thu, 18 Jul 2002 02:14:01 -0400 (EDT)
Received: from alliance.research.att.com (alliance.research.att.com [135.207.26.26]) by mail-green.research.att.com (Postfix) with ESMTP id 182001E017 for <idr@merit.edu>; Thu, 18 Jul 2002 02:14:01 -0400 (EDT)
Received: from windsor.research.att.com (windsor.research.att.com [135.207.26.46]) by alliance.research.att.com (8.8.7/8.8.7) with ESMTP id CAA18714 for <idr@merit.edu>; Thu, 18 Jul 2002 02:13:59 -0400 (EDT)
From: Bill Fenner <fenner@research.att.com>
Received: (from fenner@localhost) by windsor.research.att.com (8.8.8+Sun/8.8.5) id XAA16290; Wed, 17 Jul 2002 23:14:00 -0700 (PDT)
Message-Id: <200207180614.XAA16290@windsor.research.att.com>
MIME-Version: 1.0
Content-Type: multipart/mixed; boundary=19701020; charset=US-ASCII
To: idr@merit.edu
Subject: FSM words for the draft-18
Date: Wed, 17 Jul 2002 23:13:59 -0700
Versions: dmail (solaris) 2.4c/makemail 2.9d
Sender: owner-idr@merit.edu
Precedence: bulk

--19701020
Content-Type: text/plain; charset=US-ASCII
Content-Disposition: inline


Sue asked me to forward this since it didn't appear on the list.
Sorry if this causes duplicates.

  Bill

----- Begin forwarded message:

From: Susan Hares <skh@nexthop.com>
Subject: FSM words for the draft-18
Date: Wed, 17 Jul 2002 23:24:40 -0400
To: idr@merit.edu
Cc: yakov Rekhter <yakov@juniper.net>, fenner@research.att.com,
    zinin@psg.com



I would like to propose that section 8 be replaced with the
attached words.


Sue Hares


--19701020
Content-Type: text/plain; name="FSM-words-05b.txt"; x-unix-mode=0644
Content-Disposition: attachment; filename="FSM-words-05b.txt"



Note: (this is version 5 of the changes to
       the text.) 

8.0	BGP Finite State Machine


This section specifies the BGP operation in terms of a 
Finite State Machine (FSM).  The section falls into 2 
parts:

    1)	Description of Events for the State machine (section 
        8.1
    2)	Description of the FSM (section 8.2)

Session Attributes required for each connection are; 

   1)	state
   2)	Connect Retry timer
   3)	Hold timer
   4)	Hold time
   5)	Keepalive timer

8.1 Events for the BGP FSM

8.1.1   Administrative Events (events 1-5)

Please note that only Event 1 (manual start) and Event 2 
(manual stop) are mandatory administrative events. All 
other administrative events are optional.

 Event1: Manual start 

	   Definition: Administrator manually starts peer
                       connection
	   Status: 	Mandatory

 Event2: Manual stop

	   Definition: Local system administrator manually 
                       stops the peer connection.

	   Status: 	mandatory
	   

 Event3: Automatic Start

 	   Definition: Local system automatically starts the 
                       BGP peer session

	   Status:     Optional depending on local system

 Event4: Manual start with passive Transport Establishment

	   Definition: Administrator manually start the peer
                       Connection, but has the passive flag 
                       enabled.  The passive flag indicates 
                       that the peer will listen prior to 
                       establishing the connection.

	   Status:     Optional depending on local system
		

 Event5: Automatic start with passive Transport establishment

	  Definition:  Local system automatically starts the
                       BGP session with the passive flag 
                       enabled.  The passive flag indicates 
                       that the peer will listen prior to 
                       establishing a connection.

	  Status:      Optional depending on local system use
		       of a passive connection.

 Event6: Automatic start with bgp_stop_flap option set

	   Definition: Local system automatically starts the 
                       BGP peer session with persistent peer 
                       oscillation damping enabled.  The exact 
                       method of damping persistent peer 
                       oscillations is left up to the 
                       implementation.   These methods of 
                       damping persistent BGP adjacency 
                       flapping are outside the scope of this 
                       document.
 

           Status: 	Optional, used only if the bgp peer has 
	  	        Enabled a method of damping persistent
		        BGP peer flapping.


 Event7: Auto stop

	   Definition: Local system automatically stops the
                       BGP peer session.  
	
	   Status:     Optional depending on local system


8.1.2 Timer Events (events 8-11)

 Event8:  idle Hold timer expires 

           Definition: Idle Hold timer expires.  The Idle 
                       Hold Timer is only used when persistent  
                       BGP oscillation damping functions are 
                       enabled.  

           Status: 	Optional.  Used when persistent
			BGP peer oscillation damping functions
			are enabled. 



 Event9: connect retry timer expires

           Definition: An event triggered by the expiration of 
                       the ConnectRetry timer.
  
	   Status:     Mandatory

 Event10: hold time expires

	   Definition: An event generated when the HoldTimer 
                       expires.

	   Status:     Mandatory

 Event11: keepalive timer expires

	   Definition: A periodic event generated due to the 
                       expiration of the KeepAlive Timer. 
 
	   Status:     Mandatory

 Event12: DelayBGP Open timer expires [changed]
	
	   Definition: A timer that delays sending of the BGP
	               Open message for n seconds after the
                       Transport connection has been completed. 

	   status:     Optional

8.2.3) Tranport Message based (13-16) 

 Event13: Transport Connection Indication & valid remote peer[changed]

	   Definition: Event indicating that transport
                       Request valid source IP address and TCP 
                       port, and valid destination IP address 
                       and TCP Port.  The definition of 
                       invalid Source, and invalid Destination 
                       IP address is left to the implementation.
                       BGP's destination port should be port 
                       179 as defined by IANA.


	   Status:     Mandatory

 Event14: RCV Transport Connection Indication and Invalid Source or 
          Destination [changed]

           Definition: Transport request received with either
 		       an invalid source address or port
		       number or an invalid destination 
                       address or port number. BGP destination 
                       port  number should be 179 as defined 
                       by IANA.

	    Status: 	 Mandatory
	
 Event15: Transport Connection request sent received an ACK.

           Definition: Local system's request to establish a transport
		       Connection to the remote side received
		       an ACK.


	    Status:    Mandatory

 Event16: Transport Connection Confirmed [changed]

	   Definition: The local system has received a Confirmation that
                       the transport connection has been established by
		       the remote site. 

	  Status:      Mandatory

 Event17: Transport connection fails [changed]

           Definition: This BGP peer receives a transport
		       connection failure notice.  This
		       connection notice could be caused by a
		       Transport disconnect message or a
		       Timeout in the transport session.

                       If this connection is using TCP, the
		       remote BGP peer's TCP machine could
		       have sent a FIN.  The local peer would
		       respond with a FIN-ACK. Another
		       alternative is that the local peer
		       indicated a timeout in the TCP session
		       and downed the connection.

	    Status:    Mandatory


8.1.4 BGP Messages (18-27)

 Event18: BGPOpen

           Definition:  An event indicating that a valid Open 
			message has been received.

	   Status: 	Mandatory

 Event19: BGPOpen with BGP Delay Open Timer running [changed]

           Definition: An event indicating that a valid Open 
                       Message has been successful 
                       established for a peer that is 
                       currently delaying the sending of an 
                       BGP Open message. 

	    Status:    Optional

 Event20: BGPHeaderErr

	   Definition: BGP message header is not valid.  

	   Status:     Mandatory

 Event21: BGPOpenMsgErr 
	   Definition: An BGP Open message has been received 
                        with errors. 
 
	   Status: 	Mandatory


 Event22: Open collision discard

           Definition: An event generated administratively 
                       when a connection Collision has been 
                       detected while processing an incoming  
                       Open message. This connection has been 
                       selected to disconnected.  See section 
                       6.8 for more information on collision
	               detection.  

                       Event 22 is an administrative could 
                       occur if FSM is implemented as two 
                       linked state machines.  

	  Status:      Optional

 Event23: NotifMsgVerErr

	    Definition: An event is generated when a 
                        NOTIFICIATION message with "version   
                        error" is received.

	    Status:     Mandatory

 Event24: NotifMsg

	    Definition: An event is generated when a 
                        NOTIFICATION messages is received and
                        the error code is anything but 	 
			"version error".

	     Status:	Mandatory
		        
 Event25: KeepAliveMsg

	    Definition: An event is generated when a KEEPALIVE
			Message is received.

		Status: Mandatory

 Event26: UpdateMsg

	    Definition: An event is generated when a valid 
			Update Message is received.

		Status: Mandatory

 Event27: UpdateMsgErr

	    Definition: An event is generated when an invalid
			Update message is received.

		Status:	Mandatory

 
8.2) Description of FSM


Idle state

   Initially BGP is in the Idle state.   

   In this state BGP refuses all incoming BGP connections.  No 
   resources are allocated to the peer.    In response to a 
   manual start event(Event1) or an automatic start 
   event(Event3), the local system 
      -	initializes all BGP resources, 
      -	sets the Connect retry counter to zero
      -	starts the ConnectRetry timer with initial value, 
      -	initiates a transport connection to the other BGP 
        peer, 
      -	listens for a connection that may be initiated by 
        the remote BGP peer, and
        [Action A in the FSM table]
      -	changes its state to connect.

  An manual stop event (Event2) is ignored in the Idle state.

  In response to a manual start event with the passive Transport
  flag (Event 4) or automatic start with the passive Transport flag 
  (Event 5), the local system:
      -	initializes all BGP resources,
      -	sets the connect retry count to zero,
      -	start the Connect Retry timer with initial value, 
      -	listens for a connection that may be initiated by 
        the remote peer
        [Action B in the FSM table] 
      -	changes its state to Active.

  The exact value of the ConnectRetry timer is a local 
  matter, but it should be sufficiently large to allow TCP 
  initialization.   

  If a persistent BGP peer oscillation damping function is 
  enabled, two additional events may occur within Idle state:  
      -	Automatic start with bgp_stop_flap set [Event6],
      -	Idle Hold Timer expired [Event 8].

  The method of preventing persistent BGP peer oscillation is 
  outside the scope of this document.

  Any other events [Events 9-27] received in the Idle state, 
  are noted by the MIB processing as FSM Errors [action V]
  and the local peer stays in the Idle State.


Connect State:

  In this state, BGP is waiting for the transport protocol 
  connection to be completed.

  If the transport connection succeeds [Event 15 or 
  Event 16], the local system checks the "Delay Open 
  Flag".  If the Delay Open flag is set, the local system: 
     -	clears the ConnectRetry timer, 
     -	set the BGP Open Delay timer to the initial 
        value.
        [Action ZZ in FSM table]

  If the Delay Open flag is not set, the local system: 
     - clears the ConnectRetry timer, 
     - completes BGP initialization
     - send an Open message to its peer,
     - sets Hold timer to a large value,
     - Change the state to Open Sent 
   [Action H in the FSM table]

  A hold timer value of 4 minutes is suggested.

  If the Open Delay timer expires [Event 12] in the connect 
  state,
     - send an Open message to its peer,
     - set the Hold timer to a large value,
	[Action H in the FSM Table], and
     - change the state to Open Sent. 

  If the BGP port receives a Transport connection indication 
  [Event 13], the Transport connection is processed (actions AA or
  AB in the FSM table) and the connection remains
  in the connected state. 

  If the transport connection receives an Transport indication
  that is invalid or unconfigured. [Event 14]:
     - the transport connection is rejected.
	[Action L in the FSM table]

  If the transport connection fails (timeout or transport
  disconnect) [Event17], the local system:
      - restarts the ConnectRetry timer, 
      - continues to listen for a connection that may be 
        initiated by the remote BGP peer, and
     [Action G in the FSM table]
      - changes its state to Active. 


  If an Open is received with the BGP Delay Open timer is 
  running [Event 19], the local system:
	- clears the connect retry timer (cleared to zero),
	- completes the BGP initialization, 
	- Stops and clears the BGP Open Delay timer
	- Sends an Open message

	[Action H in the FSM table], and
	- changes its state to Open Confirm.


 The start events [Event 1, 3-6] are ignored in connect 
 state.

 A manual stop event[Event2], the local system:
       - drops the transport connection, 
       - releases all BGP resources,
       - sets connect retry count to zero
       - resets the connect retry timer (sets to zero)
      [Action Z in the FSM table]
       - goes to Idle state.

  In response to the ConnectRetry timer expired event(Event 
  9), the local system: 
      - Sets the MIB FSM error information with ConnectRetry 
        expired, 
      - drops the transport connection 
      - restarts the ConnectRetry timer
      - initiates a transport connection to the other BGP 
        peer, 
      - continues to listen for a connection that may be 
        initiated by the remote BGP peer, and
      [Action O in the FSM table]
      - stays in Connect state.

 In response to any other events [Events 7-8, 10-11, 18, 20-
 27] the local system:
     - resets the connect retry timer (sets to zero),
     - drops the Transport connection,
     - release all BGP resources, 
     - increments the ConnectRetryCnt by 1,
     - [optionally] performs bgp peer oscillation damping, and
     [Action D in the FSM table],
     - goes to Idle state.


Active State:

 In this state BGP is trying to acquire a peer by listening 
 for and accepting a transport protocol connection.  

 A transport connection succeeds [Event 15 or Event 16], the 
 local system: process the transport connection flags
	- If the BGP delay open flag is set:
          o clears the ConnectRetry timer, 
          o completes the BGP initialization,
          o sets the BGP delay Open timer
         [Action ZZ]

        - If the BGP delay open flag is not set:
          o clears the ConnectRetry timer,
          o completes the BGP initialization, 
          o sends the Open message to it's peer, 
          o sets its Hold timer to a large value, 
         [Action H in the FSM table]
          and changes its state to OpenSent.  

 A Hold timer value of 4 minutes is suggested.  

 If the local system receives a valid Transport Indication
 [Event 13], the local system processes the Transport flags
  (actions aa or ab in FSM section 2.3.4).  

 If the local system receives a Transport indication
 that is invalid for this connection [Event 14]:
      - the transport connection is rejected
        [Action L in the FSM table]

 If the local system receives a Transport connection
 failed [Event 17] (timeout or receives Transport
 disconnect), the local system will:
	- set Transport disconnect in the MIB reason code,
	- restart ConnectRetry timer (with initial value)
	- release all BGP resources
	- Acknowledge the drop of Transport connection if
          transport disconnect (If TCP, send a FIN ACK),
	- Increment ConnectRetryCnt by 1,
	- perform the BGP peer oscillation damping process [2]
	[Action Y in the FSM table]

 If the local system has the Delay Open timer expired [event 
 12] local system: 
        - clears the Connect Retry timer (set to zero),
	- stops and clears the Delay Open timer (set to zero) 
        - completes the BGP initialization,
        - sends the Open message to it's remote peer,
        - sets it's Hold timer to a large value,
        [Action H in the FSM table]
        - and set the state to Open Confirm.

 A Hold timer value of 4 minutes is also suggested for this 
 state transition.

 If an Open is received with the BGP Delay Open timer is 
 running [Event 19], the local system 
	- clears the connect retry timer (cleared to zero),
	- Stops and clears the BGP Open Delay timer  
	- completes the BGP initialization, 
	- Stops and clears the BGP Open Delay timer
	- Sends an Open message
	- Set the Hold timer to a large value (4 minutes),
	[Action H in the FSM table], and
	- changes its state to Open Confirm.


 In response the ConnectRetry timer expired event[Event9], 
 the local system:
        - restarts the ConnectRetry timer (with initial value),
        - initiates a transport connection to the other BGP 
          peer,
        - Continues to listen for transport connection that may be 
          initiated by remote BGP peer, 
        [Action F in the FSM table]
	- and changes its state to Connect.  


 The start events [Event1, 3-6] are ignored in the Active 
 state.

 A manual stop event[Event2], the local system:
        - Sets the administrative down in the MIB reason code,
	- Sends a notification with a Cease,
	- If any BGP routes exist, delete the routes
	- release all BGP resources, 
	- drops the Transport connection, 
        - sets connect retry count to zero
        - resets the connect retry timer (sets to zero)
        [Action S in the FSM table]
        - goes to Idle state.

 In response to any other event (Events 7-8, 10-11,18, 20-
 27), the local system:
        - stores the MIB information to indicate appropriate 
          error [FSM for Events 7-8, 10-11, 18, 20-27]
        - reset the connect retry timer (sets to zero),
        - release all BGP resources, 
	- drops the transport connection, 
        - increments the ConnectRetryCnt by one,
        - optionally performs BGP peer oscillation damping, 
  	  [Action D in FSM table],
	- and goes to the idle state


Open Sent:

 In this state BGP waits for an Open Message from its peer.  

 When an OPEN message is received, all fields are checked 
 for correctness.  If there are no errors in the OPEN message 
 [Event 18] the local system:
       - resets the BGP Delay timer to zero,
       - reset BGP Connect Timer to zero, 
       - sends a KEEPALIVE message and
       - sets a KeepAlive timer (via the text below)
       - sets the Hold timer according to the negotiated value 
         (see section 4.2), and 
         [Action N in the FSM table]
       - sets the state to Open Confirm.


 If the negotiated Hold time value is zero, then the Hold 
 Time timer and KeepAlive timers are not started.   If the 
 value of the Autonomous System field is the same as the 
 local Autonomous System number, then the connection is an 
 "internal" connection; otherwise, it is an "external" 
 connection.   (This will impact UPDATE processing as 
 described below.)  


 If the BGP message header checking [Event20] or OPEN message
 check detects an error (see Section 6.2)[Event21], the local system:
       - sends a NOTIFICATION message with appropriate error 
         code, 
       - reset the connect retry timer (sets to zero),
       - if there are any routes associated with the BGP session,
	 delete these routes
       - release all BGP resources,
       - drop the transport session 
       - increments the ConnectRetryCnt by 1, 
       - bgp peer oscillation damping process,  
       [Actions I, J in FSM table, depending on error],
       - and goes to the Idle state.

 Collision detection mechanisms (section 6.8) need to be 
 applied when a valid BGP Open is received [Event 18 or 
 Event 19].  Please refer to section 6.8 for the details of 
 the comparison. An administrative collision detect is when 
 BGP implementation determines my means outside the scope of 
 this document that a connection collision has occurred.   

 If a connection in Open Sent is determined to be the 
 connection that must be closed, an administrative collision 
 detect [Event 22] is signaled to the state machine. If such 
 an administrative collision detect dump [Event 22] is 
 received in Open Sent, the local system:  
       - sets MIB state information to 
         collision detect closure, 
       - send a NOTIFICATION with a CEASE
       - resets the connect retry timer,
       - release all BGP resources,
       - drop the transport connection,
       - increments ConnectRetryCnt by 1,
       - performs any BGP peer oscillation damp process, and
       [Action R in the FSM],
       - enters Idle state.


  If a NOTIFICATION message is received with a version 
  error[Event23], Notification message without version number 
  [Event 24], the local system: 
       - resets the connect retry timer (sets to zero)
       - drops the Transport connection,
       - releases all BGP resources, 
       - increments the ConnectRetryCnt by 1
       - process any BGP peer oscillation damping,
	[Action Y]  
       - and sets the state to Idle.


 The Start events [Event1, 3-6] are ignored in the OpenSent 
 state.

  If a manual stop event [Event 2] is issued in Open sent 
 state, the local system:
	- Sets administrative down reason in MIB reason,
	- sends the Notification with a cease,
	- if BGP routes exists, delete the routes,
	- Release all BGP resources, 
	- Drops the Transport connection, 
	- set ConnectRetryCnt to zero, 
	- resets the Connect Retry timer (set to zero),
	[Action S in the FSM table], and
	- transitions to the Idle state.

  If an automatic stop event [Event 7] is issued in Open sent 
  state, the local system:
	- Sets administrative down reason in MIB reason,
	- sends the Notification with a cease,
	- if any routes are associated with te BGP session,
	  delete the routes, 
	- release all the BGP resources
	- Drops the Transport connection, 
	- increments the ConnectRetryCnt by 1, 
	- BGP peer oscillation process [2]
	[Action C in the FSM table], and
	- transitions to the Idle state.

 If the Hold Timer expires[Event 10], the local system:
       - set Hold timer expired in MIB Error reason code, 
       - send a NOTIFICATION message with error code Hold 
         Timer Expired,
       - reset the connect retry timer (sets to zero),
       - releases all BGP resources,
       - drops the Transport connection, 
       - increments the ConnectRetryCnt by 1
         [Action K in the FSM table], and
	- transitions to the Idle state.


 If a transport indication is received for valid connection 
 [Event 13] or transport Request Acknowledgement [Event 15]
 is received, or a transport connect confirm [Event 16] is
 received a second TCP session may be in progress.  This
 second TCP session is tracked per the Call Collision 
 processing (section 6.8) until an OPEN message is received.

 A TCP connection for an invalid port [Event 14] is ignored. 

 If a Transport Failure [Event17], is received from the 
 underlying transport protocol, the local system:
       - closes the BGP connection, 
       - restarts the Connect Retry timer, 
       - and continues to listen for a connection that may be 
         initiated by the remote BGP peer,
       [Action O in the FSM table]
       - and goes into Active state.


 In response to any other event [Events 8-9, 11-12, 19, 25-27], 
  the local system:
	- sends the NOTIFICATION with the Error Code Finite 
          state machine error,
        - resets the connect retry timer (sets to zero),
	- releases all BGP resources
        - drops the Transport connection, 
        - increments the ConnectRetryCnt by 1, 
        - process any bgp peer oscillation damping[2] 
        [Action E in the FSM table],
        - and sets the state to idle.


Open Confirm State

 In this state BGP waits for a KEEPALIVE or NOTIFICATION 
 message.

 If the local system receives a KEEPALIVE message[Event 25], 
        - restarts the Hold timer, 
          [Action P in the FSM table], and
        - changes its state to Established. 


 If the local system receives a NOTIFICATION message [event 
 23-24] or receives a TCP Disconnect [event 17] from the 
 underlying transport protocol, the local system:  
        - sets the appropriate MIB information for FSM error, 
        - resets the connect retry timer (sets the timer to 
          zero),
	- releases all BGP resources, 
        - drops the TCP connection,
        - increments the ConnectRetryCnt by 1, 
        [Action Y in the FSM table],
        - and sets the state to idle.

 Any start event [Event1, 3-6] is ignored in the OpenConfirm 
 state.

 In response to a manual stop event[Event 2] initiated by 
 the operator, the local system: 
	- set Administrative down in MIB Reason code,
        - sends the NOTIFICATION message with Cease, 
	- if any BGP routes, dete the routes
        - releases all BGP resources, 
	- drop the transport connection, 
        - sets the ConnectRetryCnt to zero
        - sets the connect retry timer to zero
        [Action S in the FSM table]
        - transitions to Idle state.

 In response to the Automatic stop event initiated by the 
 system[event 7], the local system:
        - sets the MIB entry for this peer to administratively 
          down, 
        - sends the NOTIFICATION message with Cease,
	- connect retry timer reset (set to zero)
	- If any BGP routes exist, delete the routes,
	- release all BGP resources,
        - drops the transport connection,
        - increments the ConnectRetryCnt by 1, 
        [Action C in the FSM table], and
        - transitions to the Idle State.

 If the Hold Timer expires before a KEEPALIVE message is 
 received [event10], the local system:
        - set the MIB reason to Hold time expired, 
        - send the NOTIFICATION message with the error code 
          set to Hold Time Expired, 
        - resets the connect retry timer (sets the timer to to 
          zero),
	- releases all BGP resources, 
        - drops the transport connection, 
        - increments the ConnectRetryCnt by 1 
          [Action K in the FSM table], 
        - and sets the state to Idle.


 If the local system receives a KEEPALIVE timer expires 
 event [Event 11], the system:
        - sends a KEEPALIVE message,
        - restarts the Keepalive timer 
        [Action Q the in FSM table],and 
        - remains in Open Confirmed state.

 In the event of TCP establishment [Event 13], or TCP 
 connection succeeding [Event 15 or Event 16] while in Open 
 Confirm, the local system needs to track the 2nd 
 connection.  

 If a TCP connection is attempted to an invalid port [Event 
 14], the local system will ignore the second connection 
 attempt.  

 If an OPEN message is received, all fields are check for 
 correctness.  If the BGP message header checking [Event20] 
 or OPEN message check detects an error (see Section 
 6.2)[Event21], [the local system:
        - sends a NOTIFICATION message with appropriate error 
          code, 
        - resets the connect retry timer (sets the timer to 
          zero),
        - releases all BGP resources, 
        - drops the TCP connection,
        - increments the ConnectRetryCnt by 1, 
        - runs the BGP peer oscillation damping process [2]  
       [Actions I, J, in the FSM table depending on the error]
        - and goes to the Idle state.

 If the Open messages is valid [Event 18], the collision 
 detect function is processed per section 6.8.  If this 
 connection is to be dropped due to call collision, the 
 local system:
        - sets the Call Collision cease in the MIB reason 
         code,
        - sends a Notification with a Cease 
        - resets the Connect timer (set to zero),
	- releases all BGP resources,
	- Drops the TCP connection (send TCP FIN),
	- increments the ConnectRetryCnt by 1, and
	- performs any BGP peer oscillation damping process [2].
	[action 


 If during the processing of another Open message, the BGP 
 implementation determines my means outside the scope of 
 this document that a connection collision has occurred and 
 this connection is to be closed, the local system will 
 issue a call collision dump [Event 22].  When the local 
 system receives a call collision dump event [Event 22], the 
 local system:
        - Sets the MIB FSM variable to indicate collision 
          detected and dump connection.
        - send a NOTIFICATION with a CEASE
        - deletes all routes associated with connection,
        - resets the connect retry timer,
        - releases all BGP resources
        - drops all TCP connection,
        - increments the ConnectRetryCnt by 1, 
        - and performs any BGP peer oscillation damping,
        [Action R in the FSM table],
        - enters Idle state.

 In response to any other event [Events 8-9, 12, 19, 26-27], 
 the local system:
        - sends a NOTIFICATION with a code of Finite State 
          Machine Error,
        - resets the connect retry timer (sets to zero)
        - drops the TCP connection,
        - releases all BGP resources, 
        - increments the ConnectRetryCnt by 1, 
        - performs any BGP peer oscillation damping,
        [Action E in the FSM table], and 
        - transitions to Idle state.


Established State:

 In the Established state BGP can exchange UPDATE, 
 NOTFICATION, and KEEPALIVE messages with its peer.

 If the local system receives an UPDATE message [Event26], 
 the local system will:
	- process the update packet
	- restarts its Hold timer, if the negotiated Hold Time 
          value is non-zero.
	[Action W in the FSM table], and
	- remain in the Established state.

 
 If the local system receives a NOTIFICATION message 
 [Event23 or Event24] or a disconnect [Event17] from the 
 underlying transport protocol, it:
	- sets the appropriate error code in MIB reason code,
	- if any BGP routes exist, delete all BGP routes,  
        - resets the connect retry timer (sets to zero), 
        - releases all the BGP resources, 
	- drops the TCP connection,
	- increments the ConnectRetryCnt by 1, and 
       [Action T in the FSM table]   
	- Goes to the Idle state.


 If the local system receives a Keepalive message
 [Event 25], the local system will:
        - restarts its Hold Timer, if the negotiated Hold Time 
          value is non-zero.
       [Action P in the FSM table], and
	- remain in the Established state.

 If the local system receives an UPDATE message, and the 
 Update message error handling procedure (see Section 6.3) 
 detects an error [Event27], the local system:
       - sends a NOTIFICATION message with Update error,
       - resets the connect retry timer (sets to zero),
       - drops the TCP connection,
       - releases all BGP resources, 
       - increments the ConnectRetryCnt by 1 
       - performs any BGP peer oscillation damping
      [Action U in FSM table], 
       - and goes to Idle state.


 Any start event (Event 1, 3-6) is ignored in the 
 Established state.

 In response to a manual stop event (initiated by an 
 operator)[Event2]:
        - sets the Administrative stop in MIB reason code, 
	- sends the NOTIFICATION message with Cease, 
        - if BGP routes exist, delete the BGP routes, 
        - release BGP resources,
	- drop TCP connection, 
        - set ConnectRetryCnt to zero (0),
        - reset connect retry timer to zero (0), and
        [Action S in FSM table]
        - transition to the Idle.

 In response to an automatic stop event initiated by the 
 system (automatic) [Event7], the local system:
     - sets Administrative Stop in MIB Reason code, 
     - sends a NOTIFICATION with Cease,
     - resets the connect retry timer (sets to zero)
     - deletes all routes associated with bgp peer session,
     - releases all BGP resources, 
     - drops the transport connection,   
     - increments the ConnectRetryCnt by 1, 
     - performs any BGP peer oscillation damping, and 
     [Action C in FSM table]
     - transitions to the idle state.

 An example automatic stop event is exceeding the number of 
 prefixes for a given peer and the local system 
 automatically disconnecting the peer.


 If the Hold timer expires [Event10], the local system:
      - sends a NOTIFICATION message with Error Code Hold 
        Timer Expired,
      - resets the connect retry timer (sets to zero),
      - releases all BGP resources, 
      - drops the TCP connection, 
      - increments the ConnectRetryCnt by 1,  
      -	performs any BGP peer oscillation damping,
      [Action M in FSM table] 
      - and goes to Idle state.

 If the KeepAlive timer expires [Event11], the local system 
 sends a KEEPALIVE message, it restarts its KeepAlive timer, 
 unless the negotiated Hold Time value is zero [Action Q].

 Each time time the local system sends a KEEPALIVE or UPDATE 
 message, it restarts its KeepAlive timer, unless the 
 negotiated Hold Time value is zero.


 A transport connection indication [Event 13] received
 for a valid port will cause the 2nd connection to be
 tracked.  A transport connection indications for 
 invalid port [Event 14], will be ignored. 
 
 In response to a Transport connection succeeds [Event 15
 or Event 16], the 2nd connection shall be tracked until
 it sends an OPEN message.  

 If a valid Open message [Event 18] is received, it will be 
 checked to see if it collides (section 6.8) with any other 
 session. If the BGP implementation determines that this 
 connection needs to be terminated, it will process an Call 
 Collision dump event[Event 22].  If this session needs to be 
 terminated, the connection will be terminated by:

     - send a NOTIFICATION with a CEASE
     - deletes all routes associated with connection,
     - resets the connect retry timer,
     - if any BGP routes, delete the routes,
     - release all BGP resources, 
     - drops the transport connection,
     - increments ConnectRetryCnt by 1,
     - and performs any BGP peer oscillation damping,
    [Action R in the FSM table],
     - and enters the Idle state


 In response to any other event [Events 8-9,12, 19-21] the 
 local system:
     - sends a NOTIFICATION message with Error Code Finite 
       State Machine Error,
     - deletes all routes associated with BGP peer session, 
     - resets the connect retry timer (sets to zero)
     - releases all BGP resources,
     - drops the TCP connection,
     - increments the ConnectRetryCnt by 1, 
     - performs any BGP peer oscillation damping, and
     [Action E in FSM table],
     - transitions to Idle.



--19701020
Content-Type: text/plain; charset=US-ASCII
Content-Disposition: inline



--19701020
Content-Type: text/plain; name="FSM-words-05b.txt"; x-unix-mode=0644
Content-Disposition: attachment; filename="FSM-words-05b.txt"



Note: (this is version 5 of the changes to
       the text.) 

8.0	BGP Finite State Machine


This section specifies the BGP operation in terms of a 
Finite State Machine (FSM).  The section falls into 2 
parts:

    1)	Description of Events for the State machine (section 
        8.1
    2)	Description of the FSM (section 8.2)

Session Attributes required for each connection are; 

   1)	state
   2)	Connect Retry timer
   3)	Hold timer
   4)	Hold time
   5)	Keepalive timer

8.1 Events for the BGP FSM

8.1.1   Administrative Events (events 1-5)

Please note that only Event 1 (manual start) and Event 2 
(manual stop) are mandatory administrative events. All 
other administrative events are optional.

 Event1: Manual start 

	   Definition: Administrator manually starts peer
                       connection
	   Status: 	Mandatory

 Event2: Manual stop

	   Definition: Local system administrator manually 
                       stops the peer connection.

	   Status: 	mandatory
	   

 Event3: Automatic Start

 	   Definition: Local system automatically starts the 
                       BGP peer session

	   Status:     Optional depending on local system

 Event4: Manual start with passive Transport Establishment

	   Definition: Administrator manually start the peer
                       Connection, but has the passive flag 
                       enabled.  The passive flag indicates 
                       that the peer will listen prior to 
                       establishing the connection.

	   Status:     Optional depending on local system
		

 Event5: Automatic start with passive Transport establishment

	  Definition:  Local system automatically starts the
                       BGP session with the passive flag 
                       enabled.  The passive flag indicates 
                       that the peer will listen prior to 
                       establishing a connection.

	  Status:      Optional depending on local system use
		       of a passive connection.

 Event6: Automatic start with bgp_stop_flap option set

	   Definition: Local system automatically starts the 
                       BGP peer session with persistent peer 
                       oscillation damping enabled.  The exact 
                       method of damping persistent peer 
                       oscillations is left up to the 
                       implementation.   These methods of 
                       damping persistent BGP adjacency 
                       flapping are outside the scope of this 
                       document.
 

           Status: 	Optional, used only if the bgp peer has 
	  	        Enabled a method of damping persistent
		        BGP peer flapping.


 Event7: Auto stop

	   Definition: Local system automatically stops the
                       BGP peer session.  
	
	   Status:     Optional depending on local system


8.1.2 Timer Events (events 8-11)

 Event8:  idle Hold timer expires 

           Definition: Idle Hold timer expires.  The Idle 
                       Hold Timer is only used when persistent  
                       BGP oscillation damping functions are 
                       enabled.  

           Status: 	Optional.  Used when persistent
			BGP peer oscillation damping functions
			are enabled. 



 Event9: connect retry timer expires

           Definition: An event triggered by the expiration of 
                       the ConnectRetry timer.
  
	   Status:     Mandatory

 Event10: hold time expires

	   Definition: An event generated when the HoldTimer 
                       expires.

	   Status:     Mandatory

 Event11: keepalive timer expires

	   Definition: A periodic event generated due to the 
                       expiration of the KeepAlive Timer. 
 
	   Status:     Mandatory

 Event12: DelayBGP Open timer expires [changed]
	
	   Definition: A timer that delays sending of the BGP
	               Open message for n seconds after the
                       Transport connection has been completed. 

	   status:     Optional

8.2.3) Tranport Message based (13-16) 

 Event13: Transport Connection Indication & valid remote peer[changed]

	   Definition: Event indicating that transport
                       Request valid source IP address and TCP 
                       port, and valid destination IP address 
                       and TCP Port.  The definition of 
                       invalid Source, and invalid Destination 
                       IP address is left to the implementation.
                       BGP's destination port should be port 
                       179 as defined by IANA.


	   Status:     Mandatory

 Event14: RCV Transport Connection Indication and Invalid Source or 
          Destination [changed]

           Definition: Transport request received with either
 		       an invalid source address or port
		       number or an invalid destination 
                       address or port number. BGP destination 
                       port  number should be 179 as defined 
                       by IANA.

	    Status: 	 Mandatory
	
 Event15: Transport Connection request sent received an ACK.

           Definition: Local system's request to establish a transport
		       Connection to the remote side received
		       an ACK.


	    Status:    Mandatory

 Event16: Transport Connection Confirmed [changed]

	   Definition: The local system has received a Confirmation that
                       the transport connection has been established by
		       the remote site. 

	  Status:      Mandatory

 Event17: Transport connection fails [changed]

           Definition: This BGP peer receives a transport
		       connection failure notice.  This
		       connection notice could be caused by a
		       Transport disconnect message or a
		       Timeout in the transport session.

                       If this connection is using TCP, the
		       remote BGP peer's TCP machine could
		       have sent a FIN.  The local peer would
		       respond with a FIN-ACK. Another
		       alternative is that the local peer
		       indicated a timeout in the TCP session
		       and downed the connection.

	    Status:    Mandatory


8.1.4 BGP Messages (18-27)

 Event18: BGPOpen

           Definition:  An event indicating that a valid Open 
			message has been received.

	   Status: 	Mandatory

 Event19: BGPOpen with BGP Delay Open Timer running [changed]

           Definition: An event indicating that a valid Open 
                       Message has been successful 
                       established for a peer that is 
                       currently delaying the sending of an 
                       BGP Open message. 

	    Status:    Optional

 Event20: BGPHeaderErr

	   Definition: BGP message header is not valid.  

	   Status:     Mandatory

 Event21: BGPOpenMsgErr 
	   Definition: An BGP Open message has been received 
                        with errors. 
 
	   Status: 	Mandatory


 Event22: Open collision discard

           Definition: An event generated administratively 
                       when a connection Collision has been 
                       detected while processing an incoming  
                       Open message. This connection has been 
                       selected to disconnected.  See section 
                       6.8 for more information on collision
	               detection.  

                       Event 22 is an administrative could 
                       occur if FSM is implemented as two 
                       linked state machines.  

	  Status:      Optional

 Event23: NotifMsgVerErr

	    Definition: An event is generated when a 
                        NOTIFICIATION message with "version   
                        error" is received.

	    Status:     Mandatory

 Event24: NotifMsg

	    Definition: An event is generated when a 
                        NOTIFICATION messages is received and
                        the error code is anything but 	 
			"version error".

	     Status:	Mandatory
		        
 Event25: KeepAliveMsg

	    Definition: An event is generated when a KEEPALIVE
			Message is received.

		Status: Mandatory

 Event26: UpdateMsg

	    Definition: An event is generated when a valid 
			Update Message is received.

		Status: Mandatory

 Event27: UpdateMsgErr

	    Definition: An event is generated when an invalid
			Update message is received.

		Status:	Mandatory

 
8.2) Description of FSM


Idle state

   Initially BGP is in the Idle state.   

   In this state BGP refuses all incoming BGP connections.  No 
   resources are allocated to the peer.    In response to a 
   manual start event(Event1) or an automatic start 
   event(Event3), the local system 
      -	initializes all BGP resources, 
      -	sets the Connect retry counter to zero
      -	starts the ConnectRetry timer with initial value, 
      -	initiates a transport connection to the other BGP 
        peer, 
      -	listens for a connection that may be initiated by 
        the remote BGP peer, and
        [Action A in the FSM table]
      -	changes its state to connect.

  An manual stop event (Event2) is ignored in the Idle state.

  In response to a manual start event with the passive Transport
  flag (Event 4) or automatic start with the passive Transport flag 
  (Event 5), the local system:
      -	initializes all BGP resources,
      -	sets the connect retry count to zero,
      -	start the Connect Retry timer with initial value, 
      -	listens for a connection that may be initiated by 
        the remote peer
        [Action B in the FSM table] 
      -	changes its state to Active.

  The exact value of the ConnectRetry timer is a local 
  matter, but it should be sufficiently large to allow TCP 
  initialization.   

  If a persistent BGP peer oscillation damping function is 
  enabled, two additional events may occur within Idle state:  
      -	Automatic start with bgp_stop_flap set [Event6],
      -	Idle Hold Timer expired [Event 8].

  The method of preventing persistent BGP peer oscillation is 
  outside the scope of this document.

  Any other events [Events 9-27] received in the Idle state, 
  are noted by the MIB processing as FSM Errors [action V]
  and the local peer stays in the Idle State.


Connect State:

  In this state, BGP is waiting for the transport protocol 
  connection to be completed.

  If the transport connection succeeds [Event 15 or 
  Event 16], the local system checks the "Delay Open 
  Flag".  If the Delay Open flag is set, the local system: 
     -	clears the ConnectRetry timer, 
     -	set the BGP Open Delay timer to the initial 
        value.
        [Action ZZ in FSM table]

  If the Delay Open flag is not set, the local system: 
     - clears the ConnectRetry timer, 
     - completes BGP initialization
     - send an Open message to its peer,
     - sets Hold timer to a large value,
     - Change the state to Open Sent 
   [Action H in the FSM table]

  A hold timer value of 4 minutes is suggested.

  If the Open Delay timer expires [Event 12] in the connect 
  state,
     - send an Open message to its peer,
     - set the Hold timer to a large value,
	[Action H in the FSM Table], and
     - change the state to Open Sent. 

  If the BGP port receives a Transport connection indication 
  [Event 13], the Transport connection is processed (actions AA or
  AB in the FSM table) and the connection remains
  in the connected state. 

  If the transport connection receives an Transport indication
  that is invalid or unconfigured. [Event 14]:
     - the transport connection is rejected.
	[Action L in the FSM table]

  If the transport connection fails (timeout or transport
  disconnect) [Event17], the local system:
      - restarts the ConnectRetry timer, 
      - continues to listen for a connection that may be 
        initiated by the remote BGP peer, and
     [Action G in the FSM table]
      - changes its state to Active. 


  If an Open is received with the BGP Delay Open timer is 
  running [Event 19], the local system:
	- clears the connect retry timer (cleared to zero),
	- completes the BGP initialization, 
	- Stops and clears the BGP Open Delay timer
	- Sends an Open message

	[Action H in the FSM table], and
	- changes its state to Open Confirm.


 The start events [Event 1, 3-6] are ignored in connect 
 state.

 A manual stop event[Event2], the local system:
       - drops the transport connection, 
       - releases all BGP resources,
       - sets connect retry count to zero
       - resets the connect retry timer (sets to zero)
      [Action Z in the FSM table]
       - goes to Idle state.

  In response to the ConnectRetry timer expired event(Event 
  9), the local system: 
      - Sets the MIB FSM error information with ConnectRetry 
        expired, 
      - drops the transport connection 
      - restarts the ConnectRetry timer
      - initiates a transport connection to the other BGP 
        peer, 
      - continues to listen for a connection that may be 
        initiated by the remote BGP peer, and
      [Action O in the FSM table]
      - stays in Connect state.

 In response to any other events [Events 7-8, 10-11, 18, 20-
 27] the local system:
     - resets the connect retry timer (sets to zero),
     - drops the Transport connection,
     - release all BGP resources, 
     - increments the ConnectRetryCnt by 1,
     - [optionally] performs bgp peer oscillation damping, and
     [Action D in the FSM table],
     - goes to Idle state.


Active State:

 In this state BGP is trying to acquire a peer by listening 
 for and accepting a transport protocol connection.  

 A transport connection succeeds [Event 15 or Event 16], the 
 local system: process the transport connection flags
	- If the BGP delay open flag is set:
          o clears the ConnectRetry timer, 
          o completes the BGP initialization,
          o sets the BGP delay Open timer
         [Action ZZ]

        - If the BGP delay open flag is not set:
          o clears the ConnectRetry timer,
          o completes the BGP initialization, 
          o sends the Open message to it's peer, 
          o sets its Hold timer to a large value, 
         [Action H in the FSM table]
          and changes its state to OpenSent.  

 A Hold timer value of 4 minutes is suggested.  

 If the local system receives a valid Transport Indication
 [Event 13], the local system processes the Transport flags
  (actions aa or ab in FSM section 2.3.4).  

 If the local system receives a Transport indication
 that is invalid for this connection [Event 14]:
      - the transport connection is rejected
        [Action L in the FSM table]

 If the local system receives a Transport connection
 failed [Event 17] (timeout or receives Transport
 disconnect), the local system will:
	- set Transport disconnect in the MIB reason code,
	- restart ConnectRetry timer (with initial value)
	- release all BGP resources
	- Acknowledge the drop of Transport connection if
          transport disconnect (If TCP, send a FIN ACK),
	- Increment ConnectRetryCnt by 1,
	- perform the BGP peer oscillation damping process [2]
	[Action Y in the FSM table]

 If the local system has the Delay Open timer expired [event 
 12] local system: 
        - clears the Connect Retry timer (set to zero),
	- stops and clears the Delay Open timer (set to zero) 
        - completes the BGP initialization,
        - sends the Open message to it's remote peer,
        - sets it's Hold timer to a large value,
        [Action H in the FSM table]
        - and set the state to Open Confirm.

 A Hold timer value of 4 minutes is also suggested for this 
 state transition.

 If an Open is received with the BGP Delay Open timer is 
 running [Event 19], the local system 
	- clears the connect retry timer (cleared to zero),
	- Stops and clears the BGP Open Delay timer  
	- completes the BGP initialization, 
	- Stops and clears the BGP Open Delay timer
	- Sends an Open message
	- Set the Hold timer to a large value (4 minutes),
	[Action H in the FSM table], and
	- changes its state to Open Confirm.


 In response the ConnectRetry timer expired event[Event9], 
 the local system:
        - restarts the ConnectRetry timer (with initial value),
        - initiates a transport connection to the other BGP 
          peer,
        - Continues to listen for transport connection that may be 
          initiated by remote BGP peer, 
        [Action F in the FSM table]
	- and changes its state to Connect.  


 The start events [Event1, 3-6] are ignored in the Active 
 state.

 A manual stop event[Event2], the local system:
        - Sets the administrative down in the MIB reason code,
	- Sends a notification with a Cease,
	- If any BGP routes exist, delete the routes
	- release all BGP resources, 
	- drops the Transport connection, 
        - sets connect retry count to zero
        - resets the connect retry timer (sets to zero)
        [Action S in the FSM table]
        - goes to Idle state.

 In response to any other event (Events 7-8, 10-11,18, 20-
 27), the local system:
        - stores the MIB information to indicate appropriate 
          error [FSM for Events 7-8, 10-11, 18, 20-27]
        - reset the connect retry timer (sets to zero),
        - release all BGP resources, 
	- drops the transport connection, 
        - increments the ConnectRetryCnt by one,
        - optionally performs BGP peer oscillation damping, 
  	  [Action D in FSM table],
	- and goes to the idle state


Open Sent:

 In this state BGP waits for an Open Message from its peer.  

 When an OPEN message is received, all fields are checked 
 for correctness.  If there are no errors in the OPEN message 
 [Event 18] the local system:
       - resets the BGP Delay timer to zero,
       - reset BGP Connect Timer to zero, 
       - sends a KEEPALIVE message and
       - sets a KeepAlive timer (via the text below)
       - sets the Hold timer according to the negotiated value 
         (see section 4.2), and 
         [Action N in the FSM table]
       - sets the state to Open Confirm.


 If the negotiated Hold time value is zero, then the Hold 
 Time timer and KeepAlive timers are not started.   If the 
 value of the Autonomous System field is the same as the 
 local Autonomous System number, then the connection is an 
 "internal" connection; otherwise, it is an "external" 
 connection.   (This will impact UPDATE processing as 
 described below.)  


 If the BGP message header checking [Event20] or OPEN message
 check detects an error (see Section 6.2)[Event21], the local system:
       - sends a NOTIFICATION message with appropriate error 
         code, 
       - reset the connect retry timer (sets to zero),
       - if there are any routes associated with the BGP session,
	 delete these routes
       - release all BGP resources,
       - drop the transport session 
       - increments the ConnectRetryCnt by 1, 
       - bgp peer oscillation damping process,  
       [Actions I, J in FSM table, depending on error],
       - and goes to the Idle state.

 Collision detection mechanisms (section 6.8) need to be 
 applied when a valid BGP Open is received [Event 18 or 
 Event 19].  Please refer to section 6.8 for the details of 
 the comparison. An administrative collision detect is when 
 BGP implementation determines my means outside the scope of 
 this document that a connection collision has occurred.   

 If a connection in Open Sent is determined to be the 
 connection that must be closed, an administrative collision 
 detect [Event 22] is signaled to the state machine. If such 
 an administrative collision detect dump [Event 22] is 
 received in Open Sent, the local system:  
       - sets MIB state information to 
         collision detect closure, 
       - send a NOTIFICATION with a CEASE
       - resets the connect retry timer,
       - release all BGP resources,
       - drop the transport connection,
       - increments ConnectRetryCnt by 1,
       - performs any BGP peer oscillation damp process, and
       [Action R in the FSM],
       - enters Idle state.


  If a NOTIFICATION message is received with a version 
  error[Event23], Notification message without version number 
  [Event 24], the local system: 
       - resets the connect retry timer (sets to zero)
       - drops the Transport connection,
       - releases all BGP resources, 
       - increments the ConnectRetryCnt by 1
       - process any BGP peer oscillation damping,
	[Action Y]  
       - and sets the state to Idle.


 The Start events [Event1, 3-6] are ignored in the OpenSent 
 state.

  If a manual stop event [Event 2] is issued in Open sent 
 state, the local system:
	- Sets administrative down reason in MIB reason,
	- sends the Notification with a cease,
	- if BGP routes exists, delete the routes,
	- Release all BGP resources, 
	- Drops the Transport connection, 
	- set ConnectRetryCnt to zero, 
	- resets the Connect Retry timer (set to zero),
	[Action S in the FSM table], and
	- transitions to the Idle state.

  If an automatic stop event [Event 7] is issued in Open sent 
  state, the local system:
	- Sets administrative down reason in MIB reason,
	- sends the Notification with a cease,
	- if any routes are associated with te BGP session,
	  delete the routes, 
	- release all the BGP resources
	- Drops the Transport connection, 
	- increments the ConnectRetryCnt by 1, 
	- BGP peer oscillation process [2]
	[Action C in the FSM table], and
	- transitions to the Idle state.

 If the Hold Timer expires[Event 10], the local system:
       - set Hold timer expired in MIB Error reason code, 
       - send a NOTIFICATION message with error code Hold 
         Timer Expired,
       - reset the connect retry timer (sets to zero),
       - releases all BGP resources,
       - drops the Transport connection, 
       - increments the ConnectRetryCnt by 1
         [Action K in the FSM table], and
	- transitions to the Idle state.


 If a transport indication is received for valid connection 
 [Event 13] or transport Request Acknowledgement [Event 15]
 is received, or a transport connect confirm [Event 16] is
 received a second TCP session may be in progress.  This
 second TCP session is tracked per the Call Collision 
 processing (section 6.8) until an OPEN message is received.

 A TCP connection for an invalid port [Event 14] is ignored. 

 If a Transport Failure [Event17], is received from the 
 underlying transport protocol, the local system:
       - closes the BGP connection, 
       - restarts the Connect Retry timer, 
       - and continues to listen for a connection that may be 
         initiated by the remote BGP peer,
       [Action O in the FSM table]
       - and goes into Active state.


 In response to any other event [Events 8-9, 11-12, 19, 25-27], 
  the local system:
	- sends the NOTIFICATION with the Error Code Finite 
          state machine error,
        - resets the connect retry timer (sets to zero),
	- releases all BGP resources
        - drops the Transport connection, 
        - increments the ConnectRetryCnt by 1, 
        - process any bgp peer oscillation damping[2] 
        [Action E in the FSM table],
        - and sets the state to idle.


Open Confirm State

 In this state BGP waits for a KEEPALIVE or NOTIFICATION 
 message.

 If the local system receives a KEEPALIVE message[Event 25], 
        - restarts the Hold timer, 
          [Action P in the FSM table], and
        - changes its state to Established. 


 If the local system receives a NOTIFICATION message [event 
 23-24] or receives a TCP Disconnect [event 17] from the 
 underlying transport protocol, the local system:  
        - sets the appropriate MIB information for FSM error, 
        - resets the connect retry timer (sets the timer to 
          zero),
	- releases all BGP resources, 
        - drops the TCP connection,
        - increments the ConnectRetryCnt by 1, 
        [Action Y in the FSM table],
        - and sets the state to idle.

 Any start event [Event1, 3-6] is ignored in the OpenConfirm 
 state.

 In response to a manual stop event[Event 2] initiated by 
 the operator, the local system: 
	- set Administrative down in MIB Reason code,
        - sends the NOTIFICATION message with Cease, 
	- if any BGP routes, dete the routes
        - releases all BGP resources, 
	- drop the transport connection, 
        - sets the ConnectRetryCnt to zero
        - sets the connect retry timer to zero
        [Action S in the FSM table]
        - transitions to Idle state.

 In response to the Automatic stop event initiated by the 
 system[event 7], the local system:
        - sets the MIB entry for this peer to administratively 
          down, 
        - sends the NOTIFICATION message with Cease,
	- connect retry timer reset (set to zero)
	- If any BGP routes exist, delete the routes,
	- release all BGP resources,
        - drops the transport connection,
        - increments the ConnectRetryCnt by 1, 
        [Action C in the FSM table], and
        - transitions to the Idle State.

 If the Hold Timer expires before a KEEPALIVE message is 
 received [event10], the local system:
        - set the MIB reason to Hold time expired, 
        - send the NOTIFICATION message with the error code 
          set to Hold Time Expired, 
        - resets the connect retry timer (sets the timer to to 
          zero),
	- releases all BGP resources, 
        - drops the transport connection, 
        - increments the ConnectRetryCnt by 1 
          [Action K in the FSM table], 
        - and sets the state to Idle.


 If the local system receives a KEEPALIVE timer expires 
 event [Event 11], the system:
        - sends a KEEPALIVE message,
        - restarts the Keepalive timer 
        [Action Q the in FSM table],and 
        - remains in Open Confirmed state.

 In the event of TCP establishment [Event 13], or TCP 
 connection succeeding [Event 15 or Event 16] while in Open 
 Confirm, the local system needs to track the 2nd 
 connection.  

 If a TCP connection is attempted to an invalid port [Event 
 14], the local system will ignore the second connection 
 attempt.  

 If an OPEN message is received, all fields are check for 
 correctness.  If the BGP message header checking [Event20] 
 or OPEN message check detects an error (see Section 
 6.2)[Event21], [the local system:
        - sends a NOTIFICATION message with appropriate error 
          code, 
        - resets the connect retry timer (sets the timer to 
          zero),
        - releases all BGP resources, 
        - drops the TCP connection,
        - increments the ConnectRetryCnt by 1, 
        - runs the BGP peer oscillation damping process [2]  
       [Actions I, J, in the FSM table depending on the error]
        - and goes to the Idle state.

 If the Open messages is valid [Event 18], the collision 
 detect function is processed per section 6.8.  If this 
 connection is to be dropped due to call collision, the 
 local system:
        - sets the Call Collision cease in the MIB reason 
         code,
        - sends a Notification with a Cease 
        - resets the Connect timer (set to zero),
	- releases all BGP resources,
	- Drops the TCP connection (send TCP FIN),
	- increments the ConnectRetryCnt by 1, and
	- performs any BGP peer oscillation damping process [2].
	[action 


 If during the processing of another Open message, the BGP 
 implementation determines my means outside the scope of 
 this document that a connection collision has occurred and 
 this connection is to be closed, the local system will 
 issue a call collision dump [Event 22].  When the local 
 system receives a call collision dump event [Event 22], the 
 local system:
        - Sets the MIB FSM variable to indicate collision 
          detected and dump connection.
        - send a NOTIFICATION with a CEASE
        - deletes all routes associated with connection,
        - resets the connect retry timer,
        - releases all BGP resources
        - drops all TCP connection,
        - increments the ConnectRetryCnt by 1, 
        - and performs any BGP peer oscillation damping,
        [Action R in the FSM table],
        - enters Idle state.

 In response to any other event [Events 8-9, 12, 19, 26-27], 
 the local system:
        - sends a NOTIFICATION with a code of Finite State 
          Machine Error,
        - resets the connect retry timer (sets to zero)
        - drops the TCP connection,
        - releases all BGP resources, 
        - increments the ConnectRetryCnt by 1, 
        - performs any BGP peer oscillation damping,
        [Action E in the FSM table], and 
        - transitions to Idle state.


Established State:

 In the Established state BGP can exchange UPDATE, 
 NOTFICATION, and KEEPALIVE messages with its peer.

 If the local system receives an UPDATE message [Event26], 
 the local system will:
	- process the update packet
	- restarts its Hold timer, if the negotiated Hold Time 
          value is non-zero.
	[Action W in the FSM table], and
	- remain in the Established state.

 
 If the local system receives a NOTIFICATION message 
 [Event23 or Event24] or a disconnect [Event17] from the 
 underlying transport protocol, it:
	- sets the appropriate error code in MIB reason code,
	- if any BGP routes exist, delete all BGP routes,  
        - resets the connect retry timer (sets to zero), 
        - releases all the BGP resources, 
	- drops the TCP connection,
	- increments the ConnectRetryCnt by 1, and 
       [Action T in the FSM table]   
	- Goes to the Idle state.


 If the local system receives a Keepalive message
 [Event 25], the local system will:
        - restarts its Hold Timer, if the negotiated Hold Time 
          value is non-zero.
       [Action P in the FSM table], and
	- remain in the Established state.

 If the local system receives an UPDATE message, and the 
 Update message error handling procedure (see Section 6.3) 
 detects an error [Event27], the local system:
       - sends a NOTIFICATION message with Update error,
       - resets the connect retry timer (sets to zero),
       - drops the TCP connection,
       - releases all BGP resources, 
       - increments the ConnectRetryCnt by 1 
       - performs any BGP peer oscillation damping
      [Action U in FSM table], 
       - and goes to Idle state.


 Any start event (Event 1, 3-6) is ignored in the 
 Established state.

 In response to a manual stop event (initiated by an 
 operator)[Event2]:
        - sets the Administrative stop in MIB reason code, 
	- sends the NOTIFICATION message with Cease, 
        - if BGP routes exist, delete the BGP routes, 
        - release BGP resources,
	- drop TCP connection, 
        - set ConnectRetryCnt to zero (0),
        - reset connect retry timer to zero (0), and
        [Action S in FSM table]
        - transition to the Idle.

 In response to an automatic stop event initiated by the 
 system (automatic) [Event7], the local system:
     - sets Administrative Stop in MIB Reason code, 
     - sends a NOTIFICATION with Cease,
     - resets the connect retry timer (sets to zero)
     - deletes all routes associated with bgp peer session,
     - releases all BGP resources, 
     - drops the transport connection,   
     - increments the ConnectRetryCnt by 1, 
     - performs any BGP peer oscillation damping, and 
     [Action C in FSM table]
     - transitions to the idle state.

 An example automatic stop event is exceeding the number of 
 prefixes for a given peer and the local system 
 automatically disconnecting the peer.


 If the Hold timer expires [Event10], the local system:
      - sends a NOTIFICATION message with Error Code Hold 
        Timer Expired,
      - resets the connect retry timer (sets to zero),
      - releases all BGP resources, 
      - drops the TCP connection, 
      - increments the ConnectRetryCnt by 1,  
      -	performs any BGP peer oscillation damping,
      [Action M in FSM table] 
      - and goes to Idle state.

 If the KeepAlive timer expires [Event11], the local system 
 sends a KEEPALIVE message, it restarts its KeepAlive timer, 
 unless the negotiated Hold Time value is zero [Action Q].

 Each time time the local system sends a KEEPALIVE or UPDATE 
 message, it restarts its KeepAlive timer, unless the 
 negotiated Hold Time value is zero.


 A transport connection indication [Event 13] received
 for a valid port will cause the 2nd connection to be
 tracked.  A transport connection indications for 
 invalid port [Event 14], will be ignored. 
 
 In response to a Transport connection succeeds [Event 15
 or Event 16], the 2nd connection shall be tracked until
 it sends an OPEN message.  

 If a valid Open message [Event 18] is received, it will be 
 checked to see if it collides (section 6.8) with any other 
 session. If the BGP implementation determines that this 
 connection needs to be terminated, it will process an Call 
 Collision dump event[Event 22].  If this session needs to be 
 terminated, the connection will be terminated by:

     - send a NOTIFICATION with a CEASE
     - deletes all routes associated with connection,
     - resets the connect retry timer,
     - if any BGP routes, delete the routes,
     - release all BGP resources, 
     - drops the transport connection,
     - increments ConnectRetryCnt by 1,
     - and performs any BGP peer oscillation damping,
    [Action R in the FSM table],
     - and enters the Idle state


 In response to any other event [Events 8-9,12, 19-21] the 
 local system:
     - sends a NOTIFICATION message with Error Code Finite 
       State Machine Error,
     - deletes all routes associated with BGP peer session, 
     - resets the connect retry timer (sets to zero)
     - releases all BGP resources,
     - drops the TCP connection,
     - increments the ConnectRetryCnt by 1, 
     - performs any BGP peer oscillation damping, and
     [Action E in FSM table],
     - transitions to Idle.



--19701020
Content-Type: text/plain; charset=US-ASCII
Content-Disposition: inline



--19701020
Content-Type: text/plain; name="ietf-hares-backoff-01.txt"; x-unix-mode=0644
Content-Disposition: attachment; filename="ietf-hares-backoff-01.txt"

                                                                         
   Internet Draft                                              S. Hares 
   Document: draft-hares-bgp-backoff-01.txt                     NextHop 
                                                          Technologies, 
                                                                   Inc. 
   Expires: December 2002                                     July 2002 
 
 
                   BGP Peer Restart Backoff Mechanisms 
 
 
 Status of this Memo 
 
   This document is an Internet-Draft and is in full conformance 
   with all provisions of Section 10 of RFC2026. 
    
   Internet-Drafts are working documents of the Internet Engineering 
   Task Force (IETF), its areas, and its working groups.  Note that      
   other groups may also distribute working documents as Internet-
   Drafts. 
    
   Internet-Drafts are draft documents valid for a maximum of six 
   months and may be updated, replaced, or obsoleted by other documents 
   at any time.  It is inappropriate to use Internet-Drafts as 
   reference material or to cite them other than as "work in progress." 
    
   The list of current Internet-Drafts can be accessed at 
        http://www.ietf.org/ietf/1id-abstracts.txt 
  
   The list of Internet-Draft Shadow Directories can be accessed at 
        http://www.ietf.org/shadow.html. 
    
    
 Abstract 
    
   This document describes mechanisms for methods for damping the 
   oscillations of BGP-4 [1] peers by increasing the time between 
   automatic start functions of the BGP peers.  The increase of time 
   between automatic start functions will be denoted as a backoff of 
   the automatic start functions by this draft. 
    
   This draft is an informational RFC which provides a description of 
   mechanisms for back-off of the automatic start function in bgp-4 
   peer reconnections. This Informational RFC includes descriptions 
   about the back-off mechanisms and how this information is recorded 
   in the BGP-4 MIB version 2 [2]. 
    
   The expontential back-off mechanism contained in this draft was 
   first recorded in a draft of the bgp-4 specification.  The author 
   requests that anyone with information regarding the original authors 
   of that work contact the author.  The lack of credit for those
   authors is based on the author's lack of knowledge the originators
   of the expotential back-off.   




    
 Hares             Informational Expires December 2002                 1 
 




                                                                       
Internet Draft      ietf-skh-bgp-backoff-01.txt           June 20, 2002
    
   Table of Contents 
    
    
   Status of this Memo...........................................1
 
   Abstract     .................................................1 

   1.0 Overview    ..............................................2 

   2.0 Conventions used in this document.........................3 

   3.0 Types of Back-off.........................................4 

   4.0 Exponential back-off......................................4 

   5.0 Step-wise Back-off .......................................5 

   6.0 Recording the Back-off information in the MIB ............5 

   7.0 Back-off Mechanisms in FSM ...............................6

      7.1 Variables to support Back-off functions................6
      7.2 Existing MIB objects used .............................8 
      7.3 Events only utilized by the back-off mechanisms .......8 
      7.4 IdleHold substate in Idle FSM description .............8 
      7.5 Action upon entering the Idle state ...................9
      7.6 Substate processing...................................10
 
   8.0 Security Considerations..................................13

   9.0 References...............................................13

   10. Author's Addresses.......................................13
   

 
 1.0 Overview 

    
   If a BGP speaker detects an error, it shuts down the connection 
   and changes its connection state to Idle. Getting out of the Idle
   state requires generation of the Start event.  If such an event is 
   generated automatically, then persistent BGP errors may result in 
   persistent flapping of the speaker.  To avoid such a condition it  
   is recommended that automatic start events should not be  
   generated immediately for a peer that was previously transitioned  
   to Idle due to an error. 
   
 
   For a peer that was previously transitioned to Idle due to an  
   error, the time between consecutive generation of start events,  
   if such events are generated automatically, shall increase.  This  
   increase of time between automatic start events will be called in  
   backoff of automatic BGP peer start events.   


   The automatic start events listed in the BGP-4 specification are:  
     
         - Event3: Automatic start 
         - Event5: Automatic start with passive flag on 
         - Event6: Automatic start with bgp_flap_stop flag on 

  Event 3 is issued when an automatic stop occurs when no additional
  flags impact the starting of the bgp peer session.  Event 5 is 
  generated when the passive flag is set to allow the local peer to
  listen prior to starting a TCP session. 
    


    
    
     
   Hares         Informational - Expires December 2002                2 
   




Internet Draft      ietf-skh-bgp-backoff-01.txt        June 20, 2002

  

  Event 6 is issued when an automatic start occurs with the
  bgp peer oscillation damping features turned on.  The 
  "bgp_flap_stop" flag engages the damping of bgp peer oscillation.
     

   Event 7: Automatic stop is the only automatic stop event for the 
   BGP finite state machine.   Automatic stop followed by automatic 
   start can also accelerate the BGP peer oscillation. 

   BGP-4 implementations today have a variety of back-off  
   mechanisms.  This document is intended to provide information on  
   these back-off mechanisms and how to record this information in a  
   MIB.  Two mechanisms are recorded in this Internet Draft:  

	1) Exponential back-off (section 2) and 
        2) Step-wise back-off (section 3).  

   The author welcomes input on additional back-off mechanisms.   
   Please send input to the author or the idr list (idr@merit.edu). 
 
   A BGP peer oscillation backoff mechanism requires additional
   processing  in the bgp-4[1] Finite State Machine (FSM).
   The BGP-4 draft contains an  indication of where the FSM would
   process the peer oscillation damping.  The implementation of 
   backoff mechanisms in a BGP-4 is  optional, but wide spread
   in implementations.  The exact  mechanisms vary between
   implementations.   
 
 
   Section 6.0 contains information on how the BGP-4 MIB encodes the  
   time between establishing BGP-4 peer sessions.  Section 7.0  
   contains the additional mechanisms needed in a BGP-4 FSM for  
   back-off implementations.  
         
    
2.0 Conventions used in this document 
    
   The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL  
   NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED",  "MAY", and  
   "OPTIONAL" in this document are to be interpreted as described in  
   RFC-2119 [1]. 
    
   As an informational draft there are no mandatory functions.  This  
   draft hopes to recommend healthy life styles for back-off  
   mechanisms in BGP peers.  As in all recommendations for healthy  
   life styles the user of BGP-4 is not required to do anything,  
   except expire the BGP-4 connection according to the letter of the  
   BGP-4 standard.  







     
   Hares         Informational - Expires December 2002             3  
 



Internet Draft      ietf-skh-bgp-backoff-01.txt           June 20,2002 


3.0 Types of Back-off  
    
    
   The author knows of three types of back-off at this time: 
    
        - Exponential back-off, and 
        - Step-wise back-off,  
        - no back-off. 
    
   The author welcomes any comments on additional back-offs and will 
   hasten to include this information in the next revision of this  
   draft. 
    

    
4.0 Exponential back-off  
     
    
   The original description occurred in RFC 1654 and was included in
   a version of bgp-4 [1] draft. 
   
    
   If a BGP speaker detects an error, it shuts down the connection  
   and changes its state to Idle. Getting out of the Idle state  
   requires generation of the Start event.  If such an event is  
   generated automatically, then persistent BGP errors may result in  
   persistent flapping of the speaker.  To avoid such a condition it  
   is recommended that automatic Start events should not be  
   generated immediately for a peer that was previously transitioned  
   to Idle due to an error.  
    
   For a peer that was previously transitioned to Idle due to an  
   error, the time between consecutive generation of Start events,  
   if such events are generated automatically, shall exponentially  
   increase.   The IdleHoldTimer will be the timer is utilized for  
   the count down of this feature.    The value of the initial timer  
   shall be 60 seconds. The time shall be doubled for each  
   consecutive retry.    The equation for the setting of the Idle  
   time is as follows: 
    
        Idle Hold timer = 2**(ConnectRetryCnt)*60 
    
   If the Idle Hold timer exceeds a local maximum value, the  
   connection shall be held down until a manual start event occurs. 
   
   The Idle Hold timer is initialized to zero.     





 
     

   Hares         Informational - Expires December 2002            4 
   


Internet Draft      ietf-skh-bgp-backoff-01.txt        June 20,2002


   
5.0 Step-wise Back-off 
     
    
   The Step-wise back-off uses the following back-off function. The  
   step-wise back-off functions provide a step function based on the  
   number of retries.   
    
   An example of the Step-wise code, is as follows:  
    
     - For 1-8 retries, the amount of time between retries is 8 
       seconds. 
    
     - For 9-24 retries, the amount of time between retries, the  
       amount time between retries is 64 seconds, 
    
     - For larger than 25 retries, the amount of time between retries 
       is 128 seconds, 
    
     - Maximum number of retries is 35. 

    
    
6.0 Recording the Back-off information in the BGP-4 MIB [3]

    
    
   The BGP-4 MIB version 2[3] will have two variables for the hold time  
   Maximums: 
    
     BGP-4 Maximum retry count value 
     BGP-4 Maximum hold time value 
     
   
   The BGP-4 MIB version 2 will have scalar flag indicating that the
   a backoff implementation is present in a BGP peer implementaiton
 
   The BGP-4 MIB version 2 will have the following table for BGP-4  
   Restart Peer Back-off: 
    
    
   BGP-4 Peer Restart Back-off table 
    
   retry cnt  hold time value (seconds) 
   --------   --------------- 
   0             value-1
   1             Value-1  
   2             Value-2  
   3             Value-3 
   4             Value-4 
   5             Value-5 
   6             Value-6 
   Â… 
   max-cnt    max-cnt-value                           
    
    
   For our example of the step-wise back-off, the table would be  
   filled in as 
    
   Retry cnt    hold time value 
   ---------    ---------------
   1              8 seconds 
   2              8 seconds 
   3              8 seconds 
   4              8 seconds 
   5              8 seconds 



   Hares         Informational - Expires December 2002                5 
 



 Internet Draft      ietf-skh-bgp-backoff-01.txt        June 20,2002

   6              8 seconds 
   7              8 seconds 
   8              8 seconds 
   9             64 seconds 
   10            64 seconds 
   Â… 
   24            64 seconds 
   25           128 seconds 
   26           128 seconds 
   .. 
   35           128 seconds 
    
   BGP-max retry cnt =35 
   BGP-max hold time = 128 seconds 
    
      

7.0) Back-off Mechanisms in FSM 
    
 7.1) Per BGP Peer Variables to support Back-off functions
    
   1) Idle Hold timer 
    
        Definition: Timer that keeps track of exponential back-    
                    Off.  This timer must expire before the bgp peer
                    generates an automatic start event.    
    
        Units:      Seconds 
        Status:     Needs to be added to BGP-4 MIB version 2[3]
                    In running timers. [new category] 
    
    
   2) bgp-4 Last Hold Timer value 
    
        Definition: Time last set in the Idle Hold timer 
    
        Units:      seconds 
        Status:     Needs to be added to BGP-4 MIB version 2 [3] 
                    In running timers [new category] 
 
    
   3) Idle Hold Time Maximum  
    
        Definition: Maximum value for Idle Hold timer 
        Units:      Seconds 
        Status:     Needs to be added to BGP-4 MIB version 2 [3]
                    In configured timers.   


    
 


    
   Hares         Informational - Expires December 2002            6 
   



Internet Draft      ietf-skh-bgp-backoff-01.txt        June 20,2002

    
 
   
   4) Keep Idle Flag  
    
        Definition: Flag set if the idle Hold timer value 
                    exceeds the Idle hold maximum value. 
         
        Units:      true/false 
        Status:     Needs to be added to BGP-4 MIB version 2 in 
                    Running status. 


  5) bgp_stop_flap flag  
    
        Definition: Flag to enable the back off mechanisms 
                    on automatic restarts.  These mechanism 
                    prevent persistent bgp peer oscillation. 
    
        Units:      True/False 
        Status:     Needs to be added to BGP-4 MIB Version 2 
                    In configured options.   
                           
   6) BGP-4 max retry count 
      
       Definition:  maximum number of times a bgp-4 peer will 
                    Automatically try to restart the bgp-4 connection. 
    
       Units:       non-zero, positive integer 
       Status:      Needs to be added to BGP-4 MIB in configured 
                    Parameters  
 
    
   7) BGP retry count 
      Definition:   Count of times the bgp-4 peer automatically 
                    Attempts to restart the connection. 
    
      Units:        non-zero, positive integer 
      Status:       Needs to be added to BGP-4 MIB in running 
                    Running parameters.  
 
 
   8) BGP-Backoff-RetryTimeTable 
 
      Definition:   Table that specifies the exact ConnectRetryInterval 
                    Per each time the bgp-4 peer attempts to  
                     automatically retart. 
 
      Units:         Table-index = cnt 
                     Table-value = timer in seconds 
 
   9) Idle Substate  
 
      Definition:    Idle Substate value (defined in section 7.3) 
      Units:         0 = Idle hold Null 
                     1 = Idle Hold wait  
                     2 = Idle Hold down 
                     3 = Idle Hold ticking 
		 
   
   Hares         Informational - Expires August 2002               7 
   



Internet Draft      ietf-skh-bgp-backoff-01.txt     February 28,2002
         

 
 7.2 Existing MIB objects used 
    
   The back-off functions are implemented as a processing within the 
   Idle state in the BGP-4 FSM.  The FSM processing for the BGP-4 also 
   uses the following variables (found in BGP-4 MIB version 2). 
    
    
   1)   bgpPeerState  
         
        Definition: From the BGP-4 MIB version 2 [2] "The BGP Peer's  
                    FSM state". 
    
        status:     Exists in BGP-4 MIB version 2. 
    
   2)   bgpPeerConnectRetryInterval  

 
        Definition: Time interval in seconds for the ConnectRetry  
                    Timer.  The suggested value for this timer is 120 
                    Seconds if no back-off mechanism is engaged.  If a 
                    Back-off mechanism is engaged, the bgpPeerBackoff 
                    Table entry is used for the value of  
                     ConnectRetryCnt. 
    
        status:     Exists in BGP-4 MIB version 2. 
 
    
    
    
7.3 Events only utilized by the back-off mechanisms 
    
    
   The following optional events are defined in section 8.1 of the BGP-4 Draft [1]. 
    
        Event3: automatic start with bgp_stop_flap option 
        Event6: Idle Hold timer expires 
    
    
 7.4 IdleHold substate in Idle FSM description 
    
    
   The Idle Hold has the following four substates: 
    
   IdleHoldWait 
   IdleHoldDown 
   IdleHoldTicking 
   IdleHoldNull
    
    The definitions of these sub-states are:  

     
     
   Hares         Informational - Expires December 2002               8 
   



Internet Draft      ietf-skh-bgp-backoff-01.txt            June 20, 2002

    
   Substate1:    Idle Hold Wait  
    
    Definition:  Idle Hold Down timer has expired. Local system is  
                 waiting for auto-start request on bgp peer session. 


     Sub-state 
     Status  
     flags:      Idle Hold Down timer is zero. 
                 Keep Idle flag is clear. 


   
   Substate2:    Idle Hold down: 
                  
     Definition: Keep Idle flag is set, and Automatic start is  
                 disabled for peer session.   
    
     Sub-state 
     status 
     flags:      Keep Idle flag is set.


   Substate3:    Idle Hold timer ticking: 
    
    Definition:  Idle Hold Timer is ticking (running) 
                 and local system will not automatically 
                 restart the BGP session. 
    
    Variable:    Idle Hold Timer is non-zero. 
    
    
  Substate4:     Idle Hold Null

    Definition:   No Idle Hold functions are being performed.

    Sub-state 
     status 
      flags:	"Keep Idle flag" is clear,
	         Idle Hold timer is zero
		 Idle Hold off  

    
   How these sub-states are represented internally is an implementation 
   matter.  The important part is that these sub-states be available to 
   the BGP MIB. 
    

7.5 Action upon entering the Idle state

    
   Processing upon entering the Idle state: 
    
   If bgp_stop_flap flag on, the Idle Hold sub state is entered: 
    
   1)Idle Hold time = connect-retry-table[connect-retry-count] 
    
   2)Idle Hold Timer = Idle Hold time 
    
   If (Idle hold time > Idle Hold time maximum), then: 
         
        1) bgp peer sets the keep-idle-flag 
        2) set substate to Idle Hold down 
        3) clear Idle Hold timer 
          
   else 
        1) increment ConnectRetryCnt, 
        2) start Idle Hold timer with value set above 
        3) set Idle-hold substate to "Idle Timer ticking" 
    
    
   If bgp_stop_flag is off, this sub-state action should not occur.  

         


   Hares         Informational - Expires November 2002            9 
   


Internet Draft      ietf-skh-bgp-backoff-01.txt          June 20,2002

    
7.6 Substate processing  
    
    
   The events are defined from the BGP-4 draft in section 8 on 
   BGP-4 FSM [2].  This processing occurs within the IDLE state if the 
   Bgp peer oscillation back-off mechanism are used. 

   The form of the table is state/sub-state/action. 

        
                Idle State      Idle State  Idle State    Idle state    
                Substate 1      Substate2   Substate3     Substate4 
   # Event      Idle Hold       Idle Hold   Idle Hold     null
                Wait            Down        timer         
                                            ticking 
    
   ------------------------------------------------------------     
   1 Manual    Connect/         Connect /    Connect/      Connect 
      start    null/            null/        null/         null/ 
		   A1           A2               A3        A1
   ------------------------------------------------------------ 
   2  Manual   Idle/           Idle/         Idle/         Idle/    
        stop   null/           null/         null/         null/
                Ignore          A4            A5           Ignore 
   ------------------------------------------------------------ 
   3  Auto     Connect         Connect/      Connect      Connect
       start   null/           null/         null/         null/ 
                  F1            F2                F3       F1 
   ------------------------------------------------------------- 
   4  Manual   Active /       Active /      Active/       Active/ 
      start    null/          null/         null/         null/ 
      with     B1             B2            B3            B1
      passive              
   ------------------------------------------------------------- 
   5  Auto      Active /      Active /      Active /      Active 
      start     null/         null/         null/         null/
      with      B1            B2            B3            B1
      passive              
   ------------------------------------------------------------ 
   6  Automatic Connect/     Idle/          Idle/       Connect/
      start &   null/        Idle hold      Idle hold   null/ 
      bgp_stop_              down/          ticking/  
      _flap on  B1           ignore          ignore      B1
   -------------------------------------------------------------- 
   7  Automatic Idle/        Idle/          Idle/        Idle/       
      Stop      Idle Hold    Idle Hold      Idle Hold          
                Wait/        down/          tick/   
                Ignore       Ignore          Ignore 
   --------------------------------------------------------------- 
   8  Idle Hold Idle/        Idle/          Idle/        Idle/ 
      timer     Idle Hold    Idle Hold      Idle Hold    null/
      expires   wait/        down/          wait/        null/
                V            Ignore         A6           V   
   ---------------------------------------------------------------- 
  


   Hares         Informational - Expires August 2002               10 
   



Internet Draft      ietf-skh-bgp-backoff-01.txt     February 20,2002

      
   Action A1:  
     
   1.   Initialize all BGP resources
   2.   Set ConnectRetryCnt to zero
   3.   Start Connect retry timer with initial value 
   4.   Initiate transport connection Request to the remote BGP peer
   5.   Listen for connection indication from remote BGP peer 
    
   Action A2:  
    
   1.   Stop and Clear Idle Hold timer 
   2.   Clear Keep Idle flag 
   3.   initialize all BGP resources 
   4.   ConnectRetryCnt set to zero 
   5.   Start Connect retry counter with initial value  
   6.   Initiate transport connection request to remote BGP peer
   7.   Listen for connection indication from remote BGP peer 
    
	Note: items 3-7 are the same actions as Action A of the 
	      BGP FSM.

   Action A3: 
 
   1.   Stop and Clear Idle Hold Timer 
   2.   initialize all BGP resources 
   3.   ConnectRetryCnt set to zero 
   4.   Start Connect retry counter with initial value  
   5.   Initiate transport connection to BGP peer by
        send a TCP syn
   6.   Listen for connection (TCP syn, ack) from remote Peer
    
 	Note: items 2-6 are the same actions as action A of the BGP
	      FSM.
    
    
   Action A4:    
   1.   clear Keep Idle flag 
    
    
   Action A5: 
   1.   Clear Idle Hold Timer 

   
   Action A6:  
    
   1.  Idle Hold substate = Idle Hold Wait 
   2.  Clear Idle Hold Timer 
    
 
     
   Hares         Informational - Expires December 2002               11 
   

  Internet Draft      ietf-skh-bgp-backoff-01.txt     June 20, 2002


 
   Action B1: 
    
   1.   Initialize all BGP resources 
   2.   ConnectRetryCnt set to 0 
   3.   Start Connect Retry timer with initial value 
   4.   Listen for connection set-up by the remote BGP peer 
	
	Note: B1 are the same actions as action B
    
   Action B2: 
    
   1.   Clear Keep Idle Flag 
   2.   Clear Idle Hold Timer 
   3.   Initialize all BGP resources 
   4.   ConnectRetryCnt set to 0 
   5.   Start Connect Retry timer with initial value 
   6.   Listen for connection set-up from the remote BGP peer 
	[TCP Syn]
	
	Note: items 3-6 are the same actions as action B. 
    
   Action B3: 
     
   1.   Clear Idle Hold Timer
   2.   Initialize all BGP resources 
   3.   ConnectRetryCnt set to 0 
   4.   Start Connect Retry timer with initial value 
   5.   Listen for transport connection indication from the remote BGP peer 

	Note: items 2-5 are the same acttions as action B. 

   Action F1:   
   1.   Restart ConnectRetry timer (with initial value)  
   2.   Initiates a transport connection request to the other bgp peer  
           [if TCP send a TCP SYN] 
   3.   Listen for remote transport connection indication 
             may be initiated from the remote BGP peer (TCP Syn)

	Note: items 1-3 are the same actions as action F	
 
   Action F2:  
   1.   Clear Idle Hold Timer 
   2.   Clear Keep Idle Flag  
   3.   Restart ConnectRetry timer (with initial value)  
   4.   Initiates a transport connection request to the remote bgp peer  
   5.   Listen for a remote transport connection indication 
	     from the remote BGP peer 

	Note: items 3-6 are the same as action F.  
		
   Action F3 
   1.   Clear Idle Hold Timer 
   2.   Restart ConnectRetry timer (with initial value)  
   3.   Initiates a transport connection request to the
        remote bgp peer  
   4.   Listen for remote transport connection that 
             may be initiated by the remote BGP peer

	Note: items 2-4 are the same as action F.

   Action v:
    
   1.   Set FSM error in MIB reason code.  
   
     
   Hares         Informational - Expires August 2002               12 
   




Internet Draft      ietf-skh-bgp-backoff-01.txt     June 20, 2002
  
    
    
8.0 Security Considerations 
   Security concerns for BGP-4 are addressed in the BGP-4 
   specification, and accompanying specifications on TCP MD5 [3] and 
   IP Security[4].  No additional considerations need to be made for 
   the BGP-4 state machine description. 
    
9.0 References 
    
   [1] RFC 2119 Bradner, S., "Key words for use in RFCs to Indicate 
       Requirement Levels", BCP 14, RFC 2119, March 1997 
    
   [2] "A Border Gateway Protocol 4 (BGP-4)" Y. Rekhter, T. Li Editors 
        http://www.ietf.org/internet-drafts/draft-ietf-idr-bgp4-17.txt 
         
         
   [3] "Definitions of Managed Objects for the Fourth Version of Border  
       Gateway Protocol (BGP-4),Second Version",  
      http://www.ietf.org/internet-drafts/draft-ietf-idr-bgp4-mibv2- 
         01.txt 
    
   [5] "Protection of BGP Sessions via the TCP MD5 Signature Option" 
        A. Heffernan, rfc2385.txt 
    
   [6] Securing BGPv4 using IPSec , D. Ward,  
        draft-ward-bgp-ipsec-00.txt 
    
    
10. Author's Addresses 
     Susan Hares 
   NextHop Technologies, Inc 
   825 Victors Way              Phone:  1-734-222-1610 
   Ann Arbor, MI USA            Email:  skh@nexthop.com 
     
   Hares         Informational - Expires December 2002             13

--19701020
Content-Type: text/plain; charset=US-ASCII
Content-Disposition: inline



----- End forwarded message:

--19701020--


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id MAA00990 for <idr-archive@nic.merit.edu>; Wed, 17 Jul 2002 12:19:32 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id C6746912B2; Wed, 17 Jul 2002 12:19:08 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 94003912B4; Wed, 17 Jul 2002 12:19:08 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 4B2DA912B2 for <idr@trapdoor.merit.edu>; Wed, 17 Jul 2002 12:19:07 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 26A905DE33; Wed, 17 Jul 2002 12:19:07 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from hotmail.com (f125.pav1.hotmail.com [64.4.31.125]) by segue.merit.edu (Postfix) with ESMTP id C98195DE20 for <idr@merit.edu>; Wed, 17 Jul 2002 12:19:06 -0400 (EDT)
Received: from mail pickup service by hotmail.com with Microsoft SMTPSVC; Wed, 17 Jul 2002 09:19:06 -0700
Received: from 57.250.229.136 by pv1fd.pav1.hotmail.msn.com with HTTP; Wed, 17 Jul 2002 16:19:06 GMT
X-Originating-IP: [57.250.229.136]
From: "M. ELK" <elkou141061@hotmail.com>
To: idr@merit.edu
Subject: Route oscillation 
Date: Wed, 17 Jul 2002 16:19:06 +0000
Mime-Version: 1.0
Content-Type: text/html
Message-ID: <F1255TbVx6nBlijWjp3000199f8@hotmail.com>
X-OriginalArrivalTime: 17 Jul 2002 16:19:06.0406 (UTC) FILETIME=[AC5B6C60:01C22DAD]
Sender: owner-idr@merit.edu
Precedence: bulk

<html><div style='background-color:'><DIV><FONT face=Helv size=1>
<P>Hi </P>
<P>Just an&nbsp;idea from non-expert ,</P>
<P>&nbsp;</P>
<P>2 choice :</P>
<P>Choice1- </P>
<P>Assume R1 and R2 in AS1 </P>
<P>R3 in AS3 , R4 in AS4</P>
<P>R1/R3 have eBGP session , R2/R4 have eBGP session . R1/R1 have iBGP session .</P>
<P>R3 advertise path to 10.0.0.0/8 to R1 . </P>
<P>R1 advertise 10.0.0.0/8 from AS3 to R2 . </P>
<P>R4 advertise 10.0.0.0/8 to R2 . </P>
<P>R2 select the path from AS4 as the best . </P>
<P>R2 advertise the path from AS4 to R1 . </P>
<P>R1 also select the path from As4 as the best ,but instead of sending to R2 a withdraw of the path from AS3 it re-advertise the path from AS3 but this time with Community=0xFFFFFFFF </P>
<P>Community 0xFFFFFFFF = The path is "HOLD/Poison" advertise because U provide me a better path ,pls assign a distance (CISCO terminology) of 255 to</P>
<P>this path . </P>
<P>R1 associate a flag "F" with the path from As3 in the ADJ-RIB-OUT </P>
<P>The distance=255 will prevent this path to be inserted in the routing table even if it is the best path based on BGP selection . </P>
<P>Case1: R3 withdraw the path to 10.0.0.0/8 . </P>
<P>R1 will send normal withdrwal to R2 . </P>
<P>R2 delete the path from AS3</P>
<P>Case 2: R4 withdraw the path to 10.0.0.0/8 . </P>
<P>R2 now is left with the path from As3 as the best (it is the only one) but since it can not be installed in the routing table R3 just delete the entry </P>
<P>of 10.0.0.0/8 from the routing table . . </P>
<P>R2 advertise the withdraw of 10/8 from AS4 to R1 . </P>
<P>R1 select the path from AS3 as the best (the only one) . </P>
<P>Since this path from AS3 is associated with flag "F" in ADJ-RIB-OUT so R1 need to consider as if this path was not advertised to R2 before and </P>
<P>as such to be advertise (this time as normal ie: without 0xFFFFFFFF community) to R2 . </P>
<P>R2 insert the new route in the BGP ,and it will be inserted as normal in the routing table . </P>
<P>&nbsp;</P>
<P>The use of community 0xFFFFFF is exactly as if R1 is sending to R2 "distance 255 10/8" . </P>
<P>&nbsp;</P>
<P>&nbsp;</P>
<P>Choice2- </P>
<P>Assume R1 and R2 in AS1 </P>
<P>R3 in AS3 , R4 in AS4</P>
<P>R1/R3 have eBGP session , R2/R4 have eBGP session . </P>
<P>R3 advertise path to 10.0.0.0/8 to R1 . </P>
<P>R1 advertise 10.0.0.0/8 from AS3 to R2 . </P>
<P>R4 advertise 10.0.0.0/8 to R2 . </P>
<P>R2 select the path from AS4 as the best . </P>
<P>R2 advertise the path from AS4 to R1 . </P>
<P>R1 also select the path from As4 as the best , R1 send to R2 the withdraw of the path from AS3 but with Community=0xFFFFFFFF </P>
<P>Community 0xFFFFFFFF = The path is withdrwan because U provide me a better path ,pls do not delete from ADJ-RIB-IN just assign a distance (CISCO terminology) of 255 to this path . </P>
<P>R2 upon recieveing the withdraw of the path from AS3 with community 0xFFFFFFFF ,it will not remove it from the ADJ-RIB-IN but simply will </P>
<P>assign a distance of 255 and start a timer "T" . say Timer T= 60 min . </P>
<P>on expiry of timer T , the path is deleted from ADJ-RIB-IN . </P>
<P>The distance=255 will prevent this path to be inserted in the routing table even if it is the best path based on BGP selection . </P>
<P>Case1: R3 withdraw the path to 10.0.0.0/8 .</P>
<P>since R1 is already withdraw the path from As3 toward R2 so it will do nothing . </P>
<P>Providing that R3 will not advertise back again the path for a period &gt; T . The path will be stored in R2 for a max duration of 60 min . </P>
<P>ie: this residual path will be clean after a max period of 60 min . </P>
<P></P>
<P>Case 2: R4 withdraw the path to 10.0.0.0/8 . </P>
<P>R2 now is left with the path from As3 as the best (it is the only one) but since it can not be installed in the routing table R3 just delete the entry </P>
<P>of 10.0.0.0/8 from the routing table . . </P>
<P>R2 advertise the withdraw of 10/8 from AS4 to R1 . </P>
<P>R1 select the path from AS3 as the best (the only one) . </P>
<P>Since this path from AS3 is associated with flag "F" in ADJ-RIB-OUT so R1 need to consider as if this path was not advertised to R2 before and </P>
<P>as such to be advertise (this time normally without 0xFFFFFFFF community) to R2 . </P>
<P>R2 insert the new route in the BGP ,and it will be inserted as normal in the routing table . </P>
<P>&nbsp;</P>
<P>The use of community 0xFFFFFF is exactly as if R1 is sending to R2 "distance 255 10/8" . </P>
<P>&nbsp;</P>
<P>The idea of choice "1" or "2" is just to differentiate betweeen 1)a path is withdrawn because it is no longer valid , 2) the path is withdrawn because the router just got a better route from his peer . </P>
<P>Now for the RR : </P>
<P>using the&nbsp;doc avail on ESnet&nbsp; &nbsp;<A href="http://www.es.net/hypertext/welcome/pr/BGP/BGP_Route_Oscillation_Type1a_v1.0.pdf">http://www.es.net/hypertext/welcome/pr/BGP/BGP_Route_Oscillation_Type1a_v1.0.pdf</A></P>
<P></P>
<P>On page 7 of BGP_Route_Oscillation_Type1a_v1.0[1].pdf :</P>
<P>instead of RD(RR2) withdraw route from AS6b :</P>
<P>1)it re-advertise the path from AS6b but with community 0xFFFFFFFF (choice 1 above) </P>
<P>OR </P>
<P>2)it withdraw but with community 0xFFFFFFFF . (choice 2 above) </P>
<P>I guess this could block the oscillation .When using choice 2 (due to the use of timer T) the oscillation will start again when timer T expire but within few exchange it will be stoped . ie: 1 cycle of oscillation each T . </P>
<P>For scalability :</P>
<P>This feature to be deployed only on the RR .</P>
<P>The RR (say RR1)&nbsp; send such community only to the&nbsp; IBGP perr&nbsp;&nbsp;which advertised the best route providing that peer is not one of RR1&nbsp;clients . </P>
<P>If the above is workable we have avoided to change the BGP standard such as </P>
<P>advertising multi path . </P>
<P>&nbsp;</P>
<P>Brgds EL-Koussy </P>
<P>&nbsp;</P></FONT></DIV></div><br clear=all><hr>MSN Photos is the easiest way to share and print your photos: <a href='http://g.msn.com/1HM1ENXX/c156??PI=31901'>Click Here</a><br></html>


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id LAA29161 for <idr-archive@nic.merit.edu>; Wed, 17 Jul 2002 11:26:06 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id BC84B912B4; Wed, 17 Jul 2002 11:25:28 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 922EA912B5; Wed, 17 Jul 2002 11:25:28 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 8E60D912B4 for <idr@trapdoor.merit.edu>; Wed, 17 Jul 2002 11:25:27 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 7ED5C5DE6C; Wed, 17 Jul 2002 11:25:27 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 23E855DE33 for <idr@merit.edu>; Wed, 17 Jul 2002 11:25:27 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6HFPMd38451; Wed, 17 Jul 2002 11:25:22 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6HFPJ138444; Wed, 17 Jul 2002 11:25:19 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g6HFPJs16431; Wed, 17 Jul 2002 11:25:19 -0400 (EDT)
Date: Wed, 17 Jul 2002 11:25:19 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: Bruno Rijsman <bwarijsman@hotmail.com>
Cc: idr@merit.edu
Subject: Re: Questions about the "Authentication Information" optional parameter
Message-ID: <20020717112519.A15487@nexthop.com>
References: <F79SmLAztxc1o8yfogk00015c76@hotmail.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <F79SmLAztxc1o8yfogk00015c76@hotmail.com>; from bwarijsman@hotmail.com on Wed, Jul 17, 2002 at 11:18:48AM -0400
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

On Wed, Jul 17, 2002 at 11:18:48AM -0400, Bruno Rijsman wrote:
> If no "Authentication Information" parameter was exchanged, it seems that 
> the 16 marker bytes are a complete waste of space.

Note that in bgp 1-3, this field was used for synchronizing the start
of packet in the data stream.  It was in bgp 4 that this was changed
to be capable of being used for authentication.

There was also a change at some point of the size of the marker field.

Just following the rfc's in the evolution of bgp and reading the IDRP
field, the whole "authentication" bit has a very ISO feel to it.
The authors may be able to shed more light on this. :-)

In any case, you can't get rid of it without breaking backwards 
compatability.

> Would it make sense to 
> define a capability (or maybe an authentication code) which means 
> "subsequent messages after the OPEN message will not contain the marker 
> bytes".

Why bother?  Routing data doesn't consume that much bandwidth.

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id LAA28965 for <idr-archive@nic.merit.edu>; Wed, 17 Jul 2002 11:19:26 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id B4081912B2; Wed, 17 Jul 2002 11:18:50 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 879C0912B4; Wed, 17 Jul 2002 11:18:50 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 57360912B2 for <idr@trapdoor.merit.edu>; Wed, 17 Jul 2002 11:18:49 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 397A75DE6C; Wed, 17 Jul 2002 11:18:49 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from hotmail.com (f79.law4.hotmail.com [216.33.149.79]) by segue.merit.edu (Postfix) with ESMTP id EF84B5DE33 for <idr@merit.edu>; Wed, 17 Jul 2002 11:18:48 -0400 (EDT)
Received: from mail pickup service by hotmail.com with Microsoft SMTPSVC; Wed, 17 Jul 2002 08:18:48 -0700
Received: from 65.194.140.2 by lw4fd.law4.hotmail.msn.com with HTTP; Wed, 17 Jul 2002 15:18:48 GMT
X-Originating-IP: [65.194.140.2]
From: "Bruno Rijsman" <bwarijsman@hotmail.com>
To: idr@merit.edu
Subject: Questions about the "Authentication Information" optional parameter
Date: Wed, 17 Jul 2002 11:18:48 -0400
Mime-Version: 1.0
Content-Type: text/plain; format=flowed
Message-ID: <F79SmLAztxc1o8yfogk00015c76@hotmail.com>
X-OriginalArrivalTime: 17 Jul 2002 15:18:48.0477 (UTC) FILETIME=[3FE744D0:01C22DA5]
Sender: owner-idr@merit.edu
Precedence: bulk

I have some questions about the "Authentication Information" optional 
parameter in the BGP OPEN message.

Have any authentication codes been defined anywhere?

Does any implementation actually support this method of authentication (as 
far as I know everyone uses RFC 2385 which uses a TCP MD5 signature 
instead)?

If it is not used in real life should we remove the "Authentication 
Information" optional parameter from the draft (or at least deprecate it)?

If no "Authentication Information" parameter was exchanged, it seems that 
the 16 marker bytes are a complete waste of space. Would it make sense to 
define a capability (or maybe an authentication code) which means 
"subsequent messages after the OPEN message will not contain the marker 
bytes".



_________________________________________________________________
Join the worldÂ’s largest e-mail service with MSN Hotmail. 
http://www.hotmail.com



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id EAA15771 for <idr-archive@nic.merit.edu>; Wed, 17 Jul 2002 04:35:21 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 2304591227; Wed, 17 Jul 2002 04:34:43 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id D6D7491228; Wed, 17 Jul 2002 04:34:42 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id B440791227 for <idr@trapdoor.merit.edu>; Wed, 17 Jul 2002 04:34:41 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 9E4A35DDC1; Wed, 17 Jul 2002 04:34:41 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 4AF675DDBE for <idr@merit.edu>; Wed, 17 Jul 2002 04:34:41 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6H8Yba27868; Wed, 17 Jul 2002 04:34:37 -0400 (EDT) (envelope-from skh@nexthop.com)
Received: from SKH.nexthop.com (SKH.corp.nexthop.com [133.93.74.227]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6H8YP127861; Wed, 17 Jul 2002 04:34:26 -0400 (EDT) (envelope-from skh@nexthop.com)
Message-Id: <5.0.0.25.0.20020717043303.03727c10@mail.nexthop.com>
X-Sender: skh@mail.nexthop.com
X-Mailer: QUALCOMM Windows Eudora Version 5.0
Date: Wed, 17 Jul 2002 04:34:38 -0400
To: idr@merit.edu
From: Susan Hares <skh@nexthop.com>
Subject: New agenda - addition
Cc: yakov Rekhter <yakov@juniper.net>, fenner@research.att.com, zinin@psg.com, kireeti@juniper.net
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"; format=flowed
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

IDR Agenda for 7/18/02

Time of meeting: 13:00 - 15:00
Room: 501

    Agenda Bashing  13:00 - 13:10

    Route Oscillation Fixes [13:10-14:00]

       1) Wilfong,BAsu, Shepherd, Ong, RAsala
	
	  presentor: Anindya Basu
	  time: 13:10 - 13:25
	
	 draft: "Eliminating IBGP oscillations"
                  draft-wilfong-ibgp-oscillations-00.txt 	
	
      2) Walton, Retano, Scudder, Cook drafts

	presentor: Alvaro Retano
	time: 13:25 - 13:40
          Advertisement of Multiple Paths in BGP
             draft-walton-bgp-add-paths-00.txt

          BGP Persistent Route Oscillation Solution
              draft-walton-bgp-route-oscillation-stop-00.txt

      3) Chen drafts overview
	
	presentor: Sue Hares
	time: 13:40 - 14:00
	
	a) BGP Route Oscillation Reduction with Confederation
             draft-chen-confed-oscillation-reduce-01.txt

	b) BGP Route Oscillation Avoidance for Route Reflection and Confederation
             draft-chen-route-oscillation-avoid-00.txt

	c) BGP Route Oscillation Reduction with Route Reflection
            draft-chen-rr-oscillation-reduce-01.txt

      4) Discussion time
	
	time:14:00 - 14:15


    Inform proposal [14:00 - 14:15]

      1) "BGPv4 INFORM message"

	draft-nalawade-bgp-inform-00.txt
         presentor: Gargi Nalawade
	time: 14:00 - 14:15


        Extended Community Issues [14:15 - 14:30]
	presentor: Andrew Lange


    Status of BGP documents [14:30 - 14:45]
	
	1) BGP-4 base document status
	2) Status of all BGP drafts
      	



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id DAA14456 for <idr-archive@nic.merit.edu>; Wed, 17 Jul 2002 03:53:18 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 78D2F91227; Wed, 17 Jul 2002 03:52:57 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 3877791228; Wed, 17 Jul 2002 03:52:57 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id E5CD491227 for <idr@trapdoor.merit.edu>; Wed, 17 Jul 2002 03:52:55 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id D020A5DDEE; Wed, 17 Jul 2002 03:52:55 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 6A7515DDC1 for <idr@merit.edu>; Wed, 17 Jul 2002 03:52:55 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6H7qlo27046; Wed, 17 Jul 2002 03:52:47 -0400 (EDT) (envelope-from skh@nexthop.com)
Received: from SKH.nexthop.com (SKH.corp.nexthop.com [133.93.74.227]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6H7qh127039; Wed, 17 Jul 2002 03:52:43 -0400 (EDT) (envelope-from skh@nexthop.com)
Message-Id: <5.0.0.25.0.20020717035238.01d862c8@mail.nexthop.com>
X-Sender: skh@mail.nexthop.com
X-Mailer: QUALCOMM Windows Eudora Version 5.0
Date: Wed, 17 Jul 2002 03:52:55 -0400
To: andrewl@exodus.net
From: Susan Hares <skh@nexthop.com>
Subject: Re: BGP Extended Communities
Cc: idr@merit.edu, andrewl@xix-w.bengi.exodus.net
In-Reply-To: <20020716221414.B1256@demiurge.exodus.net>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"; format=flowed
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

Andrew:

You are on the agenda.  Could you send me your last name as well?

Sue


At 10:14 PM 7/16/2002 -0700, andrewl@exodus.net wrote:
>Greetings,
>
>I was going over the draft-ietf-idr-bgp-ext-communities-05.txt.  I
>think we can make significant enhancements to the functionality,
>efficency and flexibility of this draft.  If we step back and look
>at defining a more generic solution which can address both
>current and future needs we will be better served.
>
>In order to do that, I'd like to propose a couple of changes to the
>current draft.  Here are the enhancements these changes will provide:
>
>- Support for IPv6.  If we are going to move to IPv6 at some point,
>we should include support for it in the protocol extentions we define.
>- More efficient packing of values in the data field.  A good example
>of where this enhancement would be applicable is in the
>draft-ietf-ptomaine-bgp-redistribution-00.txt document.
>- Cleaner support for a broad range of future data field structures,
>and interpretations.
>- Support for locally defined community structures (types).
>- Support for locally defined community sub-types.
>
>Below I have proposed some modifications to the text of the draft to
>support these enhancements.
>
>(Section 5)
>
>Each Extended Community is encoded as a variable length quantity, as
>follows:
>
>   - Type Field: 1 octet
>   - Subtype Field: 1 octet
>   - Length Field: 1 octet
>   - Value Field: 0-63 octets
>
>        0                   1                   2                   3
>        0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
>       +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
>       |     Type      |    Subtype    |    Length     |               |
>       +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+     Value     |
>       |                                               (0-63 octets)   |
>       +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
>
>**  1) This eliminates the "regular" and the "extended" variations in favor
>**  of a single definition.  Since all of the current proposals are or can be
>**  represented as extended types, and the addition of a variable length
>**  value field allows as many bytes are needed for data, the "regular" type
>**  can be removed.
>**  2) The Value field is made variable length.  This key enhancement allows
>**  us to efficently encode lists of ASN's, IPv4 and IPv6 addresses.
>
>Type Field:
>
>   The Type field is one octet.  This octect is split into three sections:
>
>              0 1 2 3 4 5 6 7
>             +-+-+-+-+-+-+-+-+
>             |I|T|           |
>             +-+-+-+-+-+-+-+-+
><< all the same as the existing draft until: >>
>
>   Remaining 6 bits:  Indicates the structure of the community.  Specifically
>    the type defined by these bits indicates how the data in the Value field
>    is structured.  For example, a type 1 (000001) might represent a 2 part
>    value field where the first part is expected to be a 2-octet ASN and
>    the second part would be a variable-length value.  A basic set of
>    useful types are defined later in this document.
>
>Subtype Field:
>
>   The Subtype field is one octet.  This contents of this octet are used
>   to clarify the interpretation of the type field.  For example, a
>   subtype 2, which would apply to both the IP-addres types defined later
>   in this document, might indicate that the IP addresses contained in
>   part one of the value field indicate that the BGP route originated
>   at the router with that IP address.
>
>(Section 6)
>
>Base BGP Extended Community Type
>
>   This section defines the base BGP community specification.  Since the
>   root value of communities is the ability to tag a route with arbitrary
>   information, and then create a system to give that information meaning,
>   the base extended community type (type 0), is very simple.  This
>   base type could also be described as the "opaque type."
>
>   Like all extended commnuities, this type can be transitive or
>   non-transitive.
>
>        0                   1                   2                   3
>        0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
>       +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
>       |     Type      |      0x00     |    Length     |               |
>       +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+     Value     |
>       |                                               (0-63 octets)   |
>       +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
>
>** The differentiation of the base type from the other types defined in
>** this document is important.  The base type is what the rest are built
>** on.  Similar functionality to all the other types can be recreated
>** with sufficient manual configuration and this base type.
>
>(Section 7)
>
>Extended Community Type Summary:
>
>   Not including the base type, there are four community types defined in
>   this document.  Each represents a different structure a community can
>   take.
>
>   Type 1:
>     2 part Value Field.  The first part is 2 octets.  The second part
>     is variable.  If this type is transitive, the first part
>     MUST contain the 2-octet ASN of the originating network.
>   Type 2:
>     2 part Value Field.  The first part is 4 octets.  The second part
>     is variable.  If the type is transitive, the first part
>     MUST contain the 4-octet ASN of the originating network.
>   Type 3:
>     2 part Value Field.  The first part is 4 octets.  The second part
>     is variable.  The first part contains an IPv4 address.
>   Type 4:
>     2 part Value Field.  The first part is 16 octets. The second part
>     is variable.  The first part contains an IPv6 address.
>
>(Section 8)
>
>Extended Community Subtype Summary:
>
>   Type 0:
>     No Subtype Defined.
>   Type 1:
>     This is the Route Target Subtype.  This subtype applies to the
>     IP address type communities (types 3 & 4). The first part of a community
>     with this subtype represents the loopback of the router where this
>     route is to be announced.  Please see RFC2547 for more details.
>
>** Correct me if my interperetation of the Route Target Community is
>** incorrect.
>
>   Type 2:
>     This is the Route Origin Subtype.  This subtype applies to the
>     IP address type communities (types 3 & 4). The first part of a community
>     with this subtype represents the loopback of the router where this
>     route is announced.
>   Type 3:
>     This is the Link Bandwidth Subtype.  This subtype applies to the
>     ASN type communities (types 1 & 2). The first part od a community
>     with this subtype represents the ASN of the eBGP neighbor whose
>     link bandwidth you wish to distribute.  The second part of a
>     community with this subtype represents the bandwidth of the link
>     in bits-per-second, encoded in IEEE floating point format.
>
>(Section 9)
>
>Locally Defined Commnuity Types
>
>   This section reserves a subgroup of the Vendor-Specific/Experimental
>   type values (I bit = 1), for locally defined types.  This allows
>   operators to define their own community structures.  To prevent type
>   collision, these types MUST be non-transitive.
>
>   Experimental Non-Transitive Type 32-63 (0xE0-0xFF)
>
>   A locally defined type will have a syntax for interpretation defined
>   on the routers that need to interpret it.  If a router receives a
>   community type associated with this range, and does not have a value
>   defined, it should pass it along normally, and ignore the contents.
>
>(Section 10)
>
>Locally Defined Community Subtypes
>
>   This section reserves a subgroup of the subtype space for local
>   definition.  This allows operator to define varriant interpertations
>   for exiting types (locally defined or otherwise).  These subtypes
>   MUST be attached to non-transitive community values to prevent
>   subtype collision.
>
>   Non-Transitive Subtypes 192-255 (0xC0-0xFF)
>
>   A locally defined subtype will have an syntax for interpertation
>   defined on the routers that need to interpret it.  If a router
>   receives a community subtype associated with this range, and does
>   not have a value defined, it should pass the it along normally, and
>   ignore the contents.




Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id BAA09355 for <idr-archive@nic.merit.edu>; Wed, 17 Jul 2002 01:18:14 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 2DF1A91221; Wed, 17 Jul 2002 01:17:45 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id F051A91262; Wed, 17 Jul 2002 01:17:44 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id E822B91221 for <idr@trapdoor.merit.edu>; Wed, 17 Jul 2002 01:17:01 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id C72245DEC8; Wed, 17 Jul 2002 01:17:01 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from demiurge.exodus.net (demiurge.exodus.net [216.32.171.82]) by segue.merit.edu (Postfix) with ESMTP id 3727B5DE50 for <idr@merit.edu>; Wed, 17 Jul 2002 01:17:01 -0400 (EDT)
Received: (from andrewl@localhost) by demiurge.exodus.net (8.9.3+Sun/8.9.3) id WAA01457; Tue, 16 Jul 2002 22:14:15 -0700 (PDT)
Date: Tue, 16 Jul 2002 22:14:14 -0700
From: andrewl@exodus.net
To: idr@merit.edu
Cc: andrewl@xix-w.bengi.exodus.net
Subject: BGP Extended Communities
Message-ID: <20020716221414.B1256@demiurge.exodus.net>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
Sender: owner-idr@merit.edu
Precedence: bulk

Greetings,

I was going over the draft-ietf-idr-bgp-ext-communities-05.txt.  I
think we can make significant enhancements to the functionality,
efficency and flexibility of this draft.  If we step back and look
at defining a more generic solution which can address both
current and future needs we will be better served.

In order to do that, I'd like to propose a couple of changes to the
current draft.  Here are the enhancements these changes will provide:

- Support for IPv6.  If we are going to move to IPv6 at some point,
we should include support for it in the protocol extentions we define.
- More efficient packing of values in the data field.  A good example
of where this enhancement would be applicable is in the
draft-ietf-ptomaine-bgp-redistribution-00.txt document.
- Cleaner support for a broad range of future data field structures,
and interpretations.
- Support for locally defined community structures (types).
- Support for locally defined community sub-types.

Below I have proposed some modifications to the text of the draft to
support these enhancements.

(Section 5)

Each Extended Community is encoded as a variable length quantity, as
follows:

  - Type Field: 1 octet
  - Subtype Field: 1 octet
  - Length Field: 1 octet
  - Value Field: 0-63 octets

       0                   1                   2                   3
       0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
      +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
      |     Type      |    Subtype    |    Length     |               |
      +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+     Value     |
      |                                               (0-63 octets)   |
      +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

**  1) This eliminates the "regular" and the "extended" variations in favor
**  of a single definition.  Since all of the current proposals are or can be
**  represented as extended types, and the addition of a variable length
**  value field allows as many bytes are needed for data, the "regular" type
**  can be removed.
**  2) The Value field is made variable length.  This key enhancement allows
**  us to efficently encode lists of ASN's, IPv4 and IPv6 addresses.

Type Field:

  The Type field is one octet.  This octect is split into three sections:

             0 1 2 3 4 5 6 7
            +-+-+-+-+-+-+-+-+
            |I|T|           |
            +-+-+-+-+-+-+-+-+
<< all the same as the existing draft until: >>

  Remaining 6 bits:  Indicates the structure of the community.  Specifically
   the type defined by these bits indicates how the data in the Value field
   is structured.  For example, a type 1 (000001) might represent a 2 part
   value field where the first part is expected to be a 2-octet ASN and
   the second part would be a variable-length value.  A basic set of
   useful types are defined later in this document.

Subtype Field:

  The Subtype field is one octet.  This contents of this octet are used
  to clarify the interpretation of the type field.  For example, a
  subtype 2, which would apply to both the IP-addres types defined later
  in this document, might indicate that the IP addresses contained in
  part one of the value field indicate that the BGP route originated
  at the router with that IP address.

(Section 6)

Base BGP Extended Community Type

  This section defines the base BGP community specification.  Since the
  root value of communities is the ability to tag a route with arbitrary
  information, and then create a system to give that information meaning,
  the base extended community type (type 0), is very simple.  This
  base type could also be described as the "opaque type."

  Like all extended commnuities, this type can be transitive or
  non-transitive.

       0                   1                   2                   3
       0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
      +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
      |     Type      |      0x00     |    Length     |               |
      +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+     Value     |
      |                                               (0-63 octets)   |
      +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+

** The differentiation of the base type from the other types defined in
** this document is important.  The base type is what the rest are built
** on.  Similar functionality to all the other types can be recreated
** with sufficient manual configuration and this base type.

(Section 7)

Extended Community Type Summary:

  Not including the base type, there are four community types defined in
  this document.  Each represents a different structure a community can
  take.

  Type 1:
    2 part Value Field.  The first part is 2 octets.  The second part
    is variable.  If this type is transitive, the first part
    MUST contain the 2-octet ASN of the originating network.
  Type 2:
    2 part Value Field.  The first part is 4 octets.  The second part
    is variable.  If the type is transitive, the first part
    MUST contain the 4-octet ASN of the originating network.
  Type 3:
    2 part Value Field.  The first part is 4 octets.  The second part
    is variable.  The first part contains an IPv4 address.
  Type 4:
    2 part Value Field.  The first part is 16 octets. The second part
    is variable.  The first part contains an IPv6 address.

(Section 8)

Extended Community Subtype Summary:

  Type 0:
    No Subtype Defined.
  Type 1:
    This is the Route Target Subtype.  This subtype applies to the
    IP address type communities (types 3 & 4). The first part of a community
    with this subtype represents the loopback of the router where this
    route is to be announced.  Please see RFC2547 for more details.

** Correct me if my interperetation of the Route Target Community is
** incorrect.

  Type 2:
    This is the Route Origin Subtype.  This subtype applies to the
    IP address type communities (types 3 & 4). The first part of a community
    with this subtype represents the loopback of the router where this
    route is announced.
  Type 3:
    This is the Link Bandwidth Subtype.  This subtype applies to the
    ASN type communities (types 1 & 2). The first part od a community
    with this subtype represents the ASN of the eBGP neighbor whose
    link bandwidth you wish to distribute.  The second part of a
    community with this subtype represents the bandwidth of the link
    in bits-per-second, encoded in IEEE floating point format.

(Section 9)

Locally Defined Commnuity Types

  This section reserves a subgroup of the Vendor-Specific/Experimental
  type values (I bit = 1), for locally defined types.  This allows
  operators to define their own community structures.  To prevent type
  collision, these types MUST be non-transitive.

  Experimental Non-Transitive Type 32-63 (0xE0-0xFF)

  A locally defined type will have a syntax for interpretation defined
  on the routers that need to interpret it.  If a router receives a
  community type associated with this range, and does not have a value
  defined, it should pass it along normally, and ignore the contents.

(Section 10)

Locally Defined Community Subtypes

  This section reserves a subgroup of the subtype space for local
  definition.  This allows operator to define varriant interpertations
  for exiting types (locally defined or otherwise).  These subtypes
  MUST be attached to non-transitive community values to prevent
  subtype collision.

  Non-Transitive Subtypes 192-255 (0xC0-0xFF)

  A locally defined subtype will have an syntax for interpertation
  defined on the routers that need to interpret it.  If a router
  receives a community subtype associated with this range, and does
  not have a value defined, it should pass the it along normally, and
  ignore the contents.




Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id LAA12886 for <idr-archive@nic.merit.edu>; Tue, 16 Jul 2002 11:40:03 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 0E458912B6; Tue, 16 Jul 2002 11:39:41 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id CDEEA912B7; Tue, 16 Jul 2002 11:39:40 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id B20BC912B6 for <idr@trapdoor.merit.edu>; Tue, 16 Jul 2002 11:39:39 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 95F835DE16; Tue, 16 Jul 2002 11:39:39 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 3C2065DDCF for <idr@merit.edu>; Tue, 16 Jul 2002 11:39:39 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6GFdbT98481; Tue, 16 Jul 2002 11:39:37 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6GFdY198474; Tue, 16 Jul 2002 11:39:34 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g6GFdYr06701; Tue, 16 Jul 2002 11:39:34 -0400 (EDT)
Date: Tue, 16 Jul 2002 11:39:33 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: "Fryczke, Pawel" <Pawel.Fryczke@intel.com>
Cc: idr@merit.edu
Subject: Re: Aggregation of the overlapping routes and ATOMIC_AGGREGATE
Message-ID: <20020716113933.C6551@nexthop.com>
References: <413FBB0BA5AED1119122000083234B1A015A942E@alpha.igk.intel.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <413FBB0BA5AED1119122000083234B1A015A942E@alpha.igk.intel.com>; from Pawel.Fryczke@intel.com on Thu, Jul 11, 2002 at 12:32:14PM +0200
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

On Thu, Jul 11, 2002 at 12:32:14PM +0200, Fryczke, Pawel wrote:
> What does happen if BGP router installs both routes in RIB and next
> aggregates them ?
> Should ATOMIC_AGGREGATE be added to aggregate or not ?
> 
> Consider scenario:
> BGP Router is configured to aggregate routes 192.0.0.0/8
> This router receives 2 NLRIs: 192.0.0.0/8 and 192.1.0.0/16
> Received NLRIs are installed in RIB, but are suppressed by aggregation and
> are not advertised.

Assuming you mean "suppresed by export policy because they
are contributors to an aggregate"...

> As a result of aggregation the router sends update that contains only one
> NLRI: 192.0.0.0/8
> Does this update should contain ATOMIC_AGGREGATE attached ?

According to the spec?  No.
Logically, yes.
An attempt was made to address this in -16, but it was determined
that the rules of whether or not AA was to be attached to make
proper semantic (rather than spec-defined) sense imposed the necessity
of keeping too much state.

My recommendation - treat AA as a piece of fluff.  Add it like
everyone else does when suppressing as's from an aggregated as-path.
Also add it when you suppress more specifics in your adj-ribs-in if
you want to be compliant, but its work that is unappreciated by
the Internet at large.

> Why does RFC assume that aggregation is done during decision process?

Logically, because routes are installed in LocRib/Adj-Ribs-Out during
this phase and Update-Send just sends Adj-Ribs-Out.

> Pawel

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id LAA12677 for <idr-archive@nic.merit.edu>; Tue, 16 Jul 2002 11:34:42 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 73E21912B5; Tue, 16 Jul 2002 11:34:16 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 3B553912B6; Tue, 16 Jul 2002 11:34:16 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 2D601912B5 for <idr@trapdoor.merit.edu>; Tue, 16 Jul 2002 11:34:15 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 1CEE25DE16; Tue, 16 Jul 2002 11:34:15 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id B51155DDCF for <idr@merit.edu>; Tue, 16 Jul 2002 11:34:14 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6GFYDF98261; Tue, 16 Jul 2002 11:34:13 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6GFYA198254; Tue, 16 Jul 2002 11:34:10 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g6GFY9t06682; Tue, 16 Jul 2002 11:34:09 -0400 (EDT)
Date: Tue, 16 Jul 2002 11:34:09 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: Bruno Rijsman <bwarijsman@hotmail.com>
Cc: idr@merit.edu, Pawel.Fryczke@intel.com
Subject: Re: Aggregation of the overlapping routes and ATOMIC_AGGREGATE
Message-ID: <20020716113409.B6551@nexthop.com>
References: <F127Aabut81LU0DxJHN0000ea48@hotmail.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <F127Aabut81LU0DxJHN0000ea48@hotmail.com>; from bwarijsman@hotmail.com on Thu, Jul 11, 2002 at 06:47:26AM -0400
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

On Thu, Jul 11, 2002 at 06:47:26AM -0400, Bruno Rijsman wrote:
> 1) If the aggregate is an "as-set" aggregate (i.e. the as-path and
>    other path attributes of the aggregate are constructed by
>    "summarizing" the attributes of the covered routes) then the
>    ATOMIC_AGGREGATE attribute is not attached.

Is this actually the case if one of the contributing routes already
ATOMIC_AGGREGATE?

> 2) If the aggregate is not an "as-set" aggregate (i.e. the as-path
>    and other path attributes are constructed as if the aggregate
>    route is a locally originated route) then the ATOMIC_AGGREGATE
>    attribute is attached.
> 
> I have seen nothing in RFC1771 or the current draft that describes
> this behavior but -as far as I know- that is what happens in real
> life.

>From draft -17:
: 5.1.6 ATOMIC_AGGREGATE
: 
: 
:    ATOMIC_AGGREGATE is a well-known discretionary attribute.
: 
:    When a router aggregates several routes for the purpose of
:    advertisement to a particular peer, and the AS_PATH of the aggregated
:    route excludes at least some of the AS numbers present in the AS_PATH
:    of the routes that are aggregated, the aggregated route, when
:    advertised to the peer, MUST include the ATOMIC_AGGREGATE attribute.

What was only really mentioned in the more recent drafts was the
option of not generating as-sets in the first place.  Prior to
this, implementations that truncated paths were non-compliant.

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id LAA12413 for <idr-archive@nic.merit.edu>; Tue, 16 Jul 2002 11:25:08 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id BF1BB9125A; Tue, 16 Jul 2002 11:24:30 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 8ED31912B5; Tue, 16 Jul 2002 11:24:30 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 7EB409125A for <idr@trapdoor.merit.edu>; Tue, 16 Jul 2002 11:24:29 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 641EB5DE17; Tue, 16 Jul 2002 11:24:29 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 094FA5DE14 for <idr@merit.edu>; Tue, 16 Jul 2002 11:24:29 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6GFOEB97826; Tue, 16 Jul 2002 11:24:14 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6GFOB197818; Tue, 16 Jul 2002 11:24:11 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g6GFOBj06627; Tue, 16 Jul 2002 11:24:11 -0400 (EDT)
Date: Tue, 16 Jul 2002 11:24:11 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: "John G. Scudder" <jgs@cisco.com>
Cc: Pedro Roque Marques <roque@juniper.net>, idr@merit.edu
Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
Message-ID: <20020716112411.A6551@nexthop.com>
References: <1117F7D44159934FB116E36F4ABF221B020AF86F@celox-ma1-ems1.celoxnetworks.com <20020712145641.A25494@nexthop.com> <200207121914.g6CJEcO59697@roque-bsd.juniper.net> <20020712162014.A25656@nexthop.com> <p05111a25b958ef63859e@[171.69.182.142]>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <p05111a25b958ef63859e@[171.69.182.142]>; from jgs@cisco.com on Mon, Jul 15, 2002 at 05:39:45PM -0400
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

On Mon, Jul 15, 2002 at 05:39:45PM -0400, John G. Scudder wrote:
> - Router B ignores the replacement because of the posited "discard 
> updates that contain unrecognized data" rule.

You've just given the strongest example of why not NOTIFYing can be
a bad idea.

As you mention later:

> Let me grant you up front that you can fix this specific case by 
> being more careful in stating your rule 

This leaves me asking again, "Is there some way we can gracefully
deal with some errors?"  In the case of dealing with the non-prefixed
confederation segments, the answer was yes.  The example you give
could also have worked had it been worded more carefully to only
apply to path attributes and unrecognized/semantically incorrect
attributes yet still use the NLRI for purposes of discarding those
prefixes from your adj-ribs-in.

My general concern is that we already have vendors implementing
various forms of graceful error handling that aren't strictly
conformant with the spec.  If this is the norm, we should codify it.
If we don't, we risk interoperability issues due to one vendor's
bugs killing other vendor's peering sessions "far away".

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA10821 for <idr-archive@nic.merit.edu>; Tue, 16 Jul 2002 10:35:37 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 9DA30912B4; Tue, 16 Jul 2002 10:35:09 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 6AEEC912B5; Tue, 16 Jul 2002 10:35:09 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 19FC9912B4 for <idr@trapdoor.merit.edu>; Tue, 16 Jul 2002 10:35:08 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 0256F5DDFF; Tue, 16 Jul 2002 10:35:08 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailwest2.unispherenetworks.com (mailwest2.unispherenetworks.com [65.194.140.146]) by segue.merit.edu (Postfix) with ESMTP id C0ACD5DD96 for <idr@merit.edu>; Tue, 16 Jul 2002 10:35:07 -0400 (EDT)
Received: by uniwest2.it.west.unispherenetworks.com with Internet Mail Service (5.5.2653.19) id <N80SPHHX>; Tue, 16 Jul 2002 10:29:26 -0400
Message-ID: <9BF027DAD859D511A383000347713E20854272@uniwest2.it.west.unispherenetworks.com>
From: "Shah, VJ" <vshah@juniper.net>
To: "'idr@merit.edu'" <idr@merit.edu>
Subject: FW:  dynamic capabilities and ORF
Date: Tue, 16 Jul 2002 10:29:16 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

I had earlier posted some queries regarding use of dynamic capability
negotiation for ORF capability.  I hope these will be addressed in the new
version of the draft.

thanks

VJ





With reference to drafts "draft-ietf-idr-dynamic-cap-01.txt and
"draft-ietf-idr-route-filter-05.txt", I have the following question:

Is dynamic capability supported for Cooperative Route Filters(ORF)?

If yes, It is not clear how to advertise or withdraw ORF capability for a
particular ORF Type in an address family using Dynamic capability. 

Consider the following example:

1) When BGP session is established, we advertise 

ORF capability 
	address family IPV4-Unicast 
		ORF Type: Prefix-List, Send/Receive: Both
            ORF Type: Extended-Community, Send/Receive: Both

2) User turns on ORF capability for Communities on the local BGP system


In this case, using dynamic capability, do we need to advertise all three
Types 

OR 

we advertise only the new type i.e Community Type.

3) Now, the user turns off ORF capability on the local BGP system for ORF
Type: Extended-Community

In this case, do we send a dynamic capability message withdrawing all the
ORF capabilities and then send another capability message advertising the
new set of ORF Types being supported.

OR

Send a Dynamic capability message removing only the Extended Community ORF
Type.  In this case, does the Send/Receive field in Capability data indicate
the mode being withdrawn.


Thanks,

VJ Shah


   

>>-----Original Message-----
>>From: Srihari R. Sangli [mailto:srihari@procket.com]
>>Sent: Thursday, July 11, 2002 4:36 PM
>>To: Abarbanel, Benjamin
>>Cc: 'Tsillas, Jim'; 'idr@merit.edu'; 'enke@redback.com'
>>Subject: RE: Re>Are dynamic capabilities really necessary if peers
>>support gracef ul restart?
>>
>>
>>
>>Ben,
>>
>>> Srihari:
>>>   I looked over the draft again and I have to apologize I 
>>misunderstood you
>>> responses below. I would recommend rewording the two 
>>paragraph so that
>>> others
>>> do not get confused like me.
>>
>>Ok. Will be added in the next revision. Thanks for the feedback.
>>
>>srihari...
>>


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA10631 for <idr-archive@nic.merit.edu>; Tue, 16 Jul 2002 10:29:07 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 11461912B2; Tue, 16 Jul 2002 10:28:38 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id CEE72912B4; Tue, 16 Jul 2002 10:28:37 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 7FFDB912B2 for <idr@trapdoor.merit.edu>; Tue, 16 Jul 2002 10:28:36 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 5AA9B5DDF1; Tue, 16 Jul 2002 10:28:36 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from celox-ma1-ems1.celoxnetworks.com (celox-ma1-imap1.celoxnetworks.com [12.40.60.233]) by segue.merit.edu (Postfix) with ESMTP id 201705DD96 for <idr@merit.edu>; Tue, 16 Jul 2002 10:28:36 -0400 (EDT)
Received: by celox-ma1-ems1.celoxnetworks.com with Internet Mail Service (5.5.2653.19) id <3T6KD8N2>; Tue, 16 Jul 2002 10:28:35 -0400
Message-ID: <1117F7D44159934FB116E36F4ABF221B020AF884@celox-ma1-ems1.celoxnetworks.com>
From: "Natale, Jonathan" <JNatale@celoxnetworks.com>
To: "'Pedro Roque Marques'" <roque@juniper.net>, Jeffrey Haas <jhaas@nexthop.com>
Cc: idr@merit.edu
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Tue, 16 Jul 2002 10:28:34 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
Sender: owner-idr@merit.edu
Precedence: bulk

Actually, that is not what I proposed; I was mearly reporting the
behavior I observed which was to ignore the attribute, but not the
route--it got installed.  Also, this was an UNEXPECTED/mis-configured
CONFED AS_PATH (sorry for not making this clear).  Whether this is a
bug, or "correct" could certainly be debated.  But I thought it was
related to:

    "As you infer, there are situations where we should just assume
    that the buffer contains cruft and that the implementation needs
    a strong reset.

    However, there are situations where we could get stuff that we
    can deal with and just ignore the "Bad" things.  This was the
    case when someone's implementation started leaking non-prefixed
    confederation segments into the internet.  The packets were well
    formed, but had cruft in them."

    -- Jeff, Fri 7/12/2002 2:29 PM

Seems to me that there are certainly cases where it is better to
ignore something then flap the whole session.


Another case where this happens is RXing a route to yourself (if
a neighbor runs PPP and redistributes connected)--route gets dropped.
Juniper sends the routes back so the neighbor would have a better
chance of knowing what got dropped, Cisco does not do this, the
RFC/Draft is unclear on this.  Both have
advantages/diaadvantages--my vote is to make it user configurable
(split-horrizon), this is already an option for other protocols.

What to do is not [publicly] defined, and should be.  Or maybe user
configurable would be good (I think so), but that could be
[publicly] defined as well.  There are many ways things could go
wrong.  The two basic categories are syntactical and semantically.
It seems that generally the RFCs/Drafts indicate that for
syntactical errors you should send a NOTIFICATION and close the
session (for how long and how many times?), and for semantical
errors you should ignore [the route or the attribute].  In cases
where things get ignored due to the fact that they are not
appropriate for the particular scenario, then an INFORM would not
hurt and may help troubleshoot the misconfig/bug.  The first step
would be to categorize all (lol) the scenarios, and then
have a vote/decree and come to a final complete protocol (lol).

-----Original Message-----
From: Pedro Roque Marques [mailto:roque@juniper.net] 
Sent: Friday, July 12, 2002 3:15 PM
To: Jeffrey Haas
Cc: idr@merit.edu
Subject: Re: INFORM proposal and MP-BGP (RFC 2858)

Jeffrey Haas writes:

> On Fri, Jul 12, 2002 at 02:33:05PM -0400, Natale, Jonathan wrote:
>> This is already done--if CONFED path is received the peer stays up
>> (and the local pref gets ignored).  This is good, but not in an rfc
>> nor a draft.  My vote is for an INFORM.

> IMO, this is a case where the route's local pref shouldn't be just
> ignored, but the entire packet should be discarded.

>From both quotes above it apears to me that you are proposing to
discard an UPDATE and leave the session up. Since BGP state is
incremental, that would mean the two peers would now potentially be
out of sync.

  Pedro.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id EAA29477 for <idr-archive@nic.merit.edu>; Tue, 16 Jul 2002 04:53:43 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id C20E5912AE; Tue, 16 Jul 2002 04:53:15 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 93AA9912AF; Tue, 16 Jul 2002 04:53:15 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 4CE06912AE for <idr@trapdoor.merit.edu>; Tue, 16 Jul 2002 04:53:14 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 336B05DDFC; Tue, 16 Jul 2002 04:53:14 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from rs-eu-exc1.rs.riverstonenet.com (unknown [193.128.15.220]) by segue.merit.edu (Postfix) with ESMTP id C45A35DD9C for <idr@merit.edu>; Tue, 16 Jul 2002 04:53:13 -0400 (EDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"
Subject: ORF implementation  issue #2
X-MimeOLE: Produced By Microsoft Exchange V6.0.5762.3
Date: Tue, 16 Jul 2002 09:53:09 +0100
Message-ID: <D2ECDCD8A528BF43A01EECCBEB191DD108F33F@rs-eu-exc1.rs.riverstonenet.com>
X-MS-Has-Attach: 
X-MS-TNEF-Correlator: 
Thread-Topic: ORF implementation  issue #2
Thread-Index: AcIspt2kCnj7qpiTEdaEoQDAT0MeLQ==
From: "Stephen Waters" <Stephen.Waters@riverstonenet.com>
To: <idr@merit.edu>
Sender: owner-idr@merit.edu
Precedence: bulk
Content-Transfer-Encoding: 8bit
X-MIME-Autoconverted: from quoted-printable to 8bit by nic.merit.edu id EAA29477

Here's another issue that came up when implementing ORF:

The filters can be Normal, or Exact matching.  For the Normal case the draft says:

"NORMAL scope
   indicates that the remote peer should consider only those routes
   whose Communities attribute either is equal to the Communities list
   specified in the ORF, or exhibit a subset relation with the
   Communities list specified in the ORF."


This gives the manager a support issue with the defined ORFs. Since community IDs will come and go, and the local manager has no control over what community IDs a route will collect, it is not feasible to set up a filter (as defined in the draft) that will be a stable superset of the IDs carried by a route.

It is more likely that a core set of IDs carried by a route will remain constant, so allowing the Normal filter community-list to be a subset of the route's community-list is more useful/stable.

Alternatively, a third filter class is added:  Exact/Subset/Superset. All of these match 'exact',
subset => filter can be a subset of the route's IDs, superset is the old 'Normal'.

At the moment, I've take the approach to allow Normal filters to be a subset of the route, but unlike issue #1, this does need to be agreed. So, I should at least have a control to allow a peer to work in subset or superset mode when using ORFs to filter.

Steve.





Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id EAA28940 for <idr-archive@nic.merit.edu>; Tue, 16 Jul 2002 04:36:06 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 9DAFE91256; Tue, 16 Jul 2002 04:35:36 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 5D2A4912AE; Tue, 16 Jul 2002 04:35:36 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 1A4D891256 for <idr@trapdoor.merit.edu>; Tue, 16 Jul 2002 04:35:35 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 066335DDFC; Tue, 16 Jul 2002 04:35:35 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from rs-eu-exc1.rs.riverstonenet.com (unknown [193.128.15.220]) by segue.merit.edu (Postfix) with ESMTP id 9537B5DD9C for <idr@merit.edu>; Tue, 16 Jul 2002 04:35:34 -0400 (EDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"
Subject: FW: ORF implementation issue
X-MimeOLE: Produced By Microsoft Exchange V6.0.5762.3
Date: Tue, 16 Jul 2002 09:35:30 +0100
Message-ID: <D2ECDCD8A528BF43A01EECCBEB191DD108F33E@rs-eu-exc1.rs.riverstonenet.com>
X-MS-Has-Attach: 
X-MS-TNEF-Correlator: 
Thread-Topic: ORF implementation issue
Thread-Index: AcIsDY1FSpBv3JfZEdaEoADAT0MeLQAlNX9g
From: "Stephen Waters" <Stephen.Waters@riverstonenet.com>
To: <idr@merit.edu>
Sender: owner-idr@merit.edu
Precedence: bulk
Content-Transfer-Encoding: 8bit
X-MIME-Autoconverted: from quoted-printable to 8bit by nic.merit.edu id EAA28940

Thanks for the responses to the mail below.

I just thought I summarize the views expressed in the mail I received:

1) don't use ORF if it doesn't suit your filtering needs.

This translates to:  If want to accept most routes, and filter the odd annoying one, you probably don't want ORF since this could involve entering a large number of permit filters.  

Regardless of your policy model, you also have to live with the fact that if routes you like start picking up new community IDs along their merry way, you will lose routes until you fix the filters.


2) we need a 'permit-all' alternative to the 'deny-all' default.


My view on this is that the default could be changed to 'permit-all'. This is not a 'firewall'; it's a feature to help save processing bandwidth where back-stop local policy is there to make the final call.

Steve.


> -----Original Message-----
> From: Stephen Waters 
> Sent: Monday, July 15, 2002 3:36 PM
> To: idr@merit.edu
> Subject: ORF implementation issue
> 
> 
> 
> Having implemented ORF, I have a few issues with the draft. 
> Here is the first one...
> 
> 
> "If a particular route maintained by a BGP speaker doesn't match any
>    of the ORF entries of any of the (non-empty) ORFs associated with a
>    particular peer, then this route should not be advertised to the
>    peer."
> 
> I think this makes things very difficult for the manager.  
> The idea with ORF (I think) is to 
> reduce the local processing of routes by warning your peer 
> about routes you are definitely not interested in.
> 
> Since there is no support for wild cards in spec., this text 
> would require the manager to set up ORF entries (to be sent 
> to the peer) for every conceivable community-mix that may be 
> of interest. If he misses some, or a new community appears on 
> the block, useful routes may be lost.
> 
> For example, I am will to accept routes with any one of 300 
> different communities, and am not interested in 20 other 
> community groups.  I would need to send 20 DENY filters, and 
> 300 PERMIT filters.
> 
> Now the problem:
> 
> A route can carry a number of communities, and new community 
> numbers will come and go.  This would cause we to have to 
> update my filters constantly.
> 
> I have taken the view that blocking useful routes would be 
> very bad, and if the ORF filtering is in any doubt, the route 
> should be sent and the local filtering on the remote peer can 
> make the final decision.
> 
> So, in my implementation, if there is no matching ORF filter, 
> the route is passed. Taking this approach also means that I 
> can no reduce my ORF to a list of explicit DENY filters (with 
> the occasional imbedded PERMIT).  This approach also means 
> that if community lists change, the result is much less 
> likely to result in lost useful routes, and unwanted routes 
> carrying strange new community lists can be filtered as and 
> when the manager has time.
> 
> Comments?
> 
> Regards, Steve.
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id SAA08501 for <idr-archive@nic.merit.edu>; Mon, 15 Jul 2002 18:15:11 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 0CA1B9129B; Mon, 15 Jul 2002 18:14:52 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id CA4339129C; Mon, 15 Jul 2002 18:14:51 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 89D3E9129B for <idr@trapdoor.merit.edu>; Mon, 15 Jul 2002 18:14:50 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 6BEC25DEB6; Mon, 15 Jul 2002 18:14:50 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from guelah.shrubbery.net (guelah.shrubbery.net [198.58.5.1]) by segue.merit.edu (Postfix) with ESMTP id BD4C05DD90 for <idr@merit.edu>; Mon, 15 Jul 2002 18:14:49 -0400 (EDT)
Received: (from heas@localhost) by guelah.shrubbery.net (8.11.6/8.11.1) id g6FMEh417765; Mon, 15 Jul 2002 22:14:43 GMT
Date: Mon, 15 Jul 2002 15:14:43 -0700
From: john heasley <heas@shrubbery.net>
To: Jeffrey Haas <jhaas@nexthop.com>
Cc: john heasley <heas@shrubbery.net>, "Srihari R. Sangli" <srihari@procket.com>, Stephen Waters <Stephen.Waters@riverstonenet.com>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@marconi.com>, idr@merit.edu
Subject: Re: Re>Are dynamic capabilities really necessary if peers support gracef ul restart?
Message-ID: <20020715151442.L16765@shrubbery.net>
References: <D2ECDCD8A528BF43A01EECCBEB191DD108F332@rs-eu-exc1.rs.riverstonenet.com> <Pine.GSO.4.40.0207111123500.192-100000@miata.procket.com> <20020711114409.P16814@shrubbery.net> <20020712132855.A25315@nexthop.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5.1i
In-Reply-To: <20020712132855.A25315@nexthop.com>; from jhaas@nexthop.com on Fri, Jul 12, 2002 at 01:28:55PM -0400
X-note: live free, or die!
X-homer: mmmm, forbidden doughnut.
Sender: owner-idr@merit.edu
Precedence: bulk

Fri, Jul 12, 2002 at 01:28:55PM -0400, Jeffrey Haas:
> On Thu, Jul 11, 2002 at 11:44:09AM -0700, john heasley wrote:
> > 	"it must be possible to turn such options off and force
> > 	 values/capabilities that will be required."
> 
> Considered from a slightly different angle, do you perhaps also mean:
> 1. If I said I supported some AFI/SAFI
> 2. I decide that your implementation of said AFI/SAFI is buggy
> 3. I send you a dynamic capability update withdrawing that AFI/SAFI
> 4. If you keep sending it after I sent you the capability update, what
>    should I do?
> 
> To 4, answers are:
> 1. Wait a certain amount of time for some queues to clear.  If we
>    still receive the unwanted messages, we should then NOTIFY.
> 2. Grin and bear it.

s/bear/ignore/ ?

> Other observations?

there really are two cases, right?  1) its safe to ignore continued msgs
for a capability (eg: SAFI multicast prefixes) and 2) it isnt.  though
some capability may exist that i am unaware of, i dont believe there are
any cases where 2 applies at the moment, though some such capability may
be added in the future.

so, in most cases it can just be ignored with the little cost of extra
messages/process time.  but, for example the cisco knob

router bgp N
 neighbor x.x.x.x version 4

will cause it to not establish peering with a version 3 speaker or to
"force version 4"; the ability to force that option was/is desirable.

i understand that such a knob would be an implementation option, but the
framework has to exist to support it.  some way for the peer to say NAK
(and ACK, which could be used to for your suggested timing answer #1) to
a dynamic change where the initiator can either accept that or close the
session.  correct me if i am wrong, but that is not part of the current
draft.

so, in context; if i determine that a particular peer's router is sending
botched filters (due to a bug),

router bgp N
 no neighbor x.x.x.x rx-ORF

might disallow ORF updates from the peer; but there is no reason to reset
the session if the peer refuses or continues to send filter updates.  or
maybe i think the customer is using route-refresh in an abusive manner....

make sense?


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id RAA07970 for <idr-archive@nic.merit.edu>; Mon, 15 Jul 2002 17:59:16 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 230619129D; Mon, 15 Jul 2002 17:58:24 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id E2ACC912A1; Mon, 15 Jul 2002 17:58:23 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 8F95A9129D for <idr@trapdoor.merit.edu>; Mon, 15 Jul 2002 17:58:19 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 7AED35DDEA; Mon, 15 Jul 2002 17:58:19 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 047475DD90 for <idr@merit.edu>; Mon, 15 Jul 2002 17:58:19 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id RAA09451; Mon, 15 Jul 2002 17:58:16 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id RAA27886; Mon, 15 Jul 2002 17:58:18 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W98V54X>; Mon, 15 Jul 2002 17:58:17 -0400
Message-ID: <39469E08BD83D411A3D900204840EC558227A3@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'John G. Scudder'" <jgs@cisco.com>, Jeffrey Haas <jhaas@nexthop.com>
Cc: Pedro Roque Marques <roque@juniper.net>, idr@merit.edu
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Mon, 15 Jul 2002 17:58:14 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

John:
 I dont think people who suggest to use INFORM message will argue with 
you on the merits of not keeping the route. I do think the tendency in
BGP is, if you see something that looks strange drop the session. This
religious approach leads to much more serious consequences, which 
sometimes could be remedied by not dropping the session but by providing 
a mechanism for resynchronizing the corrrect information.

$.02 Worth,
Ben

> -----Original Message-----
> From: John G. Scudder [mailto:jgs@cisco.com]
> Sent: Monday, July 15, 2002 5:40 PM
> To: Jeffrey Haas
> Cc: Pedro Roque Marques; idr@merit.edu
> Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
> 
> 
> At 4:20 PM -0400 7/12/02, Jeffrey Haas wrote:
> >On Fri, Jul 12, 2002 at 12:14:38PM -0700, Pedro Roque Marques wrote:
> >>  >From both quotes above it apears to me that you are proposing to
> >>  discard an UPDATE and leave the session up. Since BGP state is
> >>  incremental, that would mean the two peers would now 
> potentially be
> >>  out of sync.
> >
> >And no less intentionally than if I discarded everything via policy.
> >In this case, my policy is "discard bad (for some value) packets".
> 
> No, it's worse than that.  Here's an example of harm using your 
> "let's introduce a new origin code of 4" case.
> 
> - Router A sends router B an announcement of prefix foo with origin 1.
> - Router A now sends router B a replacement for prefix foo, but with 
> newfangled origin 4.  The replacement has a different next hop from 
> the previous announcement.
> - Router B ignores the replacement because of the posited "discard 
> updates that contain unrecognized data" rule.
> 
> Result?  Router B retains router A's old route, whose next hop is no 
> longer valid.  This goes on forever, or until router A sends an 
> update with a recognized origin, or until router A withdraws the 
> route.  The bad next hop may result in nothing serious, but it may 
> equally well result in a black hole or a persistent forwarding loop. 
> This seems to a cure which is much worse than the disease.
> 
> Of course, refusing routes by policy doesn't have this problem at all.
> 
> Let me grant you up front that you can fix this specific case by 
> being more careful in stating your rule -- instead of "discard 
> updates that contain unrecognized data" make it "treat updates that 
> contain unrecognized data as if they were withdraws".  But whenever 
> (and if ever) you want to gloss over a protocol error like this IMO 
> you need to analyze the consequences quite carefully.
> 
> --John
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id RAA07309 for <idr-archive@nic.merit.edu>; Mon, 15 Jul 2002 17:40:28 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 6F29391293; Mon, 15 Jul 2002 17:39:50 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 3CAE491295; Mon, 15 Jul 2002 17:39:50 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 3AEF891293 for <idr@trapdoor.merit.edu>; Mon, 15 Jul 2002 17:39:49 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 1F2AB5DD92; Mon, 15 Jul 2002 17:39:49 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from cisco.com (router.cisco.com [171.69.182.20]) by segue.merit.edu (Postfix) with ESMTP id 5BB7D5DD90 for <idr@merit.edu>; Mon, 15 Jul 2002 17:39:48 -0400 (EDT)
Received: from [171.69.182.142] (dhcp-171-69-182-142.cisco.com [171.69.182.142]) by cisco.com (8.8.8/2.6/Cisco List Logging/8.8.8) with ESMTP id RAA05678; Mon, 15 Jul 2002 17:39:44 -0400 (EDT)
Mime-Version: 1.0
X-Sender: jgs@router
Message-Id: <p05111a25b958ef63859e@[171.69.182.142]>
In-Reply-To: <20020712162014.A25656@nexthop.com>
References:  <1117F7D44159934FB116E36F4ABF221B020AF86F@celox-ma1-ems1.celoxnetworks.com > <20020712145641.A25494@nexthop.com> <200207121914.g6CJEcO59697@roque-bsd.juniper.net> <20020712162014.A25656@nexthop.com>
Date: Mon, 15 Jul 2002 17:39:45 -0400
To: Jeffrey Haas <jhaas@nexthop.com>
From: "John G. Scudder" <jgs@cisco.com>
Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
Cc: Pedro Roque Marques <roque@juniper.net>, idr@merit.edu
Content-Type: text/plain; charset="us-ascii" ; format="flowed"
Sender: owner-idr@merit.edu
Precedence: bulk

At 4:20 PM -0400 7/12/02, Jeffrey Haas wrote:
>On Fri, Jul 12, 2002 at 12:14:38PM -0700, Pedro Roque Marques wrote:
>>  >From both quotes above it apears to me that you are proposing to
>>  discard an UPDATE and leave the session up. Since BGP state is
>>  incremental, that would mean the two peers would now potentially be
>>  out of sync.
>
>And no less intentionally than if I discarded everything via policy.
>In this case, my policy is "discard bad (for some value) packets".

No, it's worse than that.  Here's an example of harm using your 
"let's introduce a new origin code of 4" case.

- Router A sends router B an announcement of prefix foo with origin 1.
- Router A now sends router B a replacement for prefix foo, but with 
newfangled origin 4.  The replacement has a different next hop from 
the previous announcement.
- Router B ignores the replacement because of the posited "discard 
updates that contain unrecognized data" rule.

Result?  Router B retains router A's old route, whose next hop is no 
longer valid.  This goes on forever, or until router A sends an 
update with a recognized origin, or until router A withdraws the 
route.  The bad next hop may result in nothing serious, but it may 
equally well result in a black hole or a persistent forwarding loop. 
This seems to a cure which is much worse than the disease.

Of course, refusing routes by policy doesn't have this problem at all.

Let me grant you up front that you can fix this specific case by 
being more careful in stating your rule -- instead of "discard 
updates that contain unrecognized data" make it "treat updates that 
contain unrecognized data as if they were withdraws".  But whenever 
(and if ever) you want to gloss over a protocol error like this IMO 
you need to analyze the consequences quite carefully.

--John


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id NAA28318 for <idr-archive@nic.merit.edu>; Mon, 15 Jul 2002 13:04:12 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id B99F99128E; Mon, 15 Jul 2002 13:03:56 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 8943A9128D; Mon, 15 Jul 2002 13:03:56 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id DE9599128E for <idr@trapdoor.merit.edu>; Mon, 15 Jul 2002 13:03:49 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id B89DF5DDB5; Mon, 15 Jul 2002 13:03:49 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from celox-ma1-ems1.celoxnetworks.com (celox-ma1-imap1.celoxnetworks.com [12.40.60.233]) by segue.merit.edu (Postfix) with ESMTP id 8018D5DDAF for <idr@merit.edu>; Mon, 15 Jul 2002 13:03:49 -0400 (EDT)
Received: by celox-ma1-ems1.celoxnetworks.com with Internet Mail Service (5.5.2653.19) id <3T6KD62X>; Mon, 15 Jul 2002 13:03:49 -0400
Message-ID: <1117F7D44159934FB116E36F4ABF221B020AF87C@celox-ma1-ems1.celoxnetworks.com>
From: "Natale, Jonathan" <JNatale@celoxnetworks.com>
To: "'Stephen Waters'" <Stephen.Waters@riverstonenet.com>, idr@merit.edu
Subject: RE: ORF implementation issue
Date: Mon, 15 Jul 2002 13:03:48 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
Sender: owner-idr@merit.edu
Precedence: bulk

I see this as equivalent to the "implied deny all" rule.  I am assuming
there is a way to replace/preempt this with a "permit all" in ORF.  If not,
then I would agree that this makes things very difficult for the manager.

-----Original Message-----
From: Stephen Waters [mailto:Stephen.Waters@riverstonenet.com] 
Sent: Monday, July 15, 2002 10:36 AM
To: idr@merit.edu
Subject: ORF implementation issue


Having implemented ORF, I have a few issues with the draft. Here is the
first one...


"If a particular route maintained by a BGP speaker doesn't match any
   of the ORF entries of any of the (non-empty) ORFs associated with a
   particular peer, then this route should not be advertised to the
   peer."

I think this makes things very difficult for the manager.  The idea with ORF
(I think) is to 
reduce the local processing of routes by warning your peer about routes you
are definitely not interested in.

Since there is no support for wild cards in spec., this text would require
the manager to set up ORF entries (to be sent to the peer) for every
conceivable community-mix that may be of interest. If he misses some, or a
new community appears on the block, useful routes may be lost.

For example, I am will to accept routes with any one of 300 different
communities, and am not interested in 20 other community groups.  I would
need to send 20 DENY filters, and 300 PERMIT filters.

Now the problem:

A route can carry a number of communities, and new community numbers will
come and go.  This would cause we to have to update my filters constantly.

I have taken the view that blocking useful routes would be very bad, and if
the ORF filtering is in any doubt, the route should be sent and the local
filtering on the remote peer can make the final decision.

So, in my implementation, if there is no matching ORF filter, the route is
passed. Taking this approach also means that I can no reduce my ORF to a
list of explicit DENY filters (with the occasional imbedded PERMIT).  This
approach also means that if community lists change, the result is much less
likely to result in lost useful routes, and unwanted routes carrying strange
new community lists can be filtered as and when the manager has time.

Comments?

Regards, Steve.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id LAA24368 for <idr-archive@nic.merit.edu>; Mon, 15 Jul 2002 11:04:36 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 55FAE91284; Mon, 15 Jul 2002 11:04:12 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 20FD491286; Mon, 15 Jul 2002 11:04:11 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id E1DE491284 for <idr@trapdoor.merit.edu>; Mon, 15 Jul 2002 11:03:34 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id C38725DE02; Mon, 15 Jul 2002 11:03:34 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from launch.server101.com (ns1.giga-sj-001.net [216.218.210.201]) by segue.merit.edu (Postfix) with ESMTP id 662CB5DD9F for <idr@merit.edu>; Mon, 15 Jul 2002 11:03:34 -0400 (EDT)
Received: from LYSANDER (sentinel.sockeye.com [65.209.66.235]) (authenticated as dunn@dunn.org with LOGIN) by launch.server101.com (8.11.6/8.11.6) with ESMTP id g6FF5qU00362; Tue, 16 Jul 2002 01:05:52 +1000
Message-ID: <006501c22c10$caaf4c20$5801a8c0@LYSANDER>
From: "Bradley Dunn" <bradley@dunn.org>
To: "Parag Deshpande" <paragdeshpande@sdksoft.com>
Cc: <idr@merit.edu>
References: <012301c22c0f$b979e420$cbc8c8c8@sdksoft.com>
Subject: Re: BGP-Figures
Date: Mon, 15 Jul 2002 11:03:30 -0400
MIME-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"
Content-Transfer-Encoding: 7bit
X-Priority: 3
X-MSMail-Priority: Normal
X-Mailer: Microsoft Outlook Express 6.00.2600.0000
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2600.0000
Sender: owner-idr@merit.edu
Precedence: bulk

> I would appreciate if someone could provide the current figures for the
> following:
> 1. No. of Networks in the Internet =
> 2. Avg No. of BGP peers in an AS =
> 3. No of Autonomous Systems in the Internet =
> 4. Typical (No. of) BGP Routing table entries =

3 and 4 can be answered by browsing http://bgp.potaroo.net/.

I'm not sure what you mean by 1 and how it would differ from 3 or 4.

2 is nearly impossible to measure, but I'd guess the average is relatively
low (<10), weighed down by lots of multi-homed edge networks with only a few
routers and upstreams. Service provider networks might push into the low
100s on border routers, but even for large networks the number will be
moderated by deployment of route reflectors and/or confederations.

Bradley



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA24008 for <idr-archive@nic.merit.edu>; Mon, 15 Jul 2002 10:53:34 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 9C43291249; Mon, 15 Jul 2002 10:53:13 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 63CB991283; Mon, 15 Jul 2002 10:53:13 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 55C5F91249 for <idr@trapdoor.merit.edu>; Mon, 15 Jul 2002 10:53:12 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 3E9715DEAA; Mon, 15 Jul 2002 10:53:12 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id A91F85DD9F for <idr@merit.edu>; Mon, 15 Jul 2002 10:53:11 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6FEr8p52532; Mon, 15 Jul 2002 10:53:08 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6FEr6152525; Mon, 15 Jul 2002 10:53:06 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g6FEr5t02717; Mon, 15 Jul 2002 10:53:06 -0400 (EDT)
Date: Mon, 15 Jul 2002 10:53:05 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: Stephen Waters <Stephen.Waters@riverstonenet.com>
Cc: idr@merit.edu
Subject: Re: ORF implementation issue
Message-ID: <20020715105305.A2589@nexthop.com>
References: <D2ECDCD8A528BF43A01EECCBEB191DD108F338@rs-eu-exc1.rs.riverstonenet.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <D2ECDCD8A528BF43A01EECCBEB191DD108F338@rs-eu-exc1.rs.riverstonenet.com>; from Stephen.Waters@riverstonenet.com on Mon, Jul 15, 2002 at 03:35:45PM +0100
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

On Mon, Jul 15, 2002 at 03:35:45PM +0100, Stephen Waters wrote:
> I think this makes things very difficult for the manager.  The
> idea with ORF (I think) is to reduce the local processing of routes
> by warning your peer about routes you are definitely not interested in.

I may be reading too much into things, however this is my opinion:

I think that ORFs were only ever intended to be a very coarse grained
method of sending around policy.  The lack of a useful policy algebra
means that all you can ever send is one coarse level of a given
policy element (communities, prefix lists, etc.) and see if it goes
beyond there.  This makes in the responsibility of the entity
constructing the ORFs to make them perhaps more permissive that one would
want them to be.

Now given that they are only good for coarse grained policy, what are
they good for?  You cite the lack of wild cards as an example of
why they are coarse.

Making rfc 2547 scale is probably the most obvious.  The route target
communities provide exactly the coarse filter element that is needed.
One could probably construct other examples.

> Since there is no support for wild cards in spec., this text
> would require the manager to set up ORF entries (to be sent to the
> peer) for every conceivable community-mix that may be of interest.
> If he misses some, or a new community appears on the block, useful
> routes may be lost.

Given the coarseness of the tool, it is perhaps better to allow the
user to manually construct the list rather than to derive the list
from policy.

Note that given a particular regular expression, you *could* construct
the list of all possibile matches.  Its just combinatorically messy.

> So, in my implementation, if there is no matching ORF filter,
> the route is passed. Taking this approach also means that I can no
> reduce my ORF to a list of explicit DENY filters (with the occasional
> imbedded PERMIT).  This approach also means that if community lists
> change, the result is much less likely to result in lost useful
> routes, and unwanted routes carrying strange new community lists
> can be filtered as and when the manager has time.

It would be useful to have default policies for each ORF type
(or not) on a configurable basis along with a default policy when
running ORFs.  Does this adequately restate the above?

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA23972 for <idr-archive@nic.merit.edu>; Mon, 15 Jul 2002 10:52:21 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 7CA5991247; Mon, 15 Jul 2002 10:51:54 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 4636891249; Mon, 15 Jul 2002 10:51:54 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 2C02691247 for <idr@trapdoor.merit.edu>; Mon, 15 Jul 2002 10:51:53 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 146935DEAA; Mon, 15 Jul 2002 10:51:53 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 8DA575DD9F for <idr@merit.edu>; Mon, 15 Jul 2002 10:51:52 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA11092; Mon, 15 Jul 2002 10:51:31 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA24519; Mon, 15 Jul 2002 10:51:32 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W98V12Y>; Mon, 15 Jul 2002 10:51:31 -0400
Message-ID: <39469E08BD83D411A3D900204840EC5582279F@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Susan Hares'" <skh@nexthop.com>, Kunihiro Ishiguro <kunihiro@zebra.org>
Cc: hans.de_vleeschouwer@alcatel.be, BGP mailing list <idr@merit.edu>, Gilles Geerts <gilles.geerts@alcatel.be>
Subject: RE: TCP state machine doubts
Date: Mon, 15 Jul 2002 10:51:19 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Hi Sue:
  I was looking for "ietf-hares-bgp-statemt-02.txt" in the draft depository
and could not find it. I did find verion ietf-hares-bgp-statemt-01.txt.
Should that one suffice for review?

Regards,
Ben

> -----Original Message-----
> From: Susan Hares [mailto:skh@nexthop.com]
> Sent: Saturday, July 13, 2002 7:15 AM
> To: Kunihiro Ishiguro
> Cc: hans.de_vleeschouwer@alcatel.be; BGP mailing list; Gilles Geerts
> Subject: Re: TCP state machine doubts
> 
> 
> Kunihiro:
> 
> I've got the FSM text for draft-18 revised.  I'm looking
> for a group of people to review 5th round of the FSM
> text along with the associated documents:
> 
>          1) BGP state machine (ietf-hares-statemt-02.txt)
>          2) BGP backoff draft (ietf-hares-backoff-01.txt)
>          3) Passive and Call Collision input
>          4) BGP update draft (ietf-hares-update-00.txt]
> 
>   Thanks for volunteering to review the documents prior to
>   me flooding the information to the mail list.  I welcome
>   any other early reviewers.
> 
>   I submitted the first two as Internet drafts, but I found a few
>   typos.
> 
>   I've answered your questions below:
> 
> At 08:49 PM 7/5/2002 -0700, Kunihiro Ishiguro wrote:
> > >i have a few questions related to the new state machine 
> proposed in the
> > >latest BGP draft.
> >
> >Current draft-17 has some known problems.  I think Sue 
> aleady prepared
> >draft of draft-18.  So best way is updating the draft to 
> draft-18 then
> >process IDR working group review to it.
> >
> > >1. related to terminology related to the connect retry counter.
> > >I find back in the state descriptions following statement:
> > >- "clears the ConnectRetry timer" (e.g. page 35, state connect)
> > >- "Set connect retry timer to zero" (e.g. page 37, state Open sent)
> > >- "Connect retry timer = 0," (e.g. page 38, state Open sent)
> > > I assume that the meaning in all of the above cases is : "the
> > >ConnectRetry timer is stopped".  If so, would this not be a better
> > >wording for the action?
> 
> I've changed the text to be common.  Let me know if you like the
> new wording.  Otherwise, I can add a little more text.
> 
> 
> > >
> > >2. related to the terminology for the holdtimer.
> > >In state Connect on page 34 I find the following text: 
> "set Hold timer
> > >to a large value". I expect that this means "The Hold 
> timer value is set
> > >to a large value, and the Hold timer is started".
> 
> Could you look at the current text
> 
>          - Set the Hold timer to a large value (4 minutes),
> 
> 
> > >
> > >3. related to the ConnectRetryCnt.
> > >- Should the value of this counter not be initialsed in 
> the Idle state
> > >as well?
> >
> >These are right.  I believes all of these are fixed in the draft of
> >draft-18.
> 
> 
> See the FSM words and state machine.  I could use another
> pair of eyes to look at the FSM.
> 
> > >4. General doubt  related to the TCP backoff strategy
> > >Suppose I have a BGP peer that goes down say once every 
> 2hours. With the
> > >current sheme the ConnectRetryCnt keeps on increasing, and as a
> > >consequence the connection takes increasinly more time to be
> > >re-establsihed, even if actually it only fails like every 2 hours.
> 
> The backoff mechanisms can actually change that.  I would
> suggest that is another mechanism we could add in the
> backoff section.   We could add a timer at that point or
> agree to add another state to the FSM's timer.
> 
> Please comment on which one you'd like to see.  Please
> suggest text for the backoff draft.
> 
> 
> >My understanding is once BGP peer is establieshed, the 
> ConnectRetryCnt
> >is cleared.
> 
> This mechanism does not fix the persistent peer oscillation.
> Again, I think we can change this in the backoff draft, but
> maybe we need another timer to reduce the connect retry count.
> 
> Could you suggest some text based on the backoff draft?
> 
> 
> Sue Hares
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA23765 for <idr-archive@nic.merit.edu>; Mon, 15 Jul 2002 10:46:45 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 17D1991281; Mon, 15 Jul 2002 10:45:41 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id CF52691283; Mon, 15 Jul 2002 10:45:40 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 7237591281 for <idr@trapdoor.merit.edu>; Mon, 15 Jul 2002 10:45:39 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 5672A5DEAA; Mon, 15 Jul 2002 10:45:39 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mplspop2.mpls.uswest.net (mplspop2.mpls.uswest.net [204.147.80.4]) by segue.merit.edu (Postfix) with SMTP id CBBC25DEA8 for <idr@merit.edu>; Mon, 15 Jul 2002 10:45:38 -0400 (EDT)
Received: (qmail 62772 invoked from network); 15 Jul 2002 14:45:39 -0000
Received: from mplsapanas01poolb115.mpls.uswest.net (HELO charita) (65.103.0.115) by mplspop2.mpls.uswest.net with SMTP; 15 Jul 2002 14:45:39 -0000
Date: Mon, 15 Jul 2002 09:55:56 -0500
Message-ID: <012301c22c0f$b979e420$cbc8c8c8@sdksoft.com>
From: "Parag Deshpande" <paragdeshpande@sdksoft.com>
To: idr@merit.edu
Subject: BGP-Figures
MIME-Version: 1.0
Content-Type: multipart/alternative; boundary="----=_NextPart_000_0120_01C22BE5.D0718180"
X-Priority: 3
X-MSMail-Priority: Normal
X-Mailer: Microsoft Outlook Express 5.00.2615.200
X-MimeOLE: Produced By Microsoft MimeOLE V5.00.2615.200
Sender: owner-idr@merit.edu
Precedence: bulk

This is a multi-part message in MIME format.

------=_NextPart_000_0120_01C22BE5.D0718180
Content-Type: text/plain;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

Hi,

I would appreciate if someone could provide the current figures for the =
following:
1. No. of Networks in the Internet =3D=20
2. Avg No. of BGP peers in an AS =3D=20
3. No of Autonomous Systems in the Internet =3D=20
4. Typical (No. of) BGP Routing table entries =3D=20

Thanks
Parag

------=_NextPart_000_0120_01C22BE5.D0718180
Content-Type: text/html;
	charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META content=3D"text/html; charset=3Diso-8859-1" =
http-equiv=3DContent-Type>
<META content=3D"MSHTML 5.00.2614.3500" name=3DGENERATOR>
<STYLE></STYLE>
</HEAD>
<BODY bgColor=3D#ffffff>
<DIV><FONT size=3D2>Hi,</FONT></DIV>
<DIV>&nbsp;</DIV>
<DIV><FONT size=3D2>I would appreciate if someone could&nbsp;provide the =
current=20
figures for the following:</FONT></DIV>
<DIV><FONT size=3D2>1. No. of Networks in the Internet =3D </FONT></DIV>
<DIV><FONT size=3D2>2. Avg No. of BGP peers in an AS =3D </FONT></DIV>
<DIV><FONT size=3D2>3. No of Autonomous Systems in the Internet =3D =
</FONT></DIV>
<DIV><FONT size=3D2>4. Typical (No. of) BGP Routing table entries =3D =
</FONT></DIV>
<DIV>&nbsp;</DIV>
<DIV><FONT size=3D2>Thanks</FONT></DIV>
<DIV><FONT size=3D2>Parag</FONT></DIV></BODY></HTML>

------=_NextPart_000_0120_01C22BE5.D0718180--



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA23473 for <idr-archive@nic.merit.edu>; Mon, 15 Jul 2002 10:36:29 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 2ABD89124A; Mon, 15 Jul 2002 10:35:51 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id EA7A691281; Mon, 15 Jul 2002 10:35:50 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id CC3449124A for <idr@trapdoor.merit.edu>; Mon, 15 Jul 2002 10:35:49 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id BCEE45DE8B; Mon, 15 Jul 2002 10:35:49 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from rs-eu-exc1.rs.riverstonenet.com (unknown [193.128.15.220]) by segue.merit.edu (Postfix) with ESMTP id 43C8A5DE9D for <idr@merit.edu>; Mon, 15 Jul 2002 10:35:49 -0400 (EDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"
Subject: ORF implementation issue
X-MimeOLE: Produced By Microsoft Exchange V6.0.5762.3
Date: Mon, 15 Jul 2002 15:35:45 +0100
Message-ID: <D2ECDCD8A528BF43A01EECCBEB191DD108F338@rs-eu-exc1.rs.riverstonenet.com>
X-MS-Has-Attach: 
X-MS-TNEF-Correlator: 
Thread-Topic: ORF implementation issue
Thread-Index: AcIsDY1FSpBv3JfZEdaEoADAT0MeLQ==
From: "Stephen Waters" <Stephen.Waters@riverstonenet.com>
To: <idr@merit.edu>
Sender: owner-idr@merit.edu
Precedence: bulk
Content-Transfer-Encoding: 8bit
X-MIME-Autoconverted: from quoted-printable to 8bit by nic.merit.edu id KAA23473

Having implemented ORF, I have a few issues with the draft. Here is the first one...


"If a particular route maintained by a BGP speaker doesn't match any
   of the ORF entries of any of the (non-empty) ORFs associated with a
   particular peer, then this route should not be advertised to the
   peer."

I think this makes things very difficult for the manager.  The idea with ORF (I think) is to 
reduce the local processing of routes by warning your peer about routes you are definitely not interested in.

Since there is no support for wild cards in spec., this text would require the manager to set up ORF entries (to be sent to the peer) for every conceivable community-mix that may be of interest. If he misses some, or a new community appears on the block, useful routes may be lost.

For example, I am will to accept routes with any one of 300 different communities, and am not interested in 20 other community groups.  I would need to send 20 DENY filters, and 300 PERMIT filters.

Now the problem:

A route can carry a number of communities, and new community numbers will come and go.  This would cause we to have to update my filters constantly.

I have taken the view that blocking useful routes would be very bad, and if the ORF filtering is in any doubt, the route should be sent and the local filtering on the remote peer can make the final decision.

So, in my implementation, if there is no matching ORF filter, the route is passed. Taking this approach also means that I can no reduce my ORF to a list of explicit DENY filters (with the occasional imbedded PERMIT).  This approach also means that if community lists change, the result is much less likely to result in lost useful routes, and unwanted routes carrying strange new community lists can be filtered as and when the manager has time.

Comments?

Regards, Steve.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id IAA19721 for <idr-archive@nic.merit.edu>; Mon, 15 Jul 2002 08:43:00 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id D163A9127C; Mon, 15 Jul 2002 08:42:41 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 98FBF9127D; Mon, 15 Jul 2002 08:42:41 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 41D129127C for <idr@trapdoor.merit.edu>; Mon, 15 Jul 2002 08:42:40 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 326A35DE5B; Mon, 15 Jul 2002 08:42:40 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from celox-ma1-ems1.celoxnetworks.com (celox-ma1-imap1.celoxnetworks.com [12.40.60.233]) by segue.merit.edu (Postfix) with ESMTP id F132B5DE02 for <idr@merit.edu>; Mon, 15 Jul 2002 08:42:39 -0400 (EDT)
Received: by celox-ma1-ems1.celoxnetworks.com with Internet Mail Service (5.5.2653.19) id <3T6KD5P4>; Mon, 15 Jul 2002 08:42:39 -0400
Message-ID: <1117F7D44159934FB116E36F4ABF221B020AF875@celox-ma1-ems1.celoxnetworks.com>
From: "Natale, Jonathan" <JNatale@celoxnetworks.com>
To: "'Stephen Waters'" <Stephen.Waters@riverstonenet.com>, idr@merit.edu
Subject: RE: ORF implementations?
Date: Mon, 15 Jul 2002 08:42:38 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
Sender: owner-idr@merit.edu
Precedence: bulk

Yes:
Cisco-12000-RtrB(config-router)#nei 1.1.1.1 capability ?
  orf  Advertise ORF capability to the peer

Cisco-12000-RtrB#sho ver
Cisco Internetwork Operating System Software
IOS (tm) GS Software (GSR-P-M), Version 12.0(18.6)ST1,

...this is ED s/w but it should be in 12.0(19)ST1 and/or other IOS.

Also:
  
http://www.cisco.com/warp/public/cc/pd/iosw/iore/iomjv122/prodlit/1363_pp.ht
m

http://www.cisco.com/univercd/cc/td/doc/product/software/ios122/122newft/122
t/122t4/ftbgporf.htm

...

http://www.cisco.com/ (search for "orf"--lot's of hits)


-----Original Message-----
From: Stephen Waters [mailto:Stephen.Waters@riverstonenet.com] 
Sent: Monday, July 15, 2002 5:39 AM
To: idr@merit.edu
Subject: ORF implementations?


Hi,

Are there any ORF implementations out there? I have a few 'problems'
following the 
draft that I'd like to discuss with any fellow ORFers.

Cheers, Steve.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id IAA18997 for <idr-archive@nic.merit.edu>; Mon, 15 Jul 2002 08:18:53 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 3009091203; Mon, 15 Jul 2002 08:18:33 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id EFF6191245; Mon, 15 Jul 2002 08:18:32 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id A8E9A91203 for <idr@trapdoor.merit.edu>; Mon, 15 Jul 2002 08:18:31 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 89C8B5DDE9; Mon, 15 Jul 2002 08:18:31 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from celox-ma1-ems1.celoxnetworks.com (celox-ma1-imap1.celoxnetworks.com [12.40.60.233]) by segue.merit.edu (Postfix) with ESMTP id 4468F5DD8C for <idr@merit.edu>; Mon, 15 Jul 2002 08:18:31 -0400 (EDT)
Received: by celox-ma1-ems1.celoxnetworks.com with Internet Mail Service (5.5.2653.19) id <3T6KD5N5>; Mon, 15 Jul 2002 08:18:30 -0400
Message-ID: <1117F7D44159934FB116E36F4ABF221B020AF872@celox-ma1-ems1.celoxnetworks.com>
From: "Natale, Jonathan" <JNatale@celoxnetworks.com>
To: "'Susan Hares'" <skh@nexthop.com>, "Natale, Jonathan" <JNatale@celoxnetworks.com>
Cc: "'Fryczke, Pawel'" <Pawel.Fryczke@intel.com>, idr@merit.edu
Subject: RE: Aggregation and MULTI_EXIT_DISC, NEXT_HOP
Date: Mon, 15 Jul 2002 08:18:29 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
Sender: owner-idr@merit.edu
Precedence: bulk

Ok

-----Original Message-----
From: Susan Hares [mailto:skh@nexthop.com] 
Sent: Sunday, July 14, 2002 4:27 AM
To: Natale, Jonathan
Cc: 'Fryczke, Pawel'; idr@merit.edu
Subject: RE: Aggregation and MULTI_EXIT_DISC, NEXT_HOP

Natale:

Draft-18 of the BGP specification is awaiting the FSM work.
Watch for the FSM wording within a week.

Please send the laundry list in two forms:

    1) Where the current draft-17 + FSM words does not
         match currently implemented code.

    2) what changes you like to see

Sue Hares

At 10:48 AM 7/9/2002 -0400, Natale, Jonathan wrote:
>I raised this issue more than two years ago.  What is the story on bgp?
Why
>is the draft/rfc so far off base?  I would be happy to provide a laundry
>list of things, if that will help.
>
>I think the MED was fixed in a major vendor's code with the ~=
deterministic
>med cmd (or so I was told then, but now the story on that cmd seem
>different).
>
>The next hop was never "fixed".  The only answer for why not to aggregate
if
>nxhp not the same was that it has something to do w/ mcast RPF... but I
>never fully digested this.
>
>-$0.02
>
>-----Original Message-----
>From: Fryczke, Pawel [mailto:Pawel.Fryczke@intel.com]
>Sent: Thursday, July 04, 2002 8:08 AM
>To: idr@merit.edu
>Subject: Aggregation and MULTI_EXIT_DISC, NEXT_HOP
>
>The latest draft, draft-ietf-idr-bgp4-17.txt states:
>Routes that have the following attributes shall not be aggregated
>unless the corresponding attributes of each route are identical:
>MULTI_EXIT_DISC, NEXT_HOP.
>
>Many vendors (e.g. Cisco/Juniper) does not follow this requirement. Why?
>What is the purpose of this requirement?
>Pawel



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id FAA13898 for <idr-archive@nic.merit.edu>; Mon, 15 Jul 2002 05:39:22 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id BAF4B91226; Mon, 15 Jul 2002 05:39:01 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 88C8D9127C; Mon, 15 Jul 2002 05:39:01 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 49D4891226 for <idr@trapdoor.merit.edu>; Mon, 15 Jul 2002 05:39:00 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 2CA475DE22; Mon, 15 Jul 2002 05:39:00 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from rs-eu-exc1.rs.riverstonenet.com (unknown [193.128.15.220]) by segue.merit.edu (Postfix) with ESMTP id BF06A5DD8C for <idr@merit.edu>; Mon, 15 Jul 2002 05:38:59 -0400 (EDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"
Subject: ORF implementations?
X-MimeOLE: Produced By Microsoft Exchange V6.0.5762.3
Date: Mon, 15 Jul 2002 10:38:55 +0100
Message-ID: <D2ECDCD8A528BF43A01EECCBEB191DD115ECA6@rs-eu-exc1.rs.riverstonenet.com>
X-MS-Has-Attach: 
X-MS-TNEF-Correlator: 
Thread-Topic: ORF implementations?
Thread-Index: AcIr5BYrMmBHWZfWEdaEoADAT0MeLQ==
From: "Stephen Waters" <Stephen.Waters@riverstonenet.com>
To: <idr@merit.edu>
Sender: owner-idr@merit.edu
Precedence: bulk
Content-Transfer-Encoding: 8bit
X-MIME-Autoconverted: from quoted-printable to 8bit by nic.merit.edu id FAA13898

Hi,

Are there any ORF implementations out there? I have a few 'problems' following the 
draft that I'd like to discuss with any fellow ORFers.

Cheers, Steve.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id BAA04717 for <idr-archive@nic.merit.edu>; Mon, 15 Jul 2002 01:02:03 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 39E679120A; Mon, 15 Jul 2002 01:01:40 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id E9A2791218; Mon, 15 Jul 2002 01:01:39 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 7EA949120A for <idr@trapdoor.merit.edu>; Mon, 15 Jul 2002 01:01:38 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id E875C5DEAB; Mon, 15 Jul 2002 01:01:37 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 8BC4D5DDA6 for <idr@merit.edu>; Mon, 15 Jul 2002 01:01:37 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6F51Y438264; Mon, 15 Jul 2002 01:01:34 -0400 (EDT) (envelope-from skh@nexthop.com)
Received: from SKH.nexthop.com (SKH.corp.nexthop.com [133.93.74.227]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6F51B138238; Mon, 15 Jul 2002 01:01:11 -0400 (EDT) (envelope-from skh@nexthop.com)
Message-Id: <5.0.0.25.0.20020715005614.01d68288@mail.nexthop.com>
X-Sender: skh@mail.nexthop.com
X-Mailer: QUALCOMM Windows Eudora Version 5.0
Date: Mon, 15 Jul 2002 01:01:04 -0400
To: idr@merit.edu, ietf-agenda@nexthop.com
From: Susan Hares <skh@nexthop.com>
Subject: Agenda for IDR meeting on 
Cc: fenner@research.att.com, zinin@psg.com
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"; format=flowed
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

Below is the agenda for the Thursday IDR meeting.
Please send any requests for additions to me.


================
IDR Agenda for 7/18/02

Time of meeting: 13:00 - 15:00
Room: 501

    Agenda Bashing  13:00 - 13:10

    Route Oscillation Fixes [13:10-14:00]

       1) Wilfong,BAsu, Shepherd, Ong, RAsala
	
	  presentor: Anindya Basu
	  time: 13:10 - 13:25
	
	 draft: "Eliminating IBGP oscillations"
                  draft-wilfong-ibgp-oscillations-00.txt 	
	
      2) Walton, Retano, Scudder, Cook drafts

	presentor: Alvaro Retano
	time: 13:25 - 13:40
          Advertisement of Multiple Paths in BGP
             draft-walton-bgp-add-paths-00.txt

          BGP Persistent Route Oscillation Solution
              draft-walton-bgp-route-oscillation-stop-00.txt

      3) Chen drafts overview
	
	presentor: Sue Hares
	time: 13:40 - 13:50
	
	a) BGP Route Oscillation Reduction with Confederation
             draft-chen-confed-oscillation-reduce-01.txt

	b) BGP Route Oscillation Avoidance for Route Reflection and Confederation
             draft-chen-route-oscillation-avoid-00.txt

	c) BGP Route Oscillation Reduction with Route Reflection
            draft-chen-rr-oscillation-reduce-01.txt

      4) Discussion time
	
	time: 13:50 - 14:00


    Inform proposal [14:00 - 14:15]

      1) "BGPv4 INFORM message"

	draft-nalawade-bgp-inform-00.txt
         presentor: Gargi Nalawade
	time: 14:00 - 14:15


    Status of BGP documents [14:15 - 14:30]
	
	1) BGP-4 base document status
	2) Status of all BGP drafts



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id EAA25193 for <idr-archive@nic.merit.edu>; Sun, 14 Jul 2002 04:27:41 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 39B629123F; Sun, 14 Jul 2002 04:27:21 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id EF8889127C; Sun, 14 Jul 2002 04:27:20 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 6B8D09123F for <idr@trapdoor.merit.edu>; Sun, 14 Jul 2002 04:27:19 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 4A6E45DDB8; Sun, 14 Jul 2002 04:27:19 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id E35565DD9A for <idr@merit.edu>; Sun, 14 Jul 2002 04:27:18 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6E8RIZ16719; Sun, 14 Jul 2002 04:27:18 -0400 (EDT) (envelope-from skh@nexthop.com)
Received: from SKH.nexthop.com (SKH.corp.nexthop.com [133.93.74.227]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6E8RD116712; Sun, 14 Jul 2002 04:27:13 -0400 (EDT) (envelope-from skh@nexthop.com)
Message-Id: <5.0.0.25.0.20020714041828.01dacb00@mail.nexthop.com>
X-Sender: skh@mail.nexthop.com
X-Mailer: QUALCOMM Windows Eudora Version 5.0
Date: Sun, 14 Jul 2002 04:27:25 -0400
To: "Natale, Jonathan" <JNatale@celoxnetworks.com>
From: Susan Hares <skh@nexthop.com>
Subject: RE: Aggregation and MULTI_EXIT_DISC, NEXT_HOP
Cc: "'Fryczke, Pawel'" <Pawel.Fryczke@intel.com>, idr@merit.edu
In-Reply-To: <1117F7D44159934FB116E36F4ABF221B020AF843@celox-ma1-ems1.ce loxnetworks.com>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"; format=flowed
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

Natale:

Draft-18 of the BGP specification is awaiting the FSM work.
Watch for the FSM wording within a week.

Please send the laundry list in two forms:

    1) Where the current draft-17 + FSM words does not
         match currently implemented code.

    2) what changes you like to see

Sue Hares

At 10:48 AM 7/9/2002 -0400, Natale, Jonathan wrote:
>I raised this issue more than two years ago.  What is the story on bgp?  Why
>is the draft/rfc so far off base?  I would be happy to provide a laundry
>list of things, if that will help.
>
>I think the MED was fixed in a major vendor's code with the ~= deterministic
>med cmd (or so I was told then, but now the story on that cmd seem
>different).
>
>The next hop was never "fixed".  The only answer for why not to aggregate if
>nxhp not the same was that it has something to do w/ mcast RPF... but I
>never fully digested this.
>
>-$0.02
>
>-----Original Message-----
>From: Fryczke, Pawel [mailto:Pawel.Fryczke@intel.com]
>Sent: Thursday, July 04, 2002 8:08 AM
>To: idr@merit.edu
>Subject: Aggregation and MULTI_EXIT_DISC, NEXT_HOP
>
>The latest draft, draft-ietf-idr-bgp4-17.txt states:
>Routes that have the following attributes shall not be aggregated
>unless the corresponding attributes of each route are identical:
>MULTI_EXIT_DISC, NEXT_HOP.
>
>Many vendors (e.g. Cisco/Juniper) does not follow this requirement. Why?
>What is the purpose of this requirement?
>Pawel




Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id AAA16871 for <idr-archive@nic.merit.edu>; Sun, 14 Jul 2002 00:08:29 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id A83259127A; Sun, 14 Jul 2002 00:07:52 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 7BDE99127B; Sun, 14 Jul 2002 00:07:52 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 474119127A for <idr@trapdoor.merit.edu>; Sun, 14 Jul 2002 00:07:51 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 3107B5DE30; Sun, 14 Jul 2002 00:07:51 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 9ECA35DD9A for <idr@merit.edu>; Sun, 14 Jul 2002 00:07:50 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6E47mR11368; Sun, 14 Jul 2002 00:07:48 -0400 (EDT) (envelope-from skh@nexthop.com)
Received: from SKH.nexthop.com (ietf-wireless-dhcp-74-227.dyn.ietf54.wide.ad.jp [133.93.74.227]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6E47h111360; Sun, 14 Jul 2002 00:07:44 -0400 (EDT) (envelope-from skh@nexthop.com)
Message-Id: <5.0.0.25.0.20020714000139.047f65b8@mail.nexthop.com>
X-Sender: skh@mail.nexthop.com
X-Mailer: QUALCOMM Windows Eudora Version 5.0
Date: Sun, 14 Jul 2002 00:07:55 -0400
To: Anindya Basu <basu@research.bell-labs.com>
From: Susan Hares <skh@nexthop.com>
Subject: Re: IDR agenda
Cc: Susan Hares <skh@nexthop.com>, idr@merit.edu, kmosley@nexthop.com, yakov Rekhter <yakov@juniper.net>
In-Reply-To: <3D2AEE71.15547C32@research.bell-labs.com>
References: <5.0.0.25.0.20020709062010.035bab38@mail.nexthop.com>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"; format=flowed
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

Anindya:

We'll reserve a 10-15 slot for your paper.

Sue

At 10:08 AM 7/9/2002 -0400, Anindya Basu wrote:
>Dear WG Chairs,
>
>I would like a 10-15 minute slot for presenting the work in
>draft-wilfong-ibgp-oscillations-00.txt that addresses the route
>oscillation problem identified in
>draft-ietf-idr-route-oscillation-00.txt.
>
>Thanks for your consideration,
>-Anindya Basu
>  MTS, Bell Laboratories
>  Lucent Technologies
>
>Susan Hares wrote:
> >
> > Please send any requests for the agenda to the list.
> > The final agenda will be sent on Sunday.
> > (I am traveling to Japan until that date.)
> >
> > Sue Hares




Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id XAA16560 for <idr-archive@nic.merit.edu>; Sat, 13 Jul 2002 23:59:40 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 2EF5A91279; Sat, 13 Jul 2002 23:59:12 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id E8A9A9127A; Sat, 13 Jul 2002 23:59:11 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 958F691279 for <idr@trapdoor.merit.edu>; Sat, 13 Jul 2002 23:59:10 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 020755DF0A; Sat, 13 Jul 2002 23:59:10 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 7F1B55DF0F for <idr@merit.edu>; Sat, 13 Jul 2002 23:59:09 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6E3wq511107; Sat, 13 Jul 2002 23:58:52 -0400 (EDT) (envelope-from skh@nexthop.com)
Received: from SKH.nexthop.com (ietf-wireless-dhcp-74-227.dyn.ietf54.wide.ad.jp [133.93.74.227]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6E3wf111074; Sat, 13 Jul 2002 23:58:41 -0400 (EDT) (envelope-from skh@nexthop.com)
Message-Id: <5.0.0.25.0.20020713070127.037d0668@mail.nexthop.com>
X-Sender: skh@mail.nexthop.com
X-Mailer: QUALCOMM Windows Eudora Version 5.0
Date: Sat, 13 Jul 2002 07:15:17 -0400
To: Kunihiro Ishiguro <kunihiro@zebra.org>
From: Susan Hares <skh@nexthop.com>
Subject: Re: TCP state machine doubts
Cc: hans.de_vleeschouwer@alcatel.be, BGP mailing list <idr@merit.edu>, Gilles Geerts <gilles.geerts@alcatel.be>
In-Reply-To: <m2znx5qzwh.wl@titanium.zebra.org>
References: <3D25733C.B5F1D90@alcatel.be> <3D25733C.B5F1D90@alcatel.be>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"; format=flowed
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

Kunihiro:

I've got the FSM text for draft-18 revised.  I'm looking
for a group of people to review 5th round of the FSM
text along with the associated documents:

         1) BGP state machine (ietf-hares-statemt-02.txt)
         2) BGP backoff draft (ietf-hares-backoff-01.txt)
         3) Passive and Call Collision input
         4) BGP update draft (ietf-hares-update-00.txt]

  Thanks for volunteering to review the documents prior to
  me flooding the information to the mail list.  I welcome
  any other early reviewers.

  I submitted the first two as Internet drafts, but I found a few
  typos.

  I've answered your questions below:

At 08:49 PM 7/5/2002 -0700, Kunihiro Ishiguro wrote:
> >i have a few questions related to the new state machine proposed in the
> >latest BGP draft.
>
>Current draft-17 has some known problems.  I think Sue aleady prepared
>draft of draft-18.  So best way is updating the draft to draft-18 then
>process IDR working group review to it.
>
> >1. related to terminology related to the connect retry counter.
> >I find back in the state descriptions following statement:
> >- "clears the ConnectRetry timer" (e.g. page 35, state connect)
> >- "Set connect retry timer to zero" (e.g. page 37, state Open sent)
> >- "Connect retry timer = 0," (e.g. page 38, state Open sent)
> > I assume that the meaning in all of the above cases is : "the
> >ConnectRetry timer is stopped".  If so, would this not be a better
> >wording for the action?

I've changed the text to be common.  Let me know if you like the
new wording.  Otherwise, I can add a little more text.


> >
> >2. related to the terminology for the holdtimer.
> >In state Connect on page 34 I find the following text: "set Hold timer
> >to a large value". I expect that this means "The Hold timer value is set
> >to a large value, and the Hold timer is started".

Could you look at the current text

         - Set the Hold timer to a large value (4 minutes),


> >
> >3. related to the ConnectRetryCnt.
> >- Should the value of this counter not be initialsed in the Idle state
> >as well?
>
>These are right.  I believes all of these are fixed in the draft of
>draft-18.


See the FSM words and state machine.  I could use another
pair of eyes to look at the FSM.

> >4. General doubt  related to the TCP backoff strategy
> >Suppose I have a BGP peer that goes down say once every 2hours. With the
> >current sheme the ConnectRetryCnt keeps on increasing, and as a
> >consequence the connection takes increasinly more time to be
> >re-establsihed, even if actually it only fails like every 2 hours.

The backoff mechanisms can actually change that.  I would
suggest that is another mechanism we could add in the
backoff section.   We could add a timer at that point or
agree to add another state to the FSM's timer.

Please comment on which one you'd like to see.  Please
suggest text for the backoff draft.


>My understanding is once BGP peer is establieshed, the ConnectRetryCnt
>is cleared.

This mechanism does not fix the persistent peer oscillation.
Again, I think we can change this in the backoff draft, but
maybe we need another timer to reduce the connect retry count.

Could you suggest some text based on the backoff draft?


Sue Hares



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id QAA14156 for <idr-archive@nic.merit.edu>; Fri, 12 Jul 2002 16:20:58 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id DDF3391231; Fri, 12 Jul 2002 16:20:20 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id AFD0F913CA; Fri, 12 Jul 2002 16:20:20 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id A5A9C91231 for <idr@trapdoor.merit.edu>; Fri, 12 Jul 2002 16:20:19 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 8BFA05DDD4; Fri, 12 Jul 2002 16:20:19 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 09F9C5DD8E for <idr@merit.edu>; Fri, 12 Jul 2002 16:20:19 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6CKKHO72805; Fri, 12 Jul 2002 16:20:17 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6CKKE172798; Fri, 12 Jul 2002 16:20:14 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g6CKKEl25670; Fri, 12 Jul 2002 16:20:14 -0400 (EDT)
Date: Fri, 12 Jul 2002 16:20:14 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: Pedro Roque Marques <roque@juniper.net>
Cc: idr@merit.edu
Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
Message-ID: <20020712162014.A25656@nexthop.com>
References: <1117F7D44159934FB116E36F4ABF221B020AF86F@celox-ma1-ems1.celoxnetworks.com> <20020712145641.A25494@nexthop.com> <200207121914.g6CJEcO59697@roque-bsd.juniper.net>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <200207121914.g6CJEcO59697@roque-bsd.juniper.net>; from roque@juniper.net on Fri, Jul 12, 2002 at 12:14:38PM -0700
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

On Fri, Jul 12, 2002 at 12:14:38PM -0700, Pedro Roque Marques wrote:
> >From both quotes above it apears to me that you are proposing to
> discard an UPDATE and leave the session up. Since BGP state is
> incremental, that would mean the two peers would now potentially be
> out of sync.

And no less intentionally than if I discarded everything via policy.
In this case, my policy is "discard bad (for some value) packets".

Currently, we can only discard and move along.
The proposal with INFORM at least lets us tell our upstream that
they did something we didn't like, if they even care.

>   Pedro.

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id PAA11904 for <idr-archive@nic.merit.edu>; Fri, 12 Jul 2002 15:15:09 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id B044F913C5; Fri, 12 Jul 2002 15:14:40 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 87161913C6; Fri, 12 Jul 2002 15:14:40 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 5B903913C5 for <idr@trapdoor.merit.edu>; Fri, 12 Jul 2002 15:14:39 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 4A9055DDF6; Fri, 12 Jul 2002 15:14:39 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from merlot.juniper.net (natint.juniper.net [207.17.136.129]) by segue.merit.edu (Postfix) with ESMTP id E72835DD8E for <idr@merit.edu>; Fri, 12 Jul 2002 15:14:38 -0400 (EDT)
Received: from roque-bsd.juniper.net (roque-bsd.juniper.net [172.17.12.183]) by merlot.juniper.net (8.11.3/8.11.3) with ESMTP id g6CJEcm54543; Fri, 12 Jul 2002 12:14:38 -0700 (PDT) (envelope-from roque@juniper.net)
Received: (from roque@localhost) by roque-bsd.juniper.net (8.11.6/8.9.3) id g6CJEcO59697; Fri, 12 Jul 2002 12:14:38 -0700 (PDT) (envelope-from roque)
Date: Fri, 12 Jul 2002 12:14:38 -0700 (PDT)
Message-Id: <200207121914.g6CJEcO59697@roque-bsd.juniper.net>
From: Pedro Roque Marques <roque@juniper.net>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
To: Jeffrey Haas <jhaas@nexthop.com>
Cc: idr@merit.edu
Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
In-Reply-To: <20020712145641.A25494@nexthop.com>
References: <1117F7D44159934FB116E36F4ABF221B020AF86F@celox-ma1-ems1.celoxnetworks.com> <20020712145641.A25494@nexthop.com>
X-Mailer: VM 6.34 under 19.16 "Lille" XEmacs Lucid
Sender: owner-idr@merit.edu
Precedence: bulk

Jeffrey Haas writes:

> On Fri, Jul 12, 2002 at 02:33:05PM -0400, Natale, Jonathan wrote:
>> This is already done--if CONFED path is received the peer stays up
>> (and the local pref gets ignored).  This is good, but not in an rfc
>> nor a draft.  My vote is for an INFORM.

> IMO, this is a case where the route's local pref shouldn't be just
> ignored, but the entire packet should be discarded.

>From both quotes above it apears to me that you are proposing to
discard an UPDATE and leave the session up. Since BGP state is
incremental, that would mean the two peers would now potentially be
out of sync.

  Pedro.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id OAA11250 for <idr-archive@nic.merit.edu>; Fri, 12 Jul 2002 14:57:13 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id B55D39120D; Fri, 12 Jul 2002 14:56:50 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 8B30091224; Fri, 12 Jul 2002 14:56:50 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 853289120D for <idr@trapdoor.merit.edu>; Fri, 12 Jul 2002 14:56:49 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 6555D5DDD3; Fri, 12 Jul 2002 14:56:49 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id D7A765DD8E for <idr@merit.edu>; Fri, 12 Jul 2002 14:56:48 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6CIujj69345; Fri, 12 Jul 2002 14:56:45 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6CIuf169333; Fri, 12 Jul 2002 14:56:41 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g6CIufM25498; Fri, 12 Jul 2002 14:56:41 -0400 (EDT)
Date: Fri, 12 Jul 2002 14:56:41 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: "Natale, Jonathan" <JNatale@celoxnetworks.com>
Cc: Alex Zinin <zinin@psg.com>, idr@merit.edu
Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
Message-ID: <20020712145641.A25494@nexthop.com>
References: <1117F7D44159934FB116E36F4ABF221B020AF86F@celox-ma1-ems1.celoxnetworks.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <1117F7D44159934FB116E36F4ABF221B020AF86F@celox-ma1-ems1.celoxnetworks.com>; from JNatale@celoxnetworks.com on Fri, Jul 12, 2002 at 02:33:05PM -0400
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

On Fri, Jul 12, 2002 at 02:33:05PM -0400, Natale, Jonathan wrote:
> This is already done--if CONFED path is received the peer stays up (and the
> local pref gets ignored).  This is good, but not in an rfc nor a draft.  My
> vote is for an INFORM.

IMO, this is a case where the route's local pref shouldn't be just ignored,
but the entire packet should be discarded.

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id OAA10521 for <idr-archive@nic.merit.edu>; Fri, 12 Jul 2002 14:34:44 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 0C2C2913C2; Fri, 12 Jul 2002 14:33:20 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 35532913C9; Fri, 12 Jul 2002 14:33:19 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 885A1913C2 for <idr@trapdoor.merit.edu>; Fri, 12 Jul 2002 14:33:12 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 765BA5DD8E; Fri, 12 Jul 2002 14:33:12 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from celox-ma1-ems1.celoxnetworks.com (celox-ma1-imap1.celoxnetworks.com [12.40.60.233]) by segue.merit.edu (Postfix) with ESMTP id 3E16B5DDD3 for <idr@merit.edu>; Fri, 12 Jul 2002 14:33:12 -0400 (EDT)
Received: by celox-ma1-ems1.celoxnetworks.com with Internet Mail Service (5.5.2653.19) id <3T6KDWYS>; Fri, 12 Jul 2002 14:33:11 -0400
Message-ID: <1117F7D44159934FB116E36F4ABF221B020AF86F@celox-ma1-ems1.celoxnetworks.com>
From: "Natale, Jonathan" <JNatale@celoxnetworks.com>
To: "'Jeffrey Haas'" <jhaas@nexthop.com>, Alex Zinin <zinin@psg.com>
Cc: idr@merit.edu
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Fri, 12 Jul 2002 14:33:05 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
Sender: owner-idr@merit.edu
Precedence: bulk

This is already done--if CONFED path is received the peer stays up (and the
local pref gets ignored).  This is good, but not in an rfc nor a draft.  My
vote is for an INFORM.

-----Original Message-----
From: Jeffrey Haas [mailto:jhaas@nexthop.com] 
Sent: Friday, July 12, 2002 2:29 PM
To: Alex Zinin
Cc: idr@merit.edu
Subject: Re: INFORM proposal and MP-BGP (RFC 2858)

Alex,

There are some circumstances under which we currently send notifications
where some weaker mechanism may be sufficient.  Almost every one of
these situations fall under the category of "well formed packet,
well-formed recognized attributes, weird/bogus data inside them".

As you infer, there are situations where we should just assume that
the buffer contains cruft and that the implementation needs a 
strong reset.

However, there are situations where we could get stuff that we can
deal with and just ignore the "Bad" things.  This was the case when
someone's implementation started leaking non-prefixed confederation
segments into the internet.  The packets were well formed, but had
cruft in them.

Similarly, if we ever add an origin type of 4, we'll have migration
issues.  If implementations just discarded updates that contained 
unrecognized data....

It may be worth our while to spend some time analyzing the protocol
for places where we can gracefully deal with failures and move along.

Or...
We could just say that NOTIFY is the right answer to deal with
confused implementations and trust to graceful restart to minimize
the impact on the Internet as a whole.  (But note that this wouldn't
have dealt with the non-prefixed confeds issue.)

On Mon, Jul 08, 2002 at 03:22:59PM -0700, Alex Zinin wrote:
>  BTW, a more generic question: if we can't trust an attribute
>  as a whole (since it's invalid), how can we trust specific fields
>  (AFI/SAFI) inside, i.e., if an MP_* attribute is malformed, it is
>  unsafe to assume that AFI/SAFI fields are correct... so, we may
>  very well end up deleting and ignoring a wrong (but valid) AFI/SAFI.

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id OAA10299 for <idr-archive@nic.merit.edu>; Fri, 12 Jul 2002 14:29:11 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id C1752913C0; Fri, 12 Jul 2002 14:28:43 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 8E2F2913C2; Fri, 12 Jul 2002 14:28:43 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 4ACBA913C0 for <idr@trapdoor.merit.edu>; Fri, 12 Jul 2002 14:28:41 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 2D5D45DDC7; Fri, 12 Jul 2002 14:28:41 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id CDF655DD8E for <idr@merit.edu>; Fri, 12 Jul 2002 14:28:40 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6CIScB68557; Fri, 12 Jul 2002 14:28:38 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6CISX168536; Fri, 12 Jul 2002 14:28:33 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g6CISXD25432; Fri, 12 Jul 2002 14:28:33 -0400 (EDT)
Date: Fri, 12 Jul 2002 14:28:33 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: Alex Zinin <zinin@psg.com>
Cc: idr@merit.edu
Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
Message-ID: <20020712142833.B25315@nexthop.com>
References: <20020708144248.C11658@nexthop.com> <13164442886.20020708152259@psg.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <13164442886.20020708152259@psg.com>; from zinin@psg.com on Mon, Jul 08, 2002 at 03:22:59PM -0700
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

Alex,

There are some circumstances under which we currently send notifications
where some weaker mechanism may be sufficient.  Almost every one of
these situations fall under the category of "well formed packet,
well-formed recognized attributes, weird/bogus data inside them".

As you infer, there are situations where we should just assume that
the buffer contains cruft and that the implementation needs a 
strong reset.

However, there are situations where we could get stuff that we can
deal with and just ignore the "Bad" things.  This was the case when
someone's implementation started leaking non-prefixed confederation
segments into the internet.  The packets were well formed, but had
cruft in them.

Similarly, if we ever add an origin type of 4, we'll have migration
issues.  If implementations just discarded updates that contained 
unrecognized data....

It may be worth our while to spend some time analyzing the protocol
for places where we can gracefully deal with failures and move along.

Or...
We could just say that NOTIFY is the right answer to deal with
confused implementations and trust to graceful restart to minimize
the impact on the Internet as a whole.  (But note that this wouldn't
have dealt with the non-prefixed confeds issue.)

On Mon, Jul 08, 2002 at 03:22:59PM -0700, Alex Zinin wrote:
>  BTW, a more generic question: if we can't trust an attribute
>  as a whole (since it's invalid), how can we trust specific fields
>  (AFI/SAFI) inside, i.e., if an MP_* attribute is malformed, it is
>  unsafe to assume that AFI/SAFI fields are correct... so, we may
>  very well end up deleting and ignoring a wrong (but valid) AFI/SAFI.

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id NAA08261 for <idr-archive@nic.merit.edu>; Fri, 12 Jul 2002 13:29:44 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id DB01B913BC; Fri, 12 Jul 2002 13:29:14 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id A5CCA913BD; Fri, 12 Jul 2002 13:29:14 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 4D77E913BC for <idr@trapdoor.merit.edu>; Fri, 12 Jul 2002 13:29:13 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 2EE775DDA2; Fri, 12 Jul 2002 13:29:13 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id CF9515DD8E for <idr@merit.edu>; Fri, 12 Jul 2002 13:29:12 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6CHT0D66954; Fri, 12 Jul 2002 13:29:00 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6CHSu166947; Fri, 12 Jul 2002 13:28:56 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g6CHStx25343; Fri, 12 Jul 2002 13:28:55 -0400 (EDT)
Date: Fri, 12 Jul 2002 13:28:55 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: john heasley <heas@shrubbery.net>
Cc: "Srihari R. Sangli" <srihari@procket.com>, Stephen Waters <Stephen.Waters@riverstonenet.com>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@marconi.com>, idr@merit.edu
Subject: Re: Re>Are dynamic capabilities really necessary if peers support gracef ul restart?
Message-ID: <20020712132855.A25315@nexthop.com>
References: <D2ECDCD8A528BF43A01EECCBEB191DD108F332@rs-eu-exc1.rs.riverstonenet.com> <Pine.GSO.4.40.0207111123500.192-100000@miata.procket.com> <20020711114409.P16814@shrubbery.net>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <20020711114409.P16814@shrubbery.net>; from heas@shrubbery.net on Thu, Jul 11, 2002 at 11:44:09AM -0700
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

On Thu, Jul 11, 2002 at 11:44:09AM -0700, john heasley wrote:
> 	"it must be possible to turn such options off and force
> 	 values/capabilities that will be required."

Considered from a slightly different angle, do you perhaps also mean:
1. If I said I supported some AFI/SAFI
2. I decide that your implementation of said AFI/SAFI is buggy
3. I send you a dynamic capability update withdrawing that AFI/SAFI
4. If you keep sending it after I sent you the capability update, what
   should I do?

To 4, answers are:
1. Wait a certain amount of time for some queues to clear.  If we
   still receive the unwanted messages, we should then NOTIFY.
2. Grin and bear it.

Other observations?

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id LAA03492 for <idr-archive@nic.merit.edu>; Fri, 12 Jul 2002 11:06:58 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id D524A913B6; Fri, 12 Jul 2002 11:06:28 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 9DE7A913B7; Fri, 12 Jul 2002 11:06:28 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 72396913B6 for <idr@trapdoor.merit.edu>; Fri, 12 Jul 2002 11:06:27 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 6272E5DEE0; Fri, 12 Jul 2002 11:06:27 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from tenornetworks.com (rtu.tenornetworks.com [63.77.213.2]) by segue.merit.edu (Postfix) with ESMTP id D097D5DDA2 for <idr@merit.edu>; Fri, 12 Jul 2002 11:06:26 -0400 (EDT)
Received: from tenornet.com (newman [192.168.0.185]) by tenornetworks.com (Switch-2.1.3/Switch-2.1.0) with ESMTP id g6CF6Nd09223; Fri, 12 Jul 2002 11:06:23 -0400 (EDT)
Received: from 192.168.0.185 by tenornet.com; FRI, 12 Jul 2002 11:01:38 -0700
Received: by newman.tenornet.com with Internet Mail Service (5.5.2650.21) id <N1HXZ5BJ>; Fri, 12 Jul 2002 11:01:38 -0400
Message-ID: <6B190B34070BD411ACA000B0D0214E560165B817@newman.tenornet.com>
From: "Tsillas, Jim" <jtsillas@tenornetworks.com>
To: "'Bruno Rijsman'" <bwarijsman@hotmail.com>, idr@merit.edu
Subject: RE: Re> Unsupported capability notification
Date: Fri, 12 Jul 2002 11:01:37 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Bruno,

how do most vendors handle the case where the MP-BGP
capabilities are disjoint? One could make a case for
just letting the session stay alive and not sending
the NOTIFY. After all a peer may be interested in receiving
a certain MP-BGP AF but not sending it.

I agree that in most cases sending the "unsupported
capability" notification is the wrong things to do if
you receive an unsupported capability. The text in the
draft is clear on this point, but a concrete example
would help.

-Jim.

-----Original Message-----
From: Bruno Rijsman [mailto:bwarijsman@hotmail.com]
Sent: Friday, July 12, 2002 9:38 AM
To: idr@merit.edu
Cc: JNatale@celoxnetworks.com
Subject: Re> Unsupported capability notification


The usage of the unsupported capability notification is widely
misunderstood and this is causing real problems in real networks.

If you receive a capability code X which you do not recognize,
you MUST NOT send back an unsupported capability notification.
It is sufficient for you to simply not advertise capability X
and as a result the peer will known you do not support capability X.

If you receive a capability code Y which you do recognize but
for some reason you do not want to use capability Y on that session
you SHOULD NOT send back an unsupported capability notification. It is
sufficient for you to simply not advertise capability Y and as a result
the peer will knwon you do not support capability Y.

I known of only one case where the sending of an "unsupported
capability" notification is justified: if both sides propose the
multi-protocol capability but the proposed set of AFI/SAFIs is
completely disjunct.

There are many implementations out there that send "unsupported
capability" notifications when they should not. For example,
there is one widely deployed implementation that sends an
"unsupported capability" notification when it receives the
four-octet-as-numbers capability. Even worse: this implementation
sends the notification without any data (i.e. we don't know which
is the offending capability) and often sends the notification before
the open (i.e. we can't guess which capabilities they support by
looking at their open).









--- Original message ---

"If a BGP speaker that supports a certain capability determines that
   its peer doesn't support this capability, the speaker may send a
   NOTIFICATION message to the peer, and terminate peering. The Error
   Subcode in the message is set to Unsupported Capability. The message
   should contain the capability (capabilities) that causes the speaker
   to send the message.  The decision to send the message and terminate
   peering is local to the speaker.  Such peering should not be re-
   established automatically"
--RFC2842, "Capabilities Advertisement with BGP-4"


This sounds like:
    if rtrA is doing capC   and rtrB is not doing CapC,
        then rtrA will send an UnsupCapNotifErrorSubcode=capC

    ...so if you were to send UnsupCapNotifErrorSubcode=capC
        then you would/could/should re-open with the offending capability

It seems to me that it should be:
    if rtrA is doing capC   and rtrB is not doing CapC,
        then ***rtrB*** will send an UnsupCapNotifErrorSubcode=capC

    ...so if you were to ***receive*** UnsupCapNotifErrorSubcode=capC
        then you would/could/should re-open without the offending capability

Thoughts?

Thank you very much for your time.


_________________________________________________________________
MSN Photos is the easiest way to share and print your photos: 
http://photos.msn.com/support/worldwide.aspx



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id LAA03455 for <idr-archive@nic.merit.edu>; Fri, 12 Jul 2002 11:05:56 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id E2515913BA; Fri, 12 Jul 2002 11:05:27 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id AD1FC913C2; Fri, 12 Jul 2002 11:05:27 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 45406913BA for <idr@trapdoor.merit.edu>; Fri, 12 Jul 2002 11:05:21 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 33C3E5DEE0; Fri, 12 Jul 2002 11:05:21 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from celox-ma1-ems1.celoxnetworks.com (celox-ma1-imap1.celoxnetworks.com [12.40.60.233]) by segue.merit.edu (Postfix) with ESMTP id E3B4E5DDA2 for <idr@merit.edu>; Fri, 12 Jul 2002 11:05:20 -0400 (EDT)
Received: by celox-ma1-ems1.celoxnetworks.com with Internet Mail Service (5.5.2653.19) id <3T6KDW2B>; Fri, 12 Jul 2002 11:05:20 -0400
Message-ID: <1117F7D44159934FB116E36F4ABF221B020AF869@celox-ma1-ems1.celoxnetworks.com>
From: "Natale, Jonathan" <JNatale@celoxnetworks.com>
To: "'Tsillas, Jim'" <jtsillas@tenornetworks.com>, "'Mathew Richardson'" <mrr@nexthop.com>, idr@merit.edu
Subject: RE: Unsupported capability notification
Date: Fri, 12 Jul 2002 11:05:18 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
Sender: owner-idr@merit.edu
Precedence: bulk

Yes, I think it should

-----Original Message-----
From: Tsillas, Jim [mailto:jtsillas@tenornetworks.com] 
Sent: Friday, July 12, 2002 10:40 AM
To: 'Mathew Richardson'; idr@merit.edu
Subject: RE: Unsupported capability notification

Maybe the draft should be updated to make this clear. I know
it took me a while to understand the intent and then only after
I observed other implementations.

-Jim.

-----Original Message-----
From: Mathew Richardson [mailto:mrr@nexthop.com]
Sent: Friday, July 12, 2002 10:41 AM
To: idr@merit.edu
Subject: Re: Unsupported capability notification


> Natale, Jonathan <JNatale@celoxnetworks.com> [Fri, Jul 12, 2002 at
09:50:43AM -0400]:
>Thanks.
>
>My question is:
>Does TXing unsupported capability=C mean:
>A) I want to RX open with cap. C
>OR
>B) I want to RX open without cap. C

<snip>

A.

>Before you answer B), which seems to be the obvious answer,

<snip>

It may seem obvious, but it is incorrect.  The router sending unsupported
capability is saying "I won't peer with you because you don't support
this capability..."

mrr


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA03240 for <idr-archive@nic.merit.edu>; Fri, 12 Jul 2002 10:59:51 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id E2A4A913B4; Fri, 12 Jul 2002 10:59:28 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id AB5FF913B5; Fri, 12 Jul 2002 10:59:28 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id B8914913B4 for <idr@trapdoor.merit.edu>; Fri, 12 Jul 2002 10:59:27 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 852395DEE0; Fri, 12 Jul 2002 10:59:27 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 32E9F5DDA2 for <idr@merit.edu>; Fri, 12 Jul 2002 10:59:27 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA02727; Fri, 12 Jul 2002 10:59:24 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA14189; Fri, 12 Jul 2002 10:59:25 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W98TLV4>; Fri, 12 Jul 2002 10:59:24 -0400
Message-ID: <39469E08BD83D411A3D900204840EC5582279B@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Bruno Rijsman'" <bwarijsman@hotmail.com>, idr@merit.edu
Cc: JNatale@celoxnetworks.com
Subject: RE: Re> Unsupported capability notification
Date: Fri, 12 Jul 2002 10:59:24 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Agreed, if either router has no common ground to communicate on a basic
level
say IPv4 routes, then the session should not be established. But if there
are many
capabilities that are not agreed but one that does, the session should be
allowed
to be established. Again, rejecting router should send some sort of
indication to
offending router of the problem so there is no confusion on what the issue
is. This
notification should NOT cause a session drop.

If they simply state that in the spec, all confusion will be eliminated.

Ben

> -----Original Message-----
> From: Bruno Rijsman [mailto:bwarijsman@hotmail.com]
> Sent: Friday, July 12, 2002 10:11 AM
> To: idr@merit.edu
> Cc: JNatale@celoxnetworks.com
> Subject: RE: Re> Unsupported capability notification
> 
> 
> Re> My question is:
> Re> Does TXing unsupported capability=C mean:
> Re> A) I want to RX open with cap. C
> Re> OR
> Re> B) I want to RX open without cap. C
> 
> Sending an "unsupported capability notification containing
> capability C" means: "The fact that you support capability
> C means I don't want to have a session with you" or to put
> it differently "I want you to stop advertising capability
> C to me".
> 
> There are very few situations where this is really what you want.
> 
> If I simply don't support capability C or I don't want to use
> capability C on that session, I could simply refrain from advertising
> capability C in my open.
> 
> The one case that I know where sending a notification is justified is
> in multi-protocol negotiation. Say router A proposes multi-protocol
> ipv4 unicast. Router B proposes multi-protocol ipv6 multicast. When B
> receives the open from A (or vice versa) it may send an "unsupported
> capability" notification because they have no address families in
> common to exchange ("well if you are only interested in talking about
> ipv4 I don't want to talk to you").
> 
> 
> _________________________________________________________________
> Send and receive Hotmail on your mobile device: http://mobile.msn.com
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA02929 for <idr-archive@nic.merit.edu>; Fri, 12 Jul 2002 10:51:20 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id B98DC913B3; Fri, 12 Jul 2002 10:50:52 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 8863B913B4; Fri, 12 Jul 2002 10:50:52 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 23DE1913B3 for <idr@trapdoor.merit.edu>; Fri, 12 Jul 2002 10:50:51 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 121225DE77; Fri, 12 Jul 2002 10:50:51 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from celox-ma1-ems1.celoxnetworks.com (celox-ma1-imap1.celoxnetworks.com [12.40.60.233]) by segue.merit.edu (Postfix) with ESMTP id CBC225DDA2 for <idr@merit.edu>; Fri, 12 Jul 2002 10:50:50 -0400 (EDT)
Received: by celox-ma1-ems1.celoxnetworks.com with Internet Mail Service (5.5.2653.19) id <3T6KDWGH>; Fri, 12 Jul 2002 10:50:50 -0400
Message-ID: <1117F7D44159934FB116E36F4ABF221B020AF868@celox-ma1-ems1.celoxnetworks.com>
From: "Natale, Jonathan" <JNatale@celoxnetworks.com>
To: "'Abarbanel, Benjamin'" <Benjamin.Abarbanel@Marconi.com>, "'Bruno Rijsman'" <bwarijsman@hotmail.com>, idr@merit.edu
Cc: "Natale, Jonathan" <JNatale@celoxnetworks.com>
Subject: RE: Re> Unsupported capability notification
Date: Fri, 12 Jul 2002 10:50:45 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
Sender: owner-idr@merit.edu
Precedence: bulk

True.

But in the case of the capability, the other side will not need to guess, it
knows what you support by you OPEN:
" A BGP speaker determines the capabilities supported by its peer by
   examining the list of capabilities present in the Capabilities
   Optional Parameter carried by the OPEN message that the speaker
   receives from the peer."

...This part of the spec seems very clear and very sensible.


-----Original Message-----
From: Abarbanel, Benjamin [mailto:Benjamin.Abarbanel@Marconi.com] 
Sent: Friday, July 12, 2002 10:40 AM
To: 'Bruno Rijsman'; idr@merit.edu
Cc: JNatale@celoxnetworks.com
Subject: RE: Re> Unsupported capability notification

Guys:
  I really think the NOTIFY message without clearing the session can take 
care of all of these conflicts and still let both routers know what they 
can or cannot do. Just ignoring something and hoping the other side will
figure it out by deduction, usually leads to assumptions that are wrong.

Ben


> -----Original Message-----
> From: Bruno Rijsman [mailto:bwarijsman@hotmail.com]
> Sent: Friday, July 12, 2002 9:38 AM
> To: idr@merit.edu
> Cc: JNatale@celoxnetworks.com
> Subject: Re> Unsupported capability notification
> 
> 
> The usage of the unsupported capability notification is widely
> misunderstood and this is causing real problems in real networks.
> 
> If you receive a capability code X which you do not recognize,
> you MUST NOT send back an unsupported capability notification.
> It is sufficient for you to simply not advertise capability X
> and as a result the peer will known you do not support capability X.
> 
> If you receive a capability code Y which you do recognize but
> for some reason you do not want to use capability Y on that session
> you SHOULD NOT send back an unsupported capability notification. It is
> sufficient for you to simply not advertise capability Y and 
> as a result
> the peer will knwon you do not support capability Y.
> 
> I known of only one case where the sending of an "unsupported
> capability" notification is justified: if both sides propose the
> multi-protocol capability but the proposed set of AFI/SAFIs is
> completely disjunct.
> 
> There are many implementations out there that send "unsupported
> capability" notifications when they should not. For example,
> there is one widely deployed implementation that sends an
> "unsupported capability" notification when it receives the
> four-octet-as-numbers capability. Even worse: this implementation
> sends the notification without any data (i.e. we don't know which
> is the offending capability) and often sends the notification before
> the open (i.e. we can't guess which capabilities they support by
> looking at their open).
> 
> 
> 
> 
> 
> 
> 
> 
> 
> --- Original message ---
> 
> "If a BGP speaker that supports a certain capability determines that
>    its peer doesn't support this capability, the speaker may send a
>    NOTIFICATION message to the peer, and terminate peering. The Error
>    Subcode in the message is set to Unsupported Capability. 
> The message
>    should contain the capability (capabilities) that causes 
> the speaker
>    to send the message.  The decision to send the message and 
> terminate
>    peering is local to the speaker.  Such peering should not be re-
>    established automatically"
> --RFC2842, "Capabilities Advertisement with BGP-4"
> 
> 
> This sounds like:
>     if rtrA is doing capC   and rtrB is not doing CapC,
>         then rtrA will send an UnsupCapNotifErrorSubcode=capC
> 
>     ...so if you were to send UnsupCapNotifErrorSubcode=capC
>         then you would/could/should re-open with the 
> offending capability
> 
> It seems to me that it should be:
>     if rtrA is doing capC   and rtrB is not doing CapC,
>         then ***rtrB*** will send an UnsupCapNotifErrorSubcode=capC
> 
>     ...so if you were to ***receive*** UnsupCapNotifErrorSubcode=capC
>         then you would/could/should re-open without the 
> offending capability
> 
> Thoughts?
> 
> Thank you very much for your time.
> 
> 
> _________________________________________________________________
> MSN Photos is the easiest way to share and print your photos: 
> http://photos.msn.com/support/worldwide.aspx
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA02798 for <idr-archive@nic.merit.edu>; Fri, 12 Jul 2002 10:46:56 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id E0CDE913B2; Fri, 12 Jul 2002 10:46:02 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id A7A7C913B3; Fri, 12 Jul 2002 10:46:02 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 45827913B2 for <idr@trapdoor.merit.edu>; Fri, 12 Jul 2002 10:46:01 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 26AF85DDA2; Fri, 12 Jul 2002 10:46:01 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from celox-ma1-ems1.celoxnetworks.com (celox-ma1-imap1.celoxnetworks.com [12.40.60.233]) by segue.merit.edu (Postfix) with ESMTP id DDDA45DE77 for <idr@merit.edu>; Fri, 12 Jul 2002 10:46:00 -0400 (EDT)
Received: by celox-ma1-ems1.celoxnetworks.com with Internet Mail Service (5.5.2653.19) id <3T6KDWFD>; Fri, 12 Jul 2002 10:46:00 -0400
Message-ID: <1117F7D44159934FB116E36F4ABF221B020AF867@celox-ma1-ems1.celoxnetworks.com>
From: "Natale, Jonathan" <JNatale@celoxnetworks.com>
To: "'Abarbanel, Benjamin'" <Benjamin.Abarbanel@Marconi.com>, "Natale, Jonathan" <JNatale@celoxnetworks.com>, "'Bgp (isp-bgp@isp-bgp.com)'" <isp-bgp@isp-bgp.com>, "'idr@merit.edu'" <idr@merit.edu>, "'isp-routing@isp-routing.com'" <isp-routing@isp-routing.com>
Cc: "'Bruno Rijsman'" <bwarijsman@hotmail.com>
Subject: RE: BGP Capability Negotiation
Date: Fri, 12 Jul 2002 10:45:51 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
Sender: owner-idr@merit.edu
Precedence: bulk

Thanks,  I agree with Bruno:
" The one case that I know where sending a notification is justified is in
multi-protocol negotiation. Say router A proposes multi-protocol ipv4
unicast. Router B proposes multi-protocol ipv6 multicast. When B receives
the open from A (or vice versa) it may send an "unsupported capability"
notification because they have no address families in common to exchange
("well if you are only interested in talking about ipv4 I don't want to talk
to you")."

--this seems to be the only case.

-----Original Message-----
From: Abarbanel, Benjamin [mailto:Benjamin.Abarbanel@Marconi.com] 
Sent: Friday, July 12, 2002 10:36 AM
To: 'Natale, Jonathan'; 'Bgp (isp-bgp@isp-bgp.com)'; 'idr@merit.edu';
'isp-routing@isp-routing.com'
Subject: RE: BGP Capability Negotiation

Jonathan:

as the original spec says. 
> 
> "If a BGP speaker that supports a certain capability determines that
>    its peer doesn't support this capability, the speaker may send a
>    NOTIFICATION message to the peer, and terminate peering. The Error
>    Subcode in the message is set to Unsupported Capability. 
> The message
>    should contain the capability (capabilities) that causes 
> the speaker
>    to send the message.  The decision to send the message and 
> terminate
>    peering is local to the speaker.  Such peering should not be re-
>    established automatically"
> --RFC2842, "Capabilities Advertisement with BGP-4"
> 
> 

It should say, "such peering with the offending capability should not be 
re-established automatically. Thus if the offending router drops the
capability
from the OPEN message, it can retry to establish the session."

If I were a network operator, I rather have a session up with less
capabilities, 
than not have one at all and loose reachability through a perfectly good
router.
Or till the operator manually unconfigured the capability from the offending
router.

Ben


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA02760 for <idr-archive@nic.merit.edu>; Fri, 12 Jul 2002 10:45:57 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 125F3913B1; Fri, 12 Jul 2002 10:45:08 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 7B416913B2; Fri, 12 Jul 2002 10:45:07 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 50D79913B1 for <idr@trapdoor.merit.edu>; Fri, 12 Jul 2002 10:45:05 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id ECAC35DEE0; Fri, 12 Jul 2002 10:45:04 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from tenornetworks.com (rtu.tenornetworks.com [63.77.213.2]) by segue.merit.edu (Postfix) with ESMTP id 3F3235DDA2 for <idr@merit.edu>; Fri, 12 Jul 2002 10:45:04 -0400 (EDT)
Received: from tenornet.com (newman [192.168.0.185]) by tenornetworks.com (Switch-2.1.3/Switch-2.1.0) with ESMTP id g6CEj2d08306; Fri, 12 Jul 2002 10:45:02 -0400 (EDT)
Received: from 192.168.0.185 by tenornet.com; FRI, 12 Jul 2002 10:40:22 -0700
Received: by newman.tenornet.com with Internet Mail Service (5.5.2650.21) id <N1HXZ5A2>; Fri, 12 Jul 2002 10:40:22 -0400
Message-ID: <6B190B34070BD411ACA000B0D0214E560165B815@newman.tenornet.com>
From: "Tsillas, Jim" <jtsillas@tenornetworks.com>
To: "'Mathew Richardson'" <mrr@nexthop.com>, idr@merit.edu
Subject: RE: Unsupported capability notification
Date: Fri, 12 Jul 2002 10:40:21 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Maybe the draft should be updated to make this clear. I know
it took me a while to understand the intent and then only after
I observed other implementations.

-Jim.

-----Original Message-----
From: Mathew Richardson [mailto:mrr@nexthop.com]
Sent: Friday, July 12, 2002 10:41 AM
To: idr@merit.edu
Subject: Re: Unsupported capability notification


> Natale, Jonathan <JNatale@celoxnetworks.com> [Fri, Jul 12, 2002 at
09:50:43AM -0400]:
>Thanks.
>
>My question is:
>Does TXing unsupported capability=C mean:
>A) I want to RX open with cap. C
>OR
>B) I want to RX open without cap. C

<snip>

A.

>Before you answer B), which seems to be the obvious answer,

<snip>

It may seem obvious, but it is incorrect.  The router sending unsupported
capability is saying "I won't peer with you because you don't support
this capability..."

mrr


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA02571 for <idr-archive@nic.merit.edu>; Fri, 12 Jul 2002 10:41:16 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id B0C59913B0; Fri, 12 Jul 2002 10:41:02 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 7F8FD913B1; Fri, 12 Jul 2002 10:41:02 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 58469913B0 for <idr@trapdoor.merit.edu>; Fri, 12 Jul 2002 10:41:01 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 45F755DEE0; Fri, 12 Jul 2002 10:41:01 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id B70AE5DDA2 for <idr@merit.edu>; Fri, 12 Jul 2002 10:41:00 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6CEf0r60997 for idr@merit.edu; Fri, 12 Jul 2002 10:41:00 -0400 (EDT) (envelope-from mrr@mrichardson.nexthop.com)
Received: from paradigm.nexthop.com (mrichardson.nexthop.com [64.211.218.45]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6CEev160989 for <idr@merit.edu>; Fri, 12 Jul 2002 10:40:57 -0400 (EDT) (envelope-from mrr@mrichardson.nexthop.com)
Received: (from mrr@localhost) by paradigm.nexthop.com (8.11.6/8.11.6) id g6CEeuE27660 for idr@merit.edu; Fri, 12 Jul 2002 10:40:56 -0400 (EDT)
Date: Fri, 12 Jul 2002 10:40:55 -0400
From: Mathew Richardson <mrr@nexthop.com>
To: idr@merit.edu
Subject: Re: Unsupported capability notification
Message-ID: <20020712104055.C27251@nexthop.com>
References: <1117F7D44159934FB116E36F4ABF221B020AF862@celox-ma1-ems1.celoxnetworks.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5.1i
In-Reply-To: <1117F7D44159934FB116E36F4ABF221B020AF862@celox-ma1-ems1.celoxnetworks.com>; from JNatale@celoxnetworks.com on Fri, Jul 12, 2002 at 09:50:43AM -0400
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

> Natale, Jonathan <JNatale@celoxnetworks.com> [Fri, Jul 12, 2002 at 09:50:43AM -0400]:
>Thanks.
>
>My question is:
>Does TXing unsupported capability=C mean:
>A) I want to RX open with cap. C
>OR
>B) I want to RX open without cap. C

<snip>

A.

>Before you answer B), which seems to be the obvious answer,

<snip>

It may seem obvious, but it is incorrect.  The router sending unsupported
capability is saying "I won't peer with you because you don't support
this capability..."

mrr


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA02534 for <idr-archive@nic.merit.edu>; Fri, 12 Jul 2002 10:40:02 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 6285F913AF; Fri, 12 Jul 2002 10:39:43 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 23015913B0; Fri, 12 Jul 2002 10:39:43 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 21F49913AF for <idr@trapdoor.merit.edu>; Fri, 12 Jul 2002 10:39:42 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 11C055DEE0; Fri, 12 Jul 2002 10:39:42 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 8AB4A5DDA2 for <idr@merit.edu>; Fri, 12 Jul 2002 10:39:41 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA01332; Fri, 12 Jul 2002 10:39:39 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA08921; Fri, 12 Jul 2002 10:39:39 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W98TK19>; Fri, 12 Jul 2002 10:39:38 -0400
Message-ID: <39469E08BD83D411A3D900204840EC55822799@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Bruno Rijsman'" <bwarijsman@hotmail.com>, idr@merit.edu
Cc: JNatale@celoxnetworks.com
Subject: RE: Re> Unsupported capability notification
Date: Fri, 12 Jul 2002 10:39:37 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Guys:
  I really think the NOTIFY message without clearing the session can take 
care of all of these conflicts and still let both routers know what they 
can or cannot do. Just ignoring something and hoping the other side will
figure it out by deduction, usually leads to assumptions that are wrong.

Ben


> -----Original Message-----
> From: Bruno Rijsman [mailto:bwarijsman@hotmail.com]
> Sent: Friday, July 12, 2002 9:38 AM
> To: idr@merit.edu
> Cc: JNatale@celoxnetworks.com
> Subject: Re> Unsupported capability notification
> 
> 
> The usage of the unsupported capability notification is widely
> misunderstood and this is causing real problems in real networks.
> 
> If you receive a capability code X which you do not recognize,
> you MUST NOT send back an unsupported capability notification.
> It is sufficient for you to simply not advertise capability X
> and as a result the peer will known you do not support capability X.
> 
> If you receive a capability code Y which you do recognize but
> for some reason you do not want to use capability Y on that session
> you SHOULD NOT send back an unsupported capability notification. It is
> sufficient for you to simply not advertise capability Y and 
> as a result
> the peer will knwon you do not support capability Y.
> 
> I known of only one case where the sending of an "unsupported
> capability" notification is justified: if both sides propose the
> multi-protocol capability but the proposed set of AFI/SAFIs is
> completely disjunct.
> 
> There are many implementations out there that send "unsupported
> capability" notifications when they should not. For example,
> there is one widely deployed implementation that sends an
> "unsupported capability" notification when it receives the
> four-octet-as-numbers capability. Even worse: this implementation
> sends the notification without any data (i.e. we don't know which
> is the offending capability) and often sends the notification before
> the open (i.e. we can't guess which capabilities they support by
> looking at their open).
> 
> 
> 
> 
> 
> 
> 
> 
> 
> --- Original message ---
> 
> "If a BGP speaker that supports a certain capability determines that
>    its peer doesn't support this capability, the speaker may send a
>    NOTIFICATION message to the peer, and terminate peering. The Error
>    Subcode in the message is set to Unsupported Capability. 
> The message
>    should contain the capability (capabilities) that causes 
> the speaker
>    to send the message.  The decision to send the message and 
> terminate
>    peering is local to the speaker.  Such peering should not be re-
>    established automatically"
> --RFC2842, "Capabilities Advertisement with BGP-4"
> 
> 
> This sounds like:
>     if rtrA is doing capC   and rtrB is not doing CapC,
>         then rtrA will send an UnsupCapNotifErrorSubcode=capC
> 
>     ...so if you were to send UnsupCapNotifErrorSubcode=capC
>         then you would/could/should re-open with the 
> offending capability
> 
> It seems to me that it should be:
>     if rtrA is doing capC   and rtrB is not doing CapC,
>         then ***rtrB*** will send an UnsupCapNotifErrorSubcode=capC
> 
>     ...so if you were to ***receive*** UnsupCapNotifErrorSubcode=capC
>         then you would/could/should re-open without the 
> offending capability
> 
> Thoughts?
> 
> Thank you very much for your time.
> 
> 
> _________________________________________________________________
> MSN Photos is the easiest way to share and print your photos: 
> http://photos.msn.com/support/worldwide.aspx
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA02499 for <idr-archive@nic.merit.edu>; Fri, 12 Jul 2002 10:39:08 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 521E7913AE; Fri, 12 Jul 2002 10:36:05 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 4CAFD913B0; Fri, 12 Jul 2002 10:36:03 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 6FC79913AF for <idr@trapdoor.merit.edu>; Fri, 12 Jul 2002 10:36:01 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 566845DEE0; Fri, 12 Jul 2002 10:36:01 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id D15065DDA2 for <idr@merit.edu>; Fri, 12 Jul 2002 10:36:00 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA00979; Fri, 12 Jul 2002 10:35:58 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA07907; Fri, 12 Jul 2002 10:35:59 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W98TJ70>; Fri, 12 Jul 2002 10:35:58 -0400
Message-ID: <39469E08BD83D411A3D900204840EC55822798@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Natale, Jonathan'" <JNatale@celoxnetworks.com>, "'Bgp (isp-bgp@isp-bgp.com)'" <isp-bgp@isp-bgp.com>, "'idr@merit.edu'" <idr@merit.edu>, "'isp-routing@isp-routing.com'" <isp-routing@isp-routing.com>
Subject: RE: BGP Capability Negotiation
Date: Fri, 12 Jul 2002 10:35:57 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Jonathan:

as the original spec says. 
> 
> "If a BGP speaker that supports a certain capability determines that
>    its peer doesn't support this capability, the speaker may send a
>    NOTIFICATION message to the peer, and terminate peering. The Error
>    Subcode in the message is set to Unsupported Capability. 
> The message
>    should contain the capability (capabilities) that causes 
> the speaker
>    to send the message.  The decision to send the message and 
> terminate
>    peering is local to the speaker.  Such peering should not be re-
>    established automatically"
> --RFC2842, "Capabilities Advertisement with BGP-4"
> 
> 

It should say, "such peering with the offending capability should not be 
re-established automatically. Thus if the offending router drops the
capability
from the OPEN message, it can retry to establish the session."

If I were a network operator, I rather have a session up with less
capabilities, 
than not have one at all and loose reachability through a perfectly good
router.
Or till the operator manually unconfigured the capability from the offending
router.

Ben



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA01456 for <idr-archive@nic.merit.edu>; Fri, 12 Jul 2002 10:11:13 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id E95C8913A9; Fri, 12 Jul 2002 10:10:45 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id B5EE5913AA; Fri, 12 Jul 2002 10:10:45 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 8628A913A9 for <idr@trapdoor.merit.edu>; Fri, 12 Jul 2002 10:10:44 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 6D73B5DE53; Fri, 12 Jul 2002 10:10:44 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from hotmail.com (f97.law4.hotmail.com [216.33.149.97]) by segue.merit.edu (Postfix) with ESMTP id 2F57B5DDA2 for <idr@merit.edu>; Fri, 12 Jul 2002 10:10:44 -0400 (EDT)
Received: from mail pickup service by hotmail.com with Microsoft SMTPSVC; Fri, 12 Jul 2002 07:10:43 -0700
Received: from 65.194.140.2 by lw4fd.law4.hotmail.msn.com with HTTP; Fri, 12 Jul 2002 14:10:43 GMT
X-Originating-IP: [65.194.140.2]
From: "Bruno Rijsman" <bwarijsman@hotmail.com>
To: idr@merit.edu
Cc: JNatale@celoxnetworks.com
Subject: RE: Re> Unsupported capability notification
Date: Fri, 12 Jul 2002 10:10:43 -0400
Mime-Version: 1.0
Content-Type: text/plain; format=flowed
Message-ID: <F97uCMEQJi9AJpn8JH600010259@hotmail.com>
X-OriginalArrivalTime: 12 Jul 2002 14:10:43.0798 (UTC) FILETIME=[E92DF760:01C229AD]
Sender: owner-idr@merit.edu
Precedence: bulk

Re> My question is:
Re> Does TXing unsupported capability=C mean:
Re> A) I want to RX open with cap. C
Re> OR
Re> B) I want to RX open without cap. C

Sending an "unsupported capability notification containing
capability C" means: "The fact that you support capability
C means I don't want to have a session with you" or to put
it differently "I want you to stop advertising capability
C to me".

There are very few situations where this is really what you want.

If I simply don't support capability C or I don't want to use
capability C on that session, I could simply refrain from advertising
capability C in my open.

The one case that I know where sending a notification is justified is
in multi-protocol negotiation. Say router A proposes multi-protocol
ipv4 unicast. Router B proposes multi-protocol ipv6 multicast. When B
receives the open from A (or vice versa) it may send an "unsupported
capability" notification because they have no address families in
common to exchange ("well if you are only interested in talking about
ipv4 I don't want to talk to you").


_________________________________________________________________
Send and receive Hotmail on your mobile device: http://mobile.msn.com



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id JAA00793 for <idr-archive@nic.merit.edu>; Fri, 12 Jul 2002 09:53:01 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id AE4DB913A8; Fri, 12 Jul 2002 09:52:41 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 73091913A9; Fri, 12 Jul 2002 09:52:41 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 18A25913A8 for <idr@trapdoor.merit.edu>; Fri, 12 Jul 2002 09:52:40 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 036065DEE7; Fri, 12 Jul 2002 09:52:40 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from celox-ma1-ems1.celoxnetworks.com (celox-ma1-imap1.celoxnetworks.com [12.40.60.233]) by segue.merit.edu (Postfix) with ESMTP id B0B1B5DDFA for <idr@merit.edu>; Fri, 12 Jul 2002 09:52:39 -0400 (EDT)
Received: by celox-ma1-ems1.celoxnetworks.com with Internet Mail Service (5.5.2653.19) id <3T6KDV9P>; Fri, 12 Jul 2002 09:52:39 -0400
Message-ID: <1117F7D44159934FB116E36F4ABF221B020AF863@celox-ma1-ems1.celoxnetworks.com>
From: "Natale, Jonathan" <JNatale@celoxnetworks.com>
To: "'Bgp (isp-bgp@isp-bgp.com)'" <isp-bgp@isp-bgp.com>, "'idr@merit.edu'" <idr@merit.edu>, "'isp-routing@isp-routing.com'" <isp-routing@isp-routing.com>
Subject: RE: recursion flap
Date: Fri, 12 Jul 2002 09:52:35 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
Sender: owner-idr@merit.edu
Precedence: bulk

Here is another twist on the same issue:

This seems to be an outgrowth of <another vendor> practice of assigning
<another vendor> point-to-point interface IP addresses with a /32 mask and
then redistributing direct/connected.  I believe the /32 for yourself is
incorrect, but:
"A third option is to <another vendor> e host addresses on both ends of a
point-to-
   point link.  This option provides the same address space savings as
   <another vendor> ing a 31-bit subnet mask, but may only be <another
vendor> ed in links <another vendor> ing PPP
   encapsulation [RFC1332].  The <another vendor> e of host addresses allows
for the
   assignment of IP addresses belonging to different networks at each
   side of the link, causing link and network management not to be
   straight forward."
    --RFC3021, "Using 31-Bit Prefixes on IPv4 Point-to-Point Links"
(PROPOSED STANDARD)

Using BGP, <another vendor> told the <a major vendor> that to get to
50.1.1.2/32 it should use the next hop of 50.1.1.2.  A host route with a
nexthop equal to itself is not mentioned in the bgp RFC nor in the draft.
At least I did not find it [yet], but I did not really look that hard
because it is not that important.  Intuitively, it should not be sent and if
received it should be ignored.  In this regard it IS a <a major vendor> bug
(but that may not help--<another vendor> should not advertise such a route,
and/or <another vendor> e the "work-around" I propose below).


RFC:
    ...blah blah blah...
    "Mutually recursive routes (routes
    resolving each other or themselves), also fail the resolvability
    check"
    ...blah blah blah...

--draft-ietf-idr-bgp4-17.txt

I suspect that, if you watch it, you will see the bgp 50.1.1.2/32 route get
removed due to the recursiveness.  Then it will get re-installed based on
the 1.1.0/24 directly connected route.  This will repeat based on the
["nettable walker" or "scan" ???] <a major vendor> timer.  The interval
(half the period) is 60 seconds and is not configurable (again, not in any
RFCs). 

Also:
    ...blah blah blah...
    "implementations must
    take care that before any packets are forwarded along a BGP route,
    its associated NEXT_HOP address is resolved to the immediate
    (directly connected) next-hop address and this address"
    ...blah blah blah...

--draft-ietf-idr-bgp4-17.txt

And at one angle, the 50.1.1.2 address cannot be directly connected to
anybody but <another vendor>  since it has a /32 mask and <another vendor>
are its 1 and only 1 host.  So <another vendor> should not send it since it
will never be resolved.  There is some mention of the 32  I suppose it would
be ok to send the bgp 50.1.1.2/32 route to peers that are not on the same
point-to-point link, since in that case the next hop and prefix would not be
recursive; but then <another vendor> might break
the: "care must be taken to ensure a consistent view of routing within the
AS." (RFC1771) rule.  My vote is against allowing the bgp 50.1.1.2/32 route
to be sent to peers that are not on the same point-to-point link (or any
other link).  Also, I think maybe bgp should try a next longest bit match to
resolve the next hop.

The Work around is to add:
ip route 50.1.1.2 255.255.255.255 atm 4/0.1 on <major vendor>

Thoughts?

-----Original Message-----
From: Natale, Jonathan 
Sent: Thursday, July 11, 2002 7:47 PM
To: Bgp (isp-bgp@isp-bgp.com); idr@merit.edu; isp-routing@isp-routing.com
Subject: recursion flap

if the peer is reachable via OPSF and EBGP then the route tab will have the
EBGP route (since the admin dist of EBGP is 20; OSPF is 110) so it will be
flagged as a recursion error and removed.  Then, when a 1 minute timer
triggers, the BGP route will be removed and the OPSF route will be installed
due to the recursion.  Then, when the 1 minute timer triggers again, the
OPSF route will be removed and the BGP route will be installed due to the
lower admin dist. ...Etc... (2 minute period)  Not sure how they resolve
this, how <another vendor> should, and/or how "real" this problem is

???:
timer in rfc?
A "persistent flap"?
Notification/inform?
Juniper?


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id JAA00744 for <idr-archive@nic.merit.edu>; Fri, 12 Jul 2002 09:51:29 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id E4D3E913A7; Fri, 12 Jul 2002 09:50:51 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id AB452913A8; Fri, 12 Jul 2002 09:50:51 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 2C60E913A7 for <idr@trapdoor.merit.edu>; Fri, 12 Jul 2002 09:50:50 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 12EB55DEF2; Fri, 12 Jul 2002 09:50:50 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from celox-ma1-ems1.celoxnetworks.com (celox-ma1-imap1.celoxnetworks.com [12.40.60.233]) by segue.merit.edu (Postfix) with ESMTP id CCE305DEE7 for <idr@merit.edu>; Fri, 12 Jul 2002 09:50:49 -0400 (EDT)
Received: by celox-ma1-ems1.celoxnetworks.com with Internet Mail Service (5.5.2653.19) id <3T6KDV9J>; Fri, 12 Jul 2002 09:50:49 -0400
Message-ID: <1117F7D44159934FB116E36F4ABF221B020AF862@celox-ma1-ems1.celoxnetworks.com>
From: "Natale, Jonathan" <JNatale@celoxnetworks.com>
To: "'Bruno Rijsman'" <bwarijsman@hotmail.com>, idr@merit.edu
Cc: "Natale, Jonathan" <JNatale@celoxnetworks.com>
Subject: RE: Re> Unsupported capability notification
Date: Fri, 12 Jul 2002 09:50:43 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
Sender: owner-idr@merit.edu
Precedence: bulk

Thanks.

My question is:
Does TXing unsupported capability=C mean:
A) I want to RX open with cap. C
OR
B) I want to RX open without cap. C

Before you answer B), which seems to be the obvious answer, please read:
""If a BGP speaker that supports a certain capability determines that
   its peer doesn't support this capability, the speaker MAY send a
   NOTIFICATION message to the peer, and terminate peering (see Section
   "Extensions to Error Handling" for more details). The Error Subcode
   in the message is set to Unsupported Capability. The message SHOULD
   contain the capability (capabilities) that causes the speaker to send
   the message.  The decision to send the message and terminate peering
   is local to the speaker. If terminated, such peering SHOULD NOT be
   re-established automatically." -- draft-ietf-idr-rfc2842bis-02.txt

-----Original Message-----
From: Bruno Rijsman [mailto:bwarijsman@hotmail.com] 
Sent: Friday, July 12, 2002 9:38 AM
To: idr@merit.edu
Cc: JNatale@celoxnetworks.com
Subject: Re> Unsupported capability notification

The usage of the unsupported capability notification is widely
misunderstood and this is causing real problems in real networks.

If you receive a capability code X which you do not recognize,
you MUST NOT send back an unsupported capability notification.
It is sufficient for you to simply not advertise capability X
and as a result the peer will known you do not support capability X.

If you receive a capability code Y which you do recognize but
for some reason you do not want to use capability Y on that session
you SHOULD NOT send back an unsupported capability notification. It is
sufficient for you to simply not advertise capability Y and as a result
the peer will knwon you do not support capability Y.

I known of only one case where the sending of an "unsupported
capability" notification is justified: if both sides propose the
multi-protocol capability but the proposed set of AFI/SAFIs is
completely disjunct.

There are many implementations out there that send "unsupported
capability" notifications when they should not. For example,
there is one widely deployed implementation that sends an
"unsupported capability" notification when it receives the
four-octet-as-numbers capability. Even worse: this implementation
sends the notification without any data (i.e. we don't know which
is the offending capability) and often sends the notification before
the open (i.e. we can't guess which capabilities they support by
looking at their open).









--- Original message ---

"If a BGP speaker that supports a certain capability determines that
   its peer doesn't support this capability, the speaker may send a
   NOTIFICATION message to the peer, and terminate peering. The Error
   Subcode in the message is set to Unsupported Capability. The message
   should contain the capability (capabilities) that causes the speaker
   to send the message.  The decision to send the message and terminate
   peering is local to the speaker.  Such peering should not be re-
   established automatically"
--RFC2842, "Capabilities Advertisement with BGP-4"


This sounds like:
    if rtrA is doing capC   and rtrB is not doing CapC,
        then rtrA will send an UnsupCapNotifErrorSubcode=capC

    ...so if you were to send UnsupCapNotifErrorSubcode=capC
        then you would/could/should re-open with the offending capability

It seems to me that it should be:
    if rtrA is doing capC   and rtrB is not doing CapC,
        then ***rtrB*** will send an UnsupCapNotifErrorSubcode=capC

    ...so if you were to ***receive*** UnsupCapNotifErrorSubcode=capC
        then you would/could/should re-open without the offending capability

Thoughts?

Thank you very much for your time.


_________________________________________________________________
MSN Photos is the easiest way to share and print your photos: 
http://photos.msn.com/support/worldwide.aspx


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id JAA00302 for <idr-archive@nic.merit.edu>; Fri, 12 Jul 2002 09:38:15 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id BFB15913A6; Fri, 12 Jul 2002 09:37:52 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 9092A913A7; Fri, 12 Jul 2002 09:37:52 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 58B6F913A6 for <idr@trapdoor.merit.edu>; Fri, 12 Jul 2002 09:37:51 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 420EB5DEF2; Fri, 12 Jul 2002 09:37:51 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from hotmail.com (f113.law4.hotmail.com [216.33.149.113]) by segue.merit.edu (Postfix) with ESMTP id E6F855DEF0 for <idr@merit.edu>; Fri, 12 Jul 2002 09:37:50 -0400 (EDT)
Received: from mail pickup service by hotmail.com with Microsoft SMTPSVC; Fri, 12 Jul 2002 06:37:50 -0700
Received: from 65.194.140.2 by lw4fd.law4.hotmail.msn.com with HTTP; Fri, 12 Jul 2002 13:37:50 GMT
X-Originating-IP: [65.194.140.2]
From: "Bruno Rijsman" <bwarijsman@hotmail.com>
To: idr@merit.edu
Cc: JNatale@celoxnetworks.com
Subject: Re> Unsupported capability notification
Date: Fri, 12 Jul 2002 09:37:50 -0400
Mime-Version: 1.0
Content-Type: text/plain; format=flowed
Message-ID: <F113AqyYkbQZAVGRelN000102c4@hotmail.com>
X-OriginalArrivalTime: 12 Jul 2002 13:37:50.0539 (UTC) FILETIME=[510689B0:01C229A9]
Sender: owner-idr@merit.edu
Precedence: bulk

The usage of the unsupported capability notification is widely
misunderstood and this is causing real problems in real networks.

If you receive a capability code X which you do not recognize,
you MUST NOT send back an unsupported capability notification.
It is sufficient for you to simply not advertise capability X
and as a result the peer will known you do not support capability X.

If you receive a capability code Y which you do recognize but
for some reason you do not want to use capability Y on that session
you SHOULD NOT send back an unsupported capability notification. It is
sufficient for you to simply not advertise capability Y and as a result
the peer will knwon you do not support capability Y.

I known of only one case where the sending of an "unsupported
capability" notification is justified: if both sides propose the
multi-protocol capability but the proposed set of AFI/SAFIs is
completely disjunct.

There are many implementations out there that send "unsupported
capability" notifications when they should not. For example,
there is one widely deployed implementation that sends an
"unsupported capability" notification when it receives the
four-octet-as-numbers capability. Even worse: this implementation
sends the notification without any data (i.e. we don't know which
is the offending capability) and often sends the notification before
the open (i.e. we can't guess which capabilities they support by
looking at their open).









--- Original message ---

"If a BGP speaker that supports a certain capability determines that
   its peer doesn't support this capability, the speaker may send a
   NOTIFICATION message to the peer, and terminate peering. The Error
   Subcode in the message is set to Unsupported Capability. The message
   should contain the capability (capabilities) that causes the speaker
   to send the message.  The decision to send the message and terminate
   peering is local to the speaker.  Such peering should not be re-
   established automatically"
--RFC2842, "Capabilities Advertisement with BGP-4"


This sounds like:
    if rtrA is doing capC   and rtrB is not doing CapC,
        then rtrA will send an UnsupCapNotifErrorSubcode=capC

    ...so if you were to send UnsupCapNotifErrorSubcode=capC
        then you would/could/should re-open with the offending capability

It seems to me that it should be:
    if rtrA is doing capC   and rtrB is not doing CapC,
        then ***rtrB*** will send an UnsupCapNotifErrorSubcode=capC

    ...so if you were to ***receive*** UnsupCapNotifErrorSubcode=capC
        then you would/could/should re-open without the offending capability

Thoughts?

Thank you very much for your time.


_________________________________________________________________
MSN Photos is the easiest way to share and print your photos: 
http://photos.msn.com/support/worldwide.aspx



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id JAA29697 for <idr-archive@nic.merit.edu>; Fri, 12 Jul 2002 09:19:26 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 5CC74913A4; Fri, 12 Jul 2002 09:19:00 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 27899913A5; Fri, 12 Jul 2002 09:19:00 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 148AF913A4 for <idr@trapdoor.merit.edu>; Fri, 12 Jul 2002 09:18:59 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id F247F5DEEE; Fri, 12 Jul 2002 09:18:58 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from tenornetworks.com (rtu.tenornetworks.com [63.77.213.2]) by segue.merit.edu (Postfix) with ESMTP id 7F9DE5DDFA for <idr@merit.edu>; Fri, 12 Jul 2002 09:18:58 -0400 (EDT)
Received: from tenornet.com (newman [192.168.0.185]) by tenornetworks.com (Switch-2.1.3/Switch-2.1.0) with ESMTP id g6CDIud05004; Fri, 12 Jul 2002 09:18:56 -0400 (EDT)
Received: from 192.168.0.185 by tenornet.com; FRI, 12 Jul 2002 09:14:17 -0700
Received: by newman.tenornet.com with Internet Mail Service (5.5.2650.21) id <N1HXZZ7K>; Fri, 12 Jul 2002 09:14:16 -0400
Message-ID: <6B190B34070BD411ACA000B0D0214E560165B814@newman.tenornet.com>
From: "Tsillas, Jim" <jtsillas@tenornetworks.com>
To: "'Natale, Jonathan'" <JNatale@celoxnetworks.com>, "'Bgp (isp-bgp@isp-bgp.com)'" <isp-bgp@isp-bgp.com>, "'idr@merit.edu'" <idr@merit.edu>, "'isp-routing@isp-routing.com'" <isp-routing@isp-routing.com>
Subject: RE: BGP Capability Negotiation
Date: Fri, 12 Jul 2002 09:14:15 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Jon,

I think the draft is OK on this point. An example:
rtrA supports a route-refresh capability and tells rtrB.
rtrB doesn't support this capability. rtrB will not
notify rtrA because it doesn't care to use rtrA's advertised
capability (it ignores the capability). rtrA may notify
rtrB because it may require that rtrB support route-refresh
too. On the other hand, rtrA may not care to notify rtrB
since the capability may be "one way": in the case of
route-refresh rtrB doesn't need to support the capability
in order to use rtrA's refresh capability. In the case
of an MP-BGP capability, rtrA will probably want to
notify rtrB since the capability will probably need to
be bidirectional (what is the common practice in this case??)

-Jim.

-----Original Message-----
From: Natale, Jonathan [mailto:JNatale@celoxnetworks.com]
Sent: Friday, July 12, 2002 9:05 AM
To: 'Bgp (isp-bgp@isp-bgp.com)'; 'idr@merit.edu';
'isp-routing@isp-routing.com'
Subject: BGP Capability Negotiation


"If a BGP speaker that supports a certain capability determines that
   its peer doesn't support this capability, the speaker may send a
   NOTIFICATION message to the peer, and terminate peering. The Error
   Subcode in the message is set to Unsupported Capability. The message
   should contain the capability (capabilities) that causes the speaker
   to send the message.  The decision to send the message and terminate
   peering is local to the speaker.  Such peering should not be re-
   established automatically"
--RFC2842, "Capabilities Advertisement with BGP-4"


This sounds like:
    if rtrA is doing capC   and rtrB is not doing CapC,
        then rtrA will send an UnsupCapNotifErrorSubcode=capC

    ...so if you were to send UnsupCapNotifErrorSubcode=capC
        then you would/could/should re-open with the offending capability

It seems to me that it should be:
    if rtrA is doing capC   and rtrB is not doing CapC,
        then ***rtrB*** will send an UnsupCapNotifErrorSubcode=capC

    ...so if you were to ***receive*** UnsupCapNotifErrorSubcode=capC
        then you would/could/should re-open without the offending capability

Thoughts?

Thank you very much for your time.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id JAA29238 for <idr-archive@nic.merit.edu>; Fri, 12 Jul 2002 09:05:37 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 75C1D913A3; Fri, 12 Jul 2002 09:05:10 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 44A38913A4; Fri, 12 Jul 2002 09:05:10 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 16E50913A3 for <idr@trapdoor.merit.edu>; Fri, 12 Jul 2002 09:05:09 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 06A4A5DEE1; Fri, 12 Jul 2002 09:05:09 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from celox-ma1-ems1.celoxnetworks.com (celox-ma1-imap1.celoxnetworks.com [12.40.60.233]) by segue.merit.edu (Postfix) with ESMTP id C266E5DDFA for <idr@merit.edu>; Fri, 12 Jul 2002 09:05:08 -0400 (EDT)
Received: by celox-ma1-ems1.celoxnetworks.com with Internet Mail Service (5.5.2653.19) id <3T6KDV51>; Fri, 12 Jul 2002 09:05:08 -0400
Message-ID: <1117F7D44159934FB116E36F4ABF221B020AF861@celox-ma1-ems1.celoxnetworks.com>
From: "Natale, Jonathan" <JNatale@celoxnetworks.com>
To: "'Bgp (isp-bgp@isp-bgp.com)'" <isp-bgp@isp-bgp.com>, "'idr@merit.edu'" <idr@merit.edu>, "'isp-routing@isp-routing.com'" <isp-routing@isp-routing.com>
Subject: BGP Capability Negotiation
Date: Fri, 12 Jul 2002 09:05:06 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
Sender: owner-idr@merit.edu
Precedence: bulk

"If a BGP speaker that supports a certain capability determines that
   its peer doesn't support this capability, the speaker may send a
   NOTIFICATION message to the peer, and terminate peering. The Error
   Subcode in the message is set to Unsupported Capability. The message
   should contain the capability (capabilities) that causes the speaker
   to send the message.  The decision to send the message and terminate
   peering is local to the speaker.  Such peering should not be re-
   established automatically"
--RFC2842, "Capabilities Advertisement with BGP-4"


This sounds like:
    if rtrA is doing capC   and rtrB is not doing CapC,
        then rtrA will send an UnsupCapNotifErrorSubcode=capC

    ...so if you were to send UnsupCapNotifErrorSubcode=capC
        then you would/could/should re-open with the offending capability

It seems to me that it should be:
    if rtrA is doing capC   and rtrB is not doing CapC,
        then ***rtrB*** will send an UnsupCapNotifErrorSubcode=capC

    ...so if you were to ***receive*** UnsupCapNotifErrorSubcode=capC
        then you would/could/should re-open without the offending capability

Thoughts?

Thank you very much for your time.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id TAA04217 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 19:47:56 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 4161B9121E; Thu, 11 Jul 2002 19:47:37 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 0D3E291388; Thu, 11 Jul 2002 19:47:36 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id C9B939121E for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 19:47:30 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id B039A5DE1B; Thu, 11 Jul 2002 19:47:30 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from celox-ma1-ems1.celoxnetworks.com (celox-ma1-imap1.celoxnetworks.com [12.40.60.233]) by segue.merit.edu (Postfix) with ESMTP id 7B23C5DDAB for <idr@merit.edu>; Thu, 11 Jul 2002 19:47:30 -0400 (EDT)
Received: by celox-ma1-ems1.celoxnetworks.com with Internet Mail Service (5.5.2653.19) id <3T6KD4ZL>; Thu, 11 Jul 2002 19:47:29 -0400
Message-ID: <1117F7D44159934FB116E36F4ABF221B020AF85C@celox-ma1-ems1.celoxnetworks.com>
From: "Natale, Jonathan" <JNatale@celoxnetworks.com>
To: "Bgp (isp-bgp@isp-bgp.com)" <isp-bgp@isp-bgp.com>, idr@merit.edu, isp-routing@isp-routing.com
Subject: recursion flap
Date: Thu, 11 Jul 2002 19:47:28 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
Sender: owner-idr@merit.edu
Precedence: bulk

if the peer is reachable via OPSF and EBGP then the route tab will have the
EBGP route (since the admin dist of EBGP is 20; OSPF is 110) so it will be
flagged as a recursion error and removed.  Then, when a 1 minute timer
triggers, the BGP route will be removed and the OPSF route will be installed
due to the recursion.  Then, when the 1 minute timer triggers again, the
OPSF route will be removed and the BGP route will be installed due to the
lower admin dist. ...Etc... (2 minute period)  Not sure how they resolve
this, how we should, and/or how "real" this problem is

???:
timer in rfc?
A "persistent flap"?
Notification/inform?
Juniper?


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id QAA28462 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 16:53:58 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id A9C579137E; Thu, 11 Jul 2002 16:53:33 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 74B9D9137F; Thu, 11 Jul 2002 16:53:33 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 778AE9137E for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 16:53:32 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 5C50F5DE58; Thu, 11 Jul 2002 16:53:32 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id D267A5DE3E for <idr@merit.edu>; Thu, 11 Jul 2002 16:53:31 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id QAA28766; Thu, 11 Jul 2002 16:53:29 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id QAA23911; Thu, 11 Jul 2002 16:53:29 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W98SSPP>; Thu, 11 Jul 2002 16:53:28 -0400
Message-ID: <39469E08BD83D411A3D900204840EC55822797@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Tsillas, Jim'" <jtsillas@tenornetworks.com>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: graceful restart TCP reset discovery (was Re>Are dynamic capa bili ties really necessary ...)
Date: Thu, 11 Jul 2002 16:53:20 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Jim:
  Understood, you make good points.

Regards,
Ben

> -----Original Message-----
> From: Tsillas, Jim [mailto:jtsillas@tenornetworks.com]
> Sent: Thursday, July 11, 2002 3:21 PM
> To: 'Abarbanel, Benjamin'
> Cc: 'idr@merit.edu'
> Subject: graceful restart TCP reset discovery (was Re>Are dynamic
> capabili ties really necessary ...)
> 
> 
> Ben,
> 
> You are correct. I was thinking about the case where TCP is
> reset due to "fast external fallover". In this case, the
> forwarding plane is down and if it stays down for more than
> the graceful restart timeout, the receiving speaker resorts to
> normal restart.
> 
> Sections 6.1 and 6.2 go into good detail on this. The length
> of time it takes to determine if TCP has reset is implementation
> dependent. One way to detect TCP reset is to look for a new
> session for the same peer (since the neighbor will try to
> intiate a new session when it restarts). Another way is to rely
> on "fast-external-fallover" being turned on. Otherwise you just
> have to wait for either the keepalive expiration or the TCP timeout.
> 
> There is also an obscure case with some (probably most) TCPs where
> the restarting speaker has not yet attempted to open the session but
> the receiving speaker attempts to send data on a non-existing session.
> This causes the restarting speaker to send a FIN back. There are
> probably other equally obscure cases which are implementation 
> dependent.
> 
> Once TCP reset is detected, we start the graceful restart timeout
> period. This means that in some cases, the total timeout can
> also include the amount of time it took to discover the TCP
> session has been reset. The draft makes no distinction as far
> as how the session was reset, as far as timeout is concerned.
> 
> -Jim.
> 
> -----Original Message-----
> From: Abarbanel, Benjamin [mailto:Benjamin.Abarbanel@marconi.com]
> Sent: Thursday, July 11, 2002 2:43 PM
> To: 'Tsillas, Jim'; Abarbanel, Benjamin; 'Yakov Rekhter'; Srihari R.
> Sangli
> Cc: 'idr@merit.edu'
> Subject: RE: Are dynamic capabilities really necessary if 
> peers support
> gr acef ul restart? 
> 
> 
> 
> 
> > graceful restart can also be useful when the control plane
> > is implemented separately from the forwarding plane. When
> > this is true, the control place can reset (or fail-over)
> > while the forwarding plane is unaffected. The reset, can
> > be made transparent to the forwarding plane. Graceful
> > restart can also serve to limit the scope of the reset to
> > just the given speaker and its neighbors. In other words,
> > the reset state doesn't propagate through the network.
> >
> 
> Thanks for the clarification. It would be nice if applicable uses for
> gracefull restart were added to the draft. 
> 
> > The case which you point out (where TCP is broken) is actually
> > not covered since a broken TCP session implies a broken
> > forwarding plane.
> > 
> 
> I disagree, this is taken out of draft-ietf-idr-restart-05.txt, 
> Abstract section 2.
>                                     
>  "Finally,procedures are outlined for temporarily retaining routing
> information
>   across a TCP transport reset." 
> 
> I assume it means we hold routing information after the 
> forwarding plane
> is broken (BGP peering session dropped), right?
> 
> 
> > -Jim.
> > 
> > -----Original Message-----
> > From: Abarbanel, Benjamin [mailto:Benjamin.Abarbanel@marconi.com]
> > Sent: Thursday, July 11, 2002 1:05 PM
> > To: 'Yakov Rekhter'; Srihari R. Sangli
> > Cc: Tsillas, Jim; 'idr@merit.edu'
> > Subject: RE: Are dynamic capabilities really necessary if 
> > peers support
> > gr acef ul restart? 
> > 
> > 
> > Yakov:
> >   The only time I think gracefull restart is usefull is if 
> > two peers are
> > operating
> > fine, but the TCP connection is broken by the network. In the 
> > case that
> > either peer drops the session or either peer reboot/crash, I 
> > dont see much
> > value in gracefull restart.
> > 
> > Did I miss something?
> > 
> > Ben
> > 
> > > -----Original Message-----
> > > From: Yakov Rekhter [mailto:yakov@juniper.net]
> > > Sent: Thursday, July 11, 2002 12:55 PM
> > > To: Srihari R. Sangli
> > > Cc: Tsillas, Jim; 'idr@merit.edu'
> > > Subject: Re: Are dynamic capabilities really necessary if 
> > > peers support
> > > gracef ul restart? 
> > > 
> > > 
> > > Srihari,
> > > 
> > > > > It seems we are entering major discussion on dynamic
> > > > > capabilities and the can of worms is open. Can anyone
> > > > > think of a scenario where a dynamic capabilities
> > > > > negotiation is still needed if we have graceful restart?
> > > > 
> > > > Talking about address-families exchanged between BGP 
> speakers, the
> > > > dynamic capability is useful for enabling a new 
> address-family or
> > > > disabling a address-family w/o disrupting the session 
> and/or other
> > > > address-families. It is also useful for turning ON/OFF 
> > > other capabilities
> > > > (like ORF, etc)  without much disruption. 
> > > 
> > > Just to clarify, in the presence of graceful restart the 
> disruption
> > > you refer to above is just the disruption in the routing peering
> > > between the two speakers. There is no disruption in the 
> > > data/forwarding
> > > plane and no disruption in any other routing peering. Correct ?
> > > 
> > > Yakov.
> > > 
> > 
> 


Received: from trapdoor.merit.edu (trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id QAA27941 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 16:38:19 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id E87E391389; Thu, 11 Jul 2002 16:35:56 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id AB5369137E; Thu, 11 Jul 2002 16:35:56 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 094A191389 for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 16:35:52 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id E18F15DDAD; Thu, 11 Jul 2002 16:35:51 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from dmz2.procket.com (dmz2.procket.com [65.174.124.37]) by segue.merit.edu (Postfix) with ESMTP id 996EF5DDED for <idr@merit.edu>; Thu, 11 Jul 2002 16:35:51 -0400 (EDT)
Received: from miata.procket.com (zeus-d-1.procket.com [65.174.124.60]) by dmz2.procket.com (Postfix) with ESMTP id A7F693442D; Thu, 11 Jul 2002 13:42:12 -0700 (PDT)
Received: from localhost (srihari@localhost) by miata.procket.com (8.12.1/8.12.1) with ESMTP id g6BKZpp5015270; Thu, 11 Jul 2002 13:35:51 -0700 (PDT)
Date: Thu, 11 Jul 2002 13:35:50 -0700 (PDT)
From: "Srihari R. Sangli" <srihari@procket.com>
To: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: "'Tsillas, Jim'" <jtsillas@tenornetworks.com>, "'idr@merit.edu'" <idr@merit.edu>, "'enke@redback.com'" <enke@redback.com>
Subject: RE: Re>Are dynamic capabilities really necessary if peers support  gracef ul restart?
In-Reply-To: <39469E08BD83D411A3D900204840EC55822794@vie-msgusr-01.dc.fore.com>
Message-ID: <Pine.GSO.4.40.0207111335060.192-100000@miata.procket.com>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
Sender: owner-idr@merit.edu
Precedence: bulk

Ben,

> Srihari:
>   I looked over the draft again and I have to apologize I misunderstood you
> responses below. I would recommend rewording the two paragraph so that
> others
> do not get confused like me.

Ok. Will be added in the next revision. Thanks for the feedback.

srihari...



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id QAA27768 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 16:32:50 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 280979137B; Thu, 11 Jul 2002 16:32:28 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id E32069137D; Thu, 11 Jul 2002 16:32:27 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id B1CDA9137B for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 16:32:15 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 99E8F5DDED; Thu, 11 Jul 2002 16:32:15 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from dmz2.procket.com (dmz2.procket.com [65.174.124.37]) by segue.merit.edu (Postfix) with ESMTP id 7163E5DDAD for <idr@merit.edu>; Thu, 11 Jul 2002 16:32:15 -0400 (EDT)
Received: from miata.procket.com (zeus-d-1.procket.com [65.174.124.60]) by dmz2.procket.com (Postfix) with ESMTP id 9903A3442B; Thu, 11 Jul 2002 13:38:36 -0700 (PDT)
Received: from localhost (srihari@localhost) by miata.procket.com (8.12.1/8.12.1) with ESMTP id g6BKWECO014881; Thu, 11 Jul 2002 13:32:14 -0700 (PDT)
Date: Thu, 11 Jul 2002 13:32:14 -0700 (PDT)
From: "Srihari R. Sangli" <srihari@procket.com>
To: john heasley <heas@shrubbery.net>
Cc: Stephen Waters <Stephen.Waters@riverstonenet.com>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>, <idr@merit.edu>
Subject: Re: Re>Are dynamic capabilities really necessary if peers support gracef ul restart?
In-Reply-To: <20020711114409.P16814@shrubbery.net>
Message-ID: <Pine.GSO.4.40.0207111329180.192-100000@miata.procket.com>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
Sender: owner-idr@merit.edu
Precedence: bulk

Hi John,

> >
> > > A network manager wanting to change the state or extent of a
> > > capability would appreciate doing that within a bgp session, wouldn't
> > > they?
> > >
> >
> > The network manager can turn off other features (or capabilities) and thus
> > communicate such an event to the remote peers via this message. To keep it
> > simple, the draft does not allow turning OFF "Dynamic-capability" via the
> > capability message.
> >
> > srihari...
> > p.s. If I've misinterpreted your message, I apologise.
>
> these functions run chills up my spine.  i fear not being able to disable
> various capabilities and auto-negotiation when faced with a business policy
> or buggy or abusive peer/customer (or bugs in my own equipment, requiring
> s/w upgrades - requiring a maintenance window to be scheduled).
>
> so, not a complaint, but a comment on design and to vendors from an
> operator perspective:
>
> 	"it must be possible to turn such options off and force
> 	 values/capabilities that will be required."
>
> eg: if i do not want to accept filters from a peer, then i ought to be able
> disable that functionality.  or, i want nlri unicast and multicast, then i
> ought to be able to force that negotiation, else the session is reset or the
> dynamic re-negotiation is denied.
>
> the more flexible the configuration knobs the better, imiho.

Agreed. What you are suggesting is still implementable given the current
state of the draft.

srihari...



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id PAA26369 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 15:51:17 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id EA19F91260; Thu, 11 Jul 2002 15:48:58 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id AD9649137A; Thu, 11 Jul 2002 15:48:58 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id DC9D991260 for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 15:48:53 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id C970B5DE2F; Thu, 11 Jul 2002 15:48:53 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id E2F295DDC1 for <idr@merit.edu>; Thu, 11 Jul 2002 15:48:52 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id PAA23936; Thu, 11 Jul 2002 15:48:49 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id PAA11499; Thu, 11 Jul 2002 15:48:50 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W98S38W>; Thu, 11 Jul 2002 15:48:49 -0400
Message-ID: <39469E08BD83D411A3D900204840EC55822795@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Mathew Richardson'" <mrr@nexthop.com>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: "'Srihari R. Sangli'" <srihari@procket.com>, "'Tsillas, Jim'" <jtsillas@tenornetworks.com>, "'idr@merit.edu'" <idr@merit.edu>, "'enke@redback.com'" <enke@redback.com>
Subject: RE: Re>Are dynamic capabilities really necessary if peers support gracef ul restart?
Date: Thu, 11 Jul 2002 15:48:46 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Hi Mathew:
 Did not mean to sound like I am sniping. My second mail 
to srihari should cover it all.

Regards,
Ben

> -----Original Message-----
> From: Mathew Richardson [mailto:mrr@nexthop.com]
> Sent: Thursday, July 11, 2002 3:35 PM
> To: Abarbanel, Benjamin
> Cc: 'Srihari R. Sangli'; 'Tsillas, Jim'; 'idr@merit.edu';
> 'enke@redback.com'
> Subject: Re: Re>Are dynamic capabilities really necessary if peers
> support gracef ul restart?
> 
> 
> > Abarbanel, Benjamin <Benjamin.Abarbanel@Marconi.com> [Thu, 
> Jul 11, 2002 at 02:52:19PM -0400]:
> >> > I have a question on the current 
> >> draft-ietf-idr-dynamic-cap-02.txt, section
> >> > 7 - Operation, 3rd and 4th paragraph:
> >> >
> >> >     "The Dynamic Capability itself can not be revised 
> >> dynamically via a
> 
> May I suggest that this sentence be changed to read:
> 
> "The Dynamic Capability Capability can not be revised
> dynamically via a ..."
> 
> ?
> 
> Would this change address your concerns, Mr. Abarbanel?
> 
> >> >      CAPABILITY message. The lifetime of the Dynamic 
> >> Capability is the
> >> >      duration of the BGP session in which the capability is 
> >> advertised.
> >> 
> >> This is referring to the "Dynamic-Capability" itself.  
> This capability
> >> can ONLY appear in the OPEN message.
> >> 
> >
> >I thought this spec was talking about dynamic capabilities 
> not static 
> >ones, so why are we assuming its the one in the OPEN message?
> 
> I don't think that was an assumption... it was an assertion.  He's
> saying that the Capability that advertises the ability to perform
> Dynamic-Capability negotiations MUST NOT be negotiated dynamically;
> it is only legal to advertise the Dynamic-Capability Capability in
> OPEN messages.
> 
> >What purpose does it serve to talk about static capabilities in this
> >draft, other than confuse the reader?
> 
> This is the draft in which the Dynamic-Capability Capability is
> introduced.  Hence, it is the logical place to talk about the
> limitations of the Dynamic-Capability Capability.
> 
> <snip>
> 
> mrr
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id PAA26193 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 15:45:31 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id F3AE491375; Thu, 11 Jul 2002 15:43:15 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id BD88C91379; Thu, 11 Jul 2002 15:43:14 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id EDBBF91375 for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 15:43:10 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id DD1135DE4C; Thu, 11 Jul 2002 15:43:10 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 965115DE47 for <idr@merit.edu>; Thu, 11 Jul 2002 15:43:10 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id PAA23463; Thu, 11 Jul 2002 15:43:07 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id PAA09689; Thu, 11 Jul 2002 15:43:08 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W98S3L7>; Thu, 11 Jul 2002 15:43:08 -0400
Message-ID: <39469E08BD83D411A3D900204840EC55822794@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Srihari R. Sangli'" <srihari@procket.com>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: "'Tsillas, Jim'" <jtsillas@tenornetworks.com>, "'idr@merit.edu'" <idr@merit.edu>, "'enke@redback.com'" <enke@redback.com>
Subject: RE: Re>Are dynamic capabilities really necessary if peers support gracef ul restart?
Date: Thu, 11 Jul 2002 15:43:02 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Srihari:
  I looked over the draft again and I have to apologize I misunderstood you 
responses below. I would recommend rewording the two paragraph so that
others
do not get confused like me.

I have added the implied changes in the paragraph to fix the issue. Below.

Ben
 
 
> >
> > I have a question on the current 
> draft-ietf-idr-dynamic-cap-02.txt, section
> > 7 - Operation, 3rd and 4th paragraph:
> >
> >     "The Dynamic Capability itself can not be revised 
> dynamically via a
> >      CAPABILITY message. The lifetime of the Dynamic 
> Capability is the
> >      duration of the BGP session in which the capability is 
> advertised.
> 

I would reword it as follows:

"The Dynamic Capability feature defined in the OPEN message can not
 be revised dynamically via a CAPABILITY message. The lifetime of the
 Dynamic Capabaility feature is the duration of the BGP session."

> This is referring to the "Dynamic-Capability" itself.  This capability
> can ONLY appear in the OPEN message.
> 
> >      Upon receiving a CAPABILITY message from its peer, the 
> BGP speaker
> >      shall update the capabilities previously received from 
> that peer
> >      based on the "Action" in the message, and then 
> function in accordance
> >      with the revised capabilities for the peer. Any unrecognized
> >      capabilities in the message should be ignored."
> 

"Upon receiving a CAPABILITY message with specific capabilities from
 its peer, the BGP speaker shall update those specific capabilities 
 previously received from that peer based on the "Action" in the 
 message, and then function in accordance with the revised capabilities
 for the peer. 

 Any unrecognized capabilities found in the CAPABALITY message should be 
 ignored".

Regards,
Ben


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id PAA25842 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 15:36:12 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 5745591370; Thu, 11 Jul 2002 15:35:34 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 2448A91372; Thu, 11 Jul 2002 15:35:34 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id C478D91370 for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 15:35:31 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id AF7295DE3C; Thu, 11 Jul 2002 15:35:31 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 129085DDD2 for <idr@merit.edu>; Thu, 11 Jul 2002 15:35:31 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6BJZ6v31828; Thu, 11 Jul 2002 15:35:06 -0400 (EDT) (envelope-from mrr@mrichardson.nexthop.com)
Received: from paradigm.nexthop.com (mrichardson.nexthop.com [64.211.218.45]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6BJZ3131821; Thu, 11 Jul 2002 15:35:03 -0400 (EDT) (envelope-from mrr@mrichardson.nexthop.com)
Received: (from mrr@localhost) by paradigm.nexthop.com (8.11.6/8.11.6) id g6BJZ3A05368; Thu, 11 Jul 2002 15:35:03 -0400 (EDT)
Date: Thu, 11 Jul 2002 15:35:03 -0400
From: Mathew Richardson <mrr@nexthop.com>
To: "Abarbanel, Benjamin" <Benjamin.Abarbanel@marconi.com>
Cc: "'Srihari R. Sangli'" <srihari@procket.com>, "'Tsillas, Jim'" <jtsillas@tenornetworks.com>, "'idr@merit.edu'" <idr@merit.edu>, "'enke@redback.com'" <enke@redback.com>
Subject: Re: Re>Are dynamic capabilities really necessary if peers support gracef ul restart?
Message-ID: <20020711153503.A17603@nexthop.com>
References: <39469E08BD83D411A3D900204840EC55822792@vie-msgusr-01.dc.fore.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5.1i
In-Reply-To: <39469E08BD83D411A3D900204840EC55822792@vie-msgusr-01.dc.fore.com>; from Benjamin.Abarbanel@Marconi.com on Thu, Jul 11, 2002 at 02:52:19PM -0400
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

> Abarbanel, Benjamin <Benjamin.Abarbanel@Marconi.com> [Thu, Jul 11, 2002 at 02:52:19PM -0400]:
>> > I have a question on the current 
>> draft-ietf-idr-dynamic-cap-02.txt, section
>> > 7 - Operation, 3rd and 4th paragraph:
>> >
>> >     "The Dynamic Capability itself can not be revised 
>> dynamically via a

May I suggest that this sentence be changed to read:

"The Dynamic Capability Capability can not be revised
dynamically via a ..."

?

Would this change address your concerns, Mr. Abarbanel?

>> >      CAPABILITY message. The lifetime of the Dynamic 
>> Capability is the
>> >      duration of the BGP session in which the capability is 
>> advertised.
>> 
>> This is referring to the "Dynamic-Capability" itself.  This capability
>> can ONLY appear in the OPEN message.
>> 
>
>I thought this spec was talking about dynamic capabilities not static 
>ones, so why are we assuming its the one in the OPEN message?

I don't think that was an assumption... it was an assertion.  He's
saying that the Capability that advertises the ability to perform
Dynamic-Capability negotiations MUST NOT be negotiated dynamically;
it is only legal to advertise the Dynamic-Capability Capability in
OPEN messages.

>What purpose does it serve to talk about static capabilities in this
>draft, other than confuse the reader?

This is the draft in which the Dynamic-Capability Capability is
introduced.  Hence, it is the logical place to talk about the
limitations of the Dynamic-Capability Capability.

<snip>

mrr


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id PAA25473 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 15:26:03 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 1656B91371; Thu, 11 Jul 2002 15:25:26 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id D365F91370; Thu, 11 Jul 2002 15:25:25 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 72D5F91372 for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 15:25:24 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 4B42A5DE3C; Thu, 11 Jul 2002 15:25:24 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from tenornetworks.com (rtu.tenornetworks.com [63.77.213.2]) by segue.merit.edu (Postfix) with ESMTP id AC9BF5DDD2 for <idr@merit.edu>; Thu, 11 Jul 2002 15:25:23 -0400 (EDT)
Received: from tenornet.com (newman [192.168.0.185]) by tenornetworks.com (Switch-2.1.3/Switch-2.1.0) with ESMTP id g6BJPKd10125; Thu, 11 Jul 2002 15:25:21 -0400 (EDT)
Received: from 192.168.0.185 by tenornet.com; THU, 11 Jul 2002 15:20:36 -0700
Received: by newman.tenornet.com with Internet Mail Service (5.5.2650.21) id <N1HXZZ1Y>; Thu, 11 Jul 2002 15:20:35 -0400
Message-ID: <6B190B34070BD411ACA000B0D0214E560165B811@newman.tenornet.com>
From: "Tsillas, Jim" <jtsillas@tenornetworks.com>
To: "'Abarbanel, Benjamin'" <Benjamin.Abarbanel@marconi.com>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: graceful restart TCP reset discovery (was Re>Are dynamic capabili ties really necessary ...)
Date: Thu, 11 Jul 2002 15:20:34 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Ben,

You are correct. I was thinking about the case where TCP is
reset due to "fast external fallover". In this case, the
forwarding plane is down and if it stays down for more than
the graceful restart timeout, the receiving speaker resorts to
normal restart.

Sections 6.1 and 6.2 go into good detail on this. The length
of time it takes to determine if TCP has reset is implementation
dependent. One way to detect TCP reset is to look for a new
session for the same peer (since the neighbor will try to
intiate a new session when it restarts). Another way is to rely
on "fast-external-fallover" being turned on. Otherwise you just
have to wait for either the keepalive expiration or the TCP timeout.

There is also an obscure case with some (probably most) TCPs where
the restarting speaker has not yet attempted to open the session but
the receiving speaker attempts to send data on a non-existing session.
This causes the restarting speaker to send a FIN back. There are
probably other equally obscure cases which are implementation dependent.

Once TCP reset is detected, we start the graceful restart timeout
period. This means that in some cases, the total timeout can
also include the amount of time it took to discover the TCP
session has been reset. The draft makes no distinction as far
as how the session was reset, as far as timeout is concerned.

-Jim.

-----Original Message-----
From: Abarbanel, Benjamin [mailto:Benjamin.Abarbanel@marconi.com]
Sent: Thursday, July 11, 2002 2:43 PM
To: 'Tsillas, Jim'; Abarbanel, Benjamin; 'Yakov Rekhter'; Srihari R.
Sangli
Cc: 'idr@merit.edu'
Subject: RE: Are dynamic capabilities really necessary if peers support
gr acef ul restart? 




> graceful restart can also be useful when the control plane
> is implemented separately from the forwarding plane. When
> this is true, the control place can reset (or fail-over)
> while the forwarding plane is unaffected. The reset, can
> be made transparent to the forwarding plane. Graceful
> restart can also serve to limit the scope of the reset to
> just the given speaker and its neighbors. In other words,
> the reset state doesn't propagate through the network.
>

Thanks for the clarification. It would be nice if applicable uses for
gracefull restart were added to the draft. 

> The case which you point out (where TCP is broken) is actually
> not covered since a broken TCP session implies a broken
> forwarding plane.
> 

I disagree, this is taken out of draft-ietf-idr-restart-05.txt, 
Abstract section 2.
                                    
 "Finally,procedures are outlined for temporarily retaining routing
information
  across a TCP transport reset." 

I assume it means we hold routing information after the forwarding plane
is broken (BGP peering session dropped), right?


> -Jim.
> 
> -----Original Message-----
> From: Abarbanel, Benjamin [mailto:Benjamin.Abarbanel@marconi.com]
> Sent: Thursday, July 11, 2002 1:05 PM
> To: 'Yakov Rekhter'; Srihari R. Sangli
> Cc: Tsillas, Jim; 'idr@merit.edu'
> Subject: RE: Are dynamic capabilities really necessary if 
> peers support
> gr acef ul restart? 
> 
> 
> Yakov:
>   The only time I think gracefull restart is usefull is if 
> two peers are
> operating
> fine, but the TCP connection is broken by the network. In the 
> case that
> either peer drops the session or either peer reboot/crash, I 
> dont see much
> value in gracefull restart.
> 
> Did I miss something?
> 
> Ben
> 
> > -----Original Message-----
> > From: Yakov Rekhter [mailto:yakov@juniper.net]
> > Sent: Thursday, July 11, 2002 12:55 PM
> > To: Srihari R. Sangli
> > Cc: Tsillas, Jim; 'idr@merit.edu'
> > Subject: Re: Are dynamic capabilities really necessary if 
> > peers support
> > gracef ul restart? 
> > 
> > 
> > Srihari,
> > 
> > > > It seems we are entering major discussion on dynamic
> > > > capabilities and the can of worms is open. Can anyone
> > > > think of a scenario where a dynamic capabilities
> > > > negotiation is still needed if we have graceful restart?
> > > 
> > > Talking about address-families exchanged between BGP speakers, the
> > > dynamic capability is useful for enabling a new address-family or
> > > disabling a address-family w/o disrupting the session and/or other
> > > address-families. It is also useful for turning ON/OFF 
> > other capabilities
> > > (like ORF, etc)  without much disruption. 
> > 
> > Just to clarify, in the presence of graceful restart the disruption
> > you refer to above is just the disruption in the routing peering
> > between the two speakers. There is no disruption in the 
> > data/forwarding
> > plane and no disruption in any other routing peering. Correct ?
> > 
> > Yakov.
> > 
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id OAA24241 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 14:52:56 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id D12AD9136A; Thu, 11 Jul 2002 14:52:28 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id A235B9136C; Thu, 11 Jul 2002 14:52:28 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id AD2AF9136A for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 14:52:27 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 920555DDAD; Thu, 11 Jul 2002 14:52:27 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 112615DE2F for <idr@merit.edu>; Thu, 11 Jul 2002 14:52:27 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id OAA19206; Thu, 11 Jul 2002 14:52:24 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id OAA27408; Thu, 11 Jul 2002 14:52:24 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W98SLJF>; Thu, 11 Jul 2002 14:52:23 -0400
Message-ID: <39469E08BD83D411A3D900204840EC55822792@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Srihari R. Sangli'" <srihari@procket.com>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: "'Tsillas, Jim'" <jtsillas@tenornetworks.com>, "'idr@merit.edu'" <idr@merit.edu>, "'enke@redback.com'" <enke@redback.com>
Subject: RE: Re>Are dynamic capabilities really necessary if peers support gracef ul restart?
Date: Thu, 11 Jul 2002 14:52:19 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Srihari:

see below
 
> 
> 
> Handling the race condition will be clarified in the draft in the next
> revision.
> 

Thanks for this.

> >
> > I have a question on the current 
> draft-ietf-idr-dynamic-cap-02.txt, section
> > 7 - Operation, 3rd and 4th paragraph:
> >
> >     "The Dynamic Capability itself can not be revised 
> dynamically via a
> >      CAPABILITY message. The lifetime of the Dynamic 
> Capability is the
> >      duration of the BGP session in which the capability is 
> advertised.
> 
> This is referring to the "Dynamic-Capability" itself.  This capability
> can ONLY appear in the OPEN message.
> 

I thought this spec was talking about dynamic capabilities not static 
ones, so why are we assuming its the one in the OPEN message?

What purpose does it serve to talk about static capabilities in this
draft, other than confuse the reader?

> >      Upon receiving a CAPABILITY message from its peer, the 
> BGP speaker
> >      shall update the capabilities previously received from 
> that peer
> >      based on the "Action" in the message, and then 
> function in accordance
> >      with the revised capabilities for the peer. Any unrecognized
> >      capabilities in the message should be ignored."
> 
> This is referring to capabilities other than "Dynamic-Capability".
> 
> >
> > Sounds like the second paragraph contradicts the first one.
> 
> No, it does not.

Same comment as state above.

Ben


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id OAA24074 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 14:47:16 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id AF7839136B; Thu, 11 Jul 2002 14:46:40 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 8298F9136A; Thu, 11 Jul 2002 14:46:40 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id AEDD79136B for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 14:46:33 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 3665F5DE39; Thu, 11 Jul 2002 14:46:33 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id CCAE85DDAD for <idr@merit.edu>; Thu, 11 Jul 2002 14:46:32 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id OAA18711; Thu, 11 Jul 2002 14:46:29 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id OAA25884; Thu, 11 Jul 2002 14:46:30 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W98SK7G>; Thu, 11 Jul 2002 14:46:29 -0400
Message-ID: <39469E08BD83D411A3D900204840EC55822791@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'john heasley'" <heas@shrubbery.net>, "Srihari R. Sangli" <srihari@procket.com>
Cc: Stephen Waters <Stephen.Waters@riverstonenet.com>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>, idr@merit.edu
Subject: RE: Re>Are dynamic capabilities really necessary if peers support gracef ul restart?
Date: Thu, 11 Jul 2002 14:46:28 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Excellent Points, Stephen

Ben

> -----Original Message-----
> From: john heasley [mailto:heas@shrubbery.net]
> Sent: Thursday, July 11, 2002 2:44 PM
> To: Srihari R. Sangli
> Cc: Stephen Waters; Abarbanel, Benjamin; idr@merit.edu
> Subject: Re: Re>Are dynamic capabilities really necessary if peers
> support gracef ul restart?
> 
> 
> Thu, Jul 11, 2002 at 11:27:44AM -0700, Srihari R. Sangli:
> > 
> > > A network manager wanting to change the state or extent of a
> > > capability would appreciate doing that within a bgp 
> session, wouldn't
> > > they?
> > >
> > 
> > The network manager can turn off other features (or 
> capabilities) and thus
> > communicate such an event to the remote peers via this 
> message. To keep it
> > simple, the draft does not allow turning OFF 
> "Dynamic-capability" via the
> > capability message.
> > 
> > srihari...
> > p.s. If I've misinterpreted your message, I apologise.
> 
> these functions run chills up my spine.  i fear not being 
> able to disable
> various capabilities and auto-negotiation when faced with a 
> business policy
> or buggy or abusive peer/customer (or bugs in my own 
> equipment, requiring
> s/w upgrades - requiring a maintenance window to be scheduled).
> 
> so, not a complaint, but a comment on design and to vendors from an
> operator perspective:
> 
> 	"it must be possible to turn such options off and force
> 	 values/capabilities that will be required."
> 
> eg: if i do not want to accept filters from a peer, then i 
> ought to be able
> disable that functionality.  or, i want nlri unicast and 
> multicast, then i
> ought to be able to force that negotiation, else the session 
> is reset or the
> dynamic re-negotiation is denied.
> 
> the more flexible the configuration knobs the better, imiho.
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id OAA23987 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 14:44:36 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 1812191369; Thu, 11 Jul 2002 14:44:15 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id DD2679136A; Thu, 11 Jul 2002 14:44:14 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id DA03991369 for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 14:44:13 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id C02185DE2F; Thu, 11 Jul 2002 14:44:13 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from guelah.shrubbery.net (guelah.shrubbery.net [198.58.5.1]) by segue.merit.edu (Postfix) with ESMTP id 310D85DDAD for <idr@merit.edu>; Thu, 11 Jul 2002 14:44:13 -0400 (EDT)
Received: (from heas@localhost) by guelah.shrubbery.net (8.11.6/8.11.1) id g6BIi9v24647; Thu, 11 Jul 2002 18:44:09 GMT
Date: Thu, 11 Jul 2002 11:44:09 -0700
From: john heasley <heas@shrubbery.net>
To: "Srihari R. Sangli" <srihari@procket.com>
Cc: Stephen Waters <Stephen.Waters@riverstonenet.com>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>, idr@merit.edu
Subject: Re: Re>Are dynamic capabilities really necessary if peers support gracef ul restart?
Message-ID: <20020711114409.P16814@shrubbery.net>
References: <D2ECDCD8A528BF43A01EECCBEB191DD108F332@rs-eu-exc1.rs.riverstonenet.com> <Pine.GSO.4.40.0207111123500.192-100000@miata.procket.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5.1i
In-Reply-To: <Pine.GSO.4.40.0207111123500.192-100000@miata.procket.com>; from srihari@procket.com on Thu, Jul 11, 2002 at 11:27:44AM -0700
X-note: live free, or die!
X-homer: mmmm, forbidden doughnut.
Sender: owner-idr@merit.edu
Precedence: bulk

Thu, Jul 11, 2002 at 11:27:44AM -0700, Srihari R. Sangli:
> 
> > A network manager wanting to change the state or extent of a
> > capability would appreciate doing that within a bgp session, wouldn't
> > they?
> >
> 
> The network manager can turn off other features (or capabilities) and thus
> communicate such an event to the remote peers via this message. To keep it
> simple, the draft does not allow turning OFF "Dynamic-capability" via the
> capability message.
> 
> srihari...
> p.s. If I've misinterpreted your message, I apologise.

these functions run chills up my spine.  i fear not being able to disable
various capabilities and auto-negotiation when faced with a business policy
or buggy or abusive peer/customer (or bugs in my own equipment, requiring
s/w upgrades - requiring a maintenance window to be scheduled).

so, not a complaint, but a comment on design and to vendors from an
operator perspective:

	"it must be possible to turn such options off and force
	 values/capabilities that will be required."

eg: if i do not want to accept filters from a peer, then i ought to be able
disable that functionality.  or, i want nlri unicast and multicast, then i
ought to be able to force that negotiation, else the session is reset or the
dynamic re-negotiation is denied.

the more flexible the configuration knobs the better, imiho.



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id OAA23959 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 14:43:42 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 9D73191368; Thu, 11 Jul 2002 14:43:21 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 6866891369; Thu, 11 Jul 2002 14:43:21 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 52E4891368 for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 14:43:20 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 38E6F5DE31; Thu, 11 Jul 2002 14:43:20 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id B89FC5DE2F for <idr@merit.edu>; Thu, 11 Jul 2002 14:43:19 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id OAA18514; Thu, 11 Jul 2002 14:43:17 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id OAA25332; Thu, 11 Jul 2002 14:43:17 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W98SKXS>; Thu, 11 Jul 2002 14:43:16 -0400
Message-ID: <39469E08BD83D411A3D900204840EC55822790@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Tsillas, Jim'" <jtsillas@tenornetworks.com>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>, "'Yakov Rekhter'" <yakov@juniper.net>, "Srihari R. Sangli" <srihari@procket.com>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: Are dynamic capabilities really necessary if peers support gr acef ul restart? 
Date: Thu, 11 Jul 2002 14:43:11 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

> graceful restart can also be useful when the control plane
> is implemented separately from the forwarding plane. When
> this is true, the control place can reset (or fail-over)
> while the forwarding plane is unaffected. The reset, can
> be made transparent to the forwarding plane. Graceful
> restart can also serve to limit the scope of the reset to
> just the given speaker and its neighbors. In other words,
> the reset state doesn't propagate through the network.
>

Thanks for the clarification. It would be nice if applicable uses for
gracefull restart were added to the draft. 

> The case which you point out (where TCP is broken) is actually
> not covered since a broken TCP session implies a broken
> forwarding plane.
> 

I disagree, this is taken out of draft-ietf-idr-restart-05.txt, 
Abstract section 2.
                                    
 "Finally,procedures are outlined for temporarily retaining routing
information
  across a TCP transport reset." 

I assume it means we hold routing information after the forwarding plane
is broken (BGP peering session dropped), right?


> -Jim.
> 
> -----Original Message-----
> From: Abarbanel, Benjamin [mailto:Benjamin.Abarbanel@marconi.com]
> Sent: Thursday, July 11, 2002 1:05 PM
> To: 'Yakov Rekhter'; Srihari R. Sangli
> Cc: Tsillas, Jim; 'idr@merit.edu'
> Subject: RE: Are dynamic capabilities really necessary if 
> peers support
> gr acef ul restart? 
> 
> 
> Yakov:
>   The only time I think gracefull restart is usefull is if 
> two peers are
> operating
> fine, but the TCP connection is broken by the network. In the 
> case that
> either peer drops the session or either peer reboot/crash, I 
> dont see much
> value in gracefull restart.
> 
> Did I miss something?
> 
> Ben
> 
> > -----Original Message-----
> > From: Yakov Rekhter [mailto:yakov@juniper.net]
> > Sent: Thursday, July 11, 2002 12:55 PM
> > To: Srihari R. Sangli
> > Cc: Tsillas, Jim; 'idr@merit.edu'
> > Subject: Re: Are dynamic capabilities really necessary if 
> > peers support
> > gracef ul restart? 
> > 
> > 
> > Srihari,
> > 
> > > > It seems we are entering major discussion on dynamic
> > > > capabilities and the can of worms is open. Can anyone
> > > > think of a scenario where a dynamic capabilities
> > > > negotiation is still needed if we have graceful restart?
> > > 
> > > Talking about address-families exchanged between BGP speakers, the
> > > dynamic capability is useful for enabling a new address-family or
> > > disabling a address-family w/o disrupting the session and/or other
> > > address-families. It is also useful for turning ON/OFF 
> > other capabilities
> > > (like ORF, etc)  without much disruption. 
> > 
> > Just to clarify, in the presence of graceful restart the disruption
> > you refer to above is just the disruption in the routing peering
> > between the two speakers. There is no disruption in the 
> > data/forwarding
> > plane and no disruption in any other routing peering. Correct ?
> > 
> > Yakov.
> > 
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id OAA23427 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 14:28:25 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 035A591366; Thu, 11 Jul 2002 14:27:47 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id C844591367; Thu, 11 Jul 2002 14:27:46 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id A68E591366 for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 14:27:45 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 8677C5DE05; Thu, 11 Jul 2002 14:27:45 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from dmz2.procket.com (dmz2.procket.com [65.174.124.37]) by segue.merit.edu (Postfix) with ESMTP id 598B75DDAD for <idr@merit.edu>; Thu, 11 Jul 2002 14:27:45 -0400 (EDT)
Received: from miata.procket.com (zeus-d-1.procket.com [65.174.124.60]) by dmz2.procket.com (Postfix) with ESMTP id 152A53442B; Thu, 11 Jul 2002 11:34:06 -0700 (PDT)
Received: from localhost (srihari@localhost) by miata.procket.com (8.12.1/8.12.1) with ESMTP id g6BIRikK003149; Thu, 11 Jul 2002 11:27:44 -0700 (PDT)
Date: Thu, 11 Jul 2002 11:27:44 -0700 (PDT)
From: "Srihari R. Sangli" <srihari@procket.com>
To: Stephen Waters <Stephen.Waters@riverstonenet.com>
Cc: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>, <idr@merit.edu>
Subject: RE: Re>Are dynamic capabilities really necessary if peers support gracef ul restart?
In-Reply-To: <D2ECDCD8A528BF43A01EECCBEB191DD108F332@rs-eu-exc1.rs.riverstonenet.com>
Message-ID: <Pine.GSO.4.40.0207111123500.192-100000@miata.procket.com>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
Sender: owner-idr@merit.edu
Precedence: bulk

> A network manager wanting to change the state or extent of a
> capability would appreciate doing that within a bgp session, wouldn't
> they?
>

The network manager can turn off other features (or capabilities) and thus
communicate such an event to the remote peers via this message. To keep it
simple, the draft does not allow turning OFF "Dynamic-capability" via the
capability message.

srihari...
p.s. If I've misinterpreted your message, I apologise.



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id OAA23222 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 14:23:10 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id BC81F91365; Thu, 11 Jul 2002 14:22:48 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 8F94E91366; Thu, 11 Jul 2002 14:22:48 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 058CA91365 for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 14:22:46 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id E75AA5DE05; Thu, 11 Jul 2002 14:22:46 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from tenornetworks.com (rtu.tenornetworks.com [63.77.213.2]) by segue.merit.edu (Postfix) with ESMTP id 74E2D5DDAD for <idr@merit.edu>; Thu, 11 Jul 2002 14:22:46 -0400 (EDT)
Received: from tenornet.com (newman [192.168.0.185]) by tenornetworks.com (Switch-2.1.3/Switch-2.1.0) with ESMTP id g6BIMid07327; Thu, 11 Jul 2002 14:22:44 -0400 (EDT)
Received: from 192.168.0.185 by tenornet.com; THU, 11 Jul 2002 14:18:07 -0700
Received: by newman.tenornet.com with Internet Mail Service (5.5.2650.21) id <N1HXZZA7>; Thu, 11 Jul 2002 14:18:07 -0400
Message-ID: <6B190B34070BD411ACA000B0D0214E560165B80F@newman.tenornet.com>
From: "Tsillas, Jim" <jtsillas@tenornetworks.com>
To: "'Srihari R. Sangli'" <srihari@procket.com>, "Tsillas, Jim" <jtsillas@tenornetworks.com>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: Are dynamic capabilities really necessary if peers support gr acef ul restart?
Date: Thu, 11 Jul 2002 14:18:06 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

I agree we need to see a description for how dynamic
capabilities should work for each capability currently
in use and for each newly introduced capability.

My point on graceful restart was that since it can hide
session resets, couldn't we simply use that to reconfigure
capabilities rather than introduce a new concept. Ben
and Steve's point on being able to turn something on and
off without worrying about affecting the session is well
taken and I now agree with them.

-Jim.

-----Original Message-----
From: Srihari R. Sangli [mailto:srihari@procket.com]
Sent: Thursday, July 11, 2002 12:43 PM
To: Tsillas, Jim
Cc: 'idr@merit.edu'
Subject: Re: Are dynamic capabilities really necessary if peers support
gracef ul restart?



> It seems we are entering major discussion on dynamic
> capabilities and the can of worms is open. Can anyone
> think of a scenario where a dynamic capabilities
> negotiation is still needed if we have graceful restart?

Talking about address-families exchanged between BGP speakers, the
dynamic capability is useful for enabling a new address-family or
disabling a address-family w/o disrupting the session and/or other
address-families. It is also useful for turning ON/OFF other capabilities
(like ORF, etc)  without much disruption. Note that Graceful restart has
its own limitations for control plane changes.

I understand that the draft needs to be clear on the race condition
aspects that was mentioned earlier. Any other issues ?

srihari...



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id OAA23201 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 14:22:39 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 850F291364; Thu, 11 Jul 2002 14:22:16 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 5212091365; Thu, 11 Jul 2002 14:22:16 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 242DE91364 for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 14:22:15 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 0C1625DE05; Thu, 11 Jul 2002 14:22:15 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from dmz1.procket.com (dmz1.procket.com [65.174.124.36]) by segue.merit.edu (Postfix) with ESMTP id D68805DDAD for <idr@merit.edu>; Thu, 11 Jul 2002 14:22:14 -0400 (EDT)
Received: from miata.procket.com (zeus-d-1.procket.com [65.174.124.60]) by dmz1.procket.com (Postfix) with ESMTP id 864EF23C33; Thu, 11 Jul 2002 11:35:53 -0700 (PDT)
Received: from localhost (srihari@localhost) by miata.procket.com (8.12.1/8.12.1) with ESMTP id g6BIM9Jk002697; Thu, 11 Jul 2002 11:22:10 -0700 (PDT)
Date: Thu, 11 Jul 2002 11:22:09 -0700 (PDT)
From: "Srihari R. Sangli" <srihari@procket.com>
To: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: "'Tsillas, Jim'" <jtsillas@tenornetworks.com>, "'idr@merit.edu'" <idr@merit.edu>, "'enke@redback.com'" <enke@redback.com>
Subject: RE: Re>Are dynamic capabilities really necessary if peers support  gracef ul restart?
In-Reply-To: <39469E08BD83D411A3D900204840EC5582278C@vie-msgusr-01.dc.fore.com>
Message-ID: <Pine.GSO.4.40.0207111111560.192-100000@miata.procket.com>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
Sender: owner-idr@merit.edu
Precedence: bulk

> Jim and Enke:
>
> I think the issue with dynamic capabilities is that the current draft is not
> clear on what to do in the case of race conditions with the capability
> messages
> between two routers. The can of worms you speak of will bite us if we do not
> clearify this issue. Each vendor will choose to take a different action
> when this happens, I would prefer that the draft state a way to handle race
> conditions and thus avoid the whole mess of either reseting the session or
> recovering the CAPABILITY state.


Handling the race condition will be clarified in the draft in the next
revision.

>
> I have a question on the current draft-ietf-idr-dynamic-cap-02.txt, section
> 7 - Operation, 3rd and 4th paragraph:
>
>     "The Dynamic Capability itself can not be revised dynamically via a
>      CAPABILITY message. The lifetime of the Dynamic Capability is the
>      duration of the BGP session in which the capability is advertised.

This is referring to the "Dynamic-Capability" itself.  This capability
can ONLY appear in the OPEN message.

>      Upon receiving a CAPABILITY message from its peer, the BGP speaker
>      shall update the capabilities previously received from that peer
>      based on the "Action" in the message, and then function in accordance
>      with the revised capabilities for the peer. Any unrecognized
>      capabilities in the message should be ignored."

This is referring to capabilities other than "Dynamic-Capability".

>
> Sounds like the second paragraph contradicts the first one.

No, it does not.

> So I cannot
> tell if I can renegotiate and replace the CAPABILITY or not during this
> session?

How about now ?

srihari...



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id OAA22800 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 14:10:04 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id DC92C91363; Thu, 11 Jul 2002 14:09:42 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id A176C91364; Thu, 11 Jul 2002 14:09:42 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 9C29491363 for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 14:09:41 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 56CE85DE47; Thu, 11 Jul 2002 14:09:41 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id B00745DE2F for <idr@merit.edu>; Thu, 11 Jul 2002 14:09:40 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id OAA15762; Thu, 11 Jul 2002 14:09:37 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id OAA17511; Thu, 11 Jul 2002 14:09:38 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W98S2WH>; Thu, 11 Jul 2002 14:09:37 -0400
Message-ID: <39469E08BD83D411A3D900204840EC5582278F@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Stephen Waters'" <Stephen.Waters@riverstonenet.com>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: idr@merit.edu
Subject: RE: Re>Are dynamic capabilities really necessary if peers support gracef ul restart?
Date: Thu, 11 Jul 2002 14:09:32 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Absolutely, Agreed.  

 
Ben

> -----Original Message-----
> From: Stephen Waters [mailto:Stephen.Waters@riverstonenet.com]
> Sent: Thursday, July 11, 2002 11:57 AM
> To: Abarbanel, Benjamin
> Cc: idr@merit.edu
> Subject: RE: Re>Are dynamic capabilities really necessary if peers
> support gracef ul restart?
> 
> 
> 
> I don't see much point in dynamic capability negotiation if 
> you can't turn things on and off,
> (or off and on) to your heart's content. 
> 
> A network manager wanting to change the state or extent of a 
> capability would appreciate doing that within a bgp session, 
> wouldn't they?
> 
> Steve.  
> 
> > -----Original Message-----
> > From: Abarbanel, Benjamin [mailto:Benjamin.Abarbanel@Marconi.com]
> > Sent: Thursday, July 11, 2002 4:15 PM
> > To: 'Tsillas, Jim'; 'idr@merit.edu'
> > Cc: 'enke@redback.com'
> > Subject: RE: Re>Are dynamic capabilities really necessary if peers
> > support gracef ul restart?
> > 
> > 
> > Jim and Enke:
> > 
> > I think the issue with dynamic capabilities is that the 
> > current draft is not
> > clear on what to do in the case of race conditions with the 
> capability
> > messages 
> > between two routers. The can of worms you speak of will bite 
> > us if we do not
> > clearify this issue. Each vendor will choose to take a 
> > different action 
> > when this happens, I would prefer that the draft state a way 
> > to handle race
> > conditions and thus avoid the whole mess of either reseting 
> > the session or
> > recovering the CAPABILITY state.
> > 
> > I have a question on the current 
> > draft-ietf-idr-dynamic-cap-02.txt, section
> > 7 - Operation, 3rd and 4th paragraph:
> > 
> >     "The Dynamic Capability itself can not be revised 
> > dynamically via a
> >      CAPABILITY message. The lifetime of the Dynamic 
> Capability is the
> >      duration of the BGP session in which the capability is 
> > advertised.
> > 
> >      Upon receiving a CAPABILITY message from its peer, the 
> > BGP speaker
> >      shall update the capabilities previously received from 
> that peer
> >      based on the "Action" in the message, and then function 
> > in accordance
> >      with the revised capabilities for the peer. Any unrecognized
> >      capabilities in the message should be ignored."
> > 
> > Sounds like the second paragraph contradicts the first one. 
> > So I cannot 
> > tell if I can renegotiate and replace the CAPABILITY or not 
> > during this
> > session?
> > 
> > Ben
> > 
> > > -----Original Message-----
> > > From: Tsillas, Jim [mailto:jtsillas@tenornetworks.com]
> > > Sent: Thursday, July 11, 2002 9:52 AM
> > > To: 'idr@merit.edu'
> > > Subject: FW: Re>Are dynamic capabilities really necessary if peers
> > > support gracef ul restart?
> > > 
> > > 
> > > 
> > > Bruno,
> > > 
> > > if graceful restart is implemented for all supported
> > > address families and can be manually initiated it
> > > seems to me that much of the functionality of capabilities
> > > negotiation can be achieved by gracefully restarting the
> > > session. I'm trying to think of a scenario where this
> > > won't work. Granted that bouncing the session, even with
> > > graceful restart, will cause large numbers of messages
> > > to be sent between peers (which is something to avoid).
> > > 
> > > By can of worms I am implying unknown race conditions
> > > which can result by renegotiating certain capabilities.
> > > For some capabilities it may be harder than others to
> > > identify and handle all the race conditions.
> > > 
> > > -Jim.
> > > 
> > > -----Original Message-----
> > > From: Bruno Rijsman [mailto:bwarijsman@hotmail.com]
> > > Sent: Thursday, July 11, 2002 8:30 AM
> > > To: idr@merit.edu
> > > Cc: jtsillas@tenornetworks.com
> > > Subject: Re>Are dynamic capabilities really necessary if 
> > peers support
> > > gracef ul restart?
> > > 
> > > 
> > > Re> It seems we are entering major discussion on dynamic
> > > Re> capabilities and the can of worms is open. Can anyone
> > > Re> think of a scenario where a dynamic capabilities
> > > Re> negotiation is still needed if we have graceful restart?
> > > 
> > > One example is: it allows you to activate or de-activate an
> > > address family (e.g. ipv6 or vpnv4) on a session which is
> > > already established and exchanging some other address family
> > > (e.g. ipv4) without having to bounce that session.
> > > 
> > > More generally speaking, it allows you to activate or
> > > de-activate features on a session without needing to bounce the
> > > session.
> > > 
> > > Not needing to bounce a session a good thing.
> > > 
> > > It seems from your e-mail that you are implying that 
> > graceful restart
> > > removes the need or reduces the usefulness of dynamic capability
> > > negotiation. Could you please clarify that?
> > > 
> > > Having personally implemented dynamic capability negotiation
> > > I can tell you that the "can of worms" is not so bad: all you
> > > have to do is to be willing to accept and silently ignore certain
> > > messages after withdrawing a capability (e.g. you have to 
> be willing
> > > to accept and ignore vpnv4 updates if you recently withdrew the
> > > vpnv4 multi-protocol capability).
> > > 
> > > 
> > > 
> > > _________________________________________________________________
> > > Send and receive Hotmail on your mobile device: 
> http://mobile.msn.com
> > 
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id OAA22735 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 14:08:13 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 456AB9125D; Thu, 11 Jul 2002 14:07:45 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 151B491363; Thu, 11 Jul 2002 14:07:45 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 00F5B9125D for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 14:07:43 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id DC56A5DE05; Thu, 11 Jul 2002 14:07:43 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from tenornetworks.com (rtu.tenornetworks.com [63.77.213.2]) by segue.merit.edu (Postfix) with ESMTP id 57C4C5DDAD for <idr@merit.edu>; Thu, 11 Jul 2002 14:07:43 -0400 (EDT)
Received: from tenornet.com (newman [192.168.0.185]) by tenornetworks.com (Switch-2.1.3/Switch-2.1.0) with ESMTP id g6BI7dd06595; Thu, 11 Jul 2002 14:07:39 -0400 (EDT)
Received: from 192.168.0.185 by tenornet.com; THU, 11 Jul 2002 14:02:59 -0700
Received: by newman.tenornet.com with Internet Mail Service (5.5.2650.21) id <N1HXZZAB>; Thu, 11 Jul 2002 14:02:58 -0400
Message-ID: <6B190B34070BD411ACA000B0D0214E560165B80E@newman.tenornet.com>
From: "Tsillas, Jim" <jtsillas@tenornetworks.com>
To: "'Abarbanel, Benjamin'" <Benjamin.Abarbanel@marconi.com>, "'Yakov Rekhter'" <yakov@juniper.net>, "Srihari R. Sangli" <srihari@procket.com>
Cc: "Tsillas, Jim" <jtsillas@tenornetworks.com>, "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: Are dynamic capabilities really necessary if peers support gr acef ul restart? 
Date: Thu, 11 Jul 2002 14:02:57 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Ben,

graceful restart can also be useful when the control plane
is implemented separately from the forwarding plane. When
this is true, the control place can reset (or fail-over)
while the forwarding plane is unaffected. The reset, can
be made transparent to the forwarding plane. Graceful
restart can also serve to limit the scope of the reset to
just the given speaker and its neighbors. In other words,
the reset state doesn't propagate through the network.

The case which you point out (where TCP is broken) is actually
not covered since a broken TCP session implies a broken
forwarding plane.

-Jim.

-----Original Message-----
From: Abarbanel, Benjamin [mailto:Benjamin.Abarbanel@marconi.com]
Sent: Thursday, July 11, 2002 1:05 PM
To: 'Yakov Rekhter'; Srihari R. Sangli
Cc: Tsillas, Jim; 'idr@merit.edu'
Subject: RE: Are dynamic capabilities really necessary if peers support
gr acef ul restart? 


Yakov:
  The only time I think gracefull restart is usefull is if two peers are
operating
fine, but the TCP connection is broken by the network. In the case that
either peer drops the session or either peer reboot/crash, I dont see much
value in gracefull restart.

Did I miss something?

Ben

> -----Original Message-----
> From: Yakov Rekhter [mailto:yakov@juniper.net]
> Sent: Thursday, July 11, 2002 12:55 PM
> To: Srihari R. Sangli
> Cc: Tsillas, Jim; 'idr@merit.edu'
> Subject: Re: Are dynamic capabilities really necessary if 
> peers support
> gracef ul restart? 
> 
> 
> Srihari,
> 
> > > It seems we are entering major discussion on dynamic
> > > capabilities and the can of worms is open. Can anyone
> > > think of a scenario where a dynamic capabilities
> > > negotiation is still needed if we have graceful restart?
> > 
> > Talking about address-families exchanged between BGP speakers, the
> > dynamic capability is useful for enabling a new address-family or
> > disabling a address-family w/o disrupting the session and/or other
> > address-families. It is also useful for turning ON/OFF 
> other capabilities
> > (like ORF, etc)  without much disruption. 
> 
> Just to clarify, in the presence of graceful restart the disruption
> you refer to above is just the disruption in the routing peering
> between the two speakers. There is no disruption in the 
> data/forwarding
> plane and no disruption in any other routing peering. Correct ?
> 
> Yakov.
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id NAA22303 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 13:55:55 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 1A36391362; Thu, 11 Jul 2002 13:55:28 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id D128091363; Thu, 11 Jul 2002 13:55:27 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 9B2D491362 for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 13:55:26 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 8081E5DE2F; Thu, 11 Jul 2002 13:55:26 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from dmz2.procket.com (dmz2.procket.com [65.174.124.37]) by segue.merit.edu (Postfix) with ESMTP id 58C8E5DE05 for <idr@merit.edu>; Thu, 11 Jul 2002 13:55:26 -0400 (EDT)
Received: from miata.procket.com (zeus-d-1.procket.com [65.174.124.60]) by dmz2.procket.com (Postfix) with ESMTP id 65C633442D; Thu, 11 Jul 2002 11:01:42 -0700 (PDT)
Received: from localhost (srihari@localhost) by miata.procket.com (8.12.1/8.12.1) with ESMTP id g6BHtKRS000759; Thu, 11 Jul 2002 10:55:20 -0700 (PDT)
Date: Thu, 11 Jul 2002 10:55:20 -0700 (PDT)
From: "Srihari R. Sangli" <srihari@procket.com>
To: Yakov Rekhter <yakov@juniper.net>
Cc: "Tsillas, Jim" <jtsillas@tenornetworks.com>, "'idr@merit.edu'" <idr@merit.edu>
Subject: Re: Are dynamic capabilities really necessary if peers support gracef ul restart? 
In-Reply-To: <200207111655.g6BGtGm64434@merlot.juniper.net>
Message-ID: <Pine.GSO.4.40.0207111049180.192-100000@miata.procket.com>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
Sender: owner-idr@merit.edu
Precedence: bulk

Yakov,

> Just to clarify, in the presence of graceful restart the disruption
> you refer to above is just the disruption in the routing peering
> between the two speakers. There is no disruption in the data/forwarding
> plane and no disruption in any other routing peering. Correct ?

I was referring to the control plane changes after the BGP speaker restart
event until the point its up and has relearned them.

srihari...



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id NAA20654 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 13:06:15 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id F1CF19125C; Thu, 11 Jul 2002 13:05:36 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id BD67D9135A; Thu, 11 Jul 2002 13:05:35 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id C3AC09125C for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 13:05:34 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id AB3B35DE10; Thu, 11 Jul 2002 13:05:34 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 2AF215DDFE for <idr@merit.edu>; Thu, 11 Jul 2002 13:05:34 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id NAA11127; Thu, 11 Jul 2002 13:05:31 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id NAA02095; Thu, 11 Jul 2002 13:05:32 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W98S1FW>; Thu, 11 Jul 2002 13:05:32 -0400
Message-ID: <39469E08BD83D411A3D900204840EC5582278D@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Yakov Rekhter'" <yakov@juniper.net>, "Srihari R. Sangli" <srihari@procket.com>
Cc: "Tsillas, Jim" <jtsillas@tenornetworks.com>, "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: Are dynamic capabilities really necessary if peers support gr acef ul restart? 
Date: Thu, 11 Jul 2002 13:05:20 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Yakov:
  The only time I think gracefull restart is usefull is if two peers are
operating
fine, but the TCP connection is broken by the network. In the case that
either peer drops the session or either peer reboot/crash, I dont see much
value in gracefull restart.

Did I miss something?

Ben

> -----Original Message-----
> From: Yakov Rekhter [mailto:yakov@juniper.net]
> Sent: Thursday, July 11, 2002 12:55 PM
> To: Srihari R. Sangli
> Cc: Tsillas, Jim; 'idr@merit.edu'
> Subject: Re: Are dynamic capabilities really necessary if 
> peers support
> gracef ul restart? 
> 
> 
> Srihari,
> 
> > > It seems we are entering major discussion on dynamic
> > > capabilities and the can of worms is open. Can anyone
> > > think of a scenario where a dynamic capabilities
> > > negotiation is still needed if we have graceful restart?
> > 
> > Talking about address-families exchanged between BGP speakers, the
> > dynamic capability is useful for enabling a new address-family or
> > disabling a address-family w/o disrupting the session and/or other
> > address-families. It is also useful for turning ON/OFF 
> other capabilities
> > (like ORF, etc)  without much disruption. 
> 
> Just to clarify, in the presence of graceful restart the disruption
> you refer to above is just the disruption in the routing peering
> between the two speakers. There is no disruption in the 
> data/forwarding
> plane and no disruption in any other routing peering. Correct ?
> 
> Yakov.
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id MAA20325 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 12:55:54 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id F3D0E9125B; Thu, 11 Jul 2002 12:55:32 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id BD5A79125C; Thu, 11 Jul 2002 12:55:31 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 96FD49125B for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 12:55:30 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 802E75DE10; Thu, 11 Jul 2002 12:55:30 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from merlot.juniper.net (natint.juniper.net [207.17.136.129]) by segue.merit.edu (Postfix) with ESMTP id 2D7775DDFE for <idr@merit.edu>; Thu, 11 Jul 2002 12:55:30 -0400 (EDT)
Received: from juniper.net (garnet.juniper.net [172.17.28.17]) by merlot.juniper.net (8.11.3/8.11.3) with ESMTP id g6BGtGm64434; Thu, 11 Jul 2002 09:55:16 -0700 (PDT) (envelope-from yakov@juniper.net)
Message-Id: <200207111655.g6BGtGm64434@merlot.juniper.net>
To: "Srihari R. Sangli" <srihari@procket.com>
Cc: "Tsillas, Jim" <jtsillas@tenornetworks.com>, "'idr@merit.edu'" <idr@merit.edu>
Subject: Re: Are dynamic capabilities really necessary if peers support gracef ul restart? 
In-Reply-To: Your message of "Thu, 11 Jul 2002 09:43:21 PDT." <Pine.GSO.4.40.0207110925100.26039-100000@miata.procket.com> 
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-ID: <24470.1026406516.1@juniper.net>
Date: Thu, 11 Jul 2002 09:55:16 -0700
From: Yakov Rekhter <yakov@juniper.net>
Sender: owner-idr@merit.edu
Precedence: bulk

Srihari,

> > It seems we are entering major discussion on dynamic
> > capabilities and the can of worms is open. Can anyone
> > think of a scenario where a dynamic capabilities
> > negotiation is still needed if we have graceful restart?
> 
> Talking about address-families exchanged between BGP speakers, the
> dynamic capability is useful for enabling a new address-family or
> disabling a address-family w/o disrupting the session and/or other
> address-families. It is also useful for turning ON/OFF other capabilities
> (like ORF, etc)  without much disruption. 

Just to clarify, in the presence of graceful restart the disruption
you refer to above is just the disruption in the routing peering
between the two speakers. There is no disruption in the data/forwarding
plane and no disruption in any other routing peering. Correct ?

Yakov.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id MAA19900 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 12:43:56 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 5112991259; Thu, 11 Jul 2002 12:43:29 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 1CB2F91362; Thu, 11 Jul 2002 12:43:29 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id B838A91259 for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 12:43:26 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 9E4125DDED; Thu, 11 Jul 2002 12:43:26 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from dmz1.procket.com (dmz1.procket.com [65.174.124.36]) by segue.merit.edu (Postfix) with ESMTP id 77CB65DDAD for <idr@merit.edu>; Thu, 11 Jul 2002 12:43:26 -0400 (EDT)
Received: from miata.procket.com (zeus-d-1.procket.com [65.174.124.60]) by dmz1.procket.com (Postfix) with ESMTP id 8786323C32; Thu, 11 Jul 2002 09:57:01 -0700 (PDT)
Received: from localhost (srihari@localhost) by miata.procket.com (8.12.1/8.12.1) with ESMTP id g6BGhLvI025437; Thu, 11 Jul 2002 09:43:22 -0700 (PDT)
Date: Thu, 11 Jul 2002 09:43:21 -0700 (PDT)
From: "Srihari R. Sangli" <srihari@procket.com>
To: "Tsillas, Jim" <jtsillas@tenornetworks.com>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: Re: Are dynamic capabilities really necessary if peers support gracef ul restart?
In-Reply-To: <6B190B34070BD411ACA000B0D0214E560165B80B@newman.tenornet.com>
Message-ID: <Pine.GSO.4.40.0207110925100.26039-100000@miata.procket.com>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
Sender: owner-idr@merit.edu
Precedence: bulk

> It seems we are entering major discussion on dynamic
> capabilities and the can of worms is open. Can anyone
> think of a scenario where a dynamic capabilities
> negotiation is still needed if we have graceful restart?

Talking about address-families exchanged between BGP speakers, the
dynamic capability is useful for enabling a new address-family or
disabling a address-family w/o disrupting the session and/or other
address-families. It is also useful for turning ON/OFF other capabilities
(like ORF, etc)  without much disruption. Note that Graceful restart has
its own limitations for control plane changes.

I understand that the draft needs to be clear on the race condition
aspects that was mentioned earlier. Any other issues ?

srihari...



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id LAA18266 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 11:57:11 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id A0B8091360; Thu, 11 Jul 2002 11:56:51 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 658CE91361; Thu, 11 Jul 2002 11:56:51 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 06F9C91360 for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 11:56:49 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id E3EC05DE1B; Thu, 11 Jul 2002 11:56:49 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from rs-eu-exc1.rs.riverstonenet.com (unknown [193.128.15.220]) by segue.merit.edu (Postfix) with ESMTP id 7610F5DDFE for <idr@merit.edu>; Thu, 11 Jul 2002 11:56:49 -0400 (EDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"
Subject: RE: Re>Are dynamic capabilities really necessary if peers support gracef ul restart?
X-MimeOLE: Produced By Microsoft Exchange V6.0.5762.3
Date: Thu, 11 Jul 2002 16:56:44 +0100
Message-ID: <D2ECDCD8A528BF43A01EECCBEB191DD108F332@rs-eu-exc1.rs.riverstonenet.com>
X-MS-Has-Attach: 
X-MS-TNEF-Correlator: 
Thread-Topic: Re>Are dynamic capabilities really necessary if peers support gracef ul restart?
Thread-Index: AcIo7girqiORE4w8R7O614lN4LkeHAABRuPg
From: "Stephen Waters" <Stephen.Waters@riverstonenet.com>
To: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: <idr@merit.edu>
Sender: owner-idr@merit.edu
Precedence: bulk
Content-Transfer-Encoding: 8bit
X-MIME-Autoconverted: from quoted-printable to 8bit by nic.merit.edu id LAA18266

I don't see much point in dynamic capability negotiation if you can't turn things on and off,
(or off and on) to your heart's content. 

A network manager wanting to change the state or extent of a capability would appreciate doing that within a bgp session, wouldn't they?

Steve.  

> -----Original Message-----
> From: Abarbanel, Benjamin [mailto:Benjamin.Abarbanel@Marconi.com]
> Sent: Thursday, July 11, 2002 4:15 PM
> To: 'Tsillas, Jim'; 'idr@merit.edu'
> Cc: 'enke@redback.com'
> Subject: RE: Re>Are dynamic capabilities really necessary if peers
> support gracef ul restart?
> 
> 
> Jim and Enke:
> 
> I think the issue with dynamic capabilities is that the 
> current draft is not
> clear on what to do in the case of race conditions with the capability
> messages 
> between two routers. The can of worms you speak of will bite 
> us if we do not
> clearify this issue. Each vendor will choose to take a 
> different action 
> when this happens, I would prefer that the draft state a way 
> to handle race
> conditions and thus avoid the whole mess of either reseting 
> the session or
> recovering the CAPABILITY state.
> 
> I have a question on the current 
> draft-ietf-idr-dynamic-cap-02.txt, section
> 7 - Operation, 3rd and 4th paragraph:
> 
>     "The Dynamic Capability itself can not be revised 
> dynamically via a
>      CAPABILITY message. The lifetime of the Dynamic Capability is the
>      duration of the BGP session in which the capability is 
> advertised.
> 
>      Upon receiving a CAPABILITY message from its peer, the 
> BGP speaker
>      shall update the capabilities previously received from that peer
>      based on the "Action" in the message, and then function 
> in accordance
>      with the revised capabilities for the peer. Any unrecognized
>      capabilities in the message should be ignored."
> 
> Sounds like the second paragraph contradicts the first one. 
> So I cannot 
> tell if I can renegotiate and replace the CAPABILITY or not 
> during this
> session?
> 
> Ben
> 
> > -----Original Message-----
> > From: Tsillas, Jim [mailto:jtsillas@tenornetworks.com]
> > Sent: Thursday, July 11, 2002 9:52 AM
> > To: 'idr@merit.edu'
> > Subject: FW: Re>Are dynamic capabilities really necessary if peers
> > support gracef ul restart?
> > 
> > 
> > 
> > Bruno,
> > 
> > if graceful restart is implemented for all supported
> > address families and can be manually initiated it
> > seems to me that much of the functionality of capabilities
> > negotiation can be achieved by gracefully restarting the
> > session. I'm trying to think of a scenario where this
> > won't work. Granted that bouncing the session, even with
> > graceful restart, will cause large numbers of messages
> > to be sent between peers (which is something to avoid).
> > 
> > By can of worms I am implying unknown race conditions
> > which can result by renegotiating certain capabilities.
> > For some capabilities it may be harder than others to
> > identify and handle all the race conditions.
> > 
> > -Jim.
> > 
> > -----Original Message-----
> > From: Bruno Rijsman [mailto:bwarijsman@hotmail.com]
> > Sent: Thursday, July 11, 2002 8:30 AM
> > To: idr@merit.edu
> > Cc: jtsillas@tenornetworks.com
> > Subject: Re>Are dynamic capabilities really necessary if 
> peers support
> > gracef ul restart?
> > 
> > 
> > Re> It seems we are entering major discussion on dynamic
> > Re> capabilities and the can of worms is open. Can anyone
> > Re> think of a scenario where a dynamic capabilities
> > Re> negotiation is still needed if we have graceful restart?
> > 
> > One example is: it allows you to activate or de-activate an
> > address family (e.g. ipv6 or vpnv4) on a session which is
> > already established and exchanging some other address family
> > (e.g. ipv4) without having to bounce that session.
> > 
> > More generally speaking, it allows you to activate or
> > de-activate features on a session without needing to bounce the
> > session.
> > 
> > Not needing to bounce a session a good thing.
> > 
> > It seems from your e-mail that you are implying that 
> graceful restart
> > removes the need or reduces the usefulness of dynamic capability
> > negotiation. Could you please clarify that?
> > 
> > Having personally implemented dynamic capability negotiation
> > I can tell you that the "can of worms" is not so bad: all you
> > have to do is to be willing to accept and silently ignore certain
> > messages after withdrawing a capability (e.g. you have to be willing
> > to accept and ignore vpnv4 updates if you recently withdrew the
> > vpnv4 multi-protocol capability).
> > 
> > 
> > 
> > _________________________________________________________________
> > Send and receive Hotmail on your mobile device: 
http://mobile.msn.com
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id LAA16916 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 11:17:14 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 49F699125A; Thu, 11 Jul 2002 11:15:39 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 179EB9135E; Thu, 11 Jul 2002 11:15:39 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 016F99125A for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 11:15:37 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id D36BD5DE29; Thu, 11 Jul 2002 11:15:37 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 5C89C5DE04 for <idr@merit.edu>; Thu, 11 Jul 2002 11:15:37 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id LAA03212; Thu, 11 Jul 2002 11:15:25 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id LAA05958; Thu, 11 Jul 2002 11:15:25 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <3W98R75Y>; Thu, 11 Jul 2002 11:15:25 -0400
Message-ID: <39469E08BD83D411A3D900204840EC5582278C@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Tsillas, Jim'" <jtsillas@tenornetworks.com>, "'idr@merit.edu'" <idr@merit.edu>
Cc: "'enke@redback.com'" <enke@redback.com>
Subject: RE: Re>Are dynamic capabilities really necessary if peers support gracef ul restart?
Date: Thu, 11 Jul 2002 11:15:15 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Jim and Enke:

I think the issue with dynamic capabilities is that the current draft is not
clear on what to do in the case of race conditions with the capability
messages 
between two routers. The can of worms you speak of will bite us if we do not
clearify this issue. Each vendor will choose to take a different action 
when this happens, I would prefer that the draft state a way to handle race
conditions and thus avoid the whole mess of either reseting the session or
recovering the CAPABILITY state.

I have a question on the current draft-ietf-idr-dynamic-cap-02.txt, section
7 - Operation, 3rd and 4th paragraph:

    "The Dynamic Capability itself can not be revised dynamically via a
     CAPABILITY message. The lifetime of the Dynamic Capability is the
     duration of the BGP session in which the capability is advertised.

     Upon receiving a CAPABILITY message from its peer, the BGP speaker
     shall update the capabilities previously received from that peer
     based on the "Action" in the message, and then function in accordance
     with the revised capabilities for the peer. Any unrecognized
     capabilities in the message should be ignored."

Sounds like the second paragraph contradicts the first one. So I cannot 
tell if I can renegotiate and replace the CAPABILITY or not during this
session?

Ben

> -----Original Message-----
> From: Tsillas, Jim [mailto:jtsillas@tenornetworks.com]
> Sent: Thursday, July 11, 2002 9:52 AM
> To: 'idr@merit.edu'
> Subject: FW: Re>Are dynamic capabilities really necessary if peers
> support gracef ul restart?
> 
> 
> 
> Bruno,
> 
> if graceful restart is implemented for all supported
> address families and can be manually initiated it
> seems to me that much of the functionality of capabilities
> negotiation can be achieved by gracefully restarting the
> session. I'm trying to think of a scenario where this
> won't work. Granted that bouncing the session, even with
> graceful restart, will cause large numbers of messages
> to be sent between peers (which is something to avoid).
> 
> By can of worms I am implying unknown race conditions
> which can result by renegotiating certain capabilities.
> For some capabilities it may be harder than others to
> identify and handle all the race conditions.
> 
> -Jim.
> 
> -----Original Message-----
> From: Bruno Rijsman [mailto:bwarijsman@hotmail.com]
> Sent: Thursday, July 11, 2002 8:30 AM
> To: idr@merit.edu
> Cc: jtsillas@tenornetworks.com
> Subject: Re>Are dynamic capabilities really necessary if peers support
> gracef ul restart?
> 
> 
> Re> It seems we are entering major discussion on dynamic
> Re> capabilities and the can of worms is open. Can anyone
> Re> think of a scenario where a dynamic capabilities
> Re> negotiation is still needed if we have graceful restart?
> 
> One example is: it allows you to activate or de-activate an
> address family (e.g. ipv6 or vpnv4) on a session which is
> already established and exchanging some other address family
> (e.g. ipv4) without having to bounce that session.
> 
> More generally speaking, it allows you to activate or
> de-activate features on a session without needing to bounce the
> session.
> 
> Not needing to bounce a session a good thing.
> 
> It seems from your e-mail that you are implying that graceful restart
> removes the need or reduces the usefulness of dynamic capability
> negotiation. Could you please clarify that?
> 
> Having personally implemented dynamic capability negotiation
> I can tell you that the "can of worms" is not so bad: all you
> have to do is to be willing to accept and silently ignore certain
> messages after withdrawing a capability (e.g. you have to be willing
> to accept and ignore vpnv4 updates if you recently withdrew the
> vpnv4 multi-protocol capability).
> 
> 
> 
> _________________________________________________________________
> Send and receive Hotmail on your mobile device: http://mobile.msn.com
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA16419 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 10:47:12 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 5512B9135D; Thu, 11 Jul 2002 10:45:50 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 243379135E; Thu, 11 Jul 2002 10:45:50 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id EE7029135D for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 10:45:48 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id CEB105DE13; Thu, 11 Jul 2002 10:45:48 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from titanium.zebra.org (sdsl-64-139-11-202.dsl.sca.megapath.net [64.139.11.202]) by segue.merit.edu (Postfix) with ESMTP id E2BBB5DE04 for <idr@merit.edu>; Thu, 11 Jul 2002 10:45:47 -0400 (EDT)
Received: from titanium.zebra.org (IDENT:kunihiro@titanium [127.0.0.1]) by titanium.zebra.org (8.9.3/8.9.3) with ESMTP id KAA00786; Thu, 11 Jul 2002 10:47:09 -0400
Date: Thu, 11 Jul 2002 07:47:09 -0700
Message-ID: <m27kk21fv6.wl@titanium.zebra.org>
From: Kunihiro Ishiguro <kunihiro@zebra.org>
To: "Bruno Rijsman" <bwarijsman@hotmail.com>
Cc: idr@merit.edu, Pawel.Fryczke@intel.com
Subject: Re: Aggregation of the overlapping routes and ATOMIC_AGGREGATE
In-Reply-To: <F127Aabut81LU0DxJHN0000ea48@hotmail.com>
References: <F127Aabut81LU0DxJHN0000ea48@hotmail.com>
User-Agent: Wanderlust/2.8.1 (Something) SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.3 Emacs/21.2.50 (powerpc-unknown-linux-gnu) MULE/5.0 (SAKAKI)
MIME-Version: 1.0 (generated by SEMI 1.14.3 - "Ushinoya")
Content-Type: text/plain; charset=US-ASCII
Sender: owner-idr@merit.edu
Precedence: bulk

>Re> Consider scenario:
>Re> BGP Router is configured to aggregate routes 192.0.0.0/8
>Re> This router receives 2 NLRIs: 192.0.0.0/8 and 192.1.0.0/16
>Re> Received NLRIs are installed in RIB, but are suppressed by
>Re> aggregation and are not advertised. As a result of aggregation
>Re> the router sends update that contains only one NLRI: 192.0.0.0/8
>Re> Does this update should contain ATOMIC_AGGREGATE attached?
>
>I my experience this is what the major deployed implementations do:
>
>1) If the aggregate is an "as-set" aggregate (i.e. the as-path and
>   other path attributes of the aggregate are constructed by
>   "summarizing" the attributes of the covered routes) then the
>   ATOMIC_AGGREGATE attribute is not attached.
>
>2) If the aggregate is not an "as-set" aggregate (i.e. the as-path
>   and other path attributes are constructed as if the aggregate
>   route is a locally originated route) then the ATOMIC_AGGREGATE
>   attribute is attached.

Adding to that, when "as-set" is not specified, created aggregated
route has origin IGP.  For example RFC1771 and the latest draft said

      ORIGIN attribute: If at least one route among routes that are
      aggregated has ORIGIN with the value INCOMPLETE, then the
      aggregated route must have the ORIGIN attribute with the value
      INCOMPLETE. Otherwise, if at least one route among routes that are
      aggregated has ORIGIN with the value EGP, then the aggregated
      route must have the origin attribute with the value EGP. In all
      other case the value of the ORIGIN attribute of the aggregated
      route is INTERNAL.

When "as-set" is specifed above rule is applied but when "as-set" is
not specified, the aggregate route always has orgin IGP like self
originated route.  This is little bit implementation specific but some
people may consider.
-- 
Kunihiro Ishiguro


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA16271 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 10:29:33 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id F38A89135C; Thu, 11 Jul 2002 10:28:55 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id BE8629135D; Thu, 11 Jul 2002 10:28:54 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 9CD639135C for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 10:28:53 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 6843E5DE10; Thu, 11 Jul 2002 10:28:53 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id E365F5DD91 for <idr@merit.edu>; Thu, 11 Jul 2002 10:28:52 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA29063; Thu, 11 Jul 2002 10:28:50 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id KAA24830; Thu, 11 Jul 2002 10:28:50 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <N5N1NT0D>; Thu, 11 Jul 2002 10:28:50 -0400
Message-ID: <39469E08BD83D411A3D900204840EC5582278A@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Chandrashekhar Appanna'" <achandra@cisco.com>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: "'Bruno Rijsman'" <bwarijsman@hotmail.com>, idr@merit.edu
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Thu, 11 Jul 2002 10:28:42 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

 
> 
> You can be tolerant in your implementation today if you choose
> and if your customers will accept it. I do not see how you hope
> that a new INFORM message will let an implementation correct itself 
> when the implementation failed to do the right thing in the first 
> place.
> 

The case I am refering to is the "ships in the night" condition where the
messages cross on the wire, thus causing improper processing. With the use
of the INFORM like message, the two routers can get synchronized on what 
capabilities to honor without a session restart. Thus preserving the state
of all their routes.

Ben

> -
> Chandra A.
> 
> 
> On Wed, Jul 10, 2002 at 02:49:12PM -0400, Abarbanel, Benjamin wrote:
> > I am in favour of providing a tolerance counter that will 
> drop the session
> > only after a max number of bad update messages are 
> received. Thus if the
> > offending
> > peer does not correct or adjust itself in terms of 
> capabilities, after a
> > number
> > of INFORM responses are sent to it, then and only then 
> should the receiving
> > router drop the session. This will cover the case, where 
> the INFORM is also
> > mis-processed in the "ships in the night" scenario.
> > 
> > Ben
> > 
> > 
> > > -----Original Message-----
> > > From: Chandrashekhar Appanna [mailto:achandra@cisco.com]
> > > Sent: Wednesday, July 10, 2002 2:09 PM
> > > To: Abarbanel, Benjamin
> > > Cc: 'Bruno Rijsman'; idr@merit.edu
> > > Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
> > > 
> > > 
> > > 
> > > But that and the ability to INFORM is not the same. In the
> > > example you provide, the INFORM is not providing any extra 
> > > advantage. If the peer did not process/rcv the withdraw 
> > > why do you hope that it will rcv/process the INFORM?
> > > 
> > > I think the behavior of the withdraw itself is the problem here,
> > > It has to handle the case of a message cross that you describe, 
> > > in a well defined manner. 
> > > 
> > > -
> > > Chandra A.
> > > 
> > > On Wed, Jul 10, 2002 at 11:50:32AM -0400, Abarbanel, 
> Benjamin wrote:
> > > > Correct. The protocol has to be elastic, since it carries 
> > > so much weight in 
> > > > controlling stability in the Internet.
> > > > 
> > > > Ben
> > > > 
> > > > > -----Original Message-----
> > > > > From: Bruno Rijsman [mailto:bwarijsman@hotmail.com]
> > > > > Sent: Wednesday, July 10, 2002 11:47 AM
> > > > > To: idr@merit.edu
> > > > > Cc: Benjamin.Abarbanel@Marconi.com
> > > > > Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
> > > > > 
> > > > > 
> > > > > Re> Just because you receive an Update message with 
> > > capabilities that
> > > > > Re> were not negotiated at the OPEN message, should not 
> > > require the
> > > > > Re> peering session to terminate. The affect of all that 
> > > peer routes
> > > > > Re> going down will destabilize the network.
> > > > > 
> > > > > Also, keep in mind that in the case of dynamic capability 
> > > negotiation
> > > > > is possible and perfectly legal to receive a message 
> > > which relies on
> > > > > a capability which was recently withdrawn. Consider for 
> > > example the
> > > > > case where a route-refresh message and a capability 
> message which
> > > > > withdraws the route-refresh capability cross each other.
> > > > > 
> > > > > 
> > > > > 
> > > > > 
> _________________________________________________________________
> > > > > Send and receive Hotmail on your mobile device: 
> > http://mobile.msn.com
> > > > 
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id JAA15799 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 09:57:38 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 8421991359; Thu, 11 Jul 2002 09:57:10 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 531D39135B; Thu, 11 Jul 2002 09:57:10 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 49D3A91359 for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 09:57:09 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 384945DE05; Thu, 11 Jul 2002 09:57:09 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from tenornetworks.com (rtu.tenornetworks.com [63.77.213.2]) by segue.merit.edu (Postfix) with ESMTP id BA1A65DD91 for <idr@merit.edu>; Thu, 11 Jul 2002 09:57:08 -0400 (EDT)
Received: from tenornet.com (newman [192.168.0.185]) by tenornetworks.com (Switch-2.1.3/Switch-2.1.0) with ESMTP id g6BDv7d25126 for <idr@merit.edu>; Thu, 11 Jul 2002 09:57:07 -0400 (EDT)
Received: from 192.168.0.185 by tenornet.com; THU, 11 Jul 2002 09:52:28 -0700
Received: by newman.tenornet.com with Internet Mail Service (5.5.2650.21) id <N1HXZYQX>; Thu, 11 Jul 2002 09:52:27 -0400
Message-ID: <6B190B34070BD411ACA000B0D0214E560165B80D@newman.tenornet.com>
From: "Tsillas, Jim" <jtsillas@tenornetworks.com>
To: "'idr@merit.edu'" <idr@merit.edu>
Subject: FW: Re>Are dynamic capabilities really necessary if peers support gracef ul restart?
Date: Thu, 11 Jul 2002 09:52:27 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Bruno,

if graceful restart is implemented for all supported
address families and can be manually initiated it
seems to me that much of the functionality of capabilities
negotiation can be achieved by gracefully restarting the
session. I'm trying to think of a scenario where this
won't work. Granted that bouncing the session, even with
graceful restart, will cause large numbers of messages
to be sent between peers (which is something to avoid).

By can of worms I am implying unknown race conditions
which can result by renegotiating certain capabilities.
For some capabilities it may be harder than others to
identify and handle all the race conditions.

-Jim.

-----Original Message-----
From: Bruno Rijsman [mailto:bwarijsman@hotmail.com]
Sent: Thursday, July 11, 2002 8:30 AM
To: idr@merit.edu
Cc: jtsillas@tenornetworks.com
Subject: Re>Are dynamic capabilities really necessary if peers support
gracef ul restart?


Re> It seems we are entering major discussion on dynamic
Re> capabilities and the can of worms is open. Can anyone
Re> think of a scenario where a dynamic capabilities
Re> negotiation is still needed if we have graceful restart?

One example is: it allows you to activate or de-activate an
address family (e.g. ipv6 or vpnv4) on a session which is
already established and exchanging some other address family
(e.g. ipv4) without having to bounce that session.

More generally speaking, it allows you to activate or
de-activate features on a session without needing to bounce the
session.

Not needing to bounce a session a good thing.

It seems from your e-mail that you are implying that graceful restart
removes the need or reduces the usefulness of dynamic capability
negotiation. Could you please clarify that?

Having personally implemented dynamic capability negotiation
I can tell you that the "can of worms" is not so bad: all you
have to do is to be willing to accept and silently ignore certain
messages after withdrawing a capability (e.g. you have to be willing
to accept and ignore vpnv4 updates if you recently withdrew the
vpnv4 multi-protocol capability).



_________________________________________________________________
Send and receive Hotmail on your mobile device: http://mobile.msn.com



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id IAA14955 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 08:30:51 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 8A74291356; Thu, 11 Jul 2002 08:30:11 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 1D8AF91358; Thu, 11 Jul 2002 08:30:10 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id BC00C91356 for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 08:30:08 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 0F5E05DDD9; Thu, 11 Jul 2002 08:30:07 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from hotmail.com (f98.law4.hotmail.com [216.33.149.98]) by segue.merit.edu (Postfix) with ESMTP id 5D1005DDD6 for <idr@merit.edu>; Thu, 11 Jul 2002 08:30:06 -0400 (EDT)
Received: from mail pickup service by hotmail.com with Microsoft SMTPSVC; Thu, 11 Jul 2002 05:30:05 -0700
Received: from 65.194.140.2 by lw4fd.law4.hotmail.msn.com with HTTP; Thu, 11 Jul 2002 12:30:05 GMT
X-Originating-IP: [65.194.140.2]
From: "Bruno Rijsman" <bwarijsman@hotmail.com>
To: idr@merit.edu
Cc: jtsillas@tenornetworks.com
Subject: Re>Are dynamic capabilities really necessary if peers support gracef ul restart?
Date: Thu, 11 Jul 2002 08:30:05 -0400
Mime-Version: 1.0
Content-Type: text/plain; format=flowed
Message-ID: <F98FydQPW6Ft5BzMF5w0000ead8@hotmail.com>
X-OriginalArrivalTime: 11 Jul 2002 12:30:05.0923 (UTC) FILETIME=[AFE99B30:01C228D6]
Sender: owner-idr@merit.edu
Precedence: bulk

Re> It seems we are entering major discussion on dynamic
Re> capabilities and the can of worms is open. Can anyone
Re> think of a scenario where a dynamic capabilities
Re> negotiation is still needed if we have graceful restart?

One example is: it allows you to activate or de-activate an
address family (e.g. ipv6 or vpnv4) on a session which is
already established and exchanging some other address family
(e.g. ipv4) without having to bounce that session.

More generally speaking, it allows you to activate or
de-activate features on a session without needing to bounce the
session.

Not needing to bounce a session a good thing.

It seems from your e-mail that you are implying that graceful restart
removes the need or reduces the usefulness of dynamic capability
negotiation. Could you please clarify that?

Having personally implemented dynamic capability negotiation
I can tell you that the "can of worms" is not so bad: all you
have to do is to be willing to accept and silently ignore certain
messages after withdrawing a capability (e.g. you have to be willing
to accept and ignore vpnv4 updates if you recently withdrew the
vpnv4 multi-protocol capability).



_________________________________________________________________
Send and receive Hotmail on your mobile device: http://mobile.msn.com



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id IAA14821 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 08:21:12 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 7AEB591214; Thu, 11 Jul 2002 08:20:44 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 48B909131B; Thu, 11 Jul 2002 08:20:44 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 0DDA291214 for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 08:20:43 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id EBC7D5DDD6; Thu, 11 Jul 2002 08:20:42 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from tenornetworks.com (rtu.tenornetworks.com [63.77.213.2]) by segue.merit.edu (Postfix) with ESMTP id 7D6BB5DD91 for <idr@merit.edu>; Thu, 11 Jul 2002 08:20:42 -0400 (EDT)
Received: from tenornet.com (newman [192.168.0.185]) by tenornetworks.com (Switch-2.1.3/Switch-2.1.0) with ESMTP id g6BCKed20352 for <idr@merit.edu>; Thu, 11 Jul 2002 08:20:40 -0400 (EDT)
Received: from 192.168.0.185 by tenornet.com; THU, 11 Jul 2002 08:16:02 -0700
Received: by newman.tenornet.com with Internet Mail Service (5.5.2650.21) id <N1HXZYJ9>; Thu, 11 Jul 2002 08:16:02 -0400
Message-ID: <6B190B34070BD411ACA000B0D0214E560165B80B@newman.tenornet.com>
From: "Tsillas, Jim" <jtsillas@tenornetworks.com>
To: "'idr@merit.edu'" <idr@merit.edu>
Subject: Are dynamic capabilities really necessary if peers support gracef ul restart?
Date: Thu, 11 Jul 2002 08:16:02 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

It seems we are entering major discussion on dynamic
capabilities and the can of worms is open. Can anyone
think of a scenario where a dynamic capabilities
negotiation is still needed if we have graceful restart?

thanks,
-Jim.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id GAA11938 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 06:48:03 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id CE30991258; Thu, 11 Jul 2002 06:47:28 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 99D7D9131B; Thu, 11 Jul 2002 06:47:28 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 3C96391258 for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 06:47:27 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 1594B5DDAD; Thu, 11 Jul 2002 06:47:27 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from hotmail.com (f127.law4.hotmail.com [216.33.149.127]) by segue.merit.edu (Postfix) with ESMTP id BA71F5DDA5 for <idr@merit.edu>; Thu, 11 Jul 2002 06:47:26 -0400 (EDT)
Received: from mail pickup service by hotmail.com with Microsoft SMTPSVC; Thu, 11 Jul 2002 03:47:26 -0700
Received: from 65.194.140.2 by lw4fd.law4.hotmail.msn.com with HTTP; Thu, 11 Jul 2002 10:47:26 GMT
X-Originating-IP: [65.194.140.2]
From: "Bruno Rijsman" <bwarijsman@hotmail.com>
To: idr@merit.edu
Cc: Pawel.Fryczke@intel.com
Subject: Aggregation of the overlapping routes and ATOMIC_AGGREGATE
Date: Thu, 11 Jul 2002 06:47:26 -0400
Mime-Version: 1.0
Content-Type: text/plain; format=flowed
Message-ID: <F127Aabut81LU0DxJHN0000ea48@hotmail.com>
X-OriginalArrivalTime: 11 Jul 2002 10:47:26.0393 (UTC) FILETIME=[588BEA90:01C228C8]
Sender: owner-idr@merit.edu
Precedence: bulk

Re> Consider scenario:
Re> BGP Router is configured to aggregate routes 192.0.0.0/8
Re> This router receives 2 NLRIs: 192.0.0.0/8 and 192.1.0.0/16
Re> Received NLRIs are installed in RIB, but are suppressed by
Re> aggregation and are not advertised. As a result of aggregation
Re> the router sends update that contains only one NLRI: 192.0.0.0/8
Re> Does this update should contain ATOMIC_AGGREGATE attached?

I my experience this is what the major deployed implementations do:

1) If the aggregate is an "as-set" aggregate (i.e. the as-path and
   other path attributes of the aggregate are constructed by
   "summarizing" the attributes of the covered routes) then the
   ATOMIC_AGGREGATE attribute is not attached.

2) If the aggregate is not an "as-set" aggregate (i.e. the as-path
   and other path attributes are constructed as if the aggregate
   route is a locally originated route) then the ATOMIC_AGGREGATE
   attribute is attached.

I have seen nothing in RFC1771 or the current draft that describes
this behavior but -as far as I know- that is what happens in real
life.


_________________________________________________________________
Chat with friends online, try MSN Messenger: http://messenger.msn.com



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id GAA11516 for <idr-archive@nic.merit.edu>; Thu, 11 Jul 2002 06:35:06 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 35FD691257; Thu, 11 Jul 2002 06:34:28 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 0196491258; Thu, 11 Jul 2002 06:34:27 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id B0E2291257 for <idr@trapdoor.merit.edu>; Thu, 11 Jul 2002 06:34:26 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 9A79C5DDA5; Thu, 11 Jul 2002 06:34:26 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mail1.hd.intel.com (hdfdns01.hd.intel.com [192.52.58.10]) by segue.merit.edu (Postfix) with ESMTP id 2B95E5DD91 for <idr@merit.edu>; Thu, 11 Jul 2002 06:34:26 -0400 (EDT)
Received: from fmsmsxvs040.fm.intel.com (fmsmsxvs040.fm.intel.com [132.233.42.124]) by mail1.hd.intel.com (8.11.6/8.11.6/d: solo.mc,v 1.42 2002/05/23 22:21:11 root Exp $) with SMTP id g6BAYP019129 for <idr@merit.edu>; Thu, 11 Jul 2002 10:34:25 GMT
Received: from alpha.igk.intel.com ([172.28.168.68]) by fmsmsxvs040.fm.intel.com (NAVGW 2.5.2.11) with SMTP id M2002071103334005579 for <idr@merit.edu>; Thu, 11 Jul 2002 03:33:40 -0700
Received: by alpha.igk.intel.com with Internet Mail Service (5.5.2653.19) id <L7B07S76>; Thu, 11 Jul 2002 12:32:15 +0200
Message-ID: <413FBB0BA5AED1119122000083234B1A015A942E@alpha.igk.intel.com>
From: "Fryczke, Pawel" <Pawel.Fryczke@intel.com>
To: idr@merit.edu
Subject: Aggregation of the overlapping routes and ATOMIC_AGGREGATE
Date: Thu, 11 Jul 2002 12:32:14 +0200
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain; charset="ISO-8859-2"
Sender: owner-idr@merit.edu
Precedence: bulk

Latest BGP draft states:
"If a BGP speaker receives overlapping routes, the Decision Process
MUST consider both routes based on the configured acceptance policy.
If both a less and a more specific route are accepted, then the
Decision Process MUST either install both the less and the more
specific routes or it MUST aggregate the two routes and install the
aggregated route, provided that both routes have the same value of
the NEXT_HOP attribute.
If a BGP speaker chooses to aggregate, then it MUST add
ATOMIC_AGGREGATE attribute to the route."

What does happen if BGP router installs both routes in RIB and next
aggregates them ?
Should ATOMIC_AGGREGATE be added to aggregate or not ?

Consider scenario:
BGP Router is configured to aggregate routes 192.0.0.0/8
This router receives 2 NLRIs: 192.0.0.0/8 and 192.1.0.0/16
Received NLRIs are installed in RIB, but are suppressed by aggregation and
are not advertised.
As a result of aggregation the router sends update that contains only one
NLRI: 192.0.0.0/8
Does this update should contain ATOMIC_AGGREGATE attached ?

Why does RFC assume that aggregation is done during decision process?
It is possible to aggregate during update process or some other process,
isn't it?

Pawel


Received: from trapdoor.merit.edu (trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id SAA17725 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 18:35:33 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id CB6909134E; Wed, 10 Jul 2002 18:32:44 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id CC4059134F; Wed, 10 Jul 2002 18:32:25 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 93EFE9134E for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 18:31:37 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 7C8CE5DDD5; Wed, 10 Jul 2002 18:31:37 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from celox-ma1-ems1.celoxnetworks.com (celox-ma1-imap1.celoxnetworks.com [12.40.60.233]) by segue.merit.edu (Postfix) with ESMTP id 4132A5DD94 for <idr@merit.edu>; Wed, 10 Jul 2002 18:31:37 -0400 (EDT)
Received: by celox-ma1-ems1.celoxnetworks.com with Internet Mail Service (5.5.2653.19) id <3T6KDRZB>; Wed, 10 Jul 2002 18:31:36 -0400
Message-ID: <1117F7D44159934FB116E36F4ABF221B020AF857@celox-ma1-ems1.celoxnetworks.com>
From: "Natale, Jonathan" <JNatale@celoxnetworks.com>
To: "'Gargi Nalawade'" <gargi@cisco.com>, idr@merit.edu
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Wed, 10 Jul 2002 18:31:36 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
Sender: owner-idr@merit.edu
Precedence: bulk

Good example.

I had also suggested a new NOTIFICATION for the case when it IS configured
to kill the con:



-----Original Message-----
From: Natale, Jonathan 
Sent: Monday, April 29, 2002 4:13 PM
To: 'Russ White'; Pedro Roque Marques
Cc: idr@merit.edu
Subject: RE: BGP-17 Draft Comments...

I would like to suggest another UPDATE Message Error subcode be introduced
~= "Maximum Number of Allowed Prefixes Exceeded" (if this has been done,
then please direct me to where it is documented).  Currently ( 12.1(5)T4 ),
a "Malformed Attribute List" notification is sent when the user configured
maximum number of allowed prefixes is exceeded.  Alternatively, this current
(or the "correct" behavior) should be specified in the draft-ietf-idr-bgp4
draft which recently added:

   "A BGP speaker may support the ability to impose an (locally
   configured) upper bound on the number of address prefixes the speaker
   is willing to accept from a neighbor. When the upper bound is
   reached, the speaker (under control of local configuration) may
   either (a) discard new address prefixes from the neighbor, or (b)
   terminate the BGP peering with the neighbor. If the BGP speaker
   decides to terminate its peering with a neighbor because the number
   of address prefixes received from the neighbor exceeds the locally
   configured upper bound, then the speaker must send to the neighbor a
   NOTIFICATION message with the Error Code Cease."


-$0.02


From: Gargi Nalawade [mailto:gargi@cisco.com] 
Sent: Wednesday, July 10, 2002 5:12 PM
To: idr@merit.edu
Subject: Re: INFORM proposal and MP-BGP (RFC 2858)

Hi All,

A few clarifications about the INFORM message & its motivations. This is
supposed to be purely an informational message. Does not propose to affect
the functionality of the protocol. Rather, it proposes to give flexibility
to implementations in that; they dont need to reset the sessions, just so
they can inform the peer of a specific noteworthy event happening. 

Take the example of max-prefix limit. If the limit is hit, the receiving
peer would just generate a local warning (if so configured) and continue
with the session. There is no mechanism of informing the other peer that
it has exceeded the prefix-limit value, other than churning through the log,
noticing the warnin & then picking up the telephone (or oops - writing an
email) & informing the peer's administrator.

This is just one such hypothetical example where the INFORM would be useful.
But the INFORM message gives us the capability to do just that. There are 
several other such scenarios seen in deployment, that necessitate the
existence of just such a mechanism.

-Gargi


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id SAA17718 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 18:35:32 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 245A59134F; Wed, 10 Jul 2002 18:34:14 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id DF5FA9135A; Wed, 10 Jul 2002 18:34:13 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 817C59134F for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 18:34:07 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 6E60B5DD97; Wed, 10 Jul 2002 18:34:07 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from celox-ma1-ems1.celoxnetworks.com (celox-ma1-imap1.celoxnetworks.com [12.40.60.233]) by segue.merit.edu (Postfix) with ESMTP id 33E075DD94 for <idr@merit.edu>; Wed, 10 Jul 2002 18:34:07 -0400 (EDT)
Received: by celox-ma1-ems1.celoxnetworks.com with Internet Mail Service (5.5.2653.19) id <3T6KDRZM>; Wed, 10 Jul 2002 18:34:06 -0400
Message-ID: <1117F7D44159934FB116E36F4ABF221B020AF858@celox-ma1-ems1.celoxnetworks.com>
From: "Natale, Jonathan" <JNatale@celoxnetworks.com>
To: "'Abarbanel, Benjamin'" <Benjamin.Abarbanel@Marconi.com>, "'Gargi Nalawade'" <gargi@cisco.com>, idr@merit.edu
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Wed, 10 Jul 2002 18:34:06 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
Sender: owner-idr@merit.edu
Precedence: bulk

Rxing mcast nlri, when only doing unicast
The draft added this clarification--"ignore them" (seems like a job for the
INFORM???)

-$0.03

-----Original Message-----
From: Abarbanel, Benjamin [mailto:Benjamin.Abarbanel@Marconi.com] 
Sent: Wednesday, July 10, 2002 5:39 PM
To: 'Gargi Nalawade'; idr@merit.edu
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)

Thanks for the clarification on the original intent.

Clearly I think there are many cases, were we want to send potential
operational affecting information to a neighboring peer without reseting
the session. INFORM type message could provide that communication.

I can see the following type of data it would convey:
  1. CPU Utilization levels in an effort to slow down the rate of update.
  2. Memory utilization in an effort to control the amount of routing
     information sent, possibly forbid any optional capabilities while
     router is in a memory depleted state.
  3. Reaching maximum limits on configured or defaulted resources. Number
     of prefixes, attributes, capabilities, etc.
  4. Protocol related WARNING conditions that do not cause a session 
     reset. Repair race conditions, identify FSM state, etc.  

Thanks,
Ben 


> -----Original Message-----
> From: Gargi Nalawade [mailto:gargi@cisco.com]
> Sent: Wednesday, July 10, 2002 5:12 PM
> To: idr@merit.edu
> Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
> 
> 
> Hi All,
> 
> A few clarifications about the INFORM message & its 
> motivations. This is
> supposed to be purely an informational message. Does not 
> propose to affect
> the functionality of the protocol. Rather, it proposes to 
> give flexibility
> to implementations in that; they dont need to reset the 
> sessions, just so
> they can inform the peer of a specific noteworthy event happening. 
> 
> Take the example of max-prefix limit. If the limit is hit, 
> the receiving
> peer would just generate a local warning (if so configured) 
> and continue
> with the session. There is no mechanism of informing the 
> other peer that
> it has exceeded the prefix-limit value, other than churning 
> through the log,
> noticing the warnin & then picking up the telephone (or oops 
> - writing an
> email) & informing the peer's administrator.
> 
> This is just one such hypothetical example where the INFORM 
> would be useful.
> But the INFORM message gives us the capability to do just 
> that. There are 
> several other such scenarios seen in deployment, that necessitate the
> existence of just such a mechanism.
> 
> -Gargi
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id SAA17302 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 18:22:51 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 9F7939134C; Wed, 10 Jul 2002 18:21:27 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 586DE9134F; Wed, 10 Jul 2002 18:21:27 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id E29A89134C for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 18:21:23 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id C9E315DE07; Wed, 10 Jul 2002 18:21:23 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from cisco.com (cypher.cisco.com [171.69.11.18]) by segue.merit.edu (Postfix) with ESMTP id 464C25DDD5 for <idr@merit.edu>; Wed, 10 Jul 2002 18:21:23 -0400 (EDT)
Received: (from achandra@localhost) by cisco.com (8.8.8/2.6/Cisco List Logging/8.8.8) id PAA27127; Wed, 10 Jul 2002 15:21:21 -0700 (PDT)
Date: Wed, 10 Jul 2002 15:21:20 -0700
From: Chandrashekhar Appanna <achandra@cisco.com>
To: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: "'Chandrashekhar Appanna'" <achandra@cisco.com>, "'Bruno Rijsman'" <bwarijsman@hotmail.com>, idr@merit.edu
Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
Message-ID: <20020710152120.E18943@cypher.cisco.com>
References: <39469E08BD83D411A3D900204840EC55822782@vie-msgusr-01.dc.fore.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5.1i
In-Reply-To: <39469E08BD83D411A3D900204840EC55822782@vie-msgusr-01.dc.fore.com>; from Benjamin.Abarbanel@Marconi.com on Wed, Jul 10, 2002 at 02:49:12PM -0400
Sender: owner-idr@merit.edu
Precedence: bulk

You can be tolerant in your implementation today if you choose
and if your customers will accept it. I do not see how you hope
that a new INFORM message will let an implementation correct itself 
when the implementation failed to do the right thing in the first 
place.

-
Chandra A.


On Wed, Jul 10, 2002 at 02:49:12PM -0400, Abarbanel, Benjamin wrote:
> I am in favour of providing a tolerance counter that will drop the session
> only after a max number of bad update messages are received. Thus if the
> offending
> peer does not correct or adjust itself in terms of capabilities, after a
> number
> of INFORM responses are sent to it, then and only then should the receiving
> router drop the session. This will cover the case, where the INFORM is also
> mis-processed in the "ships in the night" scenario.
> 
> Ben
> 
> 
> > -----Original Message-----
> > From: Chandrashekhar Appanna [mailto:achandra@cisco.com]
> > Sent: Wednesday, July 10, 2002 2:09 PM
> > To: Abarbanel, Benjamin
> > Cc: 'Bruno Rijsman'; idr@merit.edu
> > Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
> > 
> > 
> > 
> > But that and the ability to INFORM is not the same. In the
> > example you provide, the INFORM is not providing any extra 
> > advantage. If the peer did not process/rcv the withdraw 
> > why do you hope that it will rcv/process the INFORM?
> > 
> > I think the behavior of the withdraw itself is the problem here,
> > It has to handle the case of a message cross that you describe, 
> > in a well defined manner. 
> > 
> > -
> > Chandra A.
> > 
> > On Wed, Jul 10, 2002 at 11:50:32AM -0400, Abarbanel, Benjamin wrote:
> > > Correct. The protocol has to be elastic, since it carries 
> > so much weight in 
> > > controlling stability in the Internet.
> > > 
> > > Ben
> > > 
> > > > -----Original Message-----
> > > > From: Bruno Rijsman [mailto:bwarijsman@hotmail.com]
> > > > Sent: Wednesday, July 10, 2002 11:47 AM
> > > > To: idr@merit.edu
> > > > Cc: Benjamin.Abarbanel@Marconi.com
> > > > Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
> > > > 
> > > > 
> > > > Re> Just because you receive an Update message with 
> > capabilities that
> > > > Re> were not negotiated at the OPEN message, should not 
> > require the
> > > > Re> peering session to terminate. The affect of all that 
> > peer routes
> > > > Re> going down will destabilize the network.
> > > > 
> > > > Also, keep in mind that in the case of dynamic capability 
> > negotiation
> > > > is possible and perfectly legal to receive a message 
> > which relies on
> > > > a capability which was recently withdrawn. Consider for 
> > example the
> > > > case where a route-refresh message and a capability message which
> > > > withdraws the route-refresh capability cross each other.
> > > > 
> > > > 
> > > > 
> > > > _________________________________________________________________
> > > > Send and receive Hotmail on your mobile device: 
> http://mobile.msn.com
> > > 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id SAA16685 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 18:04:41 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id AF40D9134A; Wed, 10 Jul 2002 18:04:04 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 804B59134B; Wed, 10 Jul 2002 18:04:04 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 997D69134A for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 18:04:03 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 7E7735DDFD; Wed, 10 Jul 2002 18:04:03 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 086D45DDD5 for <idr@merit.edu>; Wed, 10 Jul 2002 18:04:03 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id SAA00299; Wed, 10 Jul 2002 18:04:00 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id SAA22160; Wed, 10 Jul 2002 18:04:02 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <N5N1M7WV>; Wed, 10 Jul 2002 18:04:01 -0400
Message-ID: <39469E08BD83D411A3D900204840EC55822789@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'John G. Scudder'" <jgs@cisco.com>
Cc: "'Pedro Roque Marques'" <roque@juniper.net>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>, "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Wed, 10 Jul 2002 18:03:53 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

John:

What if the dynamic capabilities was controlled by an operator's script
that was dynamic and automatic and configured from NMS outside the router. 
The script would add/delete capabilities based on the current state of the
router. Lets say this situation occured continuosly.

Obviously, it would not be sensible to run a live network this way. But this
is one hypthetical condition in which we could re-enter the race condition
after
the session restarted. I guess maybe, one has to judge how relient you want
the router to be. 

In practice, if you can create it with a script, there is a (maybe small)
chance it would happen in a live network.

Ben

> -----Original Message-----
> From: John G. Scudder [mailto:jgs@cisco.com]
> Sent: Wednesday, July 10, 2002 5:53 PM
> To: Abarbanel, Benjamin
> Cc: 'Pedro Roque Marques'; Abarbanel, Benjamin; 'idr@merit.edu'
> Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
> 
> 
> At 5:46 PM -0400 7/10/02, Abarbanel, Benjamin wrote:
> >I was not saying that two different TCP sessions will cross 
> each other and
> >convey each other messages in a mixed up way. I was saying that the
> >potential
> >for the "ships crossing in the night" that caused the first 
> race condition
> >with
> >the capabilities could be there again, after the new session 
> is established.
> >If some combination of routes or capabilities or just router 
> execution
> >timing
> >caused it, it could happen again with similar environment 
> and thus cause
> >the condition. Murphy's law always strikes the same way if 
> the conditions
> >are right. Simply reseting a session I would believe could replay the
> >condition
> >that cause the first race condition and thus throw the two 
> routers into a
> >perpetual session restart loop.
> 
> Since the hypothetical problem relates to dynamic capability 
> advertisement, why would it recur?  Surely any sensible 
> implementation will only dynamically advertise a capability as a 
> result of a configuration change.  The changed configuration will be 
> used to produce the capabilities in the initial OPEN of the new 
> session.  Thus, no need to use dynamic capability on the new session 
> (until and unless the configuration changes again).
> 
> --John
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id RAA16273 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 17:53:17 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id A516D91349; Wed, 10 Jul 2002 17:52:39 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 7209E9134A; Wed, 10 Jul 2002 17:52:39 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 64ADB91349 for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 17:52:38 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 5077D5DDEE; Wed, 10 Jul 2002 17:52:38 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from cisco.com (router.cisco.com [171.69.182.20]) by segue.merit.edu (Postfix) with ESMTP id 852F65DDD5 for <idr@merit.edu>; Wed, 10 Jul 2002 17:52:37 -0400 (EDT)
Received: from [171.69.182.142] (dhcp-171-69-182-142.cisco.com [171.69.182.142]) by cisco.com (8.8.8/2.6/Cisco List Logging/8.8.8) with ESMTP id RAA05567; Wed, 10 Jul 2002 17:52:34 -0400 (EDT)
Mime-Version: 1.0
X-Sender: jgs@router
Message-Id: <p05111a14b9525c7ba73d@[171.69.182.142]>
In-Reply-To:  <39469E08BD83D411A3D900204840EC55822788@vie-msgusr-01.dc.fore.com>
References:  <39469E08BD83D411A3D900204840EC55822788@vie-msgusr-01.dc.fore.com>
Date: Wed, 10 Jul 2002 17:52:33 -0400
To: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
From: "John G. Scudder" <jgs@cisco.com>
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Cc: "'Pedro Roque Marques'" <roque@juniper.net>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>, "'idr@merit.edu'" <idr@merit.edu>
Content-Type: text/plain; charset="us-ascii" ; format="flowed"
Sender: owner-idr@merit.edu
Precedence: bulk

At 5:46 PM -0400 7/10/02, Abarbanel, Benjamin wrote:
>I was not saying that two different TCP sessions will cross each other and
>convey each other messages in a mixed up way. I was saying that the
>potential
>for the "ships crossing in the night" that caused the first race condition
>with
>the capabilities could be there again, after the new session is established.
>If some combination of routes or capabilities or just router execution
>timing
>caused it, it could happen again with similar environment and thus cause
>the condition. Murphy's law always strikes the same way if the conditions
>are right. Simply reseting a session I would believe could replay the
>condition
>that cause the first race condition and thus throw the two routers into a
>perpetual session restart loop.

Since the hypothetical problem relates to dynamic capability 
advertisement, why would it recur?  Surely any sensible 
implementation will only dynamically advertise a capability as a 
result of a configuration change.  The changed configuration will be 
used to produce the capabilities in the initial OPEN of the new 
session.  Thus, no need to use dynamic capability on the new session 
(until and unless the configuration changes again).

--John


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id RAA16097 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 17:47:13 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 637E691348; Wed, 10 Jul 2002 17:46:35 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 1100791349; Wed, 10 Jul 2002 17:46:33 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 1B26191348 for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 17:46:30 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 01BD85DDD5; Wed, 10 Jul 2002 17:46:30 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 7FDF15DD96 for <idr@merit.edu>; Wed, 10 Jul 2002 17:46:29 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id RAA29501; Wed, 10 Jul 2002 17:46:27 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id RAA20171; Wed, 10 Jul 2002 17:46:28 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <N5N1M7JB>; Wed, 10 Jul 2002 17:46:28 -0400
Message-ID: <39469E08BD83D411A3D900204840EC55822788@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Pedro Roque Marques'" <roque@juniper.net>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Wed, 10 Jul 2002 17:46:25 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Pedro:
  
See below

Ben

> -----Original Message-----
> From: Pedro Roque Marques [mailto:roque@juniper.net]
> Sent: Wednesday, July 10, 2002 5:24 PM
> To: Abarbanel, Benjamin
> Cc: 'idr@merit.edu'
> Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
> 
> 
> Abarbanel, Benjamin writes:
> 
> >> 
> > But the potential for the race condition with the new session is
> > still there.
> 
> I'm sorry but i really disagree with your statement. It would also
> appear to me that if that was the truth we would have bigger 
> issues than
> the one we are discussing...
> 
> Would you please detail, under what scenario(s) you can have data from
> an old BGP session being accepted as valid in a new BGP session ?
> 

I was not saying that two different TCP sessions will cross each other and
convey each other messages in a mixed up way. I was saying that the
potential
for the "ships crossing in the night" that caused the first race condition
with
the capabilities could be there again, after the new session is established.
If some combination of routes or capabilities or just router execution
timing
caused it, it could happen again with similar environment and thus cause
the condition. Murphy's law always strikes the same way if the conditions
are right. Simply reseting a session I would believe could replay the
condition
that cause the first race condition and thus throw the two routers into a 
perpetual session restart loop.

Ben 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id RAA15840 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 17:40:00 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id A4AF191347; Wed, 10 Jul 2002 17:39:31 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 75B9A91348; Wed, 10 Jul 2002 17:39:31 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 80B1E91347 for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 17:39:30 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 6E4EA5DDD5; Wed, 10 Jul 2002 17:39:30 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id EA9865DD96 for <idr@merit.edu>; Wed, 10 Jul 2002 17:39:29 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id RAA29243; Wed, 10 Jul 2002 17:39:27 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id RAA19216; Wed, 10 Jul 2002 17:39:28 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <N5N1M7A8>; Wed, 10 Jul 2002 17:39:27 -0400
Message-ID: <39469E08BD83D411A3D900204840EC55822787@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Gargi Nalawade'" <gargi@cisco.com>, idr@merit.edu
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Wed, 10 Jul 2002 17:39:17 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Thanks for the clarification on the original intent.

Clearly I think there are many cases, were we want to send potential
operational affecting information to a neighboring peer without reseting
the session. INFORM type message could provide that communication.

I can see the following type of data it would convey:
  1. CPU Utilization levels in an effort to slow down the rate of update.
  2. Memory utilization in an effort to control the amount of routing
     information sent, possibly forbid any optional capabilities while
     router is in a memory depleted state.
  3. Reaching maximum limits on configured or defaulted resources. Number
     of prefixes, attributes, capabilities, etc.
  4. Protocol related WARNING conditions that do not cause a session 
     reset. Repair race conditions, identify FSM state, etc.  

Thanks,
Ben 


> -----Original Message-----
> From: Gargi Nalawade [mailto:gargi@cisco.com]
> Sent: Wednesday, July 10, 2002 5:12 PM
> To: idr@merit.edu
> Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
> 
> 
> Hi All,
> 
> A few clarifications about the INFORM message & its 
> motivations. This is
> supposed to be purely an informational message. Does not 
> propose to affect
> the functionality of the protocol. Rather, it proposes to 
> give flexibility
> to implementations in that; they dont need to reset the 
> sessions, just so
> they can inform the peer of a specific noteworthy event happening. 
> 
> Take the example of max-prefix limit. If the limit is hit, 
> the receiving
> peer would just generate a local warning (if so configured) 
> and continue
> with the session. There is no mechanism of informing the 
> other peer that
> it has exceeded the prefix-limit value, other than churning 
> through the log,
> noticing the warnin & then picking up the telephone (or oops 
> - writing an
> email) & informing the peer's administrator.
> 
> This is just one such hypothetical example where the INFORM 
> would be useful.
> But the INFORM message gives us the capability to do just 
> that. There are 
> several other such scenarios seen in deployment, that necessitate the
> existence of just such a mechanism.
> 
> -Gargi
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id RAA15290 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 17:24:43 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 3CFC091345; Wed, 10 Jul 2002 17:24:06 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 0E06191346; Wed, 10 Jul 2002 17:24:05 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id C615391345 for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 17:24:04 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id AD1F55DDCA; Wed, 10 Jul 2002 17:24:04 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from merlot.juniper.net (natint.juniper.net [207.17.136.129]) by segue.merit.edu (Postfix) with ESMTP id 59EEF5DD96 for <idr@merit.edu>; Wed, 10 Jul 2002 17:24:04 -0400 (EDT)
Received: from roque-bsd.juniper.net (roque-bsd.juniper.net [172.17.12.183]) by merlot.juniper.net (8.11.3/8.11.3) with ESMTP id g6ALO4m03260; Wed, 10 Jul 2002 14:24:04 -0700 (PDT) (envelope-from roque@juniper.net)
Received: (from roque@localhost) by roque-bsd.juniper.net (8.11.6/8.9.3) id g6ALO4r55279; Wed, 10 Jul 2002 14:24:04 -0700 (PDT) (envelope-from roque)
Date: Wed, 10 Jul 2002 14:24:04 -0700 (PDT)
Message-Id: <200207102124.g6ALO4r55279@roque-bsd.juniper.net>
From: Pedro Roque Marques <roque@juniper.net>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
To: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
In-Reply-To: <39469E08BD83D411A3D900204840EC55822786@vie-msgusr-01.dc.fore.com>
References: <39469E08BD83D411A3D900204840EC55822786@vie-msgusr-01.dc.fore.com>
X-Mailer: VM 6.34 under 19.16 "Lille" XEmacs Lucid
Sender: owner-idr@merit.edu
Precedence: bulk

Abarbanel, Benjamin writes:

>> 
> But the potential for the race condition with the new session is
> still there.

I'm sorry but i really disagree with your statement. It would also
appear to me that if that was the truth we would have bigger issues than
the one we are discussing...

Would you please detail, under what scenario(s) you can have data from
an old BGP session being accepted as valid in a new BGP session ?

Due to the fact that these are 2 different TCP sessions, that would
require that TCP itself accepts the packet of an old session... to my
knowledge it is not commonly known for TCP to exhibit such a problem. TCP
in fact as a series of mechanisms designed to address this issue.

Of course, it could always be that i'm misinterpreting your statement,
case in which i would apologize and ask for a clarification.

regards,
  Pedro.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id RAA14820 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 17:13:15 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id B3D6991343; Wed, 10 Jul 2002 17:12:37 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 7EC1F91345; Wed, 10 Jul 2002 17:12:37 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 7365991343 for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 17:12:36 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 61E0D5DDEE; Wed, 10 Jul 2002 17:12:36 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from sj-msg-core-3.cisco.com (sj-msg-core-3.cisco.com [171.70.157.152]) by segue.merit.edu (Postfix) with ESMTP id 1D1445DDCA for <idr@merit.edu>; Wed, 10 Jul 2002 17:12:36 -0400 (EDT)
Received: from sj-msg-av-3.cisco.com (sj-msg-av-3.cisco.com [171.69.17.42]) by sj-msg-core-3.cisco.com (8.12.2/8.12.2) with ESMTP id g6ALCXGs011780 for <idr@merit.edu>; Wed, 10 Jul 2002 14:12:33 -0700 (PDT)
Received: from cisco.com (localhost [127.0.0.1]) by sj-msg-av-3.cisco.com (8.12.2/8.12.2) with ESMTP id g6ALCT58018369 for <idr@merit.edu>; Wed, 10 Jul 2002 14:12:35 -0700 (PDT)
Message-ID: <3D2CA33A.D6E305A2@cisco.com>
Date: Wed, 10 Jul 2002 14:12:26 -0700
From: Gargi Nalawade <gargi@cisco.com>
X-Mailer: Mozilla 4.76 [en]C-CCK-MCD   (Windows NT 5.0; U)
X-Accept-Language: en
MIME-Version: 1.0
To: idr@merit.edu
Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
References: <39469E08BD83D411A3D900204840EC55822785@vie-msgusr-01.dc.fore.com> <200207102055.g6AKthG55232@roque-bsd.juniper.net>
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Sender: owner-idr@merit.edu
Precedence: bulk

Hi All,

A few clarifications about the INFORM message & its motivations. This is
supposed to be purely an informational message. Does not propose to affect
the functionality of the protocol. Rather, it proposes to give flexibility
to implementations in that; they dont need to reset the sessions, just so
they can inform the peer of a specific noteworthy event happening. 

Take the example of max-prefix limit. If the limit is hit, the receiving
peer would just generate a local warning (if so configured) and continue
with the session. There is no mechanism of informing the other peer that
it has exceeded the prefix-limit value, other than churning through the log,
noticing the warnin & then picking up the telephone (or oops - writing an
email) & informing the peer's administrator.

This is just one such hypothetical example where the INFORM would be useful.
But the INFORM message gives us the capability to do just that. There are 
several other such scenarios seen in deployment, that necessitate the
existence of just such a mechanism.

-Gargi


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id RAA14695 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 17:10:22 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id F3D5D91344; Wed, 10 Jul 2002 17:05:31 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id A5A5C91356; Wed, 10 Jul 2002 17:05:05 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 6535391358 for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 17:04:58 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 3FEB55DD9C; Wed, 10 Jul 2002 17:04:58 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id BC2D45DD92 for <idr@merit.edu>; Wed, 10 Jul 2002 17:04:57 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id RAA27568; Wed, 10 Jul 2002 17:04:54 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id RAA11555; Wed, 10 Jul 2002 17:04:56 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <N5N1MZVM>; Wed, 10 Jul 2002 17:04:55 -0400
Message-ID: <39469E08BD83D411A3D900204840EC55822786@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Pedro Roque Marques'" <roque@juniper.net>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Wed, 10 Jul 2002 17:04:50 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

> >>  btw: If a BGP speaker, receives an inform that claims that it is
> >> sending an NLRI that in fact it is not configured to send what is
> >> the suggested behaviour ? I'd be tempted by default to reset the
> >> session since the peer clearly doesn't know what it is talking
> >> about.
>  
> > Could we not fall into this kind of race condition after session
> > resets?
> 
> It would not be of the same session, correct... ? session implies a
> TCP socket. If you do have a close, there is an implicit sequencing
> event and no confusion with previous messages.
> 
But the potential for the race condition with the new session is still 
there. If its not a fluky timing problem it might reappear, if it was fluky
it will go away. My only point is, if we were gracefull the first time it 
happened we would have told the offending peer without dropping his routes
and
perhaps it could correct the error and renegotiate the capabilities and
stabilize the problem without a network outage.

Ben



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id QAA14216 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 16:56:15 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id DCC0A91340; Wed, 10 Jul 2002 16:55:45 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id A1A5A91341; Wed, 10 Jul 2002 16:55:45 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 6791691340 for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 16:55:44 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 495325DD9C; Wed, 10 Jul 2002 16:55:44 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from merlot.juniper.net (natint.juniper.net [207.17.136.129]) by segue.merit.edu (Postfix) with ESMTP id E49A25DD95 for <idr@merit.edu>; Wed, 10 Jul 2002 16:55:43 -0400 (EDT)
Received: from roque-bsd.juniper.net (roque-bsd.juniper.net [172.17.12.183]) by merlot.juniper.net (8.11.3/8.11.3) with ESMTP id g6AKthm01023; Wed, 10 Jul 2002 13:55:43 -0700 (PDT) (envelope-from roque@juniper.net)
Received: (from roque@localhost) by roque-bsd.juniper.net (8.11.6/8.9.3) id g6AKthG55232; Wed, 10 Jul 2002 13:55:43 -0700 (PDT) (envelope-from roque)
Date: Wed, 10 Jul 2002 13:55:43 -0700 (PDT)
Message-Id: <200207102055.g6AKthG55232@roque-bsd.juniper.net>
From: Pedro Roque Marques <roque@juniper.net>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
To: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
In-Reply-To: <39469E08BD83D411A3D900204840EC55822785@vie-msgusr-01.dc.fore.com>
References: <39469E08BD83D411A3D900204840EC55822785@vie-msgusr-01.dc.fore.com>
X-Mailer: VM 6.34 under 19.16 "Lille" XEmacs Lucid
Sender: owner-idr@merit.edu
Precedence: bulk

Abarbanel, Benjamin writes:

> How will the offending peer know it is doing something wrong if the
> receiving peer does not send some indication of the problem?

There is no problem in this scenario... just messages that cross
themselfs in transit. I see no benifit in informing the sender of
this.

>>  btw: If a BGP speaker, receives an inform that claims that it is
>> sending an NLRI that in fact it is not configured to send what is
>> the suggested behaviour ? I'd be tempted by default to reset the
>> session since the peer clearly doesn't know what it is talking
>> about.
 
> Could we not fall into this kind of race condition after session
> resets?

It would not be of the same session, correct... ? session implies a
TCP socket. If you do have a close, there is an implicit sequencing
event and no confusion with previous messages.

  Pedro.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id PAA11723 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 15:40:22 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 0CDBA91351; Wed, 10 Jul 2002 15:38:05 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id D254791350; Wed, 10 Jul 2002 15:38:04 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id D669691335 for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 15:38:00 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id C1DB95DDD5; Wed, 10 Jul 2002 15:38:00 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 3B0AC5DD96 for <idr@merit.edu>; Wed, 10 Jul 2002 15:38:00 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id PAA21849; Wed, 10 Jul 2002 15:37:56 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id PAA22466; Wed, 10 Jul 2002 15:37:58 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <N5N1M4D2>; Wed, 10 Jul 2002 15:37:57 -0400
Message-ID: <39469E08BD83D411A3D900204840EC55822785@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Pedro Roque Marques'" <roque@juniper.net>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Wed, 10 Jul 2002 15:37:48 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

> -----Original Message-----
> From: Pedro Roque Marques [mailto:roque@juniper.net]
> Sent: Wednesday, July 10, 2002 3:06 PM
> To: Abarbanel, Benjamin
> Cc: 'idr@merit.edu'
> Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
> 
> 
> Abarbanel, Benjamin writes:
> 
> > pedro: As was mentioned in earlier email:
> 
> > " Also, keep in mind that in the case of dynamic capability
> > negotiation is possible and perfectly legal to receive a message
> > which relies on a capability which was recently withdrawn. Consider
> > for example the case where a route-refresh message and a capability
> > message which withdraws the route-refresh capability cross each
> > other."
> 
> > In this case there was no corruption in the payload of the Update
> > message and yet the state machine was in a inconsistent state for a
> > particular capability due to the race condition.
> 
> > In your approach
> > you would have considered that peer's routes invalid and terminated
> > the session.
> 
> No. It has been a while since i read that particular proposal but i
> seem to rememeber and i believe that the following algotihm is
> recomended there or has been discussed...
> 
> At an nlri withrawl:
> - Keep track of nlri which where enabled until that event
> - Start a timer (something like 30 secs)
> - For the duration of the timer ignore updates that match the
> remebered nlri.
> - When the timer kicks, cleanup the 'old nlri'...
> 
> No additional spec changes... implementation details that afect the
> behaviour during a shortest interval of time.
> 
> > In my, approach I would not, but yet send back an
> > INFORM to the inconsistent peer and expect it to take the proper
> > corrective action.
> 
> imho, that isn't the correct behaviour on a 'race condition' kind of
> scenario... you want to discard those messages silently without
> generating new messages in response.

How will the offending peer know it is doing something wrong if the
receiving 
peer does not send some indication of the problem? 

I would agree sending more than a few consecutive INFORM messages could lead
to a very big burst of INFORM messages if the offending peer does not stop
sending the invalid update messages. 

> 
> btw: If a BGP speaker, receives an inform that claims that it is
> sending an NLRI that in fact it is not configured to send what is the
> suggested behaviour ? I'd be tempted by default to reset the session
> since the peer clearly doesn't know what it is talking about.
>
 
Could we not fall into this kind of race condition after session resets? 


Ben


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id PAA10793 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 15:13:37 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 9EF8591327; Wed, 10 Jul 2002 15:13:00 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 63F7C9132B; Wed, 10 Jul 2002 15:13:00 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 629FC91327 for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 15:12:59 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 4966B5DE74; Wed, 10 Jul 2002 15:12:59 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 50AAA5DD96 for <idr@merit.edu>; Wed, 10 Jul 2002 15:12:58 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id PAA19627; Wed, 10 Jul 2002 15:12:55 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id PAA16262; Wed, 10 Jul 2002 15:12:56 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <N5N1MSMA>; Wed, 10 Jul 2002 15:12:55 -0400
Message-ID: <39469E08BD83D411A3D900204840EC55822784@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Jeffrey Haas'" <jhaas@nexthop.com>, Pedro Roque Marques <roque@juniper.net>
Cc: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>, idr@merit.edu
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Wed, 10 Jul 2002 15:12:45 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

> -----Original Message-----
> From: Jeffrey Haas [mailto:jhaas@nexthop.com]
> Sent: Wednesday, July 10, 2002 3:08 PM
> To: Pedro Roque Marques
> Cc: Abarbanel, Benjamin; idr@merit.edu
> Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
> 
> 
> I want to post something more thoughtful on this thread, but have
> to do it later due to time constraints.
> 
> I couldn't let this pass though.
> 
> On Wed, Jul 10, 2002 at 11:54:41AM -0700, Pedro Roque Marques wrote:
> > If the stability of the Internet is the concern, then i believe that
> > one should avoid any changes to the protocol unless they are
> > absolutely necessary.
> 
> This made me laugh extremely loudly - but not at you.
> 
> I count 9 RFC extensions past RFC1771 base functionality.
> I count at least 10 protocol extensions in ID form in just
> my personal watch list of stuff that we may implement or have
> implemented.  This list is by no means complete.
> 
> When talking to people, I often refer to BGP as the "Border 
> Glue Protocol".
> People keep adding stuff.
> 
> > By default the answer should be 'no'. Don't change the protocol, add
> > flexibility that you desire, and is fact required operationally, in
> > the implementations. There is plenty of space to compete there...
> 
> The trick is to do it in an interoperable way.

I totally agree, I also think we should add things in a way that is
phaseable
into existing networks. Thus causing little if any network disburbances
between 
the old routers and the new.


Ben


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id PAA10543 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 15:08:18 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 469DB9132E; Wed, 10 Jul 2002 15:07:59 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 0FBA69132F; Wed, 10 Jul 2002 15:07:58 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id E40439132E for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 15:07:57 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id CA75F5DDD5; Wed, 10 Jul 2002 15:07:57 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 76A325DD96 for <idr@merit.edu>; Wed, 10 Jul 2002 15:07:57 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6AJ7h889619; Wed, 10 Jul 2002 15:07:43 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6AJ7e189612; Wed, 10 Jul 2002 15:07:40 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g6AJ7ex18969; Wed, 10 Jul 2002 15:07:40 -0400 (EDT)
Date: Wed, 10 Jul 2002 15:07:40 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: Pedro Roque Marques <roque@juniper.net>
Cc: "Abarbanel, Benjamin" <Benjamin.Abarbanel@marconi.com>, idr@merit.edu
Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
Message-ID: <20020710150740.D18651@nexthop.com>
References: <39469E08BD83D411A3D900204840EC5582277F@vie-msgusr-01.dc.fore.com> <200207101854.g6AIsfV55063@roque-bsd.juniper.net>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <200207101854.g6AIsfV55063@roque-bsd.juniper.net>; from roque@juniper.net on Wed, Jul 10, 2002 at 11:54:41AM -0700
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

I want to post something more thoughtful on this thread, but have
to do it later due to time constraints.

I couldn't let this pass though.

On Wed, Jul 10, 2002 at 11:54:41AM -0700, Pedro Roque Marques wrote:
> If the stability of the Internet is the concern, then i believe that
> one should avoid any changes to the protocol unless they are
> absolutely necessary.

This made me laugh extremely loudly - but not at you.

I count 9 RFC extensions past RFC1771 base functionality.
I count at least 10 protocol extensions in ID form in just
my personal watch list of stuff that we may implement or have
implemented.  This list is by no means complete.

When talking to people, I often refer to BGP as the "Border Glue Protocol".
People keep adding stuff.

> By default the answer should be 'no'. Don't change the protocol, add
> flexibility that you desire, and is fact required operationally, in
> the implementations. There is plenty of space to compete there...

The trick is to do it in an interoperable way.

>   Pedro.

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id PAA10502 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 15:06:55 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id DB17C9132D; Wed, 10 Jul 2002 15:06:26 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id AA4049132E; Wed, 10 Jul 2002 15:06:26 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 69F649132D for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 15:06:25 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 4B4715DE74; Wed, 10 Jul 2002 15:06:25 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from merlot.juniper.net (natint.juniper.net [207.17.136.129]) by segue.merit.edu (Postfix) with ESMTP id AD16D5DD96 for <idr@merit.edu>; Wed, 10 Jul 2002 15:06:24 -0400 (EDT)
Received: from roque-bsd.juniper.net (roque-bsd.juniper.net [172.17.12.183]) by merlot.juniper.net (8.11.3/8.11.3) with ESMTP id g6AJ6Om93614; Wed, 10 Jul 2002 12:06:24 -0700 (PDT) (envelope-from roque@juniper.net)
Received: (from roque@localhost) by roque-bsd.juniper.net (8.11.6/8.9.3) id g6AJ6Of55087; Wed, 10 Jul 2002 12:06:24 -0700 (PDT) (envelope-from roque)
Date: Wed, 10 Jul 2002 12:06:24 -0700 (PDT)
Message-Id: <200207101906.g6AJ6Of55087@roque-bsd.juniper.net>
From: Pedro Roque Marques <roque@juniper.net>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
To: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
In-Reply-To: <39469E08BD83D411A3D900204840EC55822781@vie-msgusr-01.dc.fore.com>
References: <39469E08BD83D411A3D900204840EC55822781@vie-msgusr-01.dc.fore.com>
X-Mailer: VM 6.34 under 19.16 "Lille" XEmacs Lucid
Sender: owner-idr@merit.edu
Precedence: bulk

Abarbanel, Benjamin writes:

> pedro: As was mentioned in earlier email:

> " Also, keep in mind that in the case of dynamic capability
> negotiation is possible and perfectly legal to receive a message
> which relies on a capability which was recently withdrawn. Consider
> for example the case where a route-refresh message and a capability
> message which withdraws the route-refresh capability cross each
> other."

> In this case there was no corruption in the payload of the Update
> message and yet the state machine was in a inconsistent state for a
> particular capability due to the race condition.

> In your approach
> you would have considered that peer's routes invalid and terminated
> the session.

No. It has been a while since i read that particular proposal but i
seem to rememeber and i believe that the following algotihm is
recomended there or has been discussed...

At an nlri withrawl:
- Keep track of nlri which where enabled until that event
- Start a timer (something like 30 secs)
- For the duration of the timer ignore updates that match the
remebered nlri.
- When the timer kicks, cleanup the 'old nlri'...

No additional spec changes... implementation details that afect the
behaviour during a shortest interval of time.

> In my, approach I would not, but yet send back an
> INFORM to the inconsistent peer and expect it to take the proper
> corrective action.

imho, that isn't the correct behaviour on a 'race condition' kind of
scenario... you want to discard those messages silently without
generating new messages in response.

btw: If a BGP speaker, receives an inform that claims that it is
sending an NLRI that in fact it is not configured to send what is the
suggested behaviour ? I'd be tempted by default to reset the session
since the peer clearly doesn't know what it is talking about.

> My way, gives a little room for the protocol to
> tolerate temporary inconsistency.  Yours wont.

I believe it is a weak argument...
Temporary, triggered inconsistencies, should not be handled at the
expense of ignoring the problem of inconsistencies that happen due to
failure in a more generic case...

btw: i've omitted my usual rant on how i believe that dynamic capability
negociation for NLRI is a questionable enhancement from a cost/benefit
point of view. Shouldn't be monopolizing the soap box...

  Pedro.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id PAA10450 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 15:05:20 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 5C4C89132A; Wed, 10 Jul 2002 15:03:56 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 214A391331; Wed, 10 Jul 2002 15:03:56 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id F13229132A for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 15:03:51 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id DD4425DE74; Wed, 10 Jul 2002 15:03:51 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 967D55DDD5 for <idr@merit.edu>; Wed, 10 Jul 2002 15:03:51 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id PAA18987; Wed, 10 Jul 2002 15:03:48 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id PAA12451; Wed, 10 Jul 2002 15:03:50 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <N5N1MQ0Y>; Wed, 10 Jul 2002 15:03:48 -0400
Message-ID: <39469E08BD83D411A3D900204840EC55822783@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Pedro Roque Marques'" <roque@juniper.net>, "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: idr@merit.edu
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Wed, 10 Jul 2002 15:03:43 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

I think we are getting very philosophical here, but as we all know no
protocol
that is heavily used stands still. BGP is one that is constantly in a state
of 
change because of the various users it has and will continue to add on.
To say one cannot change it and improve it, is to say we dont make progress.

We know there are many issues with the way it works and I am sure many
experts
will tell you what they are. So I wont get into it. All I and others here
are 
trying to do is add a little elasticity/gracefullness/resiliency, you name
it what you want.

Ben

> -----Original Message-----
> From: Pedro Roque Marques [mailto:roque@juniper.net]
> Sent: Wednesday, July 10, 2002 2:55 PM
> To: Abarbanel, Benjamin
> Cc: idr@merit.edu
> Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
> 
> 
> Abarbanel, Benjamin writes:
> 
> > Correct. The protocol has to be elastic, since it carries so much
> > weight in controlling stability in the Internet.
> 
> <meta>
> 
> If the stability of the Internet is the concern, then i believe that
> one should avoid any changes to the protocol unless they are
> absolutely necessary.
> 
> If we where talking about a protocol in early development then adding
> extra 'bells and whistles' would probably be ok (although i can think
> of quite a large number of new protocols which deployment has been
> very negativly impacted by constant churn).
> 
> But since this an essential protocol to the internet, the burden of
> proof required to make a modification to the protocol should be much
> higher and should be on the side of the proponents.
> 
> By default the answer should be 'no'. Don't change the protocol, add
> flexibility that you desire, and is fact required operationally, in
> the implementations. There is plenty of space to compete there...
> 
> </meta>
> 
>   Pedro.
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id OAA10131 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 14:55:10 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id D189991324; Wed, 10 Jul 2002 14:54:43 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 9CAE991326; Wed, 10 Jul 2002 14:54:43 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 6887C91324 for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 14:54:42 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 4EA1B5DDD5; Wed, 10 Jul 2002 14:54:42 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from merlot.juniper.net (natint.juniper.net [207.17.136.129]) by segue.merit.edu (Postfix) with ESMTP id EFAF45DD96 for <idr@merit.edu>; Wed, 10 Jul 2002 14:54:41 -0400 (EDT)
Received: from roque-bsd.juniper.net (roque-bsd.juniper.net [172.17.12.183]) by merlot.juniper.net (8.11.3/8.11.3) with ESMTP id g6AIsfm92759; Wed, 10 Jul 2002 11:54:41 -0700 (PDT) (envelope-from roque@juniper.net)
Received: (from roque@localhost) by roque-bsd.juniper.net (8.11.6/8.9.3) id g6AIsfV55063; Wed, 10 Jul 2002 11:54:41 -0700 (PDT) (envelope-from roque)
Date: Wed, 10 Jul 2002 11:54:41 -0700 (PDT)
Message-Id: <200207101854.g6AIsfV55063@roque-bsd.juniper.net>
From: Pedro Roque Marques <roque@juniper.net>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
To: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: idr@merit.edu
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
In-Reply-To: <39469E08BD83D411A3D900204840EC5582277F@vie-msgusr-01.dc.fore.com>
References: <39469E08BD83D411A3D900204840EC5582277F@vie-msgusr-01.dc.fore.com>
X-Mailer: VM 6.34 under 19.16 "Lille" XEmacs Lucid
Sender: owner-idr@merit.edu
Precedence: bulk

Abarbanel, Benjamin writes:

> Correct. The protocol has to be elastic, since it carries so much
> weight in controlling stability in the Internet.

<meta>

If the stability of the Internet is the concern, then i believe that
one should avoid any changes to the protocol unless they are
absolutely necessary.

If we where talking about a protocol in early development then adding
extra 'bells and whistles' would probably be ok (although i can think
of quite a large number of new protocols which deployment has been
very negativly impacted by constant churn).

But since this an essential protocol to the internet, the burden of
proof required to make a modification to the protocol should be much
higher and should be on the side of the proponents.

By default the answer should be 'no'. Don't change the protocol, add
flexibility that you desire, and is fact required operationally, in
the implementations. There is plenty of space to compete there...

</meta>

  Pedro.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id OAA09916 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 14:49:52 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 444D19130E; Wed, 10 Jul 2002 14:49:20 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 0B61F91321; Wed, 10 Jul 2002 14:49:19 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id DC74A9130E for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 14:49:18 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id BF1935DEE4; Wed, 10 Jul 2002 14:49:18 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 428485DD96 for <idr@merit.edu>; Wed, 10 Jul 2002 14:49:18 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id OAA17996; Wed, 10 Jul 2002 14:49:15 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id OAA09679; Wed, 10 Jul 2002 14:49:16 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <N5N1MQHH>; Wed, 10 Jul 2002 14:49:15 -0400
Message-ID: <39469E08BD83D411A3D900204840EC55822782@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Chandrashekhar Appanna'" <achandra@cisco.com>
Cc: "'Bruno Rijsman'" <bwarijsman@hotmail.com>, idr@merit.edu
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Wed, 10 Jul 2002 14:49:12 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

I am in favour of providing a tolerance counter that will drop the session
only after a max number of bad update messages are received. Thus if the
offending
peer does not correct or adjust itself in terms of capabilities, after a
number
of INFORM responses are sent to it, then and only then should the receiving
router drop the session. This will cover the case, where the INFORM is also
mis-processed in the "ships in the night" scenario.

Ben


> -----Original Message-----
> From: Chandrashekhar Appanna [mailto:achandra@cisco.com]
> Sent: Wednesday, July 10, 2002 2:09 PM
> To: Abarbanel, Benjamin
> Cc: 'Bruno Rijsman'; idr@merit.edu
> Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
> 
> 
> 
> But that and the ability to INFORM is not the same. In the
> example you provide, the INFORM is not providing any extra 
> advantage. If the peer did not process/rcv the withdraw 
> why do you hope that it will rcv/process the INFORM?
> 
> I think the behavior of the withdraw itself is the problem here,
> It has to handle the case of a message cross that you describe, 
> in a well defined manner. 
> 
> -
> Chandra A.
> 
> On Wed, Jul 10, 2002 at 11:50:32AM -0400, Abarbanel, Benjamin wrote:
> > Correct. The protocol has to be elastic, since it carries 
> so much weight in 
> > controlling stability in the Internet.
> > 
> > Ben
> > 
> > > -----Original Message-----
> > > From: Bruno Rijsman [mailto:bwarijsman@hotmail.com]
> > > Sent: Wednesday, July 10, 2002 11:47 AM
> > > To: idr@merit.edu
> > > Cc: Benjamin.Abarbanel@Marconi.com
> > > Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
> > > 
> > > 
> > > Re> Just because you receive an Update message with 
> capabilities that
> > > Re> were not negotiated at the OPEN message, should not 
> require the
> > > Re> peering session to terminate. The affect of all that 
> peer routes
> > > Re> going down will destabilize the network.
> > > 
> > > Also, keep in mind that in the case of dynamic capability 
> negotiation
> > > is possible and perfectly legal to receive a message 
> which relies on
> > > a capability which was recently withdrawn. Consider for 
> example the
> > > case where a route-refresh message and a capability message which
> > > withdraws the route-refresh capability cross each other.
> > > 
> > > 
> > > 
> > > _________________________________________________________________
> > > Send and receive Hotmail on your mobile device: 
http://mobile.msn.com
> > 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id OAA09819 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 14:46:37 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id DDEE991328; Wed, 10 Jul 2002 14:44:30 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id A71BE91326; Wed, 10 Jul 2002 14:44:30 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id A950491321 for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 14:44:26 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 989315DEE0; Wed, 10 Jul 2002 14:44:26 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 222195DD96 for <idr@merit.edu>; Wed, 10 Jul 2002 14:44:26 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id OAA17639; Wed, 10 Jul 2002 14:44:23 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id OAA08707; Wed, 10 Jul 2002 14:44:24 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <N5N1MP9M>; Wed, 10 Jul 2002 14:44:24 -0400
Message-ID: <39469E08BD83D411A3D900204840EC55822781@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Pedro Roque Marques'" <roque@juniper.net>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Wed, 10 Jul 2002 14:44:15 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

pedro:

As was mentioned in earlier email:

" Also, keep in mind that in the case of dynamic capability negotiation
  is possible and perfectly legal to receive a message which relies on
  a capability which was recently withdrawn. Consider for example the
  case where a route-refresh message and a capability message which
  withdraws the route-refresh capability cross each other."

In this case there was no corruption in the payload of the Update message
and yet the state machine was in a inconsistent state for a particular
capability due to the race condition. In your approach you would have 
considered that peer's routes invalid and terminated the session. In my,
approach I would not, but yet send back an INFORM to the inconsistent 
peer and expect it to take the proper corrective action. My way, gives a 
little room for the protocol to tolerate temporary inconsistency. 
Yours wont. 

> 
> As i've detailed in previous e-mails, the receipt of an UPDATE message
> for an NLRI that has not been negociated, can, imho, only be cause by
> either corruption of the state machine on your peer or of the
> message. When detecting such a scenario, you want to reset all
> information received from the corrupted peer as fast as possible,
> since poison information can be just about the most disturbing thing
> to a network.
> 
> 
> > I think a more gracefull mechanism suggested by others on this list
> > will help inform the originating error'd peer of this case and allow
> > it to either
> > remove the unapproved capability from future updates or self
> > terminate the peering session.
> 
> 
> I'm sure we must have very different experiences when it comes to
> UPDATE messages that include invalid NLRI...

I am in favour of ignoring the entire update message if it has invalid
capabilities. Thus previous updates are still maintained. I am also in
favour of defining a tolerance counter for invalid update messages such
that after a number of invalid updates, only then should the receiving peer
drop the session.

> 
> All the instances i've seen either in the lab or in the wild have been
> cases in which either one of the lenght fields have been incorrectly
> calculated or the message has been inadvertily trimed by the
> sender. You see an invalid NLRI not because the peer actually intended
> to send such NLRI but because, typically, you are reading garbage.
> 
> I fail to see the wisdom of adding new procedures to the protocol that
> concern itself with how to deal with semantically incorrect
> updates. In my experience, quite often you are not lucky enought that
> the message just becomes corrupted at the byte deailing the NLRI...
>
If we are talking of gracefull ways of handling invalid updates, then an
immediate session drop is unacceptable. If we are talking of non-gracefull
way, then you are right. 

The problem is, in my experiences taking abrupt action most of the time is
not necessary and usually leads to unstable and unconverged network. Cause
if you drop the peer, and the race condition occurs again and again after
a session restarts, you will be in a perpetual loop and you will experience
oscillations in routes and then that opens the topic to our previous thread
of converstation here on the list. Also if the offending router has a
software
bug that causes this condition, what makes you think if you drop the session
the offending router will magically correct its code?
 
> In most of the cases you have already received and accepted corrupted
> data that fail to trigger any validation checks.
> 
 Can you explain in detail the scenario you are talking about here?

> All scenarios that i've seen lead me to conclude that the desired
> outcome is to:
> 1) Make sure that you do not propagate junk received in the UPDATE
> that you detected the error.

Agree on this point.

> 2) Drop all information from the peer ASAP...

Disagree on this point. Why all previous good updates from this peer bad?

> 
> RFC 2858, probably because it was drafted before capability
> negociation was in place, is incorrect in specifying that one should
> ignore an update with a missing NLRI. It might have been the best
> aproach at the time it was drafted but it isn't now. It is just
> another instance where deployment experience advises one against
> following the RFCs literally... which is a concept that should be
> quite familiar to people dealing with BGP.
> 

I sure hope the people who drafted this, have deployment experience and are 
saying this for the majority of the cases.

Can we get some voices on this from the authors of this draft?

>   Pedro.
> 

Thanks,
Ben


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id OAA08758 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 14:14:04 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 4E2DB9130B; Wed, 10 Jul 2002 14:13:44 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 1F61291312; Wed, 10 Jul 2002 14:13:44 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 2D51C9130B for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 14:13:41 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 162BF5DEDF; Wed, 10 Jul 2002 14:13:41 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from merlot.juniper.net (natint.juniper.net [207.17.136.129]) by segue.merit.edu (Postfix) with ESMTP id 70A5F5DDE4 for <idr@merit.edu>; Wed, 10 Jul 2002 14:13:40 -0400 (EDT)
Received: from roque-bsd.juniper.net (roque-bsd.juniper.net [172.17.12.183]) by merlot.juniper.net (8.11.3/8.11.3) with ESMTP id g6AIDdm89640; Wed, 10 Jul 2002 11:13:39 -0700 (PDT) (envelope-from roque@juniper.net)
Received: (from roque@localhost) by roque-bsd.juniper.net (8.11.6/8.9.3) id g6AIDd754981; Wed, 10 Jul 2002 11:13:39 -0700 (PDT) (envelope-from roque)
Date: Wed, 10 Jul 2002 11:13:39 -0700 (PDT)
Message-Id: <200207101813.g6AIDd754981@roque-bsd.juniper.net>
From: Pedro Roque Marques <roque@juniper.net>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
To: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
In-Reply-To: <39469E08BD83D411A3D900204840EC5582277E@vie-msgusr-01.dc.fore.com>
References: <39469E08BD83D411A3D900204840EC5582277E@vie-msgusr-01.dc.fore.com>
X-Mailer: VM 6.34 under 19.16 "Lille" XEmacs Lucid
Sender: owner-idr@merit.edu
Precedence: bulk

Abarbanel, Benjamin writes:

> Just because you receive an Update message with capabilities that
> were not negotiated at the OPEN message, should not require the
> peering session to terminate. The affect of all that peer routes
> going down will destabilize the network.

As i've detailed in previous e-mails, the receipt of an UPDATE message
for an NLRI that has not been negociated, can, imho, only be cause by
either corruption of the state machine on your peer or of the
message. When detecting such a scenario, you want to reset all
information received from the corrupted peer as fast as possible,
since poison information can be just about the most disturbing thing
to a network.

I believe that proponents of this mechanism do need to demonstrate how
it would be useful...

> I think a more gracefull mechanism suggested by others on this list
> will help inform the originating error'd peer of this case and allow
> it to either
> remove the unapproved capability from future updates or self
> terminate the peering session.


I'm sure we must have very different experiences when it comes to
UPDATE messages that include invalid NLRI...

All the instances i've seen either in the lab or in the wild have been
cases in which either one of the lenght fields have been incorrectly
calculated or the message has been inadvertily trimed by the
sender. You see an invalid NLRI not because the peer actually intended
to send such NLRI but because, typically, you are reading garbage.

I fail to see the wisdom of adding new procedures to the protocol that
concern itself with how to deal with semantically incorrect
updates. In my experience, quite often you are not lucky enought that
the message just becomes corrupted at the byte deailing the NLRI...

In most of the cases you have already received and accepted corrupted
data that fail to trigger any validation checks.

All scenarios that i've seen lead me to conclude that the desired
outcome is to:
1) Make sure that you do not propagate junk received in the UPDATE
that you detected the error.
2) Drop all information from the peer ASAP...

RFC 2858, probably because it was drafted before capability
negociation was in place, is incorrect in specifying that one should
ignore an update with a missing NLRI. It might have been the best
aproach at the time it was drafted but it isn't now. It is just
another instance where deployment experience advises one against
following the RFCs literally... which is a concept that should be
quite familiar to people dealing with BGP.

  Pedro.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id OAA08624 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 14:10:00 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 4047D9133C; Wed, 10 Jul 2002 14:08:51 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id E9EFF9133B; Wed, 10 Jul 2002 14:08:50 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 5F84291312 for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 14:08:47 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 48AE85DDEB; Wed, 10 Jul 2002 14:08:47 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from cisco.com (cypher.cisco.com [171.69.11.18]) by segue.merit.edu (Postfix) with ESMTP id B39115DD90 for <idr@merit.edu>; Wed, 10 Jul 2002 14:08:46 -0400 (EDT)
Received: (from achandra@localhost) by cisco.com (8.8.8/2.6/Cisco List Logging/8.8.8) id LAA09267; Wed, 10 Jul 2002 11:08:41 -0700 (PDT)
Date: Wed, 10 Jul 2002 11:08:41 -0700
From: Chandrashekhar Appanna <achandra@cisco.com>
To: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Cc: "'Bruno Rijsman'" <bwarijsman@hotmail.com>, idr@merit.edu
Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
Message-ID: <20020710110841.B18943@cypher.cisco.com>
References: <39469E08BD83D411A3D900204840EC5582277F@vie-msgusr-01.dc.fore.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5.1i
In-Reply-To: <39469E08BD83D411A3D900204840EC5582277F@vie-msgusr-01.dc.fore.com>; from Benjamin.Abarbanel@Marconi.com on Wed, Jul 10, 2002 at 11:50:32AM -0400
Sender: owner-idr@merit.edu
Precedence: bulk

But that and the ability to INFORM is not the same. In the
example you provide, the INFORM is not providing any extra 
advantage. If the peer did not process/rcv the withdraw 
why do you hope that it will rcv/process the INFORM?

I think the behavior of the withdraw itself is the problem here,
It has to handle the case of a message cross that you describe, 
in a well defined manner. 

-
Chandra A.

On Wed, Jul 10, 2002 at 11:50:32AM -0400, Abarbanel, Benjamin wrote:
> Correct. The protocol has to be elastic, since it carries so much weight in 
> controlling stability in the Internet.
> 
> Ben
> 
> > -----Original Message-----
> > From: Bruno Rijsman [mailto:bwarijsman@hotmail.com]
> > Sent: Wednesday, July 10, 2002 11:47 AM
> > To: idr@merit.edu
> > Cc: Benjamin.Abarbanel@Marconi.com
> > Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
> > 
> > 
> > Re> Just because you receive an Update message with capabilities that
> > Re> were not negotiated at the OPEN message, should not require the
> > Re> peering session to terminate. The affect of all that peer routes
> > Re> going down will destabilize the network.
> > 
> > Also, keep in mind that in the case of dynamic capability negotiation
> > is possible and perfectly legal to receive a message which relies on
> > a capability which was recently withdrawn. Consider for example the
> > case where a route-refresh message and a capability message which
> > withdraws the route-refresh capability cross each other.
> > 
> > 
> > 
> > _________________________________________________________________
> > Send and receive Hotmail on your mobile device: http://mobile.msn.com
> > 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id LAA03968 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 11:51:22 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 7755391316; Wed, 10 Jul 2002 11:50:45 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 467AA91317; Wed, 10 Jul 2002 11:50:45 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 5B7B891316 for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 11:50:44 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 470F75DEB1; Wed, 10 Jul 2002 11:50:44 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id C417F5DDA3 for <idr@merit.edu>; Wed, 10 Jul 2002 11:50:43 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id LAA06808; Wed, 10 Jul 2002 11:50:41 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id LAA06239; Wed, 10 Jul 2002 11:50:42 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <N5N1MHDT>; Wed, 10 Jul 2002 11:50:41 -0400
Message-ID: <39469E08BD83D411A3D900204840EC5582277F@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Bruno Rijsman'" <bwarijsman@hotmail.com>, idr@merit.edu
Cc: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Wed, 10 Jul 2002 11:50:32 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Correct. The protocol has to be elastic, since it carries so much weight in 
controlling stability in the Internet.

Ben

> -----Original Message-----
> From: Bruno Rijsman [mailto:bwarijsman@hotmail.com]
> Sent: Wednesday, July 10, 2002 11:47 AM
> To: idr@merit.edu
> Cc: Benjamin.Abarbanel@Marconi.com
> Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
> 
> 
> Re> Just because you receive an Update message with capabilities that
> Re> were not negotiated at the OPEN message, should not require the
> Re> peering session to terminate. The affect of all that peer routes
> Re> going down will destabilize the network.
> 
> Also, keep in mind that in the case of dynamic capability negotiation
> is possible and perfectly legal to receive a message which relies on
> a capability which was recently withdrawn. Consider for example the
> case where a route-refresh message and a capability message which
> withdraws the route-refresh capability cross each other.
> 
> 
> 
> _________________________________________________________________
> Send and receive Hotmail on your mobile device: http://mobile.msn.com
> 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id LAA03821 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 11:47:45 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 0A8409130C; Wed, 10 Jul 2002 11:47:20 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id C5C0A91316; Wed, 10 Jul 2002 11:47:19 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 8B7FB9130C for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 11:47:18 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 6FFB45DDA3; Wed, 10 Jul 2002 11:47:18 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from hotmail.com (f41.law4.hotmail.com [216.33.149.41]) by segue.merit.edu (Postfix) with ESMTP id 2F08A5DEB1 for <idr@merit.edu>; Wed, 10 Jul 2002 11:47:18 -0400 (EDT)
Received: from mail pickup service by hotmail.com with Microsoft SMTPSVC; Wed, 10 Jul 2002 08:47:17 -0700
Received: from 65.194.140.2 by lw4fd.law4.hotmail.msn.com with HTTP; Wed, 10 Jul 2002 15:47:17 GMT
X-Originating-IP: [65.194.140.2]
From: "Bruno Rijsman" <bwarijsman@hotmail.com>
To: idr@merit.edu
Cc: Benjamin.Abarbanel@Marconi.com
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Wed, 10 Jul 2002 11:47:17 -0400
Mime-Version: 1.0
Content-Type: text/plain; format=flowed
Message-ID: <F41Luwi23mZaendQxwZ0000d8b4@hotmail.com>
X-OriginalArrivalTime: 10 Jul 2002 15:47:17.0759 (UTC) FILETIME=[11D2C0F0:01C22829]
Sender: owner-idr@merit.edu
Precedence: bulk

Re> Just because you receive an Update message with capabilities that
Re> were not negotiated at the OPEN message, should not require the
Re> peering session to terminate. The affect of all that peer routes
Re> going down will destabilize the network.

Also, keep in mind that in the case of dynamic capability negotiation
is possible and perfectly legal to receive a message which relies on
a capability which was recently withdrawn. Consider for example the
case where a route-refresh message and a capability message which
withdraws the route-refresh capability cross each other.



_________________________________________________________________
Send and receive Hotmail on your mobile device: http://mobile.msn.com



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id LAA03608 for <idr-archive@nic.merit.edu>; Wed, 10 Jul 2002 11:40:54 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 477AB91210; Wed, 10 Jul 2002 11:40:26 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 14BCE91320; Wed, 10 Jul 2002 11:40:26 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 55CDC91210 for <idr@trapdoor.merit.edu>; Wed, 10 Jul 2002 11:40:21 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 428DC5DEDD; Wed, 10 Jul 2002 11:40:21 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id BFF5F5DDA3 for <idr@merit.edu>; Wed, 10 Jul 2002 11:40:20 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id LAA06038; Wed, 10 Jul 2002 11:40:18 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id LAA03683; Wed, 10 Jul 2002 11:40:18 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <N5N1MGK0>; Wed, 10 Jul 2002 11:40:17 -0400
Message-ID: <39469E08BD83D411A3D900204840EC5582277E@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Pedro Roque Marques'" <roque@juniper.net>, Jeffrey Haas <jhaas@nexthop.com>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Wed, 10 Jul 2002 11:40:08 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Pedro and Jeff:


> -----Original Message-----
> From: Pedro Roque Marques [mailto:roque@juniper.net]
> Sent: Tuesday, July 09, 2002 1:45 PM
> To: Jeffrey Haas
> Cc: 'idr@merit.edu'
> Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
> 
> 
> Jeffrey Haas writes:
> 
> > The operational community continues to demand resiliency from the
> > BGP protocol.
> 
> I believe you get less resiliency, not more, from accepting corrupted
> messages. If you have a peer that is sending you a given NLRI and it
> didn't negociate it then either:
> 1) The peers state machine is completly out of wack, case in which you
> want to reset it.

Just because you receive an Update message with capabilities that were not
negotiated at the OPEN message, should not require the peering session to
terminate. The affect of all that peer routes going down will destabilize
the network. 

I think a more gracefull mechanism suggested by others on this list will 
help inform the originating error'd peer of this case and allow it to either

remove the unapproved capability from future updates or self terminate the 
peering session. It is more gracefull for the originating peer to decide if
its 
publicized routes are compromised. The assumption being is that the
advertised 
routes are most likely stable and reachable and do not constitute any harm
to 
the topology. 

The receiving peer should ignore all subsequent capabilities not informed
during 
the session OPEN exchange. We would need a NON session terminating warning
message like the INFORM suggested by others to do this:

  "INFORM would at least make this non-silent.  However, additional pushback
   might be warranted.  Upon receipt of an INFORM (or similar) message
   the receiving peer may wish to choose one or more of the following
actions:

   o Just log it per INFORM
   o Stop sending this AFI/SAFI
   o Reset the peering session (contrary to the wishes of the peer?)
  "

> 2) It is a corrupted message, case in which you wish to reset it.
> 
> >  If we can avoid sending a NOTIFICATION and gracefully
> > deal with someone elses buggy code, thats a Good Thing.

It is very difficult to determine if the offending peer has buggy code or a
minor bug. Just taking down all its routes on the first sign of trouble
would not be wise.

Also, multiprotocol capabilities are new and optional things they should not
destablize the network if there is doubt on their intentions.

> 
> > Sending an INFORM would be nice here, if such a thing existed.
> 
> Personally i believe that this message is completly unnecessary.
> 
>   Pedro.
>

I think BGP needs many mechanisms for gracefullness and tolerance of
imprecise
information in order to combat the Global route instability that is caused
when
BGP routers just tear down their peering sessions at the first sign of
trouble.

Ben 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id NAA15520 for <idr-archive@nic.merit.edu>; Tue, 9 Jul 2002 13:48:32 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id B4550912C6; Tue,  9 Jul 2002 13:45:31 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 09D2E912C9; Tue,  9 Jul 2002 13:45:31 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 873BE912C6 for <idr@trapdoor.merit.edu>; Tue,  9 Jul 2002 13:45:06 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 14EEE5DE9C; Tue,  9 Jul 2002 13:45:06 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from merlot.juniper.net (natint.juniper.net [207.17.136.129]) by segue.merit.edu (Postfix) with ESMTP id 554AE5DE97 for <idr@merit.edu>; Tue,  9 Jul 2002 13:45:05 -0400 (EDT)
Received: from roque-bsd.juniper.net (roque-bsd.juniper.net [172.17.12.183]) by merlot.juniper.net (8.11.3/8.11.3) with ESMTP id g69Hj4m18600; Tue, 9 Jul 2002 10:45:04 -0700 (PDT) (envelope-from roque@juniper.net)
Received: (from roque@localhost) by roque-bsd.juniper.net (8.11.6/8.9.3) id g69Hj4U52488; Tue, 9 Jul 2002 10:45:04 -0700 (PDT) (envelope-from roque)
Date: Tue, 9 Jul 2002 10:45:04 -0700 (PDT)
Message-Id: <200207091745.g69Hj4U52488@roque-bsd.juniper.net>
From: Pedro Roque Marques <roque@juniper.net>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
To: Jeffrey Haas <jhaas@nexthop.com>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
In-Reply-To: <20020709122542.C22703@nexthop.com>
References: <1117F7D44159934FB116E36F4ABF221B020AF846@celox-ma1-ems1.celoxnetworks.com> <20020709122542.C22703@nexthop.com>
X-Mailer: VM 6.34 under 19.16 "Lille" XEmacs Lucid
Sender: owner-idr@merit.edu
Precedence: bulk

Jeffrey Haas writes:

> The operational community continues to demand resiliency from the
> BGP protocol.

I believe you get less resiliency, not more, from accepting corrupted
messages. If you have a peer that is sending you a given NLRI and it
didn't negociate it then either:
1) The peers state machine is completly out of wack, case in which you
want to reset it.
2) It is a corrupted message, case in which you wish to reset it.

>  If we can avoid sending a NOTIFICATION and gracefully
> deal with someone elses buggy code, thats a Good Thing.

> Sending an INFORM would be nice here, if such a thing existed.

Personally i believe that this message is completly unnecessary.

  Pedro.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id MAA13604 for <idr-archive@nic.merit.edu>; Tue, 9 Jul 2002 12:54:57 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id B49AC912B9; Tue,  9 Jul 2002 12:54:36 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 8627D912BA; Tue,  9 Jul 2002 12:54:36 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id A2E61912B9 for <idr@trapdoor.merit.edu>; Tue,  9 Jul 2002 12:54:35 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 8C6985DE1F; Tue,  9 Jul 2002 12:54:35 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mailgate.pit.comms.marconi.com (mailgate.pit.comms.marconi.com [169.144.68.6]) by segue.merit.edu (Postfix) with ESMTP id 45FBA5DDA6 for <idr@merit.edu>; Tue,  9 Jul 2002 12:54:35 -0400 (EDT)
Received: from mailman.pit.comms.marconi.com (mailman.pit.comms.marconi.com [169.144.2.12]) by mailgate.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id MAA11339; Tue, 9 Jul 2002 12:54:33 -0400 (EDT)
Received: from whq-msgrtr-01.pit.comms.marconi.com (whq-msgrtr-01.pit.comms.marconi.com [169.144.2.221]) by mailman.pit.comms.marconi.com (8.9.3/8.9.3) with ESMTP id MAA20370; Tue, 9 Jul 2002 12:54:33 -0400 (EDT)
Received: by whq-msgrtr-01.pit.comms.marconi.com with Internet Mail Service (5.5.2650.21) id <N5N1K651>; Tue, 9 Jul 2002 12:54:33 -0400
Message-ID: <39469E08BD83D411A3D900204840EC5582277B@vie-msgusr-01.dc.fore.com>
From: "Abarbanel, Benjamin" <Benjamin.Abarbanel@Marconi.com>
To: "'Natale, Jonathan'" <JNatale@celoxnetworks.com>, "'Jeffrey Haas'" <jhaas@nexthop.com>, "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Tue, 9 Jul 2002 12:54:32 -0400 
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

> Also:
> "If a BGP speaker that supports a certain capability determines that
>    its peer doesn't support this capability, the speaker MAY send a
>    NOTIFICATION message to the peer, and terminate peering 
> (see Section
>    "Extensions to Error Handling" for more details). The Error Subcode
>    in the message is set to Unsupported Capability. The message SHOULD
>    contain the capability (capabilities) that causes the 
> speaker to send
>    the message.  The decision to send the message and 
> terminate peering
>    is local to the speaker. If terminated, such peering SHOULD NOT be
>    re-established automatically. --draft-ietf-idr-rfc2842bis-02.txt,
> "Capabilities Advertisement with BGP-4"
> 
> Seems harsh, I would think that the preferred behavior is to 
> re-open w/o the
> offending capability.  A major vendor does not and they have 
> a back door cmd
> to work around it.  They also have a bug on it but based on history
> (AS_CONFED_SET <-> AS_CONFED_SEQUENCE, deterministic med, etc...) the
> rfc/draft seem to follow them NOT the other way around. Any 
> thoughts???

For network deployment phasing reasons and backward compatibility reasons.
We should not drop a peer just because we recieved a message that contained
unrecognized capability(s). I vote the spec be changed to accomodate an
evolving network. The lesser capable BGP router should silently ignore the
unrecognized capability(s) or the entire message if there is nothing else
recognizable in it. The more advanced BGP capable router upon detecting a
peer that does not understand certain advanced capability(s) should merely
not send any more advanced capability(s) to this peer. The two routers
should also log the event for their operators to review.

Also, if we drop peers do to unrecognized messages or capabilities, we make
it easier for hacker attacks causing network outages, or the topology to get
unstable.  

Right?

Ben


 
 


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id MAA12974 for <idr-archive@nic.merit.edu>; Tue, 9 Jul 2002 12:35:43 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id D7515912B8; Tue,  9 Jul 2002 12:35:23 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 4E1AC912B9; Tue,  9 Jul 2002 12:35:22 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id E7298912B8 for <idr@trapdoor.merit.edu>; Tue,  9 Jul 2002 12:35:20 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id D62D25DE97; Tue,  9 Jul 2002 12:35:20 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from celox-ma1-ems1.celoxnetworks.com (celox-ma1-imap1.celoxnetworks.com [12.40.60.233]) by segue.merit.edu (Postfix) with ESMTP id 9CB765DDA6 for <idr@merit.edu>; Tue,  9 Jul 2002 12:35:20 -0400 (EDT)
Received: by celox-ma1-ems1.celoxnetworks.com with Internet Mail Service (5.5.2653.19) id <L9JQDNBG>; Tue, 9 Jul 2002 12:35:20 -0400
Message-ID: <1117F7D44159934FB116E36F4ABF221B020AF848@celox-ma1-ems1.celoxnetworks.com>
From: "Natale, Jonathan" <JNatale@celoxnetworks.com>
To: "'Jeffrey Haas'" <jhaas@nexthop.com>, "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Tue, 9 Jul 2002 12:35:19 -0400 
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
Sender: owner-idr@merit.edu
Precedence: bulk

Thank you very much for the prompt reply.  I agree completely.

My question is this: If you receive a route refresh when you did not
advertise that capability at all (and/or your code does not support it at
all) then you should reset the peer with a 1/3 (Message Header Error/Bad
Message Type) notification. Right?

Also:
"If a BGP speaker that supports a certain capability determines that
   its peer doesn't support this capability, the speaker MAY send a
   NOTIFICATION message to the peer, and terminate peering (see Section
   "Extensions to Error Handling" for more details). The Error Subcode
   in the message is set to Unsupported Capability. The message SHOULD
   contain the capability (capabilities) that causes the speaker to send
   the message.  The decision to send the message and terminate peering
   is local to the speaker. If terminated, such peering SHOULD NOT be
   re-established automatically. --draft-ietf-idr-rfc2842bis-02.txt,
"Capabilities Advertisement with BGP-4"

Seems harsh, I would think that the preferred behavior is to re-open w/o the
offending capability.  A major vendor does not and they have a back door cmd
to work around it.  They also have a bug on it but based on history
(AS_CONFED_SET <-> AS_CONFED_SEQUENCE, deterministic med, etc...) the
rfc/draft seem to follow them NOT the other way around. Any thoughts???

-----Original Message-----
From: Jeffrey Haas [mailto:jhaas@nexthop.com] 
Sent: Tuesday, July 09, 2002 12:26 PM
To: Natale, Jonathan
Cc: 'idr@merit.edu'
Subject: Re: INFORM proposal and MP-BGP (RFC 2858)

On Tue, Jul 09, 2002 at 12:19:23PM -0400, Natale, Jonathan wrote:
> : "If a BGP speaker receives from its peer a ROUTE-REFRESH message with
>    the <AFI, SAFI> that the speaker didn't advertise to the peer at the
>    session establishment time via capability advertisement, the speaker
>    shall ignore such a message." - RFC2918, "Route Refresh Capability for
> BGP-4"
> 
> shouldn't this be an error/notification???

The operational community continues to demand resiliency from the BGP
protocol.  If we can avoid sending a NOTIFICATION and gracefully
deal with someone elses buggy code, thats a Good Thing.

Sending an INFORM would be nice here, if such a thing existed.

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id MAA12802 for <idr-archive@nic.merit.edu>; Tue, 9 Jul 2002 12:31:48 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id D0EE1912B7; Tue,  9 Jul 2002 12:30:38 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 98549912B8; Tue,  9 Jul 2002 12:30:38 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 885EE912B7 for <idr@trapdoor.merit.edu>; Tue,  9 Jul 2002 12:30:37 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 5FA155DDB1; Tue,  9 Jul 2002 12:30:37 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id EFE5F5DDA6 for <idr@merit.edu>; Tue,  9 Jul 2002 12:30:36 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g69GUYl39648; Tue, 9 Jul 2002 12:30:34 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g69GUV139640; Tue, 9 Jul 2002 12:30:31 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g69GUVg22945; Tue, 9 Jul 2002 12:30:31 -0400 (EDT)
Date: Tue, 9 Jul 2002 12:30:31 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: "Natale, Jonathan" <JNatale@celoxnetworks.com>
Cc: "'Fryczke, Pawel'" <Pawel.Fryczke@intel.com>, idr@merit.edu
Subject: Re: Aggregation and MULTI_EXIT_DISC, NEXT_HOP
Message-ID: <20020709123031.D22703@nexthop.com>
References: <1117F7D44159934FB116E36F4ABF221B020AF843@celox-ma1-ems1.celoxnetworks.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <1117F7D44159934FB116E36F4ABF221B020AF843@celox-ma1-ems1.celoxnetworks.com>; from JNatale@celoxnetworks.com on Tue, Jul 09, 2002 at 10:48:04AM -0400
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

On Tue, Jul 09, 2002 at 10:48:04AM -0400, Natale, Jonathan wrote:
> I raised this issue more than two years ago.  What is the story on bgp?  Why
> is the draft/rfc so far off base?  I would be happy to provide a laundry
> list of things, if that will help.

That list would be good.  At the very least, it lets us adjust the
spec to "should" versus "shall" if we decide that it makes sense to
do so.

> I think the MED was fixed in a major vendor's code with the ~= deterministic
> med cmd (or so I was told then, but now the story on that cmd seem
> different).

deterministic med fixes the MED election to follow the spec.  I
don't believe it has anything to do with aggregation.

> The next hop was never "fixed".  The only answer for why not to aggregate if
> nxhp not the same was that it has something to do w/ mcast RPF... but I
> never fully digested this.

Once Upon a Time (-12?) the active route was "echoed" back to
the external peer which it was learned from to facilitate multicast
RPF checks.  No one appeared to use this, so it was removed from the
spec.  If the only reason for the nexthop to be the same in aggregation
was for a similar reason, it may be worth removing the restriction.

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id MAA12638 for <idr-archive@nic.merit.edu>; Tue, 9 Jul 2002 12:26:27 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 3FA27912B6; Tue,  9 Jul 2002 12:25:48 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 08CEF912B7; Tue,  9 Jul 2002 12:25:47 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id F1139912B6 for <idr@trapdoor.merit.edu>; Tue,  9 Jul 2002 12:25:46 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id DF4A55DE02; Tue,  9 Jul 2002 12:25:46 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 58F3D5DE8F for <idr@merit.edu>; Tue,  9 Jul 2002 12:25:46 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g69GPjl39385; Tue, 9 Jul 2002 12:25:45 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g69GPg139378; Tue, 9 Jul 2002 12:25:42 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g69GPgs22921; Tue, 9 Jul 2002 12:25:42 -0400 (EDT)
Date: Tue, 9 Jul 2002 12:25:42 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: "Natale, Jonathan" <JNatale@celoxnetworks.com>
Cc: "'idr@merit.edu'" <idr@merit.edu>
Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
Message-ID: <20020709122542.C22703@nexthop.com>
References: <1117F7D44159934FB116E36F4ABF221B020AF846@celox-ma1-ems1.celoxnetworks.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <1117F7D44159934FB116E36F4ABF221B020AF846@celox-ma1-ems1.celoxnetworks.com>; from JNatale@celoxnetworks.com on Tue, Jul 09, 2002 at 12:19:23PM -0400
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

On Tue, Jul 09, 2002 at 12:19:23PM -0400, Natale, Jonathan wrote:
> : "If a BGP speaker receives from its peer a ROUTE-REFRESH message with
>    the <AFI, SAFI> that the speaker didn't advertise to the peer at the
>    session establishment time via capability advertisement, the speaker
>    shall ignore such a message." - RFC2918, "Route Refresh Capability for
> BGP-4"
> 
> shouldn't this be an error/notification???

The operational community continues to demand resiliency from the BGP
protocol.  If we can avoid sending a NOTIFICATION and gracefully
deal with someone elses buggy code, thats a Good Thing.

Sending an INFORM would be nice here, if such a thing existed.

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id MAA12540 for <idr-archive@nic.merit.edu>; Tue, 9 Jul 2002 12:23:21 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 14777912B5; Tue,  9 Jul 2002 12:23:02 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id D1D69912B6; Tue,  9 Jul 2002 12:23:01 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id D45EB912B5 for <idr@trapdoor.merit.edu>; Tue,  9 Jul 2002 12:23:00 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id B8A1D5DE8F; Tue,  9 Jul 2002 12:23:00 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 35A915DDA6 for <idr@merit.edu>; Tue,  9 Jul 2002 12:23:00 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g69GMwG39256; Tue, 9 Jul 2002 12:22:58 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g69GMt139249; Tue, 9 Jul 2002 12:22:55 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g69GMts22908; Tue, 9 Jul 2002 12:22:55 -0400 (EDT)
Date: Tue, 9 Jul 2002 12:22:55 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: "Natale, Jonathan" <JNatale@celoxnetworks.com>
Cc: idr@merit.edu
Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
Message-ID: <20020709122255.B22703@nexthop.com>
References: <1117F7D44159934FB116E36F4ABF221B020AF845@celox-ma1-ems1.celoxnetworks.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <1117F7D44159934FB116E36F4ABF221B020AF845@celox-ma1-ems1.celoxnetworks.com>; from JNatale@celoxnetworks.com on Tue, Jul 09, 2002 at 12:17:28PM -0400
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

On Tue, Jul 09, 2002 at 12:17:28PM -0400, Natale, Jonathan wrote:
> Where is this proposal? URL???

:         Title           : BGPv4 INFORM message
:         Author(s)       : G. Nalawade et al.
:         Filename        : draft-nalawade-bgp-inform-00.txt
:         Pages           : 10
:         Date            : 24-Jun-02
:
: This document defines a new message type, the BGP INFORM
: message that communicates Informational data and operational
: warnings without resetting the peering session.
:
: A URL for this Internet-Draft is:
: http://www.ietf.org/internet-drafts/draft-nalawade-bgp-inform-00.txt

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id MAA12358 for <idr-archive@nic.merit.edu>; Tue, 9 Jul 2002 12:19:36 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 75571912B4; Tue,  9 Jul 2002 12:19:27 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 4AF98912B5; Tue,  9 Jul 2002 12:19:27 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id D45A1912B4 for <idr@trapdoor.merit.edu>; Tue,  9 Jul 2002 12:19:24 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id BE69F5DE02; Tue,  9 Jul 2002 12:19:24 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from celox-ma1-ems1.celoxnetworks.com (celox-ma1-imap1.celoxnetworks.com [12.40.60.233]) by segue.merit.edu (Postfix) with ESMTP id 779275DDA6 for <idr@merit.edu>; Tue,  9 Jul 2002 12:19:24 -0400 (EDT)
Received: by celox-ma1-ems1.celoxnetworks.com with Internet Mail Service (5.5.2653.19) id <L9JQDM0P>; Tue, 9 Jul 2002 12:19:24 -0400
Message-ID: <1117F7D44159934FB116E36F4ABF221B020AF846@celox-ma1-ems1.celoxnetworks.com>
From: "Natale, Jonathan" <JNatale@celoxnetworks.com>
To: "'idr@merit.edu'" <idr@merit.edu>
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Tue, 9 Jul 2002 12:19:23 -0400 
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
Sender: owner-idr@merit.edu
Precedence: bulk

: "If a BGP speaker receives from its peer a ROUTE-REFRESH message with
   the <AFI, SAFI> that the speaker didn't advertise to the peer at the
   session establishment time via capability advertisement, the speaker
   shall ignore such a message." - RFC2918, "Route Refresh Capability for
BGP-4"

shouldn't this be an error/notification???




Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id MAA12347 for <idr-archive@nic.merit.edu>; Tue, 9 Jul 2002 12:19:31 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 055B3912B2; Tue,  9 Jul 2002 12:19:09 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id C2FCE912B4; Tue,  9 Jul 2002 12:19:08 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id CD9F5912B2 for <idr@trapdoor.merit.edu>; Tue,  9 Jul 2002 12:17:29 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id A7B555DE02; Tue,  9 Jul 2002 12:17:29 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from celox-ma1-ems1.celoxnetworks.com (celox-ma1-imap1.celoxnetworks.com [12.40.60.233]) by segue.merit.edu (Postfix) with ESMTP id 701EA5DDA6 for <idr@merit.edu>; Tue,  9 Jul 2002 12:17:29 -0400 (EDT)
Received: by celox-ma1-ems1.celoxnetworks.com with Internet Mail Service (5.5.2653.19) id <L9JQDM02>; Tue, 9 Jul 2002 12:17:29 -0400
Message-ID: <1117F7D44159934FB116E36F4ABF221B020AF845@celox-ma1-ems1.celoxnetworks.com>
From: "Natale, Jonathan" <JNatale@celoxnetworks.com>
To: "'Jeffrey Haas'" <jhaas@nexthop.com>, idr@merit.edu
Subject: RE: INFORM proposal and MP-BGP (RFC 2858)
Date: Tue, 9 Jul 2002 12:17:28 -0400 
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
Sender: owner-idr@merit.edu
Precedence: bulk

Where is this proposal? URL???



-----Original Message-----
From: Jeffrey Haas [mailto:jhaas@nexthop.com] 
Sent: Monday, July 08, 2002 2:43 PM
To: idr@merit.edu
Subject: INFORM proposal and MP-BGP (RFC 2858)

The proposal for the INFORM message gives the opportunity to redress
an open issue in the MP-BGP specification:

: 6. Error Handling
: 
:    If a BGP speaker receives from a neighbor an Update message that
:    contains the MP_REACH_NLRI or MP_UNREACH_NLRI attribute, and the
:    speaker determines that the attribute is incorrect, the speaker must
:    delete all the BGP routes received from that neighbor whose AFI/SAFI
:    is the same as the one carried in the incorrect MP_REACH_NLRI or
:    MP_UNREACH_NLRI attribute. For the duration of the BGP session over
:    which the Update message was received, the speaker then should ignore
:    all the subsequent routes with that AFI/SAFI received over that
:    session.

This amounts to silently ignoring reachability.

INFORM would at least make this non-silent.  However, additional pushback
might be warranted.  Upon receipt of an INFORM (or similar) message
the receiving peer may wish to choose one or more of the following actions:

o Just log it per INFORM
o Stop sending this AFI/SAFI
o Reset the peering session (contrary to the wishes of the peer?)

Thoughts?

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id MAA12154 for <idr-archive@nic.merit.edu>; Tue, 9 Jul 2002 12:15:45 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 0595C912B1; Tue,  9 Jul 2002 12:15:11 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 6C8E1912B2; Tue,  9 Jul 2002 12:15:10 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id C3E14912B1 for <idr@trapdoor.merit.edu>; Tue,  9 Jul 2002 12:15:07 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 8572F5DDA6; Tue,  9 Jul 2002 12:15:07 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 4DDA25DE99 for <idr@merit.edu>; Tue,  9 Jul 2002 12:15:06 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g69GF1238368; Tue, 9 Jul 2002 12:15:01 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g69GEw138358; Tue, 9 Jul 2002 12:14:58 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g69GEvq22818; Tue, 9 Jul 2002 12:14:57 -0400 (EDT)
Date: Tue, 9 Jul 2002 12:14:57 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: Manav Bhatia <manav@samsung.com>
Cc: idr@merit.edu
Subject: Re: NLRI in both update and withdraw
Message-ID: <20020709121457.A22703@nexthop.com>
References: <0a8b01c22731$0bb30e80$b4036c6b@sisodomain.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <0a8b01c22731$0bb30e80$b4036c6b@sisodomain.com>; from manav@samsung.com on Tue, Jul 09, 2002 at 03:41:50PM +0530
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

>From -17:

   An UPDATE message should not include the same address prefix in the
   WITHDRAWN ROUTES and Network Layer Reachability Information fields,
   however a BGP speaker MUST be able to process UPDATE messages in this
   form. A BGP speaker should treat an UPDATE message of this form as if
   the WITHDRAWN ROUTES doesn't contain the address prefix.

On Tue, Jul 09, 2002 at 03:41:50PM +0530, Manav Bhatia wrote:
> Hi,
> I want to know as to what the current implementations out there in the
> field do and what is the appropriate and the correct method for dealing
> with UPDATE messages  having the same NLRI listed in both the withdrawn and
> the feasible routes.
> 
> Regards,
> Manav

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA09043 for <idr-archive@nic.merit.edu>; Tue, 9 Jul 2002 10:49:02 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 1F8E6912A3; Tue,  9 Jul 2002 10:48:43 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id E34CB912A1; Tue,  9 Jul 2002 10:48:42 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 93183912A3 for <idr@trapdoor.merit.edu>; Tue,  9 Jul 2002 10:48:05 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 76C3E5DE85; Tue,  9 Jul 2002 10:48:05 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from celox-ma1-ems1.celoxnetworks.com (celox-ma1-imap1.celoxnetworks.com [12.40.60.233]) by segue.merit.edu (Postfix) with ESMTP id 413045DE60 for <idr@merit.edu>; Tue,  9 Jul 2002 10:48:05 -0400 (EDT)
Received: by celox-ma1-ems1.celoxnetworks.com with Internet Mail Service (5.5.2653.19) id <L9JQDMWA>; Tue, 9 Jul 2002 10:48:05 -0400
Message-ID: <1117F7D44159934FB116E36F4ABF221B020AF843@celox-ma1-ems1.celoxnetworks.com>
From: "Natale, Jonathan" <JNatale@celoxnetworks.com>
To: "'Fryczke, Pawel'" <Pawel.Fryczke@intel.com>, idr@merit.edu
Subject: RE: Aggregation and MULTI_EXIT_DISC, NEXT_HOP
Date: Tue, 9 Jul 2002 10:48:04 -0400 
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
Sender: owner-idr@merit.edu
Precedence: bulk

I raised this issue more than two years ago.  What is the story on bgp?  Why
is the draft/rfc so far off base?  I would be happy to provide a laundry
list of things, if that will help.

I think the MED was fixed in a major vendor's code with the ~= deterministic
med cmd (or so I was told then, but now the story on that cmd seem
different).

The next hop was never "fixed".  The only answer for why not to aggregate if
nxhp not the same was that it has something to do w/ mcast RPF... but I
never fully digested this.

-$0.02

-----Original Message-----
From: Fryczke, Pawel [mailto:Pawel.Fryczke@intel.com] 
Sent: Thursday, July 04, 2002 8:08 AM
To: idr@merit.edu
Subject: Aggregation and MULTI_EXIT_DISC, NEXT_HOP

The latest draft, draft-ietf-idr-bgp4-17.txt states:
Routes that have the following attributes shall not be aggregated
unless the corresponding attributes of each route are identical:
MULTI_EXIT_DISC, NEXT_HOP.

Many vendors (e.g. Cisco/Juniper) does not follow this requirement. Why?
What is the purpose of this requirement?
Pawel


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA08868 for <idr-archive@nic.merit.edu>; Tue, 9 Jul 2002 10:45:08 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 90F32912A7; Tue,  9 Jul 2002 10:42:41 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 52179912A1; Tue,  9 Jul 2002 10:42:41 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id BF824912A7 for <idr@trapdoor.merit.edu>; Tue,  9 Jul 2002 10:42:35 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id A62995DE02; Tue,  9 Jul 2002 10:42:35 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from celox-ma1-ems1.celoxnetworks.com (celox-ma1-imap1.celoxnetworks.com [12.40.60.233]) by segue.merit.edu (Postfix) with ESMTP id 5B44B5DE92 for <idr@merit.edu>; Tue,  9 Jul 2002 10:42:35 -0400 (EDT)
Received: by celox-ma1-ems1.celoxnetworks.com with Internet Mail Service (5.5.2653.19) id <L9JQDMVT>; Tue, 9 Jul 2002 10:42:34 -0400
Message-ID: <1117F7D44159934FB116E36F4ABF221B020AF842@celox-ma1-ems1.celoxnetworks.com>
From: "Natale, Jonathan" <JNatale@celoxnetworks.com>
To: "'hans.de_vleeschouwer@alcatel.be'" <hans.de_vleeschouwer@alcatel.be>, BGP mailing list <idr@merit.edu>
Cc: Gilles Geerts <gilles.geerts@alcatel.be>
Subject: RE: TCP state machine doubts
Date: Tue, 9 Jul 2002 10:42:33 -0400 
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
Sender: owner-idr@merit.edu
Precedence: bulk

Increasing the time on each try is not a good idea and is not currently
implemented.  Correct me if I am wrong...

-----Original Message-----
From: hans.de_vleeschouwer@alcatel.be
[mailto:hans.de_vleeschouwer@alcatel.be] 
Sent: Friday, July 05, 2002 6:22 AM
To: BGP mailing list
Cc: Gilles Geerts
Subject: TCP state machine doubts

all,

i have a few questions related to the new state machine proposed in the
latest BGP draft.

1. related to terminology related to the connect retry counter.
I find back in the state descriptions following statement:
- "clears the ConnectRetry timer" (e.g. page 35, state connect)
- "Set connect retry timer to zero" (e.g. page 37, state Open sent)
- "Connect retry timer = 0," (e.g. page 38, state Open sent)
 I assume that the meaning in all of the above cases is : "the
ConnectRetry timer is stopped".  If so, would this not be a better
wording for the action?

2. related to the terminology for the holdtimer.
In state Connect on page 34 I find the following text: "set Hold timer
to a large value". I expect that this means "The Hold timer value is set
to a large value, and the Hold timer is started".

3. related to the ConnectRetryCnt.
- Should the value of this counter not be initialsed in the Idle state
as well?

4. General doubt  related to the TCP backoff strategy
Suppose I have a BGP peer that goes down say once every 2hours. With the
current sheme the ConnectRetryCnt keeps on increasing, and as a
consequence the connection takes increasinly more time to be
re-establsihed, even if actually it only fails like every 2 hours.

The mechanism only gives like 'bad points' , but never 'good points'. In
this way a peer never gets rid of his flapping history.
Is this really the intented behavior?

Thanks,
  Hans De Vleeschouwer.


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id KAA07472 for <idr-archive@nic.merit.edu>; Tue, 9 Jul 2002 10:09:42 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 0DEF991208; Tue,  9 Jul 2002 10:09:04 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id CFE1C9120E; Tue,  9 Jul 2002 10:09:03 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 970C691208 for <idr@trapdoor.merit.edu>; Tue,  9 Jul 2002 10:09:02 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 7A5675DE85; Tue,  9 Jul 2002 10:09:02 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from crufty.research.bell-labs.com (ns2.research.bell-labs.com [204.178.16.49]) by segue.merit.edu (Postfix) with ESMTP id 1B6925DE84 for <idr@merit.edu>; Tue,  9 Jul 2002 10:09:02 -0400 (EDT)
Received: from scummy.research.bell-labs.com (H-135-104-2-10.research.bell-labs.com [135.104.2.10]) by crufty.research.bell-labs.com (8.12.2/8.12.2) with ESMTP id g69E8uCT042782; Tue, 9 Jul 2002 10:08:57 -0400 (EDT)
Received: from nslocum.cs.bell-labs.com (nslocum.cs.bell-labs.com [135.104.8.38]) by scummy.research.bell-labs.com (8.11.6/8.11.6) with ESMTP id g69E8ok45983; Tue, 9 Jul 2002 10:08:50 -0400 (EDT)
Received: from research.bell-labs.com (basu-t23.ddns.bell-labs.com [135.104.52.191]) by nslocum.cs.bell-labs.com (8.12.2/8.12.2) with ESMTP id g69E8njq79783026; Tue, 9 Jul 2002 10:08:50 -0400 (EDT)
Message-ID: <3D2AEE71.15547C32@research.bell-labs.com>
Date: Tue, 09 Jul 2002 10:08:49 -0400
From: Anindya Basu <basu@research.bell-labs.com>
X-Mailer: Mozilla 4.76 [en]C-CCK-MCD   (Windows NT 5.0; U)
X-Accept-Language: en
MIME-Version: 1.0
To: Susan Hares <skh@nexthop.com>, idr@merit.edu
Cc: kmosley@nexthop.com, yakov Rekhter <yakov@juniper.net>
Subject: Re: IDR agenda
References: <5.0.0.25.0.20020709062010.035bab38@mail.nexthop.com>
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Sender: owner-idr@merit.edu
Precedence: bulk

Dear WG Chairs,

I would like a 10-15 minute slot for presenting the work in
draft-wilfong-ibgp-oscillations-00.txt that addresses the route
oscillation problem identified in
draft-ietf-idr-route-oscillation-00.txt. 

Thanks for your consideration,
-Anindya Basu
 MTS, Bell Laboratories
 Lucent Technologies

Susan Hares wrote:
> 
> Please send any requests for the agenda to the list.
> The final agenda will be sent on Sunday.
> (I am traveling to Japan until that date.)
> 
> Sue Hares


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id GAA29583 for <idr-archive@nic.merit.edu>; Tue, 9 Jul 2002 06:22:40 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 3E4B49129B; Tue,  9 Jul 2002 06:22:13 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 0977B9129C; Tue,  9 Jul 2002 06:22:12 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id EBF0C9129B for <idr@trapdoor.merit.edu>; Tue,  9 Jul 2002 06:22:11 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id D45625DE67; Tue,  9 Jul 2002 06:22:11 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 555FF5DD9B for <idr@merit.edu>; Tue,  9 Jul 2002 06:22:11 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g69AM9D25679; Tue, 9 Jul 2002 06:22:09 -0400 (EDT) (envelope-from skh@nexthop.com)
Received: from SKH.nexthop.com (d14.as0.flnt.mi.voyager.net [216.93.93.15]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g69AM0125655; Tue, 9 Jul 2002 06:22:00 -0400 (EDT) (envelope-from skh@nexthop.com)
Message-Id: <5.0.0.25.0.20020709062010.035bab38@mail.nexthop.com>
X-Sender: skh@mail.nexthop.com
X-Mailer: QUALCOMM Windows Eudora Version 5.0
Date: Tue, 09 Jul 2002 06:22:06 -0400
To: idr@merit.edu
From: Susan Hares <skh@nexthop.com>
Subject: IDR agenda 
Cc: kmosley@nexthop.com, yakov Rekhter <yakov@juniper.net>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"; format=flowed
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

Please send any requests for the agenda to the list.
The final agenda will be sent on Sunday.
(I am traveling to Japan until that date.)

Sue Hares



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id GAA29398 for <idr-archive@nic.merit.edu>; Tue, 9 Jul 2002 06:17:02 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id A9A279129A; Tue,  9 Jul 2002 06:15:47 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 7B2ED9129C; Tue,  9 Jul 2002 06:15:47 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 48BF79129A for <idr@trapdoor.merit.edu>; Tue,  9 Jul 2002 06:15:46 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 352615DE02; Tue,  9 Jul 2002 06:15:46 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from ep_mailout1 (unknown [203.254.224.24]) by segue.merit.edu (Postfix) with ESMTP id CD77F5DDA9 for <idr@merit.edu>; Tue,  9 Jul 2002 06:15:45 -0400 (EDT)
Received: from custom-daemon.mailout1.samsung.com by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) id <0GYZ00J018KR1K@mailout1.samsung.com> for idr@merit.edu; Tue, 09 Jul 2002 19:17:15 +0900 (KST)
Received: from ep_mmp2 (localhost [127.0.0.1]) by mailout1.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTP id <0GYZ00I8L8KQJW@mailout1.samsung.com> for idr@merit.edu; Tue, 09 Jul 2002 19:17:15 +0900 (KST)
Received: from Manav ([107.108.3.180]) by mmp2.samsung.com (iPlanet Messaging Server 5.1 (built Sep  5 2001)) with ESMTPA id <0GYZ00CBJ8KSQA@mmp2.samsung.com> for idr@merit.edu; Tue, 09 Jul 2002 19:17:18 +0900 (KST)
Date: Tue, 09 Jul 2002 15:41:50 +0530
From: Manav Bhatia <manav@samsung.com>
Subject: NLRI in both update and withdraw
To: idr@merit.edu
Reply-To: Manav Bhatia <manav@samsung.com>
Message-id: <0a8b01c22731$0bb30e80$b4036c6b@sisodomain.com>
MIME-version: 1.0
X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2600.0000
X-Mailer: Microsoft Outlook Express 6.00.2600.0000
Content-type: text/plain; charset=iso-8859-1
Content-transfer-encoding: 7BIT
X-Priority: 3
X-MSMail-priority: Normal
Sender: owner-idr@merit.edu
Precedence: bulk

Hi,
I want to know as to what the current implementations out there in the
field do and what is the appropriate and the correct method for dealing
with UPDATE messages  having the same NLRI listed in both the withdrawn and
the feasible routes.

Regards,
Manav

----
"When you are courting a nice girl an hour seems like a second. When you
sit on a red-hot cinder a second seems like an hour. That's relativity."

-Albert Einstein, on relativity




Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id DAA23494 for <idr-archive@nic.merit.edu>; Tue, 9 Jul 2002 03:31:49 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 2DC139123F; Tue,  9 Jul 2002 03:31:28 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id A508391298; Tue,  9 Jul 2002 03:31:14 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 9CB019123F for <idr@trapdoor.merit.edu>; Tue,  9 Jul 2002 03:31:04 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 7B3DF5DDB8; Tue,  9 Jul 2002 03:31:04 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from psg.com (psg.com [147.28.0.62]) by segue.merit.edu (Postfix) with ESMTP id 537D05DDA9 for <idr@merit.edu>; Tue,  9 Jul 2002 03:31:04 -0400 (EDT)
Received: from 12-234-206-11.client.attbi.com ([12.234.206.11] helo=AGNS3K11) by psg.com with esmtp (Exim 3.36 #1) id 17RpSg-0006Aw-00; Tue, 09 Jul 2002 00:31:02 -0700
Date: Tue, 9 Jul 2002 00:30:28 -0700
From: Alex Zinin <zinin@psg.com>
X-Mailer: The Bat! (v1.51) Personal
Reply-To: Alex Zinin <zinin@psg.com>
X-Priority: 3 (Normal)
Message-ID: <9227284012.20020709003028@psg.com>
To: Jeffrey Haas <jhaas@nexthop.com>
Cc: Mathew Richardson <mrr@nexthop.com>, idr@merit.edu
Subject: Re: Aggregation and MULTI_EXIT_DISC, NEXT_HOP
In-Reply-To: <20020708135838.B11658@nexthop.com>
References: <413FBB0BA5AED1119122000083234B1A015A9420@alpha.igk.intel.com> <m23cuxsf7l.wl@titanium.zebra.org> <20020706020554.G15590@nexthop.com> <20020708135838.B11658@nexthop.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Sender: owner-idr@merit.edu
Precedence: bulk

> It would be nice to know why the restriction was placed in the
> specification in the first place before we suggest it be removed.

If we perform aggregation as part of the Decision process (as the
document says), it would mean that we're installing aggregated route
and not installing any specifics in our local RIB. The restriction
in question is to make sure it is safe to do so.

The question is whether anyone actually does this instead of
aggregating during the Update process...

Alex




Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id SAA04066 for <idr-archive@nic.merit.edu>; Mon, 8 Jul 2002 18:24:09 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id EE2CC91282; Mon,  8 Jul 2002 18:23:50 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id BFC8491283; Mon,  8 Jul 2002 18:23:49 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 873D991282 for <idr@trapdoor.merit.edu>; Mon,  8 Jul 2002 18:23:48 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 684125DE46; Mon,  8 Jul 2002 18:23:48 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from psg.com (psg.com [147.28.0.62]) by segue.merit.edu (Postfix) with ESMTP id 3C06D5DD8C for <idr@merit.edu>; Mon,  8 Jul 2002 18:23:47 -0400 (EDT)
Received: from 12-234-206-11.client.attbi.com ([12.234.206.11] helo=AGNS3K11) by psg.com with esmtp (Exim 3.36 #1) id 17Rgv3-000FsL-00; Mon, 08 Jul 2002 15:23:45 -0700
Date: Mon, 8 Jul 2002 15:22:59 -0700
From: Alex Zinin <zinin@psg.com>
X-Mailer: The Bat! (v1.51) Personal
Reply-To: Alex Zinin <zinin@psg.com>
X-Priority: 3 (Normal)
Message-ID: <13164442886.20020708152259@psg.com>
To: Jeffrey Haas <jhaas@nexthop.com>
Cc: idr@merit.edu
Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
In-Reply-To: <20020708144248.C11658@nexthop.com>
References: <20020708144248.C11658@nexthop.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Sender: owner-idr@merit.edu
Precedence: bulk

Jeff,

 BTW, a more generic question: if we can't trust an attribute
 as a whole (since it's invalid), how can we trust specific fields
 (AFI/SAFI) inside, i.e., if an MP_* attribute is malformed, it is
 unsafe to assume that AFI/SAFI fields are correct... so, we may
 very well end up deleting and ignoring a wrong (but valid) AFI/SAFI.

 [I thought I brought up this issue already, but I couldn't find
 any evidence]

-- 
Alex

Monday, July 08, 2002, 11:42:48 AM, Jeffrey Haas wrote:
> The proposal for the INFORM message gives the opportunity to redress
> an open issue in the MP-BGP specification:

> : 6. Error Handling
> : 
> :    If a BGP speaker receives from a neighbor an Update message that
> :    contains the MP_REACH_NLRI or MP_UNREACH_NLRI attribute, and the
> :    speaker determines that the attribute is incorrect, the speaker must
> :    delete all the BGP routes received from that neighbor whose AFI/SAFI
> :    is the same as the one carried in the incorrect MP_REACH_NLRI or
> :    MP_UNREACH_NLRI attribute. For the duration of the BGP session over
> :    which the Update message was received, the speaker then should ignore
> :    all the subsequent routes with that AFI/SAFI received over that
> :    session.

> This amounts to silently ignoring reachability.

> INFORM would at least make this non-silent.  However, additional pushback
> might be warranted.  Upon receipt of an INFORM (or similar) message
> the receiving peer may wish to choose one or more of the following actions:

> o Just log it per INFORM
> o Stop sending this AFI/SAFI
> o Reset the peering session (contrary to the wishes of the peer?)

> Thoughts?




Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id RAA01542 for <idr-archive@nic.merit.edu>; Mon, 8 Jul 2002 17:12:39 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id EC09E91280; Mon,  8 Jul 2002 17:12:15 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id B965391281; Mon,  8 Jul 2002 17:12:15 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 3C07F91280 for <idr@trapdoor.merit.edu>; Mon,  8 Jul 2002 17:12:14 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 22F825DE12; Mon,  8 Jul 2002 17:12:14 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from cisco.com (uzura.cisco.com [64.102.17.77]) by segue.merit.edu (Postfix) with ESMTP id A4BA75DD8C for <idr@merit.edu>; Mon,  8 Jul 2002 17:12:13 -0400 (EDT)
Received: from ruwhite-u10.cisco.com (ruwhite-u10.cisco.com [64.102.48.251]) by cisco.com (8.8.8/2.6/Cisco List Logging/8.8.8) with ESMTP id RAA17049; Mon, 8 Jul 2002 17:12:12 -0400 (EDT)
Date: Mon, 8 Jul 2002 17:12:12 -0400 (EDT)
From: Russ White <ruwhite@cisco.com>
Reply-To: Russ White <riw@cisco.com>
To: Jeffrey Haas <jhaas@nexthop.com>
Cc: idr@merit.edu
Subject: Re: INFORM proposal and MP-BGP (RFC 2858)
In-Reply-To: <20020708144248.C11658@nexthop.com>
Message-ID: <Pine.GSO.4.21.0207081710480.1440-100000@ruwhite-u10.cisco.com>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
Sender: owner-idr@merit.edu
Precedence: bulk

When we originally conceived of the INFORM, we had included an
"action code" as well. This turned out to be rather cumbersome,
defining all of them. Probably the best thing to do is to make
certain this code is included in the spec, and perhaps loosen up
the text as to what may be done when receiving an INFORM to
include user configurable options, such as dropping the session
or just the impacted updates, etc.

:-)

Russ

On Mon, 8 Jul 2002, Jeffrey Haas wrote:

> The proposal for the INFORM message gives the opportunity to redress
> an open issue in the MP-BGP specification:
> 
> : 6. Error Handling
> : 
> :    If a BGP speaker receives from a neighbor an Update message that
> :    contains the MP_REACH_NLRI or MP_UNREACH_NLRI attribute, and the
> :    speaker determines that the attribute is incorrect, the speaker must
> :    delete all the BGP routes received from that neighbor whose AFI/SAFI
> :    is the same as the one carried in the incorrect MP_REACH_NLRI or
> :    MP_UNREACH_NLRI attribute. For the duration of the BGP session over
> :    which the Update message was received, the speaker then should ignore
> :    all the subsequent routes with that AFI/SAFI received over that
> :    session.
> 
> This amounts to silently ignoring reachability.
> 
> INFORM would at least make this non-silent.  However, additional pushback
> might be warranted.  Upon receipt of an INFORM (or similar) message
> the receiving peer may wish to choose one or more of the following actions:
> 
> o Just log it per INFORM
> o Stop sending this AFI/SAFI
> o Reset the peering session (contrary to the wishes of the peer?)
> 
> Thoughts?
> 
> -- 
> Jeff Haas 
> NextHop Technologies
> 

__________________________________
riw@cisco.com CCIE <>< Grace Alone




Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id OAA26162 for <idr-archive@nic.merit.edu>; Mon, 8 Jul 2002 14:43:14 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id A4A6291241; Mon,  8 Jul 2002 14:42:54 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 6C2F49127C; Mon,  8 Jul 2002 14:42:54 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 4FF5291241 for <idr@trapdoor.merit.edu>; Mon,  8 Jul 2002 14:42:53 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 2DBB55DFD5; Mon,  8 Jul 2002 14:42:53 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (dns.nexthop.com [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id C793F5DFCC for <idr@merit.edu>; Mon,  8 Jul 2002 14:42:52 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g68Igqu00141 for idr@merit.edu; Mon, 8 Jul 2002 14:42:52 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g68Ign100131 for <idr@merit.edu>; Mon, 8 Jul 2002 14:42:49 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g68Ign413058 for idr@merit.edu; Mon, 8 Jul 2002 14:42:49 -0400 (EDT)
Date: Mon, 8 Jul 2002 14:42:48 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: idr@merit.edu
Subject: INFORM proposal and MP-BGP (RFC 2858)
Message-ID: <20020708144248.C11658@nexthop.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

The proposal for the INFORM message gives the opportunity to redress
an open issue in the MP-BGP specification:

: 6. Error Handling
: 
:    If a BGP speaker receives from a neighbor an Update message that
:    contains the MP_REACH_NLRI or MP_UNREACH_NLRI attribute, and the
:    speaker determines that the attribute is incorrect, the speaker must
:    delete all the BGP routes received from that neighbor whose AFI/SAFI
:    is the same as the one carried in the incorrect MP_REACH_NLRI or
:    MP_UNREACH_NLRI attribute. For the duration of the BGP session over
:    which the Update message was received, the speaker then should ignore
:    all the subsequent routes with that AFI/SAFI received over that
:    session.

This amounts to silently ignoring reachability.

INFORM would at least make this non-silent.  However, additional pushback
might be warranted.  Upon receipt of an INFORM (or similar) message
the receiving peer may wish to choose one or more of the following actions:

o Just log it per INFORM
o Stop sending this AFI/SAFI
o Reset the peering session (contrary to the wishes of the peer?)

Thoughts?

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id NAA24621 for <idr-archive@nic.merit.edu>; Mon, 8 Jul 2002 13:59:14 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id B6B4A9123E; Mon,  8 Jul 2002 13:58:46 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 888299127B; Mon,  8 Jul 2002 13:58:46 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 8681F9123E for <idr@trapdoor.merit.edu>; Mon,  8 Jul 2002 13:58:45 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 73A835DE62; Mon,  8 Jul 2002 13:58:45 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (unknown [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id E06D85DDA0 for <idr@merit.edu>; Mon,  8 Jul 2002 13:58:44 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g68Hwh498425; Mon, 8 Jul 2002 13:58:43 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: from jhaas.nexthop.com (jhaas.nexthop.com [64.211.218.31]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g68Hwd198408; Mon, 8 Jul 2002 13:58:39 -0400 (EDT) (envelope-from jhaas@jhaas.nexthop.com)
Received: (from jhaas@localhost) by jhaas.nexthop.com (8.11.3nb1/8.11.3) id g68HwcM12845; Mon, 8 Jul 2002 13:58:38 -0400 (EDT)
Date: Mon, 8 Jul 2002 13:58:38 -0400
From: Jeffrey Haas <jhaas@nexthop.com>
To: Mathew Richardson <mrr@nexthop.com>
Cc: idr@merit.edu
Subject: Re: Aggregation and MULTI_EXIT_DISC, NEXT_HOP
Message-ID: <20020708135838.B11658@nexthop.com>
References: <413FBB0BA5AED1119122000083234B1A015A9420@alpha.igk.intel.com> <m23cuxsf7l.wl@titanium.zebra.org> <20020706020554.G15590@nexthop.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5i
In-Reply-To: <20020706020554.G15590@nexthop.com>; from mrr@nexthop.com on Sat, Jul 06, 2002 at 02:05:54AM -0400
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

On Sat, Jul 06, 2002 at 02:05:54AM -0400, Mathew Richardson wrote:
> The current mandate for the BGP spec. is to document current
> implementations.  If vendors are currently aggregating routes
> with different MEDs, or NEXT_HOPs, then this restriction should
> be made a suggestion ("It is recommended that routes that have
> differing MULTI_EXIT_DISC, or NEXT_HOP, attributes not be
> aggregated"), or removed entirely.

It would be nice to know why the restriction was placed in the
specification in the first place before we suggest it be removed.

> mrr

-- 
Jeff Haas 
NextHop Technologies


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id NAA24334 for <idr-archive@nic.merit.edu>; Mon, 8 Jul 2002 13:48:47 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id D30FF9127A; Mon,  8 Jul 2002 13:47:06 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id A2C059127B; Mon,  8 Jul 2002 13:47:06 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 85BA89127A for <idr@trapdoor.merit.edu>; Mon,  8 Jul 2002 13:46:59 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 68A745DE55; Mon,  8 Jul 2002 13:46:59 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from tenornetworks.com (rtu.tenornetworks.com [63.77.213.2]) by segue.merit.edu (Postfix) with ESMTP id DA2BC5DDA0 for <idr@merit.edu>; Mon,  8 Jul 2002 13:46:58 -0400 (EDT)
Received: from tenornet.com (newman [192.168.0.185]) by tenornetworks.com (Switch-2.1.3/Switch-2.1.0) with ESMTP id g68Hksd28848; Mon, 8 Jul 2002 13:46:54 -0400 (EDT)
Received: from 192.168.0.185 by tenornet.com; MON, 8 Jul 2002 13:42:24 -0700
Received: by newman.tenornet.com with Internet Mail Service (5.5.2650.21) id <N1HXZ4HL>; Mon, 8 Jul 2002 13:42:24 -0400
Message-ID: <6B190B34070BD411ACA000B0D0214E560165B808@newman.tenornet.com>
From: "Tsillas, Jim" <jtsillas@tenornetworks.com>
To: "'Daniel Kushner'" <dkushner99@yahoo.com>, idr@merit.edu
Cc: dkushner@timetra.com
Subject: RE: Route Reflector and client interaction questions
Date: Mon, 8 Jul 2002 13:42:23 -0400 
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2650.21)
Content-Type: text/plain; charset="iso-8859-1"
Sender: owner-idr@merit.edu
Precedence: bulk

Daniel,

what you are describing is a "poison reverse" strategy
which would have the benefit of reducing the amount of
state stored in the network. In your example, when R1
sends a withdrawl back to R2, R2 can free the otherwise
useless state since R1's decision process has determined
that R2's route is better. This is usually done between
external peers, but could also be done between RRs.
Not sending the withdraw is not incorrect since the
topology is not affected.

On the topic of prohibiting attribute modification: each
vendor will decide for themselves, but is there really
any value to limiting this function? If I buy a hammer
I can read the (fictional) warning about using it to relieve
headaches and decide if I want to use it in that manner.
Should that hammer then be redesigned to make it impossible
for me to bang my head?

regards,
-Jim.



-----Original Message-----
From: Daniel Kushner [mailto:dkushner99@yahoo.com]
Sent: Monday, July 08, 2002 1:18 PM
To: idr@merit.edu
Cc: dkushner@timetra.com
Subject: Route Reflector and client interaction questions


I have some questions concerning Juniper Route 
Reflector operation where Juniper and Cisco 
differ in operation.

Given:
R1 = Juniper route reflector (RR)
R2 = RR client to Juniper

- I noticed that if a client peer (R2) advertises a 
   route (route A) to a Juniper route reflector (R1), 
   that Juniper route reflector (R1) sends a 
   withdrawal of the route (route A) back to the 
   client (R2) that advertised the route to the 
   Juniper route reflector. Cisco does not do this. 

My questions are:
1) Does Juniper withdraw the route from the client 
     sender on purpose, or is it a Juniper bug?
2) If it is not a bug, what condition/topology exactly

     does sending the withdrawal back to the client 
     protect against?
3) From the provider standpoint, if it is not a bug, 
     is withdrawing the route from the client sender 
     preferred behavior?

- I also noticed that neither Cisco or Juniper 
   prohibits modification to the AS_PATH, 
   NEXT_HOP, LOCAL_PREF, or MED for 
   routes reflected, as required by the RFC. 
   I'm not sure if this is important when 
   considering the above questions.

4) From the provider standpoint, is allowing 
    attribute modification for reflected routes
    preferred behavior?

Thanks,
- Daniel Kushner


__________________________________________________
Do You Yahoo!?
Sign up for SBC Yahoo! Dial - First Month Free
http://sbc.yahoo.com


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id NAA23369 for <idr-archive@nic.merit.edu>; Mon, 8 Jul 2002 13:18:56 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 0FF5691279; Mon,  8 Jul 2002 13:18:42 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id CB9279127A; Mon,  8 Jul 2002 13:18:41 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 1CE4D91279 for <idr@trapdoor.merit.edu>; Mon,  8 Jul 2002 13:18:00 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 00AB15DFB0; Mon,  8 Jul 2002 13:17:59 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from web21105.mail.yahoo.com (web21105.mail.yahoo.com [216.136.227.107]) by segue.merit.edu (Postfix) with SMTP id 589F05DFAB for <idr@merit.edu>; Mon,  8 Jul 2002 13:17:59 -0400 (EDT)
Message-ID: <20020708171758.69447.qmail@web21105.mail.yahoo.com>
Received: from [66.125.93.189] by web21105.mail.yahoo.com via HTTP; Mon, 08 Jul 2002 10:17:58 PDT
Date: Mon, 8 Jul 2002 10:17:58 -0700 (PDT)
From: Daniel Kushner <dkushner99@yahoo.com>
Subject: Route Reflector and client interaction questions
To: idr@merit.edu
Cc: dkushner@timetra.com
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Sender: owner-idr@merit.edu
Precedence: bulk

I have some questions concerning Juniper Route 
Reflector operation where Juniper and Cisco 
differ in operation.

Given:
R1 = Juniper route reflector (RR)
R2 = RR client to Juniper

- I noticed that if a client peer (R2) advertises a 
   route (route A) to a Juniper route reflector (R1), 
   that Juniper route reflector (R1) sends a 
   withdrawal of the route (route A) back to the 
   client (R2) that advertised the route to the 
   Juniper route reflector. Cisco does not do this. 

My questions are:
1) Does Juniper withdraw the route from the client 
     sender on purpose, or is it a Juniper bug?
2) If it is not a bug, what condition/topology exactly

     does sending the withdrawal back to the client 
     protect against?
3) From the provider standpoint, if it is not a bug, 
     is withdrawing the route from the client sender 
     preferred behavior?

- I also noticed that neither Cisco or Juniper 
   prohibits modification to the AS_PATH, 
   NEXT_HOP, LOCAL_PREF, or MED for 
   routes reflected, as required by the RFC. 
   I'm not sure if this is important when 
   considering the above questions.

4) From the provider standpoint, is allowing 
    attribute modification for reflected routes
    preferred behavior?

Thanks,
- Daniel Kushner


__________________________________________________
Do You Yahoo!?
Sign up for SBC Yahoo! Dial - First Month Free
http://sbc.yahoo.com


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id HAA12227 for <idr-archive@nic.merit.edu>; Sat, 6 Jul 2002 07:31:16 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 40F8C91214; Sat,  6 Jul 2002 07:30:16 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id C77EA91238; Sat,  6 Jul 2002 07:30:15 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id A4B9491214 for <idr@trapdoor.merit.edu>; Sat,  6 Jul 2002 07:30:13 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 7180D5DF58; Sat,  6 Jul 2002 07:30:12 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from titanium.zebra.org (sdsl-64-139-11-202.dsl.sca.megapath.net [64.139.11.202]) by segue.merit.edu (Postfix) with ESMTP id BB79E5DE48 for <idr@merit.edu>; Sat,  6 Jul 2002 07:30:10 -0400 (EDT)
Received: from titanium.zebra.org (IDENT:kunihiro@titanium [127.0.0.1]) by titanium.zebra.org (8.9.3/8.9.3) with ESMTP id HAA06350; Sat, 6 Jul 2002 07:31:22 -0400
Date: Sat, 06 Jul 2002 04:31:22 -0700
Message-ID: <m2ofdl854l.wl@titanium.zebra.org>
From: Kunihiro Ishiguro <kunihiro@zebra.org>
To: Mathew Richardson <mrr@nexthop.com>
Cc: Kunihiro Ishiguro <kunihiro@ipinfusion.com>, "Fryczke, Pawel" <Pawel.Fryczke@intel.com>, idr@merit.edu
Subject: Re: Aggregation and MULTI_EXIT_DISC, NEXT_HOP
In-Reply-To: <20020706020554.G15590@nexthop.com>
References: <413FBB0BA5AED1119122000083234B1A015A9420@alpha.igk.intel.com> <m23cuxsf7l.wl@titanium.zebra.org>
User-Agent: Wanderlust/2.8.1 (Something) SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.3 Emacs/21.2.50 (powerpc-unknown-linux-gnu) MULE/5.0 (SAKAKI)
MIME-Version: 1.0 (generated by SEMI 1.14.3 - "Ushinoya")
Content-Type: text/plain; charset=US-ASCII
Sender: owner-idr@merit.edu
Precedence: bulk

>The current mandate for the BGP spec. is to document current
>implementations.  If vendors are currently aggregating routes
>with different MEDs, or NEXT_HOPs, then this restriction should
>be made a suggestion ("It is recommended that routes that have
>differing MULTI_EXIT_DISC, or NEXT_HOP, attributes not be
>aggregated"), or removed entirely.

I think so too.
-- 
Kunihiro Ishiguro


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id CAA09443 for <idr-archive@nic.merit.edu>; Sat, 6 Jul 2002 02:07:33 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 45AF69120C; Sat,  6 Jul 2002 02:07:10 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 179AB91259; Sat,  6 Jul 2002 02:07:10 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id F35219120C for <idr@trapdoor.merit.edu>; Sat,  6 Jul 2002 02:07:08 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id E25F15DDDA; Sat,  6 Jul 2002 02:07:08 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from presque.djinesys.com (unknown [64.211.218.216]) by segue.merit.edu (Postfix) with ESMTP id 59B3E5DDAE for <idr@merit.edu>; Sat,  6 Jul 2002 02:07:08 -0400 (EDT)
Received: (from root@localhost) by presque.djinesys.com (8.11.3/8.11.1) id g6665wU25807; Sat, 6 Jul 2002 02:05:58 -0400 (EDT) (envelope-from mrr@mrichardson.nexthop.com)
Received: from paradigm.nexthop.com (mrichardson.nexthop.com [64.211.218.45]) by presque.djinesys.com (8.11.3/8.11.1) with ESMTP id g6665t125800; Sat, 6 Jul 2002 02:05:55 -0400 (EDT) (envelope-from mrr@mrichardson.nexthop.com)
Received: (from mrr@localhost) by paradigm.nexthop.com (8.11.6/8.11.6) id g6665s822962; Sat, 6 Jul 2002 02:05:54 -0400 (EDT)
Date: Sat, 6 Jul 2002 02:05:54 -0400
From: Mathew Richardson <mrr@nexthop.com>
To: Kunihiro Ishiguro <kunihiro@zebra.org>
Cc: "Fryczke, Pawel" <Pawel.Fryczke@intel.com>, idr@merit.edu
Subject: Re: Aggregation and MULTI_EXIT_DISC, NEXT_HOP
Message-ID: <20020706020554.G15590@nexthop.com>
References: <413FBB0BA5AED1119122000083234B1A015A9420@alpha.igk.intel.com> <m23cuxsf7l.wl@titanium.zebra.org>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2.5.1i
In-Reply-To: <m23cuxsf7l.wl@titanium.zebra.org>; from kunihiro@zebra.org on Fri, Jul 05, 2002 at 08:33:18PM -0700
X-Virus-Scanned: by AMaViS perl-11
Sender: owner-idr@merit.edu
Precedence: bulk

> Kunihiro Ishiguro <kunihiro@zebra.org> [Fri, Jul 05, 2002 at 08:33:18PM -0700]:
>>The latest draft, draft-ietf-idr-bgp4-17.txt states:
>>Routes that have the following attributes shall not be aggregated
>>unless the corresponding attributes of each route are identical:
>>MULTI_EXIT_DISC, NEXT_HOP.
>
>Well, this comes from RFC1771 and we didn't change it.
>
>9.2.4.2 Aggregating Routing Information
>
>   Aggregation is the process of combining the characteristics of
>   several different routes in such a way that a single route can be
>   advertised.  Aggregation can occur as part of the decision  process
>   to reduce the amount of routing information that will be placed in
>   the Adj-RIBs-Out.
>
>   Aggregation reduces the amount of information that a BGP speaker must
>   store and exchange with other BGP speakers. Routes can be aggregated
>   by applying the following procedure separately to path attributes of
>   like type and to the Network Layer Reachability Information.
>
>   Routes that have the following attributes shall not be aggregated
>   unless the corresponding attributes of each route are identical:
>   MULTI_EXIT_DISC, NEXT_HOP.
>
>IHMO, the intention of this part is "BGP should aggregate the routes
>which have same network reachability".  Diffrent MULTI_EXIT_DISC or
>NEXT_HOP means different network reachability.  So the specification
>said "shall not be aggregate".

<snip>

The current mandate for the BGP spec. is to document current
implementations.  If vendors are currently aggregating routes
with different MEDs, or NEXT_HOPs, then this restriction should
be made a suggestion ("It is recommended that routes that have
differing MULTI_EXIT_DISC, or NEXT_HOP, attributes not be
aggregated"), or removed entirely.

mrr


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id XAA08059 for <idr-archive@nic.merit.edu>; Fri, 5 Jul 2002 23:49:04 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id A336F91257; Fri,  5 Jul 2002 23:48:46 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 1782391258; Fri,  5 Jul 2002 23:48:46 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 8EC4591257 for <idr@trapdoor.merit.edu>; Fri,  5 Jul 2002 23:48:18 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 6C8575DE13; Fri,  5 Jul 2002 23:48:18 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from titanium.zebra.org (sdsl-64-139-11-202.dsl.sca.megapath.net [64.139.11.202]) by segue.merit.edu (Postfix) with ESMTP id 8B5A55DDB9 for <idr@merit.edu>; Fri,  5 Jul 2002 23:48:17 -0400 (EDT)
Received: from titanium.zebra.org (IDENT:kunihiro@titanium [127.0.0.1]) by titanium.zebra.org (8.9.3/8.9.3) with ESMTP id XAA00794; Fri, 5 Jul 2002 23:49:18 -0400
Date: Fri, 05 Jul 2002 20:49:18 -0700
Message-ID: <m2znx5qzwh.wl@titanium.zebra.org>
From: Kunihiro Ishiguro <kunihiro@zebra.org>
To: hans.de_vleeschouwer@alcatel.be
Cc: BGP mailing list <idr@merit.edu>, Gilles Geerts <gilles.geerts@alcatel.be>
Subject: Re: TCP state machine doubts
In-Reply-To: <3D25733C.B5F1D90@alcatel.be>
References: <3D25733C.B5F1D90@alcatel.be>
User-Agent: Wanderlust/2.8.1 (Something) SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.3 Emacs/21.2.50 (powerpc-unknown-linux-gnu) MULE/5.0 (SAKAKI)
MIME-Version: 1.0 (generated by SEMI 1.14.3 - "Ushinoya")
Content-Type: text/plain; charset=US-ASCII
Sender: owner-idr@merit.edu
Precedence: bulk

>i have a few questions related to the new state machine proposed in the
>latest BGP draft.

Current draft-17 has some known problems.  I think Sue aleady prepared
draft of draft-18.  So best way is updating the draft to draft-18 then
process IDR working group review to it.

>1. related to terminology related to the connect retry counter.
>I find back in the state descriptions following statement:
>- "clears the ConnectRetry timer" (e.g. page 35, state connect)
>- "Set connect retry timer to zero" (e.g. page 37, state Open sent)
>- "Connect retry timer = 0," (e.g. page 38, state Open sent)
> I assume that the meaning in all of the above cases is : "the
>ConnectRetry timer is stopped".  If so, would this not be a better
>wording for the action?
>
>2. related to the terminology for the holdtimer.
>In state Connect on page 34 I find the following text: "set Hold timer
>to a large value". I expect that this means "The Hold timer value is set
>to a large value, and the Hold timer is started".
>
>3. related to the ConnectRetryCnt.
>- Should the value of this counter not be initialsed in the Idle state
>as well?

These are right.  I believes all of these are fixed in the draft of
draft-18.

>4. General doubt  related to the TCP backoff strategy
>Suppose I have a BGP peer that goes down say once every 2hours. With the
>current sheme the ConnectRetryCnt keeps on increasing, and as a
>consequence the connection takes increasinly more time to be
>re-establsihed, even if actually it only fails like every 2 hours.

My understanding is once BGP peer is establieshed, the ConnectRetryCnt
is cleared.
-- 
Kunihiro Ishiguro


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id XAA07946 for <idr-archive@nic.merit.edu>; Fri, 5 Jul 2002 23:34:25 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 108E491256; Fri,  5 Jul 2002 23:33:52 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id D477F91257; Fri,  5 Jul 2002 23:33:51 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 5A26891256 for <idr@trapdoor.merit.edu>; Fri,  5 Jul 2002 23:32:33 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 396435DE11; Fri,  5 Jul 2002 23:32:33 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from titanium.zebra.org (12-236-237-176.client.attbi.com [12.236.237.176]) by segue.merit.edu (Postfix) with ESMTP id 8A0A15DDB9 for <idr@merit.edu>; Fri,  5 Jul 2002 23:32:32 -0400 (EDT)
Received: from titanium.zebra.org (IDENT:kunihiro@titanium [127.0.0.1]) by titanium.zebra.org (8.9.3/8.9.3) with ESMTP id XAA00731; Fri, 5 Jul 2002 23:33:19 -0400
Date: Fri, 05 Jul 2002 20:33:18 -0700
Message-ID: <m23cuxsf7l.wl@titanium.zebra.org>
From: Kunihiro Ishiguro <kunihiro@zebra.org>
To: "Fryczke, Pawel" <Pawel.Fryczke@intel.com>
Cc: idr@merit.edu
Subject: Re: Aggregation and MULTI_EXIT_DISC, NEXT_HOP
In-Reply-To: <413FBB0BA5AED1119122000083234B1A015A9420@alpha.igk.intel.com>
References: <413FBB0BA5AED1119122000083234B1A015A9420@alpha.igk.intel.com>
User-Agent: Wanderlust/2.8.1 (Something) SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.3 Emacs/21.2.50 (powerpc-unknown-linux-gnu) MULE/5.0 (SAKAKI)
MIME-Version: 1.0 (generated by SEMI 1.14.3 - "Ushinoya")
Content-Type: text/plain; charset=US-ASCII
Sender: owner-idr@merit.edu
Precedence: bulk

>The latest draft, draft-ietf-idr-bgp4-17.txt states:
>Routes that have the following attributes shall not be aggregated
>unless the corresponding attributes of each route are identical:
>MULTI_EXIT_DISC, NEXT_HOP.

Well, this comes from RFC1771 and we didn't change it.

9.2.4.2 Aggregating Routing Information

   Aggregation is the process of combining the characteristics of
   several different routes in such a way that a single route can be
   advertised.  Aggregation can occur as part of the decision  process
   to reduce the amount of routing information that will be placed in
   the Adj-RIBs-Out.

   Aggregation reduces the amount of information that a BGP speaker must
   store and exchange with other BGP speakers. Routes can be aggregated
   by applying the following procedure separately to path attributes of
   like type and to the Network Layer Reachability Information.

   Routes that have the following attributes shall not be aggregated
   unless the corresponding attributes of each route are identical:
   MULTI_EXIT_DISC, NEXT_HOP.

IHMO, the intention of this part is "BGP should aggregate the routes
which have same network reachability".  Diffrent MULTI_EXIT_DISC or
NEXT_HOP means different network reachability.  So the specification
said "shall not be aggregate".

>Many vendors (e.g. Cisco/Juniper) does not follow this requirement. Why?
>What is the purpose of this requirement?

Today's network operator take control the use of aggregation.  So even
if we aggregate diffrent network rechability routes, that should be
ok.
-- 
Kunihiro Ishiguro


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id GAA20747 for <idr-archive@nic.merit.edu>; Fri, 5 Jul 2002 06:23:00 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 3FC6691203; Fri,  5 Jul 2002 06:22:32 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 097D391244; Fri,  5 Jul 2002 06:22:31 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id F384891203 for <idr@trapdoor.merit.edu>; Fri,  5 Jul 2002 06:22:30 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id D9B065DF05; Fri,  5 Jul 2002 06:22:30 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mail.alcatel.be (alc250.alcatel.be [195.207.101.250]) by segue.merit.edu (Postfix) with ESMTP id D321E5DF04 for <idr@merit.edu>; Fri,  5 Jul 2002 06:22:29 -0400 (EDT)
Received: from Bemail06.net.alcatel.be (relay3 [127.0.0.1]) by mail.alcatel.be (8.11.0/8.11.4) with ESMTP id g65AIfG05979 for <idr@merit.edu>; Fri, 5 Jul 2002 12:18:41 +0200
Received: from alcatel.be ([138.203.142.10]) by Bemail06.net.alcatel.be (Lotus Domino Release 5.0.8) with ESMTP id 2002070512221705:3271 ; Fri, 5 Jul 2002 12:22:17 +0200 
Message-ID: <3D25733C.B5F1D90@alcatel.be>
Date: Fri, 05 Jul 2002 12:21:48 +0200
From: hans.de_vleeschouwer@alcatel.be
X-Mailer: Mozilla 4.72 [en] (WinNT; I)
X-Accept-Language: en
MIME-Version: 1.0
To: BGP mailing list <idr@merit.edu>
Cc: Gilles Geerts <gilles.geerts@alcatel.be>
Subject: TCP state machine doubts
X-MIMETrack: Itemize by SMTP Server on BEMAIL06/BE/ALCATEL(Release 5.0.8 |June 18, 2001) at 07/05/2002 12:22:17, Serialize by Router on BEMAIL06/BE/ALCATEL(Release 5.0.8 |June 18, 2001) at 07/05/2002 12:22:18, Serialize complete at 07/05/2002 12:22:18
Content-Transfer-Encoding: 7bit
Content-Type: text/plain; charset=us-ascii
Sender: owner-idr@merit.edu
Precedence: bulk

all,

i have a few questions related to the new state machine proposed in the
latest BGP draft.

1. related to terminology related to the connect retry counter.
I find back in the state descriptions following statement:
- "clears the ConnectRetry timer" (e.g. page 35, state connect)
- "Set connect retry timer to zero" (e.g. page 37, state Open sent)
- "Connect retry timer = 0," (e.g. page 38, state Open sent)
 I assume that the meaning in all of the above cases is : "the
ConnectRetry timer is stopped".  If so, would this not be a better
wording for the action?

2. related to the terminology for the holdtimer.
In state Connect on page 34 I find the following text: "set Hold timer
to a large value". I expect that this means "The Hold timer value is set
to a large value, and the Hold timer is started".

3. related to the ConnectRetryCnt.
- Should the value of this counter not be initialsed in the Idle state
as well?

4. General doubt  related to the TCP backoff strategy
Suppose I have a BGP peer that goes down say once every 2hours. With the
current sheme the ConnectRetryCnt keeps on increasing, and as a
consequence the connection takes increasinly more time to be
re-establsihed, even if actually it only fails like every 2 hours.

The mechanism only gives like 'bad points' , but never 'good points'. In
this way a peer never gets rid of his flapping history.
Is this really the intented behavior?

Thanks,
  Hans De Vleeschouwer.



Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id IAA06620 for <idr-archive@nic.merit.edu>; Thu, 4 Jul 2002 08:10:56 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id BF95B91205; Thu,  4 Jul 2002 08:10:17 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 8F74691211; Thu,  4 Jul 2002 08:10:17 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id 4A6DC91205 for <idr@trapdoor.merit.edu>; Thu,  4 Jul 2002 08:10:16 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 345C75DEBD; Thu,  4 Jul 2002 08:10:16 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from mail1.hd.intel.com (hdfdns01.hd.intel.com [192.52.58.10]) by segue.merit.edu (Postfix) with ESMTP id BBF215DD8F for <idr@merit.edu>; Thu,  4 Jul 2002 08:10:15 -0400 (EDT)
Received: from fmsmsxvs040.fm.intel.com (fmsmsxvs040.fm.intel.com [132.233.42.124]) by mail1.hd.intel.com (8.11.6/8.11.6/d: solo.mc,v 1.42 2002/05/23 22:21:11 root Exp $) with SMTP id g64CAEe24183 for <idr@merit.edu>; Thu, 4 Jul 2002 12:10:14 GMT
Received: from alpha.igk.intel.com ([172.28.168.68]) by fmsmsxvs040.fm.intel.com (NAVGW 2.5.2.11) with SMTP id M2002070405092912858 for <idr@merit.edu>; Thu, 04 Jul 2002 05:09:30 -0700
Received: by alpha.igk.intel.com with Internet Mail Service (5.5.2653.19) id <L7B07H6J>; Thu, 4 Jul 2002 14:08:04 +0200
Message-ID: <413FBB0BA5AED1119122000083234B1A015A9420@alpha.igk.intel.com>
From: "Fryczke, Pawel" <Pawel.Fryczke@intel.com>
To: idr@merit.edu
Subject: Aggregation and MULTI_EXIT_DISC, NEXT_HOP
Date: Thu, 4 Jul 2002 14:08:02 +0200 
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain; charset="ISO-8859-2"
Sender: owner-idr@merit.edu
Precedence: bulk

The latest draft, draft-ietf-idr-bgp4-17.txt states:
Routes that have the following attributes shall not be aggregated
unless the corresponding attributes of each route are identical:
MULTI_EXIT_DISC, NEXT_HOP.

Many vendors (e.g. Cisco/Juniper) does not follow this requirement. Why?
What is the purpose of this requirement?
Pawel


Received: from trapdoor.merit.edu (postfix@trapdoor.merit.edu [198.108.1.26]) by nic.merit.edu (8.9.3/8.9.1) with ESMTP id WAA17330 for <idr-archive@nic.merit.edu>; Wed, 3 Jul 2002 22:33:51 -0400 (EDT)
Received: by trapdoor.merit.edu (Postfix) id 8286991201; Wed,  3 Jul 2002 22:31:09 -0400 (EDT)
Delivered-To: idr-outgoing@trapdoor.merit.edu
Received: by trapdoor.merit.edu (Postfix, from userid 56) id 1B8E49122C; Wed,  3 Jul 2002 22:31:00 -0400 (EDT)
Delivered-To: idr@trapdoor.merit.edu
Received: from segue.merit.edu (segue.merit.edu [198.108.1.41]) by trapdoor.merit.edu (Postfix) with ESMTP id C8F4791201 for <idr@trapdoor.merit.edu>; Wed,  3 Jul 2002 22:30:59 -0400 (EDT)
Received: by segue.merit.edu (Postfix) id 9FF8E5DDDD; Wed,  3 Jul 2002 22:30:59 -0400 (EDT)
Delivered-To: idr@merit.edu
Received: from titanium.zebra.org (unknown [65.223.109.2]) by segue.merit.edu (Postfix) with ESMTP id 0C3455DDBF; Wed,  3 Jul 2002 22:30:59 -0400 (EDT)
Received: from titanium.zebra.org (IDENT:kunihiro@titanium [127.0.0.1]) by titanium.zebra.org (8.9.3/8.9.3) with ESMTP id WAA18054; Wed, 3 Jul 2002 22:32:13 -0400
Date: Wed, 03 Jul 2002 19:32:13 -0700
Message-ID: <m2adp8xlxu.wl@titanium.zebra.org>
From: Kunihiro Ishiguro <kunihiro@zebra.org>
To: Susan Harris <srh@merit.edu>
Cc: idr@merit.edu
Subject: BGP4-draft-18
User-Agent: Wanderlust/2.8.1 (Something) SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.3 Emacs/21.2.50 (powerpc-unknown-linux-gnu) MULE/5.0 (SAKAKI)
MIME-Version: 1.0 (generated by SEMI 1.14.3 - "Ushinoya")
Content-Type: text/plain; charset=US-ASCII
Sender: owner-idr@merit.edu
Precedence: bulk

Hi, this is Kunihiro

Sue, when do we publish draft-18 of BGP4?  Well, some people start
using draft-17 with IdleHold status and claim that all of other
current implementation is wrong...  If you need any help or review or
some other things, please let me know.  I really want to update the
draft.
--
Kunihiro Ishiguro