RE: discussion on fast notification work

András Császár <Andras.Csaszar@ericsson.com> Fri, 08 July 2011 09:09 UTC

Return-Path: <Andras.Csaszar@ericsson.com>
X-Original-To: rtgwg@ietfa.amsl.com
Delivered-To: rtgwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id EC90221F88C0 for <rtgwg@ietfa.amsl.com>; Fri, 8 Jul 2011 02:09:36 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -6.299
X-Spam-Level:
X-Spam-Status: No, score=-6.299 tagged_above=-999 required=5 tests=[BAYES_00=-2.599, MIME_8BIT_HEADER=0.3, RCVD_IN_DNSWL_MED=-4]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id LyrXFYBlXudl for <rtgwg@ietfa.amsl.com>; Fri, 8 Jul 2011 02:09:36 -0700 (PDT)
Received: from mailgw10.se.ericsson.net (mailgw10.se.ericsson.net [193.180.251.61]) by ietfa.amsl.com (Postfix) with ESMTP id 9F45E21F877A for <rtgwg@ietf.org>; Fri, 8 Jul 2011 02:09:35 -0700 (PDT)
X-AuditID: c1b4fb3d-b7c17ae00000262e-81-4e16c94e7d1f
Received: from esessmw0184.eemea.ericsson.se (Unknown_Domain [153.88.253.124]) by mailgw10.se.ericsson.net (Symantec Mail Security) with SMTP id AC.EC.09774.E49C61E4; Fri, 8 Jul 2011 11:09:34 +0200 (CEST)
Received: from ESESSCMS0363.eemea.ericsson.se ([169.254.1.174]) by esessmw0184.eemea.ericsson.se ([153.88.115.81]) with mapi; Fri, 8 Jul 2011 11:09:34 +0200
From: András Császár <Andras.Csaszar@ericsson.com>
To: "curtis@occnc.com" <curtis@occnc.com>
Date: Fri, 08 Jul 2011 11:09:32 +0200
Subject: RE: discussion on fast notification work
Thread-Topic: discussion on fast notification work
Thread-Index: Acw8/FAHIL0adQsyTMGWjmZrv+blmgAUL9tw
Message-ID: <8DCD771BDA4A394E9BCBA8932E8392973216EA6463@ESESSCMS0363.eemea.ericsson.se>
References: Your message of "Wed, 06 Jul 2011 19:04:51 EDT." <0ED867EB33AB2B45AAB470D5A64CDBF60CCA025055@EUSAACMS0701.eamcs.ericsson.se> <201107072319.p67NJ9pD035115@harbor.orleans.occnc.com>
In-Reply-To: <201107072319.p67NJ9pD035115@harbor.orleans.occnc.com>
Accept-Language: hu-HU, en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
acceptlanguage: hu-HU, en-US
Content-Type: text/plain; charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
X-Brightmail-Tracker: AAAAAA==
Cc: "rtgwg@ietf.org" <rtgwg@ietf.org>
X-BeenThere: rtgwg@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: Routing Area Working Group <rtgwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/rtgwg>, <mailto:rtgwg-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/rtgwg>
List-Post: <mailto:rtgwg@ietf.org>
List-Help: <mailto:rtgwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/rtgwg>, <mailto:rtgwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 08 Jul 2011 09:09:37 -0000

Hello Curtis, 

> We don't want a
> protocol fix that addresses a bad implementation of a perfectly good
> protocol.

Please note we are not trying to develop a protocol fix.
Currently there exist not standard protocol for
- IPFRR
- with 100% failure coverage

There are proposals only. So there is nothing to fix.

Note that IPFRR-FN has no intention to fix/improve OSPF/ISIS routing convergence. Instead, we acknowledge that there are practical reasons why IGPs typically do not provide fail-over times below a few hundred milliseconds. But if sub-50ms fail-over is really needed (which could actually be another thread for debate IMHO), then something else is needed.

We are almost finishing the new drafts, but it is unlikely we can submit them much earlier before Monday's cut-off.

András


> -----Original Message-----
> From: rtgwg-bounces@ietf.org [mailto:rtgwg-bounces@ietf.org] 
> On Behalf Of Curtis Villamizar
> Sent: 2011. július 8. 1:19
> To: Jeff Tantsura
> Cc: rtgwg@ietf.org
> Subject: Re: discussion on fast notification work 
> 
> 
> I agree with John that there doesn't seem to be a problem to solve.
> 
> To add to that.
> 
> In this case there seem to be consensus that this was a bad idea.  If
> the authors want to change that consensus, a draft should come well
> ahead of the meeting with discussion on the list, and only then if
> there is interest, should be spend meeting time on this.
> 
> For example, on the list we can discuss what vendor equipment was used
> in the networks that were prototyped/measured.  We don't want a
> protocol fix that addresses a bad implementation of a perfectly good
> protocol.  I prefer draft first, well in advance of the meeting, then
> consider a time slot allocation after discussion on list.
> 
> [ begin aside
> 
> The issue is whether it is possible to keep the flooding latency low
> relative to geographic delay and detection time.  A somewhat worst
> case flooding delay in a good implementation consists of 1) receive
> packet on line card, check if it is a known change, 2) send it to
> another processor to verify if it is known, 3) receive it agains and
> recheck, 4) send it again to line cards, 5) receive it and send it on
> the line interface.  Some implementations multicast from the receiving
> node, yielding only one internal hop, for two context switches total.
> 
> Even absent of geographic delay, detection time is order 10 msec.  If
> we assume a context switch time of 1 msec even receiving a packet
> three times within the same node adds on the order of 3 msec per hop.
> 
> In a good implementiation, this processing is done on the line cards
> which have little else to do other than react to low effort near real
> time requirements such as protection and flooding and installing new
> state at a slightly lower priority (and with good CPU starvation
> prevention in the scheduling).  If the line card processor is doing
> real time work and there is a desire to reduce latency, most cards
> today can support a change to the clock interrupt from a 1000 Hz clock
> to something a bit faster.  The down side is increasing the amount of
> time spent in the RTC interrupt code.
> 
> In a bad implementation there is a chance that a CPU delays flooding
> while waiting for an SPF or for some high CPU user link BGP to finish
> up some increment of work.  Then the delays can be quite long.
> 
> One advantage of flooding is that after a fault the flooding does not
> rely on assumptions about the remaining topology.  Multiple simultaous
> faults due to failure of a shared resource can and often do occur.
> Any mechanism that relys on forwarding or relies on source routing
> through a best guess at what links remain will fail to deliver
> notifications in such a case.  In that case we are back to the
> flooding delay anyway, so might as well work on keeping that short.
> 
> Maybe on the list the authors can describe the motivation, the target
> flooding or notification delay, and how the shortcomings discussed
> previously are addressed.
> 
> end aside ]
> 
> Curtis
> 
> In message 
> <0ED867EB33AB2B45AAB470D5A64CDBF60CCA025055@EUSAACMS0701.eamcs
> .ericsson.se>
> Jeff Tantsura writes:
> >  
> > Hi Alia/John,
> >  
> > Thanks for your comments.
> >  
> > We have been listening very carefully and have taken all the critic
> > rather seriously.
> >  
> > We have changed the way we distribute FNs, added authentication and
> > prototyped/measured convergence times in networks of different
> > diameters.  We have also added some additional clients (protocols to
> > use FN)
> >  
> > Updated draft will be published soon.
> >  
> > We would really like to present our work.
> >  
> > Thanks! 
> >  
> > Regards,
> > Jeff  
> >  
> > -----Original Message-----
> > From: rtgwg-bounces@ietf.org 
> [mailto:rtgwg-bounces@ietf.org] On Behalf Of John E Drake
> > Sent: Wednesday, July 06, 2011 14:54
> > To: Alia Atlas; rtgwg@ietf.org
> > Subject: RE: discussion on fast notification work
> >  
> > Alia,
> >  
> > Is it okay for me to say that I think that this is a really bad idea
> > and that I was glad that interest in it had waned?
> >  
> > Thanks,
> >  
> > John
> >  
> > Sent from my iPhone
> >  
> >  
> > > -----Original Message-----
> > > From: rtgwg-bounces@ietf.org 
> [mailto:rtgwg-bounces@ietf.org] On Behalf
> > > Of Alia Atlas
> > > Sent: Wednesday, July 06, 2011 1:57 PM
> > > To: rtgwg@ietf.org
> > > Subject: discussion on fast notification work
> > > 
> > > The last 2 IETFs, we have had discussions about the idea of fast
> > > notification, as described in
> > > draft-lu-fast-notification-framework, 
> draft-lu-fn-transport-00, and
> > > draft-csaszar-ipfrr-fn-00.
> > > 
> > > Since then, I have not seen substantial discussion or 
> interest on the
> > > mailing list.  If you are
> > > interested in this work, have questions about it, or 
> would like to see
> > > RTGWG continue to discuss it,
> > > please send email to this mailing list.  I'd like to see this
> > > conversation happening here before IETF.
> > > 
> > > Thanks,
> > > Alia
> _______________________________________________
> rtgwg mailing list
> rtgwg@ietf.org
> https://www.ietf.org/mailman/listinfo/rtgwg
>