Re: OSPF WG Charter Proposal

"Ash, Gerald R (Jerry), ALASO" <gash@ATT.COM> Wed, 06 November 2002 20:38 UTC

Received: from cherry.ease.lsoft.com (cherry.ease.lsoft.com [209.119.0.109]) by ietf.org (8.9.1a/8.9.1a) with ESMTP id PAA11321 for <ospf-archive@LISTS.IETF.ORG>; Wed, 6 Nov 2002 15:38:57 -0500 (EST)
Received: from walnut (209.119.0.61) by cherry.ease.lsoft.com (LSMTP for Digital Unix v1.1b) with SMTP id <20.007B4AA5@cherry.ease.lsoft.com>; Wed, 6 Nov 2002 15:41:25 -0500
Received: from DISCUSS.MICROSOFT.COM by DISCUSS.MICROSOFT.COM (LISTSERV-TCP/IP release 1.8e) with spool id 326414 for OSPF@DISCUSS.MICROSOFT.COM; Wed, 6 Nov 2002 15:41:25 -0500
Received: from 192.128.166.71 by WALNUT.EASE.LSOFT.COM (SMTPL release 1.0f) with TCP; Wed, 6 Nov 2002 15:41:24 -0500
Received: from attrh3i.attrh.att.com ([135.71.62.12]) by almso2.proxy.att.com (AT&T IPNS/MSO-4.0) with ESMTP id gA6JwaDw011872 for <OSPF@DISCUSS.MICROSOFT.COM>; Wed, 6 Nov 2002 15:41:23 -0500 (EST)
Received: from occlust04evs1.ugd.att.com (135.71.164.13) by attrh3i.attrh.att.com (6.5.019) id 3D8B560500FA1764; Wed, 6 Nov 2002 15:41:23 -0500
X-MimeOLE: Produced By Microsoft Exchange V6.0.5762.3
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"
Thread-Topic: OSPF WG Charter Proposal
Thread-Index: AcJ/fClyA4rkasndTdG7re9FGZq6nAGUHbAQ
Message-ID: <28F05913385EAC43AF019413F674A01701B65690@OCCLUST04EVS1.ugd.att.com>
Date: Wed, 06 Nov 2002 15:41:20 -0500
Reply-To: Mailing List <OSPF@DISCUSS.MICROSOFT.COM>
Sender: Mailing List <OSPF@DISCUSS.MICROSOFT.COM>
From: "Ash, Gerald R (Jerry), ALASO" <gash@ATT.COM>
Subject: Re: OSPF WG Charter Proposal
Comments: To: rohit@xebeo.com, Acee Lindem <acee@redback.com>
To: OSPF@DISCUSS.MICROSOFT.COM
Precedence: list
Content-Transfer-Encoding: 8bit
X-MIME-Autoconverted: from quoted-printable to 8bit by ietf.org id PAA11321

Rohit, Acee,

From the (consolidated) thread so far:

Rohit/Acee> Here is a strawman of the Charter. Please have a look at and send
Rohit/Acee> any comments you may have to me or Acee or to the list.

Venkata> Can you consider "Flooding Optimizations" into charter?
Venkata> If not, can you please explain the reason, why we should not?
Venkata> This is field of its own - we might expect lot of drafts.

Rohit> We considered this but would like to see some operational requirements
Rohit> before including this in the charter. Is anybody running OSPF networks
Rohit> hitting this problem with no workarounds in sight?
Rohit> It would be a candidate for a future version of the charter once some
Rohit> of the current items clear.

Vishwas> I do think it can be helpful in a few cases. A recent paper by Aman 
Vishwas> Shaikh/Albert Greenberg et.al. about analyzing OSPF for Enterprise 
Vishwas> is one such case. 
Vishwas> One of our drafts "Congestion Avoidance and Control for OSPF networks" 
Vishwas> talks about problems that have been caused by flooding overload in 
Vishwas> production networks.

Rohit> I am familiar with Aman's work. The problem identified there was
Rohit> that of (a) sub-optimal network/ospf configuration and (b) 
Rohit> broken router.
Rohit> This does not of course justify building new protocol mechanisms, 
Rohit> thereby complicating the protocol.
Rohit> I will look at the congestion again to see if/why it concludes otherwise.

We would like to see OSPF 'congestion control' included into the OSPF charter, where 'flooding optimizations' is a key component as discussed in our draft http://www.ietf.org/internet-drafts/draft-ash-manral-ospf-congestion-control-00.txt.  We can propose a workplan/goals/milestones if you'd like.

Several other drafts identify needs related to OSPF congestion control:
http://ietf.org/internet-drafts/draft-ietf-ospf-scalability-01.txt
http://www.ietf.org/internet-drafts/draft-choudhury-manral-flooding-simulation-00.txt
http://www.ietf.org/internet-drafts/draft-dovolsky-ccamp-ospf-limited-flooding-00.txt

AT&T has suffered a few massive failures of operational networks due to control overloads of link-state (LS) protocols ('OSPF', 'PNNI', etc.).  These outages are documented and referenced in our draft (and in http://search.ietf.org/internet-drafts/draft-ash-ospf-isis-congestion-control-02.txt).

In the instances cited, the link-state protocol overwhelmed the network with a control load 'storm' ('LSA overload'), which brought the network down, and then prevented its recovery.  Fortunately such failures are very rare; however, 'rare' for such events is unacceptable, 'never' is the goal.  Other service providers have experienced similar outages caused by similar problems.

Such failures are not the fault of the service provider operation or the vendor/equipment implementation.  They are due to shortcomings in the link-state protocols themselves -- thus the need for the enhancements proposed in the draft.

The proposals in http://www.ietf.org/internet-drafts/draft-ash-manral-ospf-congestion-control-00.txt will prevent such events from being triggered, and/or provide recovery mechanisms in case such events occur.

The problem of control overload is becoming even more acute as LS protocols are enhanced to support new capabilities, such as:

MPLS traffic engineering http://search.ietf.org/internet-drafts/draft-katz-yeung-ospf-traffic-09.txt,
GMPLS http://www.ietf.org/internet-drafts/draft-ietf-ccamp-ospf-gmpls-extensions-08.txt,
multi-area TE http://www.ietf.org/internet-drafts/draft-kompella-mpls-multiarea-te-03.txt,
MPLS/DiffServ TE http://search.ietf.org/internet-drafts/draft-ietf-tewg-diff-te-reqts-06.txt,
etc.

With this ever advancing complexity, the need keeps increasing to address the stated problem, soon.  Thus the need for the charter item on OSPF congestion control.

Thanks,
Jerry Ash