Re: [Lsr] Prefix Unreachable Announcement Use Cases

Aijun Wang <wangaijun@tsinghua.org.cn> Wed, 18 November 2020 08:06 UTC

Return-Path: <wangaijun@tsinghua.org.cn>
X-Original-To: lsr@ietfa.amsl.com
Delivered-To: lsr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id C9DCF3A102C for <lsr@ietfa.amsl.com>; Wed, 18 Nov 2020 00:06:25 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.896
X-Spam-Level:
X-Spam-Status: No, score=-1.896 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HTML_MESSAGE=0.001, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=unavailable autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id hRSb9WzG7-rA for <lsr@ietfa.amsl.com>; Wed, 18 Nov 2020 00:06:21 -0800 (PST)
Received: from mail-m127101.qiye.163.com (mail-m127101.qiye.163.com [115.236.127.101]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 4D62C3A15E4 for <lsr@ietf.org>; Wed, 18 Nov 2020 00:05:36 -0800 (PST)
Received: from DESKTOP2IOH5QC (unknown [219.142.69.75]) by mail-m127101.qiye.163.com (Hmail) with ESMTPA id AE0A24627B; Wed, 18 Nov 2020 16:05:30 +0800 (CST)
From: Aijun Wang <wangaijun@tsinghua.org.cn>
To: 'Robert Raszuk' <robert@raszuk.net>, 'Jeff Tantsura' <jefftant.ietf@gmail.com>
Cc: 'Gyan Mishra' <hayabusagsm@gmail.com>, "'Acee Lindem (acee)'" <acee@cisco.com>, 'lsr' <lsr@ietf.org>, "'Acee Lindem (acee)'" <acee=40cisco.com@dmarc.ietf.org>
References: <CAOj+MMH7zRaXNJTRC0ua7ohasUpo0MmeqgzcU9BdpcD7wD+Yrg@mail.gmail.com> <D477846E-1086-46A8-B2D6-E552623E2643@gmail.com> <016b01d6bca9$cf908c20$6eb1a460$@tsinghua.org.cn> <CAOj+MMEKbBU1mymU2RzWzwi6Se8ZwQ9OsCBn4NUiX3YAceLdoQ@mail.gmail.com> <CABNhwV1yS1KdPe0hYGOUhDBpqbNqZCaO=xNEr_LaRg35b=f55g@mail.gmail.com> <CAOj+MMGnRkYrTcC45QEy+F5HNCoFn75r=1gn-+OT89Q53D_pYA@mail.gmail.com> <CABNhwV1pK5JX5sDcPyRKuR67eAkAq-q3wRmYqbsfCwOj0wWjSw@mail.gmail.com> <32DFCE3A-D41C-48CA-928A-37011D158AEF@cisco.com> <c646fecb-2d45-4ece-adc1-eb0635a58c3c@Spark> <CAOj+MMGrZz3pJfmP1gh+4YO6XfKr_NWe+QOy8mfjyqUxqub5kw@mail.gmail.com>
In-Reply-To: <CAOj+MMGrZz3pJfmP1gh+4YO6XfKr_NWe+QOy8mfjyqUxqub5kw@mail.gmail.com>
Date: Wed, 18 Nov 2020 16:05:30 +0800
Message-ID: <019901d6bd81$9565b5b0$c0312110$@tsinghua.org.cn>
MIME-Version: 1.0
Content-Type: multipart/alternative; boundary="----=_NextPart_000_019A_01D6BDC4.A38ACA70"
X-Mailer: Microsoft Outlook 16.0
Thread-Index: AQILbNEP0SRic90ZNEbpRjygmkrFZQJx5udEAhaZmkECROyKowKGLtmXAYkEwwQB6y6oOAGrbUjJAsuFR0kBjd7z5KjNwu/w
Content-Language: zh-cn
X-HM-Spam-Status: e1kfGhgUHx5ZQUtXWQgYFAkeWUFZS1VLWVdZKFlBSkxLS0o3V1ktWUFJV1 kPCRoVCBIfWUFZTUNOHR9DT00YShhKVkpNS05NQ01MSEpLSUpVEwETFhoSFyQUDg9ZV1kWGg8SFR 0UWUFZT0tIVUpKS09ISFVLWQY+
X-HM-Sender-Digest: e1kMHhlZQR0aFwgeV1kSHx4VD1lBWUc6ODY6DDo4KT8ZEQ9JTBIJMSw6 OhAaC0xVSlVKTUtOTUNNTEhKTkhJVTMWGhIXVQwaFRwaEhEOFTsPCBIVHBMOGlUUCRxVGBVFWVdZ EgtZQVlJSkJVSk9JVU1CVUxOWVdZCAFZQU9LSk9NNwY+
X-HM-Tid: 0a75da63209e9865kuuuae0a24627b
Archived-At: <https://mailarchive.ietf.org/arch/msg/lsr/3xM3wWseSW8VEMZ_-sBBDxMM2xU>
Subject: Re: [Lsr] Prefix Unreachable Announcement Use Cases
X-BeenThere: lsr@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Link State Routing Working Group <lsr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/lsr>, <mailto:lsr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/lsr/>
List-Post: <mailto:lsr@ietf.org>
List-Help: <mailto:lsr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/lsr>, <mailto:lsr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 18 Nov 2020 08:06:26 -0000

Hi, Robert:

 

The trigger and propagation of PUA info can be standardized, the actions based on the PUA can be different in different situation. 

We can discuss and describe the actions based on different scenarios after its WG adoption?

 

 

Best Regards

 

Aijun Wang

China Telecom

 

From: Robert Raszuk <robert@raszuk.net> 
Sent: Wednesday, November 18, 2020 3:49 PM
To: Jeff Tantsura <jefftant.ietf@gmail.com>
Cc: Gyan Mishra <hayabusagsm@gmail.com>; Acee Lindem (acee) <acee@cisco.com>; lsr <lsr@ietf.org>; Aijun Wang <wangaijun@tsinghua.org.cn>; Acee Lindem (acee) <acee=40cisco.com@dmarc.ietf.org>
Subject: Re: [Lsr] Prefix Unreachable Announcement Use Cases

 

Jeff,

 

Please notice that WAN is not an IX. 

 

While you can have full mesh of BFD sessions among all IXP participants each bombarding each over over TB fabric every 100 ms or so to map the same over global WAN is a different game. If nothing else RTT between IXP participants in healthy IX is around 1 ms while RTT between PEs distributed globally is often 100-200 ms. 

 

Just imagine 1000 PEs in 10 areas distributed all over the world. That means that in worst case scenario (say same mgmt VPN present on each PE) you will establish 1000*999 BFD sessions. Now for this to make sense timer needs to be 100 ms or so with 3x or 5x multiplier. Anything slower will defeat the purpose as BGP withdraw will be faster. 

 

Then we go into queuing issues. If BFD packets are queued at any interface meltdowns may occur which can be far worse in consequences then waiting for BGP service route removal. Such meltdowns often result in cascading effects to the applications itself. 

 

So this is not at all about autodiscovery with which address to setup the BFD session. It is much more about operational aspects of going that direction. 

 

With that I am supportive of this work even if we label it as experimental for some time. As each network is different what is optimal solution for one design and deployment may not be optimal for the other. 

 

Many thx,

Robert

 

 

On Wed, Nov 18, 2020 at 4:34 AM Jeff Tantsura <jefftant.ietf@gmail.com <mailto:jefftant.ietf@gmail.com> > wrote:

We have been discussing for quite some time and in different wg's (there’s IX with RS use case) BFD verification based on next-hop extraction, Robert - you should know. (also built a well working prototype in previous life). 

Very simple logic:

Upon route import (BGP update received and imported), extract next-hop, walk BFD session table, if no match (no existing session) - establish (S)BFD session (Discriminators distribution is a solved problem) to the next-hop, associate fate of all routes received from it, keep timers reasonable to prevent false positives.

State is limited to PE’s importing each others routes (sharing a service) only
High degree of automation
No IGP pollution 

 

Cheers, 

Jeff

On Nov 17, 2020, 6:43 AM -0800, Acee Lindem (acee) <acee@cisco.com <mailto:acee@cisco.com> >, wrote:



Speaking as WG member:

 

I think it would be good to hone in on the BGP PE failure convergence use case as suggested by Robert. It seems there is some interest here although I’m not convinced the IGP is the right place to solve this problem.

 

Thanks,

Acee

 

From: Lsr <lsr-bounces@ietf.org <mailto:lsr-bounces@ietf.org> > on behalf of Gyan Mishra <hayabusagsm@gmail.com <mailto:hayabusagsm@gmail.com> >
Date: Tuesday, November 17, 2020 at 4:02 AM
To: Robert Raszuk <robert@raszuk.net <mailto:robert@raszuk.net> >
Cc: lsr <lsr@ietf.org <mailto:lsr@ietf.org> >, Jeff Tantsura <jefftant.ietf@gmail.com <mailto:jefftant.ietf@gmail.com> >, Aijun Wang <wangaijun@tsinghua.org.cn <mailto:wangaijun@tsinghua.org.cn> >, "Acee Lindem (acee)" <acee=40cisco.com@dmarc.ietf.org <mailto:40cisco.com@dmarc.ietf.org> >
Subject: Re: [Lsr] Prefix Unreachable Announcement Use Cases

 

 

 

On Tue, Nov 17, 2020 at 3:36 AM Robert Raszuk <robert@raszuk.net <mailto:robert@raszuk.net> > wrote:

 

 

   Robert, I believe the original intention was related to having the data plane converge quickly when summarization is used and flip so traffic converges from the Active ABR to the Backup ABR. 

 

I do not buy this use case. Flooding within the area is fast such that both ABRs will get the same info. As mentioned before there is no practical use of PUA for making any routing or fwd decision on which ABR to use. If your ABRs are not connected with min redundancy this draft is a worst patch ever to work around such a design. 

 

   Gyan> Agreed.  The point of PUA in ABR use case is the ability to track the component prefixes and in case where component is down and traffic is still forwarded to the ABR and dropped.  The other more important use case is when links are down within the area and the area is partitioned and so one ABR has all component prefixes however other ABR is missing half the component prefixes.  So since the ABR will by default advertise the summary as long as their is one component UP the summary is still advertised.  So this use case is severely impacting as now you have an ECMP path to the other area for the summary via the two ABRs and you drop half your traffic.  So now with PUA the problem is fixed and the PUA is sent and now traffic is only sent to the ABR that has the component prefixes.

 

Please present us a picture indicating before and after ABRs behaviour. 

 

     Gyan> will do 

 

   However PUA can be used in the absence of area segmentation within a single area when a link or node fails to converge the data plane quickly by sending PUA for the backup path so the active path. 

 

If there is no area segmentation then there is no summaries. So what are we missing in the first place ? 

 

    Gyan> Sorry I am stating that PUA feature can also be used intra area where if a link or node goes down to improve data plane convergence.

 

 

With the IGP tuned with BFD fast detection on ISIS or OSPF links and LFA & RLFA for MPLS or TI-LFA for SR local protection - with those tweaks the convergence is well into sub second.  So for Intra area convergence with all the optimizations mentioned I am not sure how much faster the data plane will converge with PUA.

 

Even without any of the above listed chain of acronymous things will generally work well intra-area without PUAs. 

 

    Gyan> Agreed which is why I mentioned the BGP next hop self use case if I could figure out how PUA could help there that would be a major benefit of PUA.

 

Thx,
R.

 

 

--

 <http://www.verizon.com/> <>

Gyan Mishra

Network Solutions Architect 

M 301 502-1347
13101 Columbia Pike 
Silver Spring, MD