Re: [Lsr] New Version Notification for draft-wang-lsr-prefix-unreachable-annoucement-03.txt

Aijun Wang <wangaijun@tsinghua.org.cn> Mon, 24 August 2020 09:11 UTC

Return-Path: <wangaijun@tsinghua.org.cn>
X-Original-To: lsr@ietfa.amsl.com
Delivered-To: lsr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id D1E023A0BA1 for <lsr@ietfa.amsl.com>; Mon, 24 Aug 2020 02:11:48 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: 0.013
X-Spam-Level:
X-Spam-Status: No, score=0.013 tagged_above=-999 required=5 tests=[HTML_MESSAGE=0.001, NORMAL_HTTP_TO_IP=0.001, NUMERIC_HTTP_ADDR=0.001, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_REMOTE_IMAGE=0.01, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id OwQFN5avrkOQ for <lsr@ietfa.amsl.com>; Mon, 24 Aug 2020 02:11:44 -0700 (PDT)
Received: from mail-m127101.qiye.163.com (mail-m127101.qiye.163.com [115.236.127.101]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id CBD353A0B9E for <lsr@ietf.org>; Mon, 24 Aug 2020 02:11:43 -0700 (PDT)
Received: from DESKTOP2IOH5QC (unknown [219.142.69.75]) by mail-m127101.qiye.163.com (Hmail) with ESMTPA id 4D97C43C0A; Mon, 24 Aug 2020 17:11:39 +0800 (CST)
From: Aijun Wang <wangaijun@tsinghua.org.cn>
To: 'Gyan Mishra' <hayabusagsm@gmail.com>
Cc: 'Peter Psenak' <ppsenak=40cisco.com@dmarc.ietf.org>, 'Robert Raszuk' <robert@raszuk.net>, 'Huzhibo' <huzhibo@huawei.com>, 'Aijun Wang' <wangaj3@chinatelecom.cn>, 'lsr' <lsr@ietf.org>, "'Acee Lindem (acee)'" <acee@cisco.com>, 'Xiaoyaqun' <xiaoyaqun@huawei.com>
References: <CAOj+MMGgpcnRMnPxQqcZofgJNH67QYUQOxWsTU5Xp-Km0D2DDg@mail.gmail.com> <A202F6E1-AD83-46E8-A1D2-E156FB35DF57@chinatelecom.cn> <CAOj+MMHd1WZNCWr6KihxzDf=G53A8FBUBbqHpZGNwvF4hsuzMA@mail.gmail.com> <059e01d66ad7$ffda2e50$ff8e8af0$@tsinghua.org.cn> <CABNhwV2oXBBNKOdUA59sLF+b5srWHi3KF2Q6H1Tg-dK+gA9Lgw@mail.gmail.com>
In-Reply-To: <CABNhwV2oXBBNKOdUA59sLF+b5srWHi3KF2Q6H1Tg-dK+gA9Lgw@mail.gmail.com>
Date: Mon, 24 Aug 2020 17:11:37 +0800
Message-ID: <013301d679f6$92da3ce0$b88eb6a0$@tsinghua.org.cn>
MIME-Version: 1.0
Content-Type: multipart/alternative; boundary="----=_NextPart_000_0134_01D67A39.A0FF9FC0"
X-Mailer: Microsoft Outlook 16.0
Content-Language: zh-cn
Thread-Index: AQKwsgRl8uq4ZZAYYiwm8gqgwsmPuwLfhjxOAXwBCl8B/jVB7AHXWAwnp1D5U6A=
X-HM-Spam-Status: e1kfGhgUHx5ZQUpXWQgYFAkeWUFZS1VLWVdZKFlBSkxLS0o3V1ktWUFJV1 kPCRoVCBIfWUFZH0hPHkgdQh1LT00aVkpOQkNJTUtJQkJNTUNVEwETFhoSFyQUDg9ZV1kWGg8SFR 0UWUFZT0tIVUpKS09ISFVKS0tZBg++
X-HM-Sender-Digest: e1kMHhlZQR0aFwgeV1kSHx4VD1lBWUc6Kyo6DTo6Mj8tFjAILAw8EQ06 LyMwCgtVSlVKTkJDSU1LSEtLSEtKVTMWGhIXVQwaFRwaEhEOFTsPCBIVHBMOGlUUCRxVGBVFWVdZ EgtZQVlJSkJVSk9JVU1CVUxOWVdZCAFZQUhDSUhNNwY+
X-HM-Tid: 0a741fbcc7389865kuuu4d97c43c0a
Archived-At: <https://mailarchive.ietf.org/arch/msg/lsr/J79WVnZ54TKX3kgQnXfj8FaWFnw>
Subject: Re: [Lsr] New Version Notification for draft-wang-lsr-prefix-unreachable-annoucement-03.txt
X-BeenThere: lsr@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Link State Routing Working Group <lsr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/lsr>, <mailto:lsr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/lsr/>
List-Post: <mailto:lsr@ietf.org>
List-Help: <mailto:lsr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/lsr>, <mailto:lsr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 24 Aug 2020 09:11:49 -0000

Hi, Gyan:

 

Sorry for replying you so late.

You are right about the summary address behavior, but the summary address is configured by manually, and if one of the specific prefix within this summary range is down, the black hole(route to this specific prefix) will be formed.  PUA mechanism just want to amend this.

Glad to know Rift has also noticed such issues.  In OSPF/ISIS, such problem needs also be solved.

 

If you are interested this topic, welcome to join us to the solution.

 

 

Best Regards

 

Aijun Wang

China Telecom

 

From: lsr-bounces@ietf.org [mailto:lsr-bounces@ietf.org] On Behalf Of Gyan Mishra
Sent: Thursday, August 6, 2020 4:44 PM
To: Aijun Wang <wangaijun@tsinghua.org.cn>
Cc: Peter Psenak <ppsenak=40cisco.com@dmarc.ietf.org>; Robert Raszuk <robert@raszuk.net>; Huzhibo <huzhibo@huawei.com>; Aijun Wang <wangaj3@chinatelecom.cn>; lsr <lsr@ietf.org>; Acee Lindem (acee) <acee@cisco.com>; Xiaoyaqun <xiaoyaqun@huawei.com>
Subject: Re: [Lsr] New Version Notification for draft-wang-lsr-prefix-unreachable-annoucement-03.txt

 

Hi Aijun and authors

 

I am catching up with this thread after reading through this draft.

 

I understand the concept that we are trying to send a PUA advertisement which sounds similar to Rift Negative Disaggregation prefix now with a  null setting when a longer match component prefix that is part of a summary range is down to detect prefix down conditions with longer match component prefixes that are part of a summary.  

 

Basically how summarization works with both OSPF and ISIS is that at minimum a single longer match within the defined IA summary range must exist for the IA summary to be sent.  So the summary is sent conditionally similar to a BGP conditional advertisement or even a ospf default originate which requires a default in the RIB to exist before the advertisement is sent.  A good example of non conditional analogy with BGP if you have a null0 static for a summary for an exact match BGP advertisement the prefix is always advertised no matter what even if no component prefixes exist.  This can result in black hole routing. BGP has conditional feature to conditionally advertisement based on existence of a route advertisement of downstream neighbor in the BGP RIB.  So with ospf and Isis the summary is in fact conditional similar to a BGP conditional advertisement and in the example given in the draft in section 3.1 when node T2 is down and pt2 becomes unreachable and let’s say that prefix is 1.1.1.1/32 <http://1.1.1.1/32>  and the summary is 1.1.1.0/30 <http://1.1.1.0/30>  and no other component prefix exists within the summary range the prefix will not get adversed.  So there will not be any black hole.  

 

The summary represents all prefixes within the range that would be suppressed with the summary when advertised into the backbone area.  However only at a minimum one prefix must exist in the range for the summary to be generated.  That is done by design as the summary represents all prefixes within the range.  So let’s say there are a 100 prefixes and let’s say a few devices are down and so now only 5 prefixes exist within the range.  By design it is OK for router to generate the summary for the 5 prefixes it is representing and that will not cause any routing loop or black hole.

 

 

I am trying to understand wage gap exists and what we are trying to solve related to summarization in the context of IPv6.  Both IPv4 and IPV6 summarization operates the similarly as far as the requirement of at minimum a single component route within the summary range must exist  as a condition to be present in the RIB before the summary can be advertised.

 

Kind Regards 

 

Gyan

 

On Tue, Aug 4, 2020 at 11:25 PM Aijun Wang <wangaijun@tsinghua.org.cn <mailto:wangaijun@tsinghua.org.cn> > wrote:

Hi, Robert:

 

From: lsr-bounces@ietf.org <mailto:lsr-bounces@ietf.org>  [mailto:lsr-bounces@ietf.org <mailto:lsr-bounces@ietf.org> ] On Behalf Of Robert Raszuk
Sent: Friday, July 31, 2020 6:21 PM
To: Aijun Wang <wangaj3@chinatelecom.cn <mailto:wangaj3@chinatelecom.cn> >
Cc: Peter Psenak <ppsenak=40cisco.com@dmarc.ietf.org <mailto:40cisco.com@dmarc.ietf.org> >; Huzhibo <huzhibo@huawei.com <mailto:huzhibo@huawei.com> >; Aijun Wang <wangaijun@tsinghua.org.cn <mailto:wangaijun@tsinghua.org.cn> >; lsr <lsr@ietf.org <mailto:lsr@ietf.org> >; Acee Lindem (acee) <acee@cisco.com <mailto:acee@cisco.com> >; Xiaoyaqun <xiaoyaqun@huawei.com <mailto:xiaoyaqun@huawei.com> >
Subject: Re: [Lsr] New Version Notification for draft-wang-lsr-prefix-unreachable-annoucement-03.txt

 

[WAJ] Such information is for underlay link state and should be flooded via IGP? The ambiguity arises from IGP summary behavior and should be solved by itself?

 

Well if we look at this problem from a distance while on surface it seems like an IGP issue (not to mention some which use BGP as IGP :) IMO it is only hurting when you have some service overlay on top utilizing the IGP. 

[WAJ] There are situations that the PUA mechanism apply when no service overlay running over IGP.  Scenarios described in  <https://datatracker.ietf.org/doc/html/draft-wang-lsr-prefix-unreachable-annoucement-03#section-3> draft-wang-lsr-prefix-unreachable-annoucement-03#section-3 are not tightly coupled with the overlay service.

 

So typically today if I am running any service with BGP I do count on BGP to remove routes which are no longer reachable. IGP just tells me how to get to the next hop, which direction to go and not if the endpoint (service CPE or PE connected to given CE) is up or down. 

 

So today smart BGP implementations in good network design can use RD based withdraws to very fast (milliseconds) remove the affected service routes. When I said should we do it in BGP I meant to ask WG if this is good enough to quickly remove service routes. If not maybe we should send such affected next hop in BGP to even faster invalidate all routes with such next hop as failing PE. 

 

Bottom line if you think the problem is IGP then I think Acee's comments apply. 

[WAJ] Which comment is not addressed yet?

 

Last - See today you are bringing the case to signal transition to DOWN ... but for some people and applications it may be not enough. In fact UP/DOWN they can get via BGP. But if you have two ABRs and one will due to topology changes in its area suddenly will be forced to reach atomic destination covered by summary over much higher metric path that for applications running above may be much more severe case and not acceptable one too. 

[WAJ] Or else, the application traffic will be broken.

 

And BGP will not remove service routes nor modify best path in any way as summary is masking the real metric to some next hops. So while in the network you may have alternate better native transit paths with a lot of free capacity if you only switch to a different bgp next hop (not talking about any TE at all) you are stuck offering much worse service to your customers. 

[WAJ] if there are other links to reach the affected prefix via the ABR, then this ABR will not send the PUA information.

 

Those cases are starting to be solved by performance routing both at the service itself or at BGP nh levels. Should IGP assist here ... I am not sure.

[WAJ] when node become down, it can only depend on other nodes within the same IGP to send such unreachability information. IGP can certainly help here J

 

 

Many thx,

R.

_______________________________________________
Lsr mailing list
Lsr@ietf.org <mailto:Lsr@ietf.org> 
https://www.ietf.org/mailman/listinfo/lsr

-- 

 <http://www.verizon.com/> 

Gyan Mishra

Network Solutions Architect 

M 301 502-1347
13101 Columbia Pike 
Silver Spring, MD