Re: [Lsr] Thoughts about PUAs - are we not over-engineering?

Aijun Wang <wangaijun@tsinghua.org.cn> Tue, 14 June 2022 10:26 UTC

Return-Path: <wangaijun@tsinghua.org.cn>
X-Original-To: lsr@ietfa.amsl.com
Delivered-To: lsr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 7B1DFC1649FF; Tue, 14 Jun 2022 03:26:48 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.906
X-Spam-Level:
X-Spam-Status: No, score=-1.906 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HTML_MESSAGE=0.001, MIME_QP_LONG_LINE=0.001, RCVD_IN_DNSWL_BLOCKED=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id SEQOfVCQq-tx; Tue, 14 Jun 2022 03:26:44 -0700 (PDT)
Received: from mail-m17638.qiye.163.com (mail-m17638.qiye.163.com [59.111.176.38]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 057BBC1649FD; Tue, 14 Jun 2022 03:26:42 -0700 (PDT)
Received: from smtpclient.apple (unknown [221.223.97.163]) by mail-m17638.qiye.163.com (Hmail) with ESMTPA id 308831C008F; Tue, 14 Jun 2022 18:26:39 +0800 (CST)
Content-Type: multipart/alternative; boundary="Apple-Mail-CFCCF293-E2AF-4C00-89AC-5FEACAED1DFE"
Content-Transfer-Encoding: 7bit
From: Aijun Wang <wangaijun@tsinghua.org.cn>
Mime-Version: 1.0 (1.0)
Message-Id: <8850AB94-C2B9-4883-A796-2AE90BB4FB93@tsinghua.org.cn>
Date: Tue, 14 Jun 2022 18:26:38 +0800
Cc: lsr <lsr@ietf.org>, draft-ppsenak-lsr-igp-ureach-prefix-announce@ietf.org, draft-wang-lsr-prefix-unreachable-annoucement <draft-wang-lsr-prefix-unreachable-annoucement@ietf.org>
To: "Van De Velde, Gunter (Nokia - BE/Antwerp)" <gunter.van_de_velde@nokia.com>
X-Mailer: iPhone Mail (19F77)
X-HM-Spam-Status: e1kfGhgUHx5ZQUtXWQgPGg8OCBgUHx5ZQUlOS1dZCBgUCR5ZQVlLVUtZV1 kWDxoPAgseWUFZKDYvK1lXWShZQUpMS0tKN1dZLVlBSVdZDwkaFQgSH1lBWRoeSBlWGE9IGUIdSk 0eGktNVRMBExYaEhckFA4PWVdZFhoPEhUdFFlBWU9LSFVKSktITk9VS1kG
X-HM-Sender-Digest: e1kMHhlZQR0aFwgeV1kSHx4VD1lBWUc6PjY6OTo*Nj01MQMIIU4aCAwN Lj0wChdVSlVKTU5OSUtJSEJCTkJPVTMWGhIXVQwaFRwaEhEOFTsPCBIVHBMOGlUUCRxVGBVFWVdZ EgtZQVlJSUpVSUlIVUJMVUpNSFlXWQgBWUFKS05MTzcG
X-HM-Tid: 0a8161c043e1d993kuws308831c008f
Archived-At: <https://mailarchive.ietf.org/arch/msg/lsr/p4spGWUYCoZ3EZEy4TJLqfYuVaI>
Subject: Re: [Lsr] Thoughts about PUAs - are we not over-engineering?
X-BeenThere: lsr@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Link State Routing Working Group <lsr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/lsr>, <mailto:lsr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/lsr/>
List-Post: <mailto:lsr@ietf.org>
List-Help: <mailto:lsr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/lsr>, <mailto:lsr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 14 Jun 2022 10:26:48 -0000

Hi, Gunter:

Let me try to answer some of your concerns.

The reason that we prefer to the Summary+PUA/UPA solution is that the node failure(which is the main scenario that we focus now) is one rarely thing in the network. Then the unreachable event triggered mechanism is more efficient than advertising all of the node’s reachable address. This point has been discussed in the mail list in past.

In the https://datatracker.ietf.org/doc/html/draft-wang-lsr-prefix-unreachable-annoucement-09#section-8, we have illustrated how to control the advertisement of PUA message on the ABR. If this can’t settle your concerns, we can consider more policy on the ABR.

Regarding to the tracking and representation of PUA in RTM, we have proposed in the earlier version of this draft, that is to install one black hole route to the specified detailed prefix.

The reason that PUA requires routers within one area to be upgraded is that it want to avoid the situations when the router doesn’t recognize PUA message and misbehave. We are considering the convergence of PUA/UPA solutions, which may relax such requirements during deployment.


Aijun Wang
China Telecom

> On Jun 14, 2022, at 16:59, Van De Velde, Gunter (Nokia - BE/Antwerp) <gunter.van_de_velde@nokia.com> wrote:
> Hi All,
> 
> When reading both proposals about PUA's:
> * draft-ppsenak-lsr-igp-ureach-prefix-announce-00
> * draft-wang-lsr-prefix-unreachable-annoucement-09
> 
> The identified problem space seems a correct observation, and indeed summaries hide remote area network instabilities. It is one of the perceived benefits of using summaries. The place in the network where this hiding takes the most impact upon convergence is at service nodes (PE's for L3/L2/transport) where due to the summarization its difficult to detect that the transport tunnel end-point suddenly becomes unreachable. My concern however is if it really is a problem that is worthy for LSR WG to solve.
> 
> To me the "draft draft-wang-lsr-prefix-unreachable-annoucement-09" is not a preferred solution due to the expectation that all nodes in an area must be upgraded to support the IGP capability. From this operational perspective the draft "draft-ppsenak-lsr-igp-ureach-prefix-announce-00" is more elegant, as only the A(S)BR's and particular PEs must be upgraded to support PUA's. I do have concerns about the number of PUA advertisements in hierarchically summarized networks (/24 (site) -> /20 (region) -> /16 (core)). More specific, in the /16 backbone area, how many of these PUAs will be floating around creating LSP LSDB update churns? How to control the potentially exponential number of observed PUAs from floating everywhere? (will this lead to OSPF type NSSA areas where areas will be purged from these PUAs for scaling stability?)
> 
> Long story short, should we not take a step back and re-think this identified problem space? Is the proposed solution space not more evil as the problem space? We do summarization because it brings stability and reduce the number of link state updates within an area. And now with PUA we re-introduce additional link state updates (PUAs), we blow up the LSDB with information opaque to SPF best-path calculation. In addition there is suggestion of new state-machinery to track the igp reachability of 'protected' prefixes and there is maybe desire to contain or filter updates cross inter-area boundaries. And finally, how will we represent and track PUA in the RTM?
> 
> What is wrong with simply not doing summaries and forget about these PUAs to pinch holes in the summary prefixes? this worked very well during last two decennia. Are we not over-engineering with PUAs?
> 
> G/
> 
> _______________________________________________
> Lsr mailing list
> Lsr@ietf.org
> https://www.ietf.org/mailman/listinfo/lsr