Re: [Lsr] Thoughts about PUAs - are we not over-engineering?

Peter Psenak <ppsenak@cisco.com> Wed, 15 June 2022 11:18 UTC

Return-Path: <ppsenak@cisco.com>
X-Original-To: lsr@ietfa.amsl.com
Delivered-To: lsr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 38972C15BEC8 for <lsr@ietfa.amsl.com>; Wed, 15 Jun 2022 04:18:15 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -12.229
X-Spam-Level:
X-Spam-Status: No, score=-12.229 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.745, DKIMWL_WL_MED=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, NICE_REPLY_A=-1.876, RCVD_IN_DNSWL_BLOCKED=0.001, SPF_NONE=0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001, USER_IN_DEF_DKIM_WL=-7.5] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=cisco.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id rYq8JGdqH-MQ for <lsr@ietfa.amsl.com>; Wed, 15 Jun 2022 04:18:11 -0700 (PDT)
Received: from aer-iport-4.cisco.com (aer-iport-4.cisco.com [173.38.203.54]) (using TLSv1.2 with cipher DHE-RSA-SEED-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 1877FC159486 for <lsr@ietf.org>; Wed, 15 Jun 2022 04:18:08 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=cisco.com; i=@cisco.com; l=7629; q=dns/txt; s=iport; t=1655291891; x=1656501491; h=message-id:date:mime-version:subject:to:cc:references: from:in-reply-to:content-transfer-encoding; bh=jKgOm48eS9GLmc8gLC+pSq003vhgydh1TVP/zhpslms=; b=EFx9bTpTYtYk7xaMXHL3OLelDMnIlzfCMauVG5umP6teI8ibYM19r64E SeuOpBH+QGfp/RYKylqu2KLGyz+sQFVlO7s4YqCbMmNSOjpsM7LkQspht 3pYk6KMazCsJOufNfjoPPKtSNLptEVIntE2rIZFdL8lbOJHHjDAjtx1+s g=;
X-IPAS-Result: A0AFAACfv6lilxbLJq1aGwEBAQEBAQEBBQEBARIBAQEDAwEBAUCBPQQBAQELAYF7gSlVLBJEhE6JAIgMA5xrgXwLAQEBDywNCQQBAYUDAoVJJjYHDgECBAEBAQEDAgMBAQEBAQEDAQEFAQEBAgEHBBQBAQEBAQEBAQkbBgwFEDWFaA2GQgEBAQECAQEBIQQLAQU2CwULCxEEAQEBAgIjAwICJx8JCAYNBgIBAReCClgBgnUjAw+sTHp/MoEBg2VBg3OBXwaBESwBhh9ch3dDgUlEgRQBJwyCRzA+gmIBAQIBgTQaNYMwgmUEjmSFCYUCJgQPAxotNBKBIXEBCAYDAwcKBTIGAgwYFAQCExJTHQISBQcKHA4UHCQZDA8DEgMRAQcCCxIIFSwIAwIDCAMCAyMLAgMXCQcKAx0IChwSEBQCBBMeCwgDGR8sCQIEDgNDCAsKAxEEAxMYCxYIEAQGAwkvDSgLAxQNAQYDBgIFBQEDIAMUAwUnBwMhBwsmDQ0EHAcdAwMFJgMCAhsHAgIDAgYXBgICbwomDQgECAQcHSQQBQIHMQUELwIeBAUGEQkCFgIGBAUCBAQWAgISCAIIJxsHFhkdGQEFXQYLCSEcCh8LBgUGFgMjcwUKPg8pNTY8LyEbCoEgBiIBGwKVIIQ4ERVUD00EHTQCTwELIApAEGgFkiksLa0bgReDWIQZhwaRWoMFBg8ELYN1jEGGMZF5hyKPSoJLimKUSwiFM4FoCIIGMxoIGxU7gmhRGQ+OLA0JiGSFTEIxAjkCBgEKAQEDCYw5AiaCIAEB
IronPort-Data: A9a23:LN1QyKsqcn7jCzBjTtO/jkzDoOfnVIleMUV32f8akzHdYApBsoF/q tZmKWmObPvcM2LwLtp0b4iw90kEu8Lcz4I1Sldl/3gzFXwXgMeUXt7xwmUckM+xwmwvaGo9s q3yv/GZdJhcokf0/0vrav67xZVF/fngqoDUUIYoAQgsA149IMsdoUg7wbRh3Nc12YHR7z6l4 LseneWOYDdJ5BYsWo4kw/rrRMRH5amaVJsw5zTSVNgT1LPsvyB94KE3ecldG0DFrrx8RYZWc QpsIIaRpQs19z91Yj+sfy2SnkciGtY+NiDW4pZatjTLbhVq/kQPPqgH2PU0ak5bsxHWjY1Iw 5ZvlrOXFykZZf3TsbFIO/VYO3kW0axu8bLdZHO4q8HWkwvNcmDnxLNlC0Re0Y8wo7ksRzoes 6ZAc3ZXNHhvhMruqF6/YvF0ncklJcrDN4IEsXYmxjbcZRojacqfHv+Uv48wMDEYj+luDMfUS s4gYisyPUX7XAYeHBBPMcdr9AuvriCvL2IHwL6PnoI7+WHd0Elu2b7idd7OYJmSX8hbn1bdu 2Tc8m3yAhgQMNG3yDeZ/DSrnOCntS/hUYwOUaWg7fNsiV7WwncJTRwZTR6mu/SwgU77Q8hSJ U0E0isjsaZ081akJvH5RAeQoXOYsFgbQdU4O/c94gyX1oLO6h2LG2tCSDNdAPQpqckqSCZsy FaPqN/kMiJitLGYYXSa9bad6zi1PEA9MmYEaiYFTxYDy8X4oYc8jg/GCNF5H+i+iLXI9SrYy j2Q6SkmgK8PyMgCy+Ow/EvMhHSnoZ2hohMJChv/XWup1icmdtOcbM+Kynf8s9hwd9aGZwzU1 JQboPS24OcLBJCLsSWCRuQRAb2kj8q43C3gbU1HRMZ+qmz8k5K3VcUBv2EkfRYB3tMsIGexO CfuVRVtCIi/1UZGjJObgartW6zGLoC5S7wJs8w4ifIUO/BMmPevpn0GWKJp9zmFfLIQua8+I 4yHVs2nEGwXD69qpBLvGbpAi+RymnpvlDyILXwe8/hB+efDDJJyYepbWGZikshihE95iFyPq o0GZ5fiJ+t3CbalOEE7DrL/3XhTfSRkWvgaWuRcd/WIJUJ9CXo9BvrKqY7NiKQ795m5Ytzgp ynnMmcBkQKXrSSeeW2iNyAyAJuyDM0XhS9qYkQR0aOAhiFLjXCHt/xEKfPavNAPqYRe8BKDZ 6ZcJZraWK4eFlwqOV01NPHAkWCrTzzz7SrmAsZvSGFXk0JIL+ARxuLZQw==
IronPort-HdrOrdr: A9a23:6AU+qK5iSCylNJQSXgPXwPnXdLJyesId70hD6qm+c203TiXqra GTdZMgpHnJYVcqKRYdcL+7VZVoLUmskKKdpLNhWYtKPzOLhILLFutfBOLZqlWKJ8S9zJ8+6U 4KScZD4bPLbWSSwfyU3OF9eOxQuOVuN8uT9J7j80s=
X-IronPort-Anti-Spam-Filtered: true
X-IronPort-AV: E=Sophos;i="5.91,302,1647302400"; d="scan'208";a="2452628"
Received: from aer-iport-nat.cisco.com (HELO aer-core-4.cisco.com) ([173.38.203.22]) by aer-iport-4.cisco.com with ESMTP/TLS/DHE-RSA-SEED-SHA; 15 Jun 2022 11:18:06 +0000
Received: from [10.147.24.42] ([10.147.24.42]) by aer-core-4.cisco.com (8.15.2/8.15.2) with ESMTP id 25FBI5Cp004969; Wed, 15 Jun 2022 11:18:06 GMT
Message-ID: <a71b7df5-4f15-a3ca-6783-3304dacd945b@cisco.com>
Date: Wed, 15 Jun 2022 13:18:05 +0200
MIME-Version: 1.0
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:91.0) Gecko/20100101 Thunderbird/91.9.1
Content-Language: en-US
To: Aijun Wang <wangaijun@tsinghua.org.cn>
Cc: "Van De Velde, Gunter (Nokia - BE/Antwerp)" <gunter.van_de_velde@nokia.com>, Robert Raszuk <robert@raszuk.net>, lsr <lsr@ietf.org>, draft-ppsenak-lsr-igp-ureach-prefix@ietf.org, draft-wang-lsr-prefix-unreachable@ietf.org
References: <41ba8fd6-ab16-6278-ba22-91a6a632ed33@cisco.com> <B8F7E718-28A3-4F97-A171-72774F8F1ACF@tsinghua.org.cn>
From: Peter Psenak <ppsenak@cisco.com>
In-Reply-To: <B8F7E718-28A3-4F97-A171-72774F8F1ACF@tsinghua.org.cn>
Content-Type: text/plain; charset="UTF-8"; format="flowed"
Content-Transfer-Encoding: 8bit
X-Outbound-SMTP-Client: 10.147.24.42, [10.147.24.42]
X-Outbound-Node: aer-core-4.cisco.com
Archived-At: <https://mailarchive.ietf.org/arch/msg/lsr/8hfaZCAZAJiODcsjWzXq0UixS_s>
Subject: Re: [Lsr] Thoughts about PUAs - are we not over-engineering?
X-BeenThere: lsr@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Link State Routing Working Group <lsr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/lsr>, <mailto:lsr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/lsr/>
List-Post: <mailto:lsr@ietf.org>
List-Help: <mailto:lsr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/lsr>, <mailto:lsr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 15 Jun 2022 11:18:15 -0000

Aijun,

On 15/06/2022 12:12, Aijun Wang wrote:
> Hi, Peter:
> 
> If you use LSInfinity as the indicator of the prefixes unreachable, then 
> how about you solve the situations that described in 
> https://datatracker.ietf.org/doc/html/draft-ietf-lsr-ip-flexalgo#section-6.4 
> <https://datatracker.ietf.org/doc/html/draft-ietf-lsr-ip-flexalgo#section-6.4>, 
> in which the the metric in parent TLV MUST be set to LSInfinity?

if the IP Algorithm Prefix Reachability Sub-TLV is present the metric 
from that Sub-TLV is used instead. There is no problem.

> Will you consider all such prefixes unreachable? This is certainly not 
> the aim of the IP FlexAlgo document.
> 
> In conclusion, the prefixes unreachable information should be indicated 
> explicitly by other means, as that introduced in the PUA draft.

the meaning of LSInfinity has been defined decades ago. No matter how 
much you may not like it, but it means unreachable.

thanks,
Peter

> 
> Aijun Wang
> China Telecom
> 
>> On Jun 15, 2022, at 17:09, Peter Psenak 
>> <ppsenak=40cisco.com@dmarc.ietf.org> wrote:
>>
>> Hi Gunter,
>>
>> On 15/06/2022 11:02, Van De Velde, Gunter (Nokia - BE/Antwerp) wrote:
>>> Hi Robert,
>>> I agree with you that the operator problem space is not limited to 
>>> multi-area/levels with IGP summarisation.
>>> With the PUA/UPA proposals I get the feeling that LSR WG is jumping 
>>> into the deep-end and is re-vectoring the IGP to carry opaque 
>>> information not used for SPF/cSPF.
>>> I believe we should be conservative for such and if LSR WG progresses 
>>> with such decision.
>>
>> please note that UPA draft builds on existing protocol specification 
>> defined in RFC5305 and RFC5308 that allow the metric larger then 
>> MAX_PATH_METRIC to be used "for purposes other than building the 
>> normal IP routing table". We are just documenting one of them.
>>
>> thanks,
>> Peter
>>
>>
>>> It could very well be that re-vectoring is the best solution, but I 
>>> guess we need to agree first on understanding the operator problem space.
>>> G/
>>> *From:*Robert Raszuk <robert@raszuk.net>
>>> *Sent:* Tuesday, June 14, 2022 11:51 AM
>>> *To:* Van De Velde, Gunter (Nokia - BE/Antwerp) 
>>> <gunter.van_de_velde@nokia.com>
>>> *Cc:* lsr <lsr@ietf.org>; 
>>> draft-ppsenak-lsr-igp-ureach-prefix-announce@ietf.org; 
>>> draft-wang-lsr-prefix-unreachable-annoucement 
>>> <draft-wang-lsr-prefix-unreachable-annoucement@ietf.org>
>>> *Subject:* Re: [Lsr] Thoughts about PUAs - are we not over-engineering?
>>> Hello Gunter,
>>> I agree with pretty much all you said except the conclusion - do 
>>> nothing :).
>>> To me if you need to accelerate connectivity restoration upon an 
>>> unlikely event like a complete PE failure the right vehicle to signal 
>>> this is within the service layer itself. Let's keep in mind that 
>>> links do fail a lot in the networks - routers do not (or they do it 
>>> is multiple orders of magnitude less frequent event). Especially 
>>> links on the PE-CE boundaries do fail a lot.
>>> Removal of next hop reachability can be done with BGP and based on 
>>> BGP native recursion will have the exact same effect as presented 
>>> ideas. Moreover it will be stateful for the endpoints which again to 
>>> me is a feature not a bug.
>>> Some suggested to define a new extension in BGP to signal it even 
>>> without using double recursion - well one of them has been proposed 
>>> in the past - 
>>> https://www.ietf.org/archive/id/draft-raszuk-aggr-withdraw-00.txt 
>>> <https://www.ietf.org/archive/id/draft-raszuk-aggr-withdraw-00.txt> 
>>> At that time the feedback received was that native BGP withdraws are 
>>> fast enough so no need to bother. Well those native withdrawals are 
>>> working today as well as some claim that specific implementations can 
>>> withdraw RD:* when PE hosting such RDs fail and RDs are allocated in 
>>> a unique per VRF fashion.
>>> Then we have the DROID proposal which again may look like overkill 
>>> for this very problem, but if you consider the bigger picture of what 
>>> networks control plane pub-sub signalling needs, it establishes the 
>>> foundation for such.
>>> Many thanks,
>>> Robert
>>> On Tue, Jun 14, 2022 at 10:59 AM Van De Velde, Gunter (Nokia - 
>>> BE/Antwerp) <gunter.van_de_velde@nokia.com 
>>> <mailto:gunter.van_de_velde@nokia.com>> wrote:
>>>    Hi All,
>>>    When reading both proposals about PUA's:
>>>    * draft-ppsenak-lsr-igp-ureach-prefix-announce-00
>>>    * draft-wang-lsr-prefix-unreachable-annoucement-09
>>>    The identified problem space seems a correct observation, and indeed
>>>    summaries hide remote area network instabilities. It is one of the
>>>    perceived benefits of using summaries. The place in the network
>>>    where this hiding takes the most impact upon convergence is at
>>>    service nodes (PE's for L3/L2/transport) where due to the
>>>    summarization its difficult to detect that the transport tunnel
>>>    end-point suddenly becomes unreachable. My concern however is if it
>>>    really is a problem that is worthy for LSR WG to solve.
>>>    To me the "draft draft-wang-lsr-prefix-unreachable-annoucement-09"
>>>    is not a preferred solution due to the expectation that all nodes in
>>>    an area must be upgraded to support the IGP capability. From this
>>>    operational perspective the draft
>>>    "draft-ppsenak-lsr-igp-ureach-prefix-announce-00" is more elegant,
>>>    as only the A(S)BR's and particular PEs must be upgraded to support
>>>    PUA's. I do have concerns about the number of PUA advertisements in
>>>    hierarchically summarized networks (/24 (site) -> /20 (region) ->
>>>    /16 (core)). More specific, in the /16 backbone area, how many of
>>>    these PUAs will be floating around creating LSP LSDB update churns?
>>>    How to control the potentially exponential number of observed PUAs
>>>    from floating everywhere? (will this lead to OSPF type NSSA areas
>>>    where areas will be purged from these PUAs for scaling stability?)
>>>    Long story short, should we not take a step back and re-think this
>>>    identified problem space? Is the proposed solution space not more
>>>    evil as the problem space? We do summarization because it brings
>>>    stability and reduce the number of link state updates within an
>>>    area. And now with PUA we re-introduce additional link state updates
>>>    (PUAs), we blow up the LSDB with information opaque to SPF best-path
>>>    calculation. In addition there is suggestion of new state-machinery
>>>    to track the igp reachability of 'protected' prefixes and there is
>>>    maybe desire to contain or filter updates cross inter-area
>>>    boundaries. And finally, how will we represent and track PUA in 
>>> the RTM?
>>>    What is wrong with simply not doing summaries and forget about these
>>>    PUAs to pinch holes in the summary prefixes? this worked very well
>>>    during last two decennia. Are we not over-engineering with PUAs?
>>>    G/
>>>    _______________________________________________
>>>    Lsr mailing list
>>>    Lsr@ietf.org <mailto:Lsr@ietf.org>
>>>    https://www.ietf.org/mailman/listinfo/lsr
>>>    <https://www.ietf.org/mailman/listinfo/lsr>
>>
>> _______________________________________________
>> Lsr mailing list
>> Lsr@ietf.org
>> https://www.ietf.org/mailman/listinfo/lsr