Re: [Lsr] Thoughts about PUAs - are we not over-engineering?

"Voyer, Daniel" <daniel.voyer@bell.ca> Wed, 15 June 2022 20:37 UTC

Return-Path: <prvs=158ad78cd=daniel.voyer@bell.ca>
X-Original-To: lsr@ietfa.amsl.com
Delivered-To: lsr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 76448C15AAD5; Wed, 15 Jun 2022 13:37:49 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.106
X-Spam-Level:
X-Spam-Status: No, score=-2.106 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_BLOCKED=0.001, SPF_HELO_NONE=0.001, SPF_NONE=0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=bell.ca
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id s6j4thMSQKoL; Wed, 15 Jun 2022 13:37:45 -0700 (PDT)
Received: from ESA1-Wyn.bell.ca (esa1-wyn.bell.ca [67.69.243.161]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id D1E1DC15AAD6; Wed, 15 Jun 2022 13:37:44 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=bell.ca; i=@bell.ca; q=dns/txt; s=ESAcorp; t=1655325465; x=1686861465; h=from:to:cc:date:message-id:content-id: content-transfer-encoding:mime-version:subject; bh=OH0erBlNGSlZJd3fBgopVEvuBl3LdvNjM4mP8MgS8Ho=; b=RWDuY1jJS0FNjxVAiti2202bG65vYbx+bTGOpvU6KEOE4jEUOYqHjo98 6Mdv17ifKVtjjBNDbLgLlCwsa02i/oIzYSZMATBjCzlr66OAypZxQYG+e 5LrLG2TEwxjUzG/XGYsCVPWP6pFhmmQ4lJb5kv3lzRFKZE7v2zFWVeK7Y 10vnEsfU3oUdNfytbWLV0b60SVdkvWce5KWa9q7jugW9xPlSdLxDpQ94L cD4FIoxsl8g1lQfrtFizAClRcC6YOYQiS3xK98HlVOHBh79VJOrhDXZ5Z ws3B/2okCKgfGbdnGr4TQz5R2Ma3NK2IPSOV3S0o3oQ4Avkn+UUY38nbp Q==;
Received: from dm5cch-d00.bellca.int.bell.ca (HELO DG1MBX01-WYN.bell.corp.bce.ca) ([198.235.102.30]) by esa01corp-wyn.bell.corp.bce.ca with ESMTP; 15 Jun 2022 16:37:43 -0400
Received: from DG4MBX03-WYN.bell.corp.bce.ca (142.182.18.29) by DG1MBX01-WYN.bell.corp.bce.ca (142.182.18.11) with Microsoft SMTP Server (TLS) id 15.0.1497.32; Wed, 15 Jun 2022 16:37:42 -0400
Received: from DG4MBX01-WYN.bell.corp.bce.ca (142.182.18.27) by DG4MBX03-WYN.bell.corp.bce.ca (142.182.18.29) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2375.24; Wed, 15 Jun 2022 13:37:42 -0700
Received: from DG4MBX01-WYN.bell.corp.bce.ca ([fe80::cc16:e2a3:3b5b:a2cf]) by DG4MBX01-WYN.bell.corp.bce.ca ([fe80::cc16:e2a3:3b5b:a2cf%5]) with mapi id 15.01.2375.024; Wed, 15 Jun 2022 16:37:42 -0400
From: "Voyer, Daniel" <daniel.voyer@bell.ca>
To: "Van De Velde, Gunter (Nokia - BE/Antwerp)" <gunter.van_de_velde@nokia.com>, lsr <lsr@ietf.org>
CC: "draft-ppsenak-lsr-igp-ureach-prefix-announce@ietf.org" <draft-ppsenak-lsr-igp-ureach-prefix-announce@ietf.org>, draft-wang-lsr-prefix-unreachable-annoucement <draft-wang-lsr-prefix-unreachable-annoucement@ietf.org>
Thread-Topic: [EXT]Thoughts about PUAs - are we not over-engineering?
Thread-Index: AQHYgPfCRvMpvKeaUUKkVHosuUFhKg==
Date: Wed, 15 Jun 2022 20:37:42 +0000
Message-ID: <91993C73-1981-476C-9D79-E82128F0E092@bell.ca>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
user-agent: Microsoft-MacOutlook/16.61.22050700
x-originating-ip: [172.28.239.65]
Content-Type: text/plain; charset="utf-8"
Content-ID: <45C59057D6FBE14786FC1EF8B3A5637F@exchange.bell.ca>
Content-Transfer-Encoding: base64
MIME-Version: 1.0
X-CFilter-Loop: Reflected
Archived-At: <https://mailarchive.ietf.org/arch/msg/lsr/BZQukFCZvpct2rpckTHbw8FL5Fs>
Subject: Re: [Lsr] Thoughts about PUAs - are we not over-engineering?
X-BeenThere: lsr@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Link State Routing Working Group <lsr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/lsr>, <mailto:lsr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/lsr/>
List-Post: <mailto:lsr@ietf.org>
List-Help: <mailto:lsr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/lsr>, <mailto:lsr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 15 Jun 2022 20:37:49 -0000

Hi Gunter,

Thanks for your comments,

The idea, here, with summarization is to "reduce" the LSDB quite a lots and make a given backbone much more scalable / flexible and allow to simplify NNI's within that given backbones considerably. 
Summarization is "needed" for better scale and, in the context of IPv6, will help in preventing blowing up the IGP.  With the size of an IPv6 prefix range (ex. /64) allocated per domain - summarization will help to contain the LSDB to that domain.

What we are "highlighting" in draft-ppsenak-lsr-igp-ureach-prefix-announce-00, is an easy way to overcome the fact that PEs are hidden behind a summary route and need a fast way to notify other PEs when they become unreachable. 

I don't see "over-engineering" here, I see "optimal-engineering" instead. 

Thanks
Dan

On 2022-06-14, 4:59 AM, "Van De Velde, Gunter (Nokia - BE/Antwerp)" <gunter.van_de_velde@nokia.com> wrote:

    Hi All,

    When reading both proposals about PUA's:
    * draft-ppsenak-lsr-igp-ureach-prefix-announce-00
    * draft-wang-lsr-prefix-unreachable-annoucement-09

    The identified problem space seems a correct observation, and indeed summaries hide remote area network instabilities. It is one of the perceived benefits of using summaries. The place in the network where this hiding takes the most impact upon convergence is at service nodes (PE's for L3/L2/transport) where due to the summarization its difficult to detect that the transport tunnel end-point suddenly becomes unreachable. My concern however is if it really is a problem that is worthy for LSR WG to solve.

    To me the "draft draft-wang-lsr-prefix-unreachable-annoucement-09" is not a preferred solution due to the expectation that all nodes in an area must be upgraded to support the IGP capability. From this operational perspective the draft "draft-ppsenak-lsr-igp-ureach-prefix-announce-00" is more elegant, as only the A(S)BR's and particular PEs must be upgraded to support PUA's. I do have concerns about the number of PUA advertisements in hierarchically summarized networks (/24 (site) -> /20 (region) -> /16 (core)). More specific, in the /16 backbone area, how many of these PUAs will be floating around creating LSP LSDB update churns? How to control the potentially exponential number of observed PUAs from floating everywhere? (will this lead to OSPF type NSSA areas where areas will be purged from these PUAs for scaling stability?)

    Long story short, should we not take a step back and re-think this identified problem space? Is the proposed solution space not more evil as the problem space? We do summarization because it brings stability and reduce the number of link state updates within an area. And now with PUA we re-introduce additional link state updates (PUAs), we blow up the LSDB with information opaque to SPF best-path calculation. In addition there is suggestion of new state-machinery to track the igp reachability of 'protected' prefixes and there is maybe desire to contain or filter updates cross inter-area boundaries. And finally, how will we represent and track PUA in the RTM?

    What is wrong with simply not doing summaries and forget about these PUAs to pinch holes in the summary prefixes? this worked very well during last two decennia. Are we not over-engineering with PUAs?

    G/
    ------------------------------------------------------------------------------
    External Email: Please use caution when opening links and attachments / Courriel externe: Soyez prudent avec les liens et documents joints