Re: [Lsr] Thoughts about PUAs - are we not over-engineering?

"Van De Velde, Gunter (Nokia - BE/Antwerp)" <gunter.van_de_velde@nokia.com> Wed, 15 June 2022 08:38 UTC

Return-Path: <gunter.van_de_velde@nokia.com>
X-Original-To: lsr@ietfa.amsl.com
Delivered-To: lsr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 395D0C15D862; Wed, 15 Jun 2022 01:38:37 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -7.655
X-Spam-Level:
X-Spam-Status: No, score=-7.655 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.745, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, RCVD_IN_DNSWL_HI=-5, RCVD_IN_MSPIKE_H2=-0.001, SPF_NONE=0.001, T_SCC_BODY_TEXT_LINE=-0.01] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=nokia.onmicrosoft.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 8wnRBK2eFny7; Wed, 15 Jun 2022 01:38:35 -0700 (PDT)
Received: from EUR05-VI1-obe.outbound.protection.outlook.com (mail-vi1eur05on2131.outbound.protection.outlook.com [40.107.21.131]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 94F3AC14F733; Wed, 15 Jun 2022 01:38:33 -0700 (PDT)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=ZiP0ZMuXGK/wuLonpG2hIFCMcIH8M51ME7cAbow5TwQbuUt9EAxHGnD5vpyWSBrTJ6v9TPlYQ/HnNymUBXWkr/pI7ks/glr1LPwK25ETHaPUWtUOIn0Ur0qJ26jQmJZ3GDStMSVMNGIbIOfPMMGwLjC/PeozO6em5j2Mn7Kwoqe693pirccHW67qEqM0mksCaB2AOFBIe91C2iYCXW64D85BMf8VdvjpY1CeP3Q4yBX3bXOgzVTv5g/G9Rv/FlBPDp9loEivqoe06bUzapk78MYCU7lB/0zpxBJ7FDiGw9rnWp03zVZB9vfgpLUPAbZZIHQ9/8Oq1Pyj7OYZyOh5gA==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=YPHsH9xmBnanCQD7pQTky+zgJ7VWSwNLNEOlUfWfkXU=; b=dhg9yy1HOSmuMlMFk6e1IaT1ErwhwMxo5gaIClane7F8c0sm+ifLrk/0rHreBTEcncByidtDAH8DoxeGS75913qhUdnmOoDfSL0c1fRv+2+OqivjSSeIY0xSZeq+8ykfSAZ35mPB/7eWMbf/20gb7Ug006wTut3j05dqBfL6UY7Ip23UJ3NVPJ9vxp1PuAkGGyb/vwmoxIUrw00+U5OvThqxeR5sye9G3U3cw/0W0nQHJJRdwVCvBQE6YS9Y9Rtdb5JZxX2QOYC/jTRflPPeuQK3I9STWzvAaUGJnA20o57LXpMczna9Pljl3qtOH2oN1mRuKtdviEArYAeKAf/qqA==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=nokia.com; dmarc=pass action=none header.from=nokia.com; dkim=pass header.d=nokia.com; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=nokia.onmicrosoft.com; s=selector1-nokia-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=YPHsH9xmBnanCQD7pQTky+zgJ7VWSwNLNEOlUfWfkXU=; b=eUAcHguU8VP1rfSQsdk7I+91tANyMaHVqx9piQjXuz3ZQWyTZH4mFgmBw/IH/JqM7sNz1vXDORbEgxrXB4yW8VUjr6V3vDBnGscglWV95Ipxg0env2UWF2ebdMftdj2HlxH3h1aUqGHPbR3++wcAvVgT0iWC18YxGn5Jnir7hQw=
Received: from AM0PR07MB6386.eurprd07.prod.outlook.com (2603:10a6:20b:144::23) by AM4PR07MB3332.eurprd07.prod.outlook.com (2603:10a6:205:a::13) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.5353.14; Wed, 15 Jun 2022 08:38:29 +0000
Received: from AM0PR07MB6386.eurprd07.prod.outlook.com ([fe80::8d52:6c9e:86f2:d305]) by AM0PR07MB6386.eurprd07.prod.outlook.com ([fe80::8d52:6c9e:86f2:d305%6]) with mapi id 15.20.5353.014; Wed, 15 Jun 2022 08:38:29 +0000
From: "Van De Velde, Gunter (Nokia - BE/Antwerp)" <gunter.van_de_velde@nokia.com>
To: Peter Psenak <ppsenak@cisco.com>, lsr <lsr@ietf.org>
CC: "draft-ppsenak-lsr-igp-ureach-prefix-announce@ietf.org" <draft-ppsenak-lsr-igp-ureach-prefix-announce@ietf.org>, "draft-wang-lsr-prefix-unreachable@ietf.org" <draft-wang-lsr-prefix-unreachable@ietf.org>
Thread-Topic: Thoughts about PUAs - are we not over-engineering?
Thread-Index: Adh/zPh4Y/SpJPjaRJe30Bd2YQgaiAAONZQAAB9DLxA=
Date: Wed, 15 Jun 2022 08:38:29 +0000
Message-ID: <AM0PR07MB6386AD4F6970AA87A9151E6DE0AD9@AM0PR07MB6386.eurprd07.prod.outlook.com>
References: <AM0PR07MB63863359D147F9EC0FF67689E0AA9@AM0PR07MB6386.eurprd07.prod.outlook.com> <16e06718-542f-e266-05fd-a1822bc4fd49@cisco.com>
In-Reply-To: <16e06718-542f-e266-05fd-a1822bc4fd49@cisco.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=nokia.com;
x-ms-publictraffictype: Email
x-ms-office365-filtering-correlation-id: 70abbc40-da86-425b-2f27-08da4eaa6c5d
x-ms-traffictypediagnostic: AM4PR07MB3332:EE_
x-microsoft-antispam-prvs: <AM4PR07MB33328E3208C68F42ED92CAACE0AD9@AM4PR07MB3332.eurprd07.prod.outlook.com>
x-ms-exchange-senderadcheck: 1
x-ms-exchange-antispam-relay: 0
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: Q0zD8BRsk3+kT7aNRUH/bDXvQ8S3I7gHDwpBBitX8F+E/k/emDh9uqyjAB2peDDm9T7YLisvU0Yaq26HQdvCAz8Aj/pfIG4sJpy6jYMJun4R47IX0eaUJH/JbAlcvnudi9fDZGsJoIFC/roSV6i8HqLzWPjCO7qPz3NxBKwMD/k+Dbun91as12tvnL4xlXUYLtA2MgGd+BUo/TFnPL9+QRHHYYM6MvMHaj9lF/AefU4pTSXOrR7/6tITB3iSYcH7xkuRUJlD9bk4VwhEUO2Du4OcsVl9L+axE7PBiMUMENjYry4N6FZx7S0q30U+iaaDpPVoW0mnL366vMUg6BuRNEzEcrw4M6F6l/3ai5G9HNHVUm1XrqenPt2DXE9wK6N1zKCUHAKpR9soLcX47mUqe55qL9Tdi9Wf5WCL0KKQyX2FUw1+gh4+WIBjvE8KNztSaqFTw1kVMFglxZzIzSiXkYJ1q0ztMS9W+QHsmUHNwTsUHo7kkCraWHFWg9LZ/zrUjafms3rUsSj7fXCLoPwVIrqwr6Y6YwQIuhdd7w7n4YdZURxu3MaI7aSZu191siqfRj/5h/YO2tCuDflolAKjPbEhzvSAQosqZCX9JNGu2pszGyDe1jIBY7Vc7I/+X+DmPmhwWfDdxrTH1spstt19GMLGaHWCWGDzKP+d4xXbeP8po8TyhrVqudEpi7Vc3rJVPNEUTTRC+BoDM3iwNiPGQqUqqQQ3/cg0L+d7YszV/LniysYBhEx2QMrlGPD3+//N
x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:AM0PR07MB6386.eurprd07.prod.outlook.com; PTR:; CAT:NONE; SFS:(13230016)(4636009)(366004)(186003)(82960400001)(7696005)(54906003)(53546011)(71200400001)(4326008)(110136005)(8676002)(8936002)(86362001)(6506007)(122000001)(52536014)(5660300002)(33656002)(508600001)(9686003)(66574015)(316002)(38070700005)(2906002)(64756008)(66446008)(38100700002)(55016003)(83380400001)(66946007)(66556008)(66476007)(76116006); DIR:OUT; SFP:1102;
x-ms-exchange-antispam-messagedata-chunkcount: 2
x-ms-exchange-antispam-messagedata-0: fMutoWUNKIeMT6K8Kf4/HXpjSXZWp0ug1jyv+QH9Ut4YsxmFKouAcU7BkOOjUWpUdY/arVPsBLuDdMs8qfqaNMrbSPCI0T9htSSqAI0PQBJBI0pJkygFn2slhWLUzbpGxQr33yrDsNap0dtD5h8gIn2UPnMDRL/+hRdyVGM6jq+NOeWjEzmtwCqbB1Kj8LoQQW9XUKGP3XIk+AhC3jD+2dp8wN3dHucv0JUr7UDmTlvCLqyzimnEQzBUtsPAJaw7CuXSdjg7h+91RUdixkI3a3GWDwK5jZ8ijj5FfqjNffuYO3gYhSjY35+jk3qqO7Og1sJIizTcQ8QlS591HOPhWyogUH0d78PxahfLVOjg8vAEUQ8Zn2pH+VML4ZeFFRXTdiyoGiqXqVa89aXXNTuQ2ZRoA0HsOkbZQgBj1+HCaj5guWneHcVcnGDFKbG7AjQWrLQ0OT/fRv1oI3Pq2H5khI6gj0Ycfg+TTFO5wUOpNmo5lYFPEJkw3GUJXa+yyMdcPG0dzL2n8bBcDuEd8X/2c8re8hpVMhsl4MLvs0gREVoFEvFB1uQ11+L+D5kilTZXQJVhvXB+6xx7MHTkldYKT8IJyCCkSDThCFZvGTBZGPOyouzSI3ye0SexW+vtChmeCwWVLN8FBAm1Q5H/6ydXhGV+9M+EVCYM3ml2WvB+7a6C71wBf2YTBsjfE3x+XO3QY47CBMSdhbsYpBa9DLCEbAoz6M/Y0jowWsk+fSVsCAv+SC9FDvmk0rXBt6VieCb8I4WTjt4BYxdOMLsO28TR/vaZr+GWhLhVnRP19lUhQYbmWsIjX7fZn1ooloFphJ6TNcdVm8RY4mrZsJerL14cWXFQKhFKDbgV4CQLuz2eJ5d/hrSRzBheKIggLnluMqX3Kati3Di6fDmLATp801FKxNUQeYKaGtgQMXidv9bC40O5gOTy3N4sWWRmwY7ODwJV/VvfHzk6ndtVlX+NsxatCzU62De+zLaj2GcGTr6orghjwudJVje5m9VTjbhp/6F6/88x1dUt9wWyfn6AdLh1n5QzzhYOCXzqMWw8TXd96BfXXlgAzN2KznKIUdf/m+MP2lLFAGQK8p6VvYgMgyYkCVmmKB5gvgrEG/FOA6K1Z8TXwpMmjOeDpTXL51rl8oDgop3/GQKOuextpKGbHb8FJ73cUdlR9yyN4vBDsk2aEbxAcuKIrUQnSRV7ciKokKJDWDvQfvc5mLBH0vo3TTOfHXdJ6WAAN56BCE7At4xz8tC8LjowJUiedZdrAo3GMfcVJHUZyUy2tT0dqqIcELdBV6Q6JKQo8E19rUriOvVvn6002HCbQBmQRYEVibeRiusrPi5qRAdYRgGUlzSV0pcp4tptSKv0hZyYhUWS2MYlrOlpw4rysH9X3sZdD27u1YHw49ipA6AyGOgJ2thDQ31P0dnEoktaV2m6b6wRyqcI7PhB3S918LIog2wG4Se/8xO3lMXXxupQyWzdrucHSJgqH95c/1q8DIZJhBFYkLFsnqxGKDLjXeRlRiArf7y9au87VtCQXYLdyhxX/Dx4FAZxVmLoElULu6qKXbt94HcfthB9ad82IEssc/4RPXW2nZKU4aW+laGD1imHXr+9UJBYe/tbG6isWvMiG0a772g49s4qgEYWxXTKpnyf+GW10rUN2WOYcabe/Ofq3BGJ/2mf/odnIZZmnncAKrWn252ITTonnkdz/7Fxuiaz4IKjFPDS5Mlopv/JilBIJ3M2mZdD2stc+I2fvKvd71yHvX1wqNC1HMF7L8eTCTO/dR4lpzTna8INWCAE
x-ms-exchange-antispam-messagedata-1: Yla4Eg4HC1NhUGXTXh9x3lhynD3PwG2wg58=
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
MIME-Version: 1.0
X-OriginatorOrg: nokia.com
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-AuthSource: AM0PR07MB6386.eurprd07.prod.outlook.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 70abbc40-da86-425b-2f27-08da4eaa6c5d
X-MS-Exchange-CrossTenant-originalarrivaltime: 15 Jun 2022 08:38:29.5330 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 5d471751-9675-428d-917b-70f44f9630b0
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: vdRdoKoEfThsZllYqlGYySF1uHF8HEKQKq5sXFFkuMiwRKX6XRudByPvnkQjXhSAiLCk6ds9qa3Ym+Ytui6i1PTnQUVU57m/QJj3pIq4wCg=
X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM4PR07MB3332
Archived-At: <https://mailarchive.ietf.org/arch/msg/lsr/FzGzNCffy0vLtloqcH-WlWELDv8>
Subject: Re: [Lsr] Thoughts about PUAs - are we not over-engineering?
X-BeenThere: lsr@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Link State Routing Working Group <lsr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/lsr>, <mailto:lsr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/lsr/>
List-Post: <mailto:lsr@ietf.org>
List-Help: <mailto:lsr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/lsr>, <mailto:lsr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 15 Jun 2022 08:38:37 -0000

Hi Peter, All,

From a BGP perspective (PE service nodes) the event detection when transport tunnel end-point suddenly becomes unreachable is an operational problem. I think we all agree.
This problem exists in any multi-domain network, and is not limited to a multi-area/level IGP with summarization. Hence my doubts that simple encodings using the IGP as API for unreachability signaling is an optimal solution.  

Churning the LSDB for these things doesn't seem right.  Would this mean that we hack the IGP implementation so we don't trigger SPFs on rx of these updates?  
Another concern is how we hook into BGP sideways to update it. Typically a router just looks at RTM and tunnel-tables for reachability. Now it would have check all the time a separate bypass-list.  
What about the pseudo-state. On startup I would imagine we would have to originate this PUA until a certain point?

Some consideration about installing the PUA route as a blackhole route, it does not seem an option because resolution of BGP next-hops with blackhole /32 routes has to continue to mean “drop” matching traffic because of the widespread way this is used for DDOS protection. So there is need another “install” type for the “unreachable” IGP prefix which does not exist yet.

To make IGP based Prefix-unreachability-signal successful seems not a trivial task pe-to-pe, and involves more than simplistic dumping of opaque link-state messages into IGP and to re-vector interior routing as an API. I'm a bit tormented regarding the potential evil caused to IGP for signaling prefix-unreachability. It may not be worth the effort. Especially when realizing that the problem space is not limited to multi-area/level summarization but instead exists in any multi-domain network. 

Maybe IETF should consider looking at the bigger picture, at service level, and document a full service level solution framework instead of looking only at IGP in atomic fashion.

G/

-----Original Message-----
From: Peter Psenak <ppsenak@cisco.com> 
Sent: Tuesday, June 14, 2022 5:46 PM
To: Van De Velde, Gunter (Nokia - BE/Antwerp) <gunter.van_de_velde@nokia.com>; lsr <lsr@ietf.org>
Cc: draft-ppsenak-lsr-igp-ureach-prefix-announce@ietf.org; draft-wang-lsr-prefix-unreachable-annoucement <draft-wang-lsr-prefix-unreachable-annoucement@ietf.org>
Subject: Re: Thoughts about PUAs - are we not over-engineering?

Hi Gunter,

please see inline:

On 14/06/2022 10:59, Van De Velde, Gunter (Nokia - BE/Antwerp) wrote:
> Hi All,
> 
> When reading both proposals about PUA's:
> * draft-ppsenak-lsr-igp-ureach-prefix-announce-00
> * draft-wang-lsr-prefix-unreachable-annoucement-09
> 
> The identified problem space seems a correct observation, and indeed summaries hide remote area network instabilities. It is one of the perceived benefits of using summaries. The place in the network where this hiding takes the most impact upon convergence is at service nodes (PE's for L3/L2/transport) where due to the summarization its difficult to detect that the transport tunnel end-point suddenly becomes unreachable. My concern however is if it really is a problem that is worthy for LSR WG to solve.

the request to address the problem is coming from the field. The scale of the networks in the field is growing significantly and the summarization is being implemented to keep the prefix scale under control.


> 
> To me the "draft draft-wang-lsr-prefix-unreachable-annoucement-09" is 
> not a preferred solution due to the expectation that all nodes in an 
> area must be upgraded to support the IGP capability. From this 
> operational perspective the draft 
> "draft-ppsenak-lsr-igp-ureach-prefix-announce-00" is more elegant, as 
> only the A(S)BR's and particular PEs must be upgraded to support 
> PUA's. I do have concerns about the number of PUA advertisements in 
> hierarchically summarized networks (/24 (site) -> /20 (region) -> /16 
> (core)). More specific, in the /16 backbone area, how many of these 
> PUAs will be floating around creating LSP LSDB update churns? How to 
> control the potentially exponential number of observed PUAs from 
> floating everywhere? (will this lead to OSPF type NSSA areas where 
> areas will be purged from these PUAs for scaling stability?)

Node going down is a rare event. The expected number of UPAs at any given time is very small. Implementations can limit the number of UPAs on ABR/ASBR in case of a catastrophic events, in which case the UPAs would hardly help anyway.

> 
> Long story short, should we not take a step back and re-think this identified problem space? Is the proposed solution space not more evil as the problem space? We do summarization because it brings stability and reduce the number of link state updates within an area. And now with PUA we re-introduce additional link state updates (PUAs), we blow up the LSDB with information opaque to SPF best-path calculation. In addition there is suggestion of new state-machinery to track the igp reachability of 'protected' prefixes and there is maybe desire to contain or filter updates cross inter-area boundaries. And finally, how will we represent and track PUA in the RTM?

the problem space is valid, as conformed by the field. As described 
above, the number of UPAs will be low, so there is no danger of 
defeating the purpose of the summarization.

> 
> What is wrong with simply not doing summaries and forget about these PUAs to pinch holes in the summary prefixes? this worked very well during last two decennia. Are we not over-engineering with PUAs?

it's the scale of the current networks, which is growing exponentially, 
which demands the use of the summarization.


thanks,
Peter

> 
> G/
>