Re: [Idr] I-D Action: draft-ietf-idr-bgp-fwd-rr-02.txt

Igor Malyushkin <gmalyushkin@gmail.com> Mon, 18 March 2024 16:34 UTC

Return-Path: <gmalyushkin@gmail.com>
X-Original-To: idr@ietfa.amsl.com
Delivered-To: idr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 88B8EC180B79 for <idr@ietfa.amsl.com>; Mon, 18 Mar 2024 09:34:03 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.107
X-Spam-Level:
X-Spam-Status: No, score=-2.107 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Z4TGXxvEoGB7 for <idr@ietfa.amsl.com>; Mon, 18 Mar 2024 09:34:02 -0700 (PDT)
Received: from mail-ua1-x929.google.com (mail-ua1-x929.google.com [IPv6:2607:f8b0:4864:20::929]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 4CFE3C151996 for <idr@ietf.org>; Mon, 18 Mar 2024 09:34:02 -0700 (PDT)
Received: by mail-ua1-x929.google.com with SMTP id a1e0cc1a2514c-7e074f8ac06so214001241.1 for <idr@ietf.org>; Mon, 18 Mar 2024 09:34:02 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1710779641; x=1711384441; darn=ietf.org; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=6wEXl9pdzfYLfva728cub1+oHcohoYwgcmjLafYjK4k=; b=HeWzk0zX2cMIWMiEtNiKpBKGt5pAZyavGdnR2v65OiI5OIgRBSBzN6/rmbw05o4AqU TO7lDqvPtrOmk+RhVPQATbD/bNpWQcVqiIXZNNAeV1/iGY2dPnH95hso+qIUPQUKCjyI 49VwOlQr8t2ZWpj9Y/Oewlf1Wh+EFPIAWeQ+QVvz7g4UxMHh55mfjGDKHf+R4KDacvfn mMFHK3W1kAycIkCHXjRpG71L7t5xgVa3Nf+fgMtSX/HK7PLZbWuIfKFyriI5ypWHmcLM xtg83YK4PMlTG2u2K1yBLYAMP/HdwN00YqmxNFPdJaoJwSuWJg6NQEgCm6F5AjX3+RTd IuUw==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1710779641; x=1711384441; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=6wEXl9pdzfYLfva728cub1+oHcohoYwgcmjLafYjK4k=; b=jm+WNJq0eHQytt1RoaG9Ndwkl0XX4JdoaJ1fSSTLsi5RSnnQ/vM19RYICQoGiVctp2 FUqUlPIds045G6TezU3VscanxseP5+r8w6pgXvsWcSTBtraW3eb2nq+ZVb+8U/6r+jU8 9qRxe7EdSKVN3yyu4PEzL9QMpB5PwJwC2u7eyhvA8eaxMMhWQ7uu8/1utCXYWNVrQ5l2 D8sb7yMudCUwq5G82InUDl38DDKRPs7tvshBJ1ZIPOXyye8AHycj4DcWddC9IagQAXFE yaEjG34+EqwB5QNGva8LGpBP+8C9x4R6RoYWIj09w327qs1PyYwPWHNB+28ZJNvMHOuV WNSA==
X-Forwarded-Encrypted: i=1; AJvYcCW6UmWRH9BloiohtqF/V7KVHa2d/m0syAH20hxh4qGXlRKsYTI84fBk/DLl+YmzjT9qGXhTmPJkeE1R0e4=
X-Gm-Message-State: AOJu0Yy4nuoQfhHxKf+s13Cd+dVp6IQ8lcsaIrS/PUOLuTy20Q3b+S+B GHO+6+psMcq03FXsDayMrKtWx0ObB9Ti6nvHCoF+BtjsfFurOJbbHSRcDXcZguqultzSsdK21uz B2nS69t4eAwvtSUD+KGB/R6qeU240SSQrqn8=
X-Google-Smtp-Source: AGHT+IGj6x/Cn1pW3XALzo29Oih36xrsdy6XuDx2rJX2CE9Kdwq2Rkrbd+RWRWBuezxNxbo2qYPueN+2Otkz2Bp6MXA=
X-Received: by 2002:a05:6122:45a:b0:4c9:d0ee:bd67 with SMTP id f26-20020a056122045a00b004c9d0eebd67mr8078580vkk.5.1710779640689; Mon, 18 Mar 2024 09:34:00 -0700 (PDT)
MIME-Version: 1.0
References: <171065415177.59997.7631576612994148063@ietfa.amsl.com> <CAOj+MMEsp_UfuiHdc4U_Bv5o7xsYYK_RryusUZ88u+SH9xifSA@mail.gmail.com> <SJ0PR05MB86322B34D635E7F221C04C0FA22D2@SJ0PR05MB8632.namprd05.prod.outlook.com> <CAEfhRrxDVi_Yw2wtTWiGzjw4pQ-8TF-48UCY5AUxMpKdrbexZw@mail.gmail.com> <CAOj+MMHNAz741WP9Pf2UCSOQRj6YepFh=Q4tzedmwBCm6e289g@mail.gmail.com> <CAEfhRryMW9nyWfnDQdi+R5g-nypg5ppwFy_Gdf71pRMFmZysHA@mail.gmail.com> <CAOj+MMF2-oqZ29hSBgaO+gzYXXyvCRgJ0m-zW2K7CWattCpgrQ@mail.gmail.com> <CAEfhRrw=acXDgVtzUEhqxZcOPYbJwT0Ha36k-ADgZaiy863erg@mail.gmail.com> <CAOj+MMGnup=Qg7+XiyXtCJe0a2csmdFVVLtJj_TmoA4yysbghw@mail.gmail.com> <CAEfhRrxFBay_pctO32G9S9kL8Ry_Y2Cx+rQEknN3D6qU9XG2+Q@mail.gmail.com> <CAOj+MMFDUtDzSVWRyH6U2arCUWsn-hNTA-+_RXxan_hPLS=AAQ@mail.gmail.com> <CAEfhRrwxXgTNR0hWd96WnAi_wOvUqCsQkN=5bvbqycgxqMCkeg@mail.gmail.com> <CAOj+MMEpvT4f3W238RPr2bULuwyqTn0jwg6xRf01dLYp6H_6hg@mail.gmail.com> <CAEfhRrz+6ZP6DMgUj-MOwkGQDwj_Z-q0NUAo6VR2-V9fjk7KAw@mail.gmail.com> <CAOj+MMFmY4LrUc=-Nb+rgOQywcTAQ3JZKA071hsHc9SEo=dM9g@mail.gmail.com>
In-Reply-To: <CAOj+MMFmY4LrUc=-Nb+rgOQywcTAQ3JZKA071hsHc9SEo=dM9g@mail.gmail.com>
From: Igor Malyushkin <gmalyushkin@gmail.com>
Date: Mon, 18 Mar 2024 20:33:48 +0400
Message-ID: <CAEfhRrzb5GsFX_AQ4dY9+2azVbNRvNZDQda3g_rMRPmj_no1tw@mail.gmail.com>
To: Robert Raszuk <robert@raszuk.net>
Cc: Kaliraj Vairavakkalai <kaliraj=40juniper.net@dmarc.ietf.org>, "idr@ietf. org" <idr@ietf.org>
Content-Type: multipart/alternative; boundary="000000000000c5bb170613f1ecb3"
Archived-At: <https://mailarchive.ietf.org/arch/msg/idr/qU_eEMLQPbwXc7l5L0JyyUSQb88>
Subject: Re: [Idr] I-D Action: draft-ietf-idr-bgp-fwd-rr-02.txt
X-BeenThere: idr@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Inter-Domain Routing <idr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/idr>, <mailto:idr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/idr/>
List-Post: <mailto:idr@ietf.org>
List-Help: <mailto:idr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/idr>, <mailto:idr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 18 Mar 2024 16:34:03 -0000

пн, 18 мар. 2024 г. в 20:11, Robert Raszuk <robert@raszuk.net>:

> Hi Igor,
>
>
>> This is crucial to my point. The local "repair" action is really not a
>>> repair. When you loose your sessions on primary ABR1 to region-2, or you
>>> get withdraws etc ...) you need to re-run local (on said ABR1) best path as
>>> there is no other trigger to simply activate bulk switchover in data plane
>>> to suddenly go to ABR2. This is the key.
>>>
>> [IM] I see at least three such triggers.
>>
>
> Ahhh see the crux of the matter is that none of those listed below is
> applicable to support bulk data plane switchover to back ABR :(
>
[IM] Generally, all infrastructure routes have several possible next-hop
addresses. We monitor the availability of the next hops (and LSPs to them),
not prefixes. So, the "bulkiness" here is about our reaction to a next-hop
failure. This next-hop is a part of many NHFLEs. But modern gear has lots
of optimizations with pointers and so on to make it almost instant.

>
> An outgoing Region2-faced interface failure.
>>
>
> When you loose interface and ABR is (and should be) connected via at least
> two interfaces to each region your IBGP sessions are intact.
>
[IM] Yes, and it is always a good option to have the second interface as a
backup. But it is the same next-hop protection technique and another layer
in the hierarchy (upper). Modern devices have several layers of next-hops
and can react to different failures.

>
> And even if this is single interface and you invent new harmful knob to
> invalidate IBGP sessions when interface X goes down (this is common
> practice on directly connected EBGP sessions only) you still have zoo of
> prefixes received over those IBGP sessions and each may have been
> advertised with different label.
>

> So you still need to run full best path and pick new (now backup path) on
> a per prefix basis and install one by one in RIB/FIB/LFIB.
>

>
>> An IGP event in Region 2 with a remote ASBR failure.
>>
>
> Ok you can invalidate a remote next hop. Sure. But that means as you have
> observed already that each route may be incoming with different label so
> you need prefix by prefix pick alternative label and install it in
> RIB/FIB/LFIB locally.
>
 [IM] Actually, this is a parallel process, we can protect our labeled
traffic during the calculation of new bests.

>
>
>
>> An RSVP event of underlying Region 2's LSP failure.
>>
>
> Whow ... now we see a soft state protocol through into this mix.  And when
> an RSVP-TE LSP to remote next hop in region-2 goes down you want to
> consider this a trigger to invalidate given next hop ? Ok but we are back
> to #2 above. Still one by one ...
>
[IM] Sure! Most of the modern SP-ready devices react on an LSP failure.
Once I tested an IOS device and it didn't react on an LSP failure at all
because it had a route toward an NH. Horrible...

>
>
>> All of them describe some group of prefixes and can influence the
>> switchover to their backups.
>>
>
> That would be true for vanilla IPv4/\Pv6 SAFI 1. Not for SAFI 4 or SAFI
> 76.
>
[IM] I didn't get it, sorry.

>
> Cheers,
> R.
>
>