Re: [Idr] I-D Action: draft-ietf-idr-bgp-fwd-rr-02.txt

Igor Malyushkin <gmalyushkin@gmail.com> Mon, 18 March 2024 15:57 UTC

Return-Path: <gmalyushkin@gmail.com>
X-Original-To: idr@ietfa.amsl.com
Delivered-To: idr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 5EDF7C18DBB9 for <idr@ietfa.amsl.com>; Mon, 18 Mar 2024 08:57:32 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.107
X-Spam-Level:
X-Spam-Status: No, score=-2.107 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id aIlh4o27dmFL for <idr@ietfa.amsl.com>; Mon, 18 Mar 2024 08:57:30 -0700 (PDT)
Received: from mail-vk1-xa30.google.com (mail-vk1-xa30.google.com [IPv6:2607:f8b0:4864:20::a30]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 35A58C151090 for <idr@ietf.org>; Mon, 18 Mar 2024 08:57:30 -0700 (PDT)
Received: by mail-vk1-xa30.google.com with SMTP id 71dfb90a1353d-4affeacaff9so1193639e0c.3 for <idr@ietf.org>; Mon, 18 Mar 2024 08:57:30 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1710777449; x=1711382249; darn=ietf.org; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=siuDjSRxxsoXDJmKmUDkqwhIkk318FDnO5pvKGOPlKk=; b=hbg/QXfKcRLwpDuI4TabC0uDcxdpL7p3gJkMYoakpKyTfZU/q6amLWpHnojK05R4Zv kGPThXCiI7CvvXT4erdNPI4rA9YM/qsh75Jygffojl4UxVxiHHFVvo4tphZzPpuIcu2C TTN6NHG/udhiNlWIfb+OorEehdksqjMvxJeYejsA7k/jg67x9Tx0NinrMb1hjFfppP13 J5rfKi/EcHXPOz2Sr419+sRDDxUsTu94qXXhVP1wvL5ZiPk6w0voM++rzTBUMRUt/XFZ ST5jZ8ie3PDqmN1B7yfkX+edrDRXt/hXQZ6AjAUmMOA+H6AWgTn9S1VW9kBLc5FGNSk2 UMtg==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1710777449; x=1711382249; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=siuDjSRxxsoXDJmKmUDkqwhIkk318FDnO5pvKGOPlKk=; b=bggNlyal4z/CV4UEfQqNS29xz6fPJdzswRxc71TH+zPwvEbd7yZRr87rIxYphRannH qrs+6omMTKa2t2580jETHM9xOS0sHGWxEKRjXI+oNlOzP17v2UzFhCr2TS/h5Vw/PhD7 28lsdhAcrPS+fyUZ8NVCOrB2qYFnJJ+Uq9dIt+0yZq6AJXS3o47IJy7iKkPQCY7U9KNS 9TvU6Q5DJRSw988GN9sxxTO4cQIV+Y1N+7CtL4mH3baWTKCITxUZyMM/NTI30RuLjJZD AETqlC0v1EGmx4qTRI2wk0WuO2eoqvmMr3ddNexz52rWe7mE4BpNnLX0P0Ghah9BmGyU 3+HA==
X-Forwarded-Encrypted: i=1; AJvYcCVXZYwYh2+tgrDxvWpnpF2O/amh4uuOvQq8kXoJbYGvC1M5fX3zGYHu4q3Zo72V/BeQvgHJof0uN6ZQrr4=
X-Gm-Message-State: AOJu0YwATxzq7Jh23urCBYJSl9MmewaMrb+2vLWLiqInxwcFoWf2eIcE Grlw8O40JxYg1siGbOi1hVhOS+b+kkvYdQ5/KBJq3wCyHAmWZRdBHejRb5r62JeFkG2TQ7EWJJZ +sburN6IAPWbmmTKx8SViHjYOG9H8fq+L
X-Google-Smtp-Source: AGHT+IEie9YI46NSZmbPxtX0zo3GkQUGW4Zy27Arm7MJZbIFvJ02m+i1NgaJ1hYnMd2NCJ64mD3p/wXQagfpUhpY5lw=
X-Received: by 2002:a05:6122:2013:b0:4d4:310f:9ddf with SMTP id l19-20020a056122201300b004d4310f9ddfmr8210630vkd.16.1710777449201; Mon, 18 Mar 2024 08:57:29 -0700 (PDT)
MIME-Version: 1.0
References: <171065415177.59997.7631576612994148063@ietfa.amsl.com> <CAOj+MMEsp_UfuiHdc4U_Bv5o7xsYYK_RryusUZ88u+SH9xifSA@mail.gmail.com> <SJ0PR05MB86322B34D635E7F221C04C0FA22D2@SJ0PR05MB8632.namprd05.prod.outlook.com> <CAEfhRrxDVi_Yw2wtTWiGzjw4pQ-8TF-48UCY5AUxMpKdrbexZw@mail.gmail.com> <CAOj+MMHNAz741WP9Pf2UCSOQRj6YepFh=Q4tzedmwBCm6e289g@mail.gmail.com> <CAEfhRryMW9nyWfnDQdi+R5g-nypg5ppwFy_Gdf71pRMFmZysHA@mail.gmail.com> <CAOj+MMF2-oqZ29hSBgaO+gzYXXyvCRgJ0m-zW2K7CWattCpgrQ@mail.gmail.com> <CAEfhRrw=acXDgVtzUEhqxZcOPYbJwT0Ha36k-ADgZaiy863erg@mail.gmail.com> <CAOj+MMGnup=Qg7+XiyXtCJe0a2csmdFVVLtJj_TmoA4yysbghw@mail.gmail.com> <CAEfhRrxFBay_pctO32G9S9kL8Ry_Y2Cx+rQEknN3D6qU9XG2+Q@mail.gmail.com> <CAOj+MMFDUtDzSVWRyH6U2arCUWsn-hNTA-+_RXxan_hPLS=AAQ@mail.gmail.com> <CAEfhRrwxXgTNR0hWd96WnAi_wOvUqCsQkN=5bvbqycgxqMCkeg@mail.gmail.com> <CAOj+MMEpvT4f3W238RPr2bULuwyqTn0jwg6xRf01dLYp6H_6hg@mail.gmail.com>
In-Reply-To: <CAOj+MMEpvT4f3W238RPr2bULuwyqTn0jwg6xRf01dLYp6H_6hg@mail.gmail.com>
From: Igor Malyushkin <gmalyushkin@gmail.com>
Date: Mon, 18 Mar 2024 19:57:17 +0400
Message-ID: <CAEfhRrz+6ZP6DMgUj-MOwkGQDwj_Z-q0NUAo6VR2-V9fjk7KAw@mail.gmail.com>
To: Robert Raszuk <robert@raszuk.net>
Cc: Kaliraj Vairavakkalai <kaliraj=40juniper.net@dmarc.ietf.org>, "idr@ietf. org" <idr@ietf.org>
Content-Type: multipart/alternative; boundary="0000000000002646590613f16a32"
Archived-At: <https://mailarchive.ietf.org/arch/msg/idr/NNMjNTyp1l-luBjrB6q0yW1jRqI>
Subject: Re: [Idr] I-D Action: draft-ietf-idr-bgp-fwd-rr-02.txt
X-BeenThere: idr@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Inter-Domain Routing <idr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/idr>, <mailto:idr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/idr/>
List-Post: <mailto:idr@ietf.org>
List-Help: <mailto:idr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/idr>, <mailto:idr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 18 Mar 2024 15:57:32 -0000

пн, 18 мар. 2024 г. в 19:30, Robert Raszuk <robert@raszuk.net>:

>
>
>> When you are doing next hop self you are setting it to a local loopback
>>> address. That is your connected route. That next hop will be accessible to
>>> peering ABR via implicit or explicit null.
>>>
>> [IM] It's not necessary to advertise our loopback address as a labeled
>> route or even by BGP (we can disseminate it via IGP). When we receive a BGP
>> LU route there must be an unicast route to resolve its NH and an LSP toward
>> this NH to resolve a label.
>>
>
> I did not say it will be advertised by BGP for resolution. I said this
> Next Hop is going to sit in BGP UPDATEs and will be use in best path
> selection when comparing metrics with other next hops.
>
[IM] Ok. I agree and I still think that this document is a try to enumerate
some of the problems of that type and offer some remedies.

>
> a) we do not know which prefixes and labels going to region-2 got affected
>>> till we run best path on them
>>>
>> [IM] I do not understand this point. With the per-prefix label allocation
>> mode, we know it exactly in advance. This mode's name implies that.
>>
>
> This is crucial to my point. The local "repair" action is really not a
> repair. When you loose your sessions on primary ABR1 to region-2, or you
> get withdraws etc ...) you need to re-run local (on said ABR1) best path as
> there is no other trigger to simply activate bulk switchover in data plane
> to suddenly go to ABR2. This is the key.
>
[IM] I see at least three such triggers. An outgoing Region2-faced
interface failure. An IGP event in Region 2 with a remote ASBR failure. An
RSVP event of underlying Region 2's LSP failure. All of them describe some
group of prefixes and can influence the switchover to their backups. I
mean, there is enough information for a node to act accordingly. The
problem here is that we are not always sure (without any special
intervention) that these backups produce a different path (not affected by
the event). But there is always a compromise.

It does not matter that you got those ABR2 backup routes/paths and labels
> ahead of time. At the time of failure of your IBGP peers to region-2 you
> still need local per prefix or at least per label.
>
[IM] As I said this is not the only type of failure.

>
> My point is that this entire sentence in the draft sort of warning users
> from configuring on subject ABRs same CLUSTER_ID is irrelevant considering
> that failure scenario of IBGP sessions to ABRs from region-2 going down may
> be detected in 180 seconds ... And some folks panic to run BFD on IBGP
> sessions.
>
[IM] I'm neither a big fan of BFD for iBGP or other peer-tracking
techniques. I agree with you that for some networks (maybe even for a lot
of them) the same CLUSTER_ID could fit well. Maybe there is a way to
restate this paragraph to include observation of current service providers'
practices, and also not to frighten possible readers at the same time.

>
>
>> b) for each prefix received from back ABR label may be different.
>>>
>> [IM] Sure, the per-prefix mode tends to be greedy in terms of resources.
>> It is a problem for Option B, but almost never a problem for Option C.
>>
>
> No one said anything about resources, but about time to do per prefix
> invalidation on ABR vs doing the fixup on ingress PEs and avoiding all the
> mess. My point is that we should do the latter.
>
>
>> So to reiterate I am saying that ABR doing local repair can not do any
>>> repair in prefix independent fashion as there is no single trigger from
>>> region-2 and there is no single switchover to backup ABR possible (each
>>> route may have a different per prefix label).
>>>
>> [IM] Well, I disagree.
>>
>
> Ok.
>
> Tx,
> R.
>