Re: [EXTERNAL] draft-ietf-rtgwg-segment-routing-ti-lfa : A simple pathological network fragment

Gyan Mishra <hayabusagsm@gmail.com> Thu, 02 November 2023 23:04 UTC

Return-Path: <hayabusagsm@gmail.com>
X-Original-To: rtgwg@ietfa.amsl.com
Delivered-To: rtgwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 271E9C151066; Thu, 2 Nov 2023 16:04:04 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.103
X-Spam-Level:
X-Spam-Status: No, score=-2.103 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_BLOCKED=0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id C_09PybMv3xn; Thu, 2 Nov 2023 16:03:58 -0700 (PDT)
Received: from mail-qt1-x82b.google.com (mail-qt1-x82b.google.com [IPv6:2607:f8b0:4864:20::82b]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 50AA7C14CF1F; Thu, 2 Nov 2023 16:03:58 -0700 (PDT)
Received: by mail-qt1-x82b.google.com with SMTP id d75a77b69052e-41cc44736f2so8393181cf.3; Thu, 02 Nov 2023 16:03:58 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1698966237; x=1699571037; darn=ietf.org; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=U9LJzlV0aS/WoMxgrregyWp+TRaR7Ahtie5P/ZVi2Ys=; b=Y6SLPq0mepJhj2juzs+bh8NWAGaNxZqeyzthBxlMmbYmsB+FQbA2ts1905nJ/NEvma 2trW5Fb6eDru/sScc+FmoFg7Zf745yQlkkiNuytm+8O/jAjJa6/mUo9Hpd2gn0NJaZr0 m+G2/r/avAmQOuS5jpTZDh1Wo7jUOABbiZEJ1fT3GWqq609B8WX0hhqturc6rXsJEYh9 9sMLAH2mdMHRstSQIK0aYvtr2cHtP15KXMMtsRAT6g/LUcs75hVtrbMAtK9oJ0r8+0pC 3K2Xwd24Wkph1JmZ1KupIvoujtlL0M41mKxtwcZ1oqIypggLNENeD6WSBcTgbjJ+m2bl WOpQ==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1698966237; x=1699571037; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=U9LJzlV0aS/WoMxgrregyWp+TRaR7Ahtie5P/ZVi2Ys=; b=ePpOo18uGqpn7zPM3NVwR9NOkaCT2z3xaxVi2gxuohehzkB7TELOcQPVIvtYxkI7TY UKdVfZ3Ww5r/J8iALNRl44Y3VLoTMaKvLC86CHINNKrhz3yRYNcv9zfnpye/unwtnqDW TS9CdJgvUowh7UfANQgawkTfnR/q6MIRNgcP0+vGAyGOPZIJIgsxYeyLccI/XNcJADPL JGy0WScQX3JatDiNt5xi9VHZAbuTi1jeB8x7zKsv9GD4ttb7Hf+UPHiemXpKs8z3jUXd GnIwjBhSvQJqVMosr8vXO0AUQd8Y5LKe7bwMAoBE+kOFgwlmojkPq4Bcd0jjUD7MrEng 61kw==
X-Gm-Message-State: AOJu0Yx1iMVSHYJvb9jl/Ew5QF3YU+afLx9IOravLNGejFRDjIGJPkNC CjuBh9pfHXPOYFKNn+acMfhLJUCz83WoGgczfFU=
X-Google-Smtp-Source: AGHT+IFRmpBiTi4QEoXEySeiFuTi953NYOVp4/knPqSYm7yQOwZU3lzdNAkZB6oh/CD7NuDiUE4YFgteNLo5kLCJVkE=
X-Received: by 2002:ac8:5952:0:b0:41e:1ee3:ba65 with SMTP id 18-20020ac85952000000b0041e1ee3ba65mr24656185qtz.41.1698966236646; Thu, 02 Nov 2023 16:03:56 -0700 (PDT)
MIME-Version: 1.0
References: <9908D9F3-45C6-497D-B3BF-84D8A68A5013@gmail.com> <AS2PR02MB88395D3114B0DEE583BEEF65F0D7A@AS2PR02MB8839.eurprd02.prod.outlook.com> <60124119-5847-4F52-8BB8-18398A9BA4AC@gmail.com> <AS2PR02MB8839FB5A5537FC3E9F37A560F0D4A@AS2PR02MB8839.eurprd02.prod.outlook.com> <PH0PR03MB63004F32F9AF282ECDB78637F6D9A@PH0PR03MB6300.namprd03.prod.outlook.com> <AS2PR02MB88393EC50B913A5F8C3AB5E2F0D8A@AS2PR02MB8839.eurprd02.prod.outlook.com> <PH0PR03MB6300D9A7F9DC3E2E864EF11EF6D8A@PH0PR03MB6300.namprd03.prod.outlook.com> <CABNhwV30uhLOo52WHAv6YS4Wg0k9gDbkrs1ANuGPPdLzc1=dsw@mail.gmail.com> <6A2E595E-A7E6-4976-ACC9-E75402AD99E2@gmail.com> <PH0PR03MB63005F751BF04E8D5BEC982FF6A6A@PH0PR03MB6300.namprd03.prod.outlook.com> <A5218ED2-479C-48B5-8AC8-DA6B247D6665@gmail.com> <PH0PR03MB63000BC8F43B90B0CA1A1543F6A6A@PH0PR03MB6300.namprd03.prod.outlook.com> <E02A044F-4431-4559-97A8-C6B810DD7E4D@gmail.com> <AF2B1C41-55F6-4E78-AA4B-0AE7F573820B@gmail.com> <PH0PR03MB63002291B0F1F5514875018EF6A6A@PH0PR03MB6300.namprd03.prod.outlook.com> <2A85DD24-612D-472D-907D-1D90C88A95AD@gmail.com> <CABY-gOMVMb+TWLoKBdurn7jL=xF6APeVhEoSzSbH5CGnvmpLHw@mail.gmail.com> <PH0PR03MB630036D752F1A7D9A92F6558F6A6A@PH0PR03MB6300.namprd03.prod.outlook.com> <C4CC4157-DC35-40A6-9B0E-FAC7EA35D4CD@gmail.com>
In-Reply-To: <C4CC4157-DC35-40A6-9B0E-FAC7EA35D4CD@gmail.com>
From: Gyan Mishra <hayabusagsm@gmail.com>
Date: Thu, 02 Nov 2023 19:03:44 -0400
Message-ID: <CABNhwV3aPHU=EuQ+8Okjxvo_1cb+xN+h0k_vjbOEjhiX2fh8iw@mail.gmail.com>
Subject: Re: [EXTERNAL] draft-ietf-rtgwg-segment-routing-ti-lfa : A simple pathological network fragment
To: Stewart Bryant <stewart.bryant@gmail.com>
Cc: Alexander Vainshtein <Alexander.Vainshtein@rbbn.com>, Yingzhen Qu <yingzhen.ietf@gmail.com>, "draft-ietf-rtgwg-segment-routing-ti-lfa@ietf.org" <draft-ietf-rtgwg-segment-routing-ti-lfa@ietf.org>, rtgwg-chairs <rtgwg-chairs@ietf.org>, "rtgwg@ietf.org" <rtgwg@ietf.org>
Content-Type: multipart/alternative; boundary="000000000000055654060933672d"
Archived-At: <https://mailarchive.ietf.org/arch/msg/rtgwg/yeaB_Qa0Jy2LyWUr-OV1Y8dXTCI>
X-BeenThere: rtgwg@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Routing Area Working Group <rtgwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/rtgwg>, <mailto:rtgwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/rtgwg/>
List-Post: <mailto:rtgwg@ietf.org>
List-Help: <mailto:rtgwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/rtgwg>, <mailto:rtgwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 02 Nov 2023 23:04:04 -0000

Any of these options work for me.

I will be a remote  participant.

Thanks

Gyan


On Thu, Nov 2, 2023 at 1:10 PM Stewart Bryant <stewart.bryant@gmail.com>
wrote:

> Any of these work for me.
>
> Stewart
>
> On 2 Nov 2023, at 16:47, Alexander Vainshtein <
> Alexander.Vainshtein@rbbn.com> wrote:
>
> Yingzhen,
>
> Any of these options would work for me. If we can start with a side
> meeting during the IETF-118, this would speed things up.
>
>
>
> Regards,
>
> Sasha
>
>
>
> *From:* Yingzhen Qu <yingzhen.ietf@gmail.com>
> *Sent:* Thursday, November 2, 2023 6:35 PM
> *To:* Stewart Bryant <stewart.bryant@gmail.com>
> *Cc:* Alexander Vainshtein <Alexander.Vainshtein@rbbn.com>; rtgwg@ietf.org;
> rtgwg-chairs <rtgwg-chairs@ietf.org>;
> draft-ietf-rtgwg-segment-routing-ti-lfa@ietf.org; Gyan Mishra <
> hayabusagsm@gmail.com>
> *Subject:* Re: [EXTERNAL] draft-ietf-rtgwg-segment-routing-ti-lfa : A
> simple pathological network fragment
>
>
>
> Hi,
>
>
>
> The ti-lfa draft has not done WGLC yet, and we should definitely try to
> resolve this issue.
>
>
>
> I just checked the IETF 118 attendees list, and it seems not everyone will
> be onsite. I'd suggest continuing the discussion using this thread, and we
> can schedule either a side meeting during 118 or an Interim meeting on this
> topic after 118. Authors from both the ti-lfa and sr-uloop, Stewart, Sasha,
> and Gyan should be there.
>
>
>
> Please reply with your thoughts or email the chairs directly.
>
>
>
> Thanks,
>
> Yingzhen
>
>
>
> On Thu, Nov 2, 2023 at 8:45 AM Stewart Bryant <stewart.bryant@gmail.com>
> wrote:
>
> Sasha, please see inline
>
>
>
> On 2 Nov 2023, at 14:12, Alexander Vainshtein <
> Alexander.Vainshtein@rbbn.com> wrote:
>
>
>
> Stewart and all,
>
> I think I understand now the difference between Section 6.2 of RFC 5715
> <https://www.rfc-editor.org/rfc/rfc5715#section-6.2> and the SR
> Micro-Loop Avoidance
> <https://datatracker.ietf.org/doc/html/draft-bashandy-rtgwg-segment-routing-uloop-15>  draft
> – or, rather, common expectations from this draft.
>
>
>
> RFC 5715 has been published in 2010. With the tunneling techniques
> available at that time in the industry I suspect that “tunnels whose path
> is not affected by the topology change”  in this section have been
> implicitly presumed to be RSVP-TE tunnels – simply because no other
> tunneling technology was available at that time (I do not think that source
> routing in IP has been seriously considered).
>
>
>
> Nearside tunnelling does not need RSVP. Any ingress that will use the PLR
> to deliver a packet via the failed link will always be able to reach the
> PLR since it is on the nearside of the failure, so all you need to do is to
> push a label that is associated with the PLR router and the packet will get
> to the PLR where it will be popped to reveal the label associated with an
> entity reachable via the failed link which them triggers a repair action on
> that packet. I cannot remember if we wrote that down, but as I remember we
> considered it obvious at the time. This needs no signalling beyond the
> normal normal IGP and MPLS LDP.
>
>
>
>
>
>
>
> The context of the SR Micro-Loop Avoidance draft is Segment Routing, and
> RSVP-TE is, most probably, out of scope. With SR, the tunnels that are not
> affected by topology changes are implemented as contiguous lists of
> unprotected Adj-SIDs – but forwarding HW is quite limited regarding the
> length of such lists. Therefore, the common expectation from the SR
> Micro-Loop avoidance  (as well from TI-:FA) is that that it uses tunnels
> whose paths are not affected by the topology change *and that are
> implementing using a reasonably short lists of Node SIDs and Adj-SIDs*.
>
>
>
> For link failure with loop free support you never need more that two
> labels: one to get you to the edge of P space, and if the P mode is not a
> PQ node, a second table to get you into Q space.
>
>
>
> A router already has the label t reach the P router, and pre the work on
> SR we proposed to use TLDP, but now you would use an SR label.
>
>
>
> So you only ever need one label at ingress and you already have that. You
> need at most two labels at the PLR, one normal label that you already have
> and one adjacency label that you could have got from T-LDP but which you
> more conveniently get from the SR system.
>
>
>
> This is much simpler than the TiLFA approach and just works.
>
>
>
>
>
> Section 6 of the TI-LFA draft describes how such paths can be computed in
> the case of a single link failure, and the constrain of post-convergence is
> one way to guarantee that they are not affected by topology changes.
>
>
>
> SR Micro-Loop Avoidance draft, for the last 7 years, repeats the promise
> to define approaches for computing such paths in Section 3 which remains
> unchanged from the -00 version and until this day.
>
> I admit that I am not aware of another way to guarantee that the path that
> is implemented as a sequence of SISD and includes Node SIDs would not be
> affected by the topology change.
>
>
>
> What do you think?
>
>
>
> I think we are making a simple problem much harder than it needs to be.
>
>
>
> Best regards
>
>
>
> Stewart
>
>
>
>
>
> Regards,
>
> Sasha
>
>
>
>
>
>
>
>
>
>
>
> Regards,
>
> Sasha
>
>
>
> *From:* Stewart Bryant <stewart.bryant@gmail.com>
> *Sent:* Thursday, November 2, 2023 1:29 PM
> *To:* Alexander Vainshtein <Alexander.Vainshtein@rbbn.com>
> *Cc:* Stewart Bryant <stewart.bryant@gmail.com>; rtgwg@ietf.org;
> rtgwg-chairs <rtgwg-chairs@ietf.org>;
> draft-ietf-rtgwg-segment-routing-ti-lfa@ietf.org; Gyan Mishra <
> hayabusagsm@gmail.com>
> *Subject:* Re: [EXTERNAL] draft-ietf-rtgwg-segment-routing-ti-lfa : A
> simple pathological network fragment
>
>
>
> I just want to correct something
>
>
>
> You do not of course need to tunnel if the packets only go though nodes
> that are shielded from the knowledge that the link has failed and thus ego
> not reconverge. A method such as RFC5715 section 6.7 - Ordered FIB update -
> does not require a tunnel because it causes the Q space to gradually expand
> and P space to gradually contract until the PLR is subsumed into Q space.
>
>
>
> - Stewart
>
>
>
>
>
>
>
> On 2 Nov 2023, at 11:20, Stewart Bryant <stewart.bryant@gmail.com> wrote:
>
>
>
>
>
>
>
> On 2 Nov 2023, at 08:56, Alexander Vainshtein <
> Alexander.Vainshtein@rbbn.com> wrote:
>
>
>
> Stewart and all,
>
> I have looked up RFC5715 Section 6.2
> <https://www.rfc-editor.org/rfc/rfc5715#section-6.2> and I agree that it
> is similar to the SR Micro-Loop Avoidance
> <https://datatracker.ietf.org/doc/html/draft-bashandy-rtgwg-segment-routing-uloop-15>
>  draft.
>
>
>
> Specifically, explicitly mentions usage of  “tunnels whose path is not
> affected by the topology change” which is quite close IMHO to what the SR
> Micro-Loop avoidance draft is about and quite close to post-convergence
> paths used in TI-LFA.
>
> At the same time there are some differences. Specifically, RFC mentions “a
> new "loop-prevention" routing message” being issued by the router
> adjacent to failure. No such message is required in the SR Micro-Loop
> Avoidance draft.
>
>
>
> There are two ways of looking at this - new as in of a new type - and new
> as in a new message is issued.
>
>
>
> What ever you do you need a message to trigger loop prevention otherwise
> nodes remote from the failure will not know that it is needed. This could
> be done in one of two ways, either a bespoke fast flooded message, or you
> can trigger it from the routing LSP that will be issued by the PLR. It is
> not clear how fast the LSP flooding message will reach the all the nodes in
> P space.
>
>
>
> Either way you need a message to trigger loop prevention.
>
>
>
>
>
> I also think that the proposal, in the case of a link failure, to tunnel
> traffic to the nearest node adjacent to failure, is problematic. (Of
> course, the SR Micro-Loop Avoidance draft does not provide any approach for
> computing micro-loop avoiding paths with limited depth of added label
> stacks at all, it just repeats the promise to provide reference approaches
> starting from version -00 and until now).
>
>
>
> There are of course multiple ways of avoiding loops called up in RFC5715,
> but all of them require that all packets arriving at any ingress in the
> network that would originally go to the PLR either go direct to Q space or
> continue in P space to the PLR are repaired. If they are going to the PLR
> they need to be tunnelled where for the purposes of this discussion
> encapsulating in an SR packet is considered to be a tunnel.
>
>
>
> Anyway you did not comment on my point that if we need loop free anyway
> the congruence of the PLR path to the PQ node is no longer a hard
> requirement.
>
>
>
> Best regards
>
>
>
> Stewart
>
>
>
>
>
> My 2c,
>
> Sasha
>
>
>
> *From:* Stewart Bryant <stewart.bryant@gmail.com>
> *Sent:* Thursday, November 2, 2023 10:15 AM
> *To:* Alexander Vainshtein <Alexander.Vainshtein@rbbn.com>
> *Cc:* Stewart Bryant <stewart.bryant@gmail.com>; rtgwg@ietf.org;
> rtgwg-chairs <rtgwg-chairs@ietf.org>;
> draft-ietf-rtgwg-segment-routing-ti-lfa@ietf.org; Gyan Mishra <
> hayabusagsm@gmail.com>
> *Subject:* Re: [EXTERNAL] draft-ietf-rtgwg-segment-routing-ti-lfa : A
> simple pathological network fragment
>
>
>
> As far as I can see SR Microloop avoidance is RFC5715 Section 6.2 nearside
> tunnelling.
>
>
>
> That works unconditionally regardless of the Ti-LFA constraints.
>
>
>
> My point is that as soon as you recognise the need to introduce one of the
> RFC5715 micro loop avoidance methods you admit that TiLFA does not
> unconditionally address micro loops and thus the TiLFA repair topology
> constraint is no longer REQUIRED. An implementor may chose to do it, but it
> becomes OPTIONAL.
>
>
>
> I think that this needs a discussion chaired by the RTGWG chairs either
> during IETF 118 or at a side meeting or at an online interim.
>
>
>
> Stewart
>
>
>
> On 2 Nov 2023, at 07:55, Alexander Vainshtein <
> Alexander.Vainshtein@rbbn.com> wrote:
>
>
>
> Stewart and all,
>
> Please see some comments *inline below*.
>
>
>
> Regards,
>
> Sasha
>
>
>
> *From:* Stewart Bryant <stewart.bryant@gmail.com>
> *Sent:* Thursday, November 2, 2023 9:30 AM
> *To:* Gyan Mishra <hayabusagsm@gmail.com>
> *Cc:* Stewart Bryant <stewart.bryant@gmail.com>; Alexander Vainshtein <
> Alexander.Vainshtein@rbbn.com>; rtgwg@ietf.org; rtgwg-chairs <
> rtgwg-chairs@ietf.org>; draft-ietf-rtgwg-segment-routing-ti-lfa@ietf.org
> *Subject:* [EXTERNAL] Re: draft-ietf-rtgwg-segment-routing-ti-lfa : A
> simple pathological network fragment
>
>
>
> Let me ask a fundamental question.
>
>
>
> The whole point of Ti-LFA as sold to the community was that repairing
> along the post convergence path as opposed to repairing along a convenient
> temporary path avoided micro loops.
>
> *[[Sasha]] To the best of my understanding, repairing along the
> post-convergence path prevents micro-loops on these paths due to
> distributed IGP convergence – as long as traffic follows these paths.  It
> does not – and, obviously, cannot, have any impact on micro-loops happening
> elsewhere due to distributed IGP convergence – neither prevents micro-loops
> that would form without TI-LFA, nor introduces any new ones.*
>
> The repair path constraint and subsequent segment optimisations add
> complexity to the calculation of the path.
>
>
>
> Are we are now saying that micro loops can form elsewhere and as a
> consequence we need a micro loop avoidance strategy?
>
> *[[Sasha]] TI-LFA, same as any other form of LFA that I am aware of,
> handles just link/node **failures**.  Micro-loops can happen – and, from
> my experience, frequently DO happen – during **repair** of a failed
> link/node.  IMHO and FWIW this alone justifies the need for the micro-loop
> avoiding strategy/solution. *
>
>
>
> If so the fundamental premise behind TiLFA is broken and the repair can
> simply become: use SR to expeditiously route the packets into Q space and
> run a micro loop avoidance strategy. This approach removes the complexity
> of constraining the repair to the post convergence path. *[[Sasha]]
> Please see my previous comment about TI-LFA paths being micro-loop avoidant
> because they are post-convergence paths.  In other words, one possible
> micro-loop avoidance strategy is usage of post-convergence paths in the
> transient period – and this, in a nutshell, is what the SR Micro-Loop
> Avoidance draft
> <https://datatracker.ietf.org/doc/html/draft-bashandy-rtgwg-segment-routing-uloop-15> is
> about (no offence intended). *
>
>
>
> Of course an implementor could cheat and just used the simplified strategy
> I describe above and almost certainly very few operators would notice
> because:
>
>
>
> 1) In many cases the two paths would be congruent
>
>
>
> 2) The transient is short and quite difficult to instrument
>
>
>
> 3) Unless there was some security reason or traffic management reason for
> the path constraint few would care.
>
>
>
> I will look at the proposed differences later, but this sounds like it
> should be a topic for discussion in RTGWG before the text is finalised and
> sent the RFC editor.
>
> *[[Sasha]] The RTGWG agenda at IETF-118
> <https://datatracker.ietf.org/meeting/118/materials/agenda-118-rtgwg-02> seems
> tightly packed already. I wonder if a side meeting for such a discussion
> could be set in a way that allows online participation?*
>
>
>
> - Stewart
>
>
>
>
> On 2 Nov 2023, at 05:09, Gyan Mishra <hayabusagsm@gmail.com> wrote:
>
>
>
>
>
> Hi Sasha, Bruno & Stewart
>
>
>
> Thank you for going over my OPSDIR review in detail.
>
>
>
> I am good with the latest updated verbiage that Bruno had given.
>
>
>
> Comments in-line
>
>
>
> On Mon, Oct 23, 2023 at 8:41 AM Alexander Vainshtein <
> Alexander.Vainshtein@rbbn.com> wrote:
>
> Bruno,
>
> Lots of thanks for a prompt and very encouraging response!
>
>
>
> Your version of the text is definitely better than mine, I am all for
> using it.
>
>
>
> As for where the clarifying text could be inserted, I see two options:
>
> ·       A common “Applicability Statement” section (there is no such
> section in the draft)
>
>
>
> ·
>
> ·       A dedicated section on relationship between TI-LFA and
> micro-loops.
>
>     Gyan> I think this option would  be best.  This would fix the existing
> gap on uLoop.  I did mention but not sure if possible- as TI-LFA and uLoop
> are tightly coupled as a overall post convergence solution is it possible
> to combine the drafts and issue another WGLC.  (Question for authors)
>
> In any case, I defer to you and the rest of the authors to decide what, if
> anything should be done for clarifying the relationship between TI-LFA and
> micro-loops.
>
>
>
>
>
> Regards,
>
> Sasha
>
>
>
> *From:* bruno.decraene@orange.com <bruno.decraene@orange.com>
> *Sent:* Monday, October 23, 2023 3:27 PM
> *To:* Alexander Vainshtein <Alexander.Vainshtein@rbbn.com>
> *Cc:* rtgwg@ietf.org; rtgwg-chairs <rtgwg-chairs@ietf.org>;
> draft-ietf-rtgwg-segment-routing-ti-lfa@ietf.org; Stewart Bryant <
> stewart.bryant@gmail.com>
> *Subject:* [EXTERNAL] RE: draft-ietf-rtgwg-segment-routing-ti-lfa : A
> simple pathological network fragment
>
>
>
> Sasha,
>
>
>
> Thanks for the summary and the constructive proposal.
>
> Speaking for myself, this makes sense and I agree.
>
>
>
> Ø  TI-LFA is a local operation applied by the PLR when it detects failure
> of one of its local links. As such,  it does not affect:
>
> o   Micro-loops that appear – or do not appear –on the paths to the
> destination that do not pass thru TI-LFA paths
>
>
>
> As an editorial comment, depending on where such text would be inserted, I
> would propose the following change:
>
> OLD: Micro-loops that appear – or do not appear –
>
> NEW: Micro-loops that appear – or do not appear – as part of the
> distributed IGP convergence [RFC5715]
>
>
>
> Motivation: some reader could wrongly understand that such micro-loops are
> caused by TI-LFA
>
>
>
> Thanks,
>
> Regards,
>
> --Bruno
>
>
>
> Orange Restricted
>
> *From:* Alexander Vainshtein <Alexander.Vainshtein@rbbn.com>
> *Sent:* Sunday, October 22, 2023 4:21 PM
> *To:* DECRAENE Bruno INNOV/NET <bruno.decraene@orange.com>; Stewart
> Bryant <stewart.bryant@gmail.com>
> *Cc:* rtgwg@ietf.org; rtgwg-chairs <rtgwg-chairs@ietf.org>;
> draft-ietf-rtgwg-segment-routing-ti-lfa@ietf.org
> *Subject:* RE: draft-ietf-rtgwg-segment-routing-ti-lfa : A simple
> pathological network fragment
> *Importance:* High
>
>
>
> Bruno, Stewart and all,
>
> I think that most of the things about TI-LFA and micro-loops have been
> said already (if in a slightly different context)  and are mainly
> self-evident.
>
> However, I share the feeling that somehow the relationship between TI-LFA
> and micro-loop avoidance has become somewhat muddled.
>
>
>
> Therefore, I would like to suggest adding some text to the TI-LFA draft
> that clarifies this relationship, e.g., along the following lines:
>
> 1.       TI-LFA is a local operation applied by the PLR when it detects
> failure of one of its local links. As such,  it does not affect:
>
> a.       Micro-loops that appear – or do not appear –on the paths to the
> destination that do not pass thru TI-LFA paths
>
>
>
> i.      As explained in RFC 5714, such micro-loops may result in the
> traffic not reaching the PLR and therefore not following TI-LFA paths
>
>
>
> ii.      Segment Routing may be used for prevention of such micro-loops
> as described in the micro-loop avoidance draft
>
> b.       Micro-loops that appear – or do not appear - when the failed
> link is repaired (*aside: the need for this line is based on personal
> experience**☹*)
>
> 2.       TI-LFA paths are loop-free. What’s more, they follow the
> post-convergence paths, and, therefore, not subject to micro-loops due to
> difference in the IGP convergence times of the nodes thru which they pass
>
> 3.       TI-LFA paths are applied from the moment the PLR detects failure
> of a local link and until IGP convergence at the PLR is completed.
> Therefore, early (relative to the other nodes) IGP convergence at the PLR
> and the consecutive ”early” release of TI-LFA paths may cause micro-loops,
> especially if these paths have been computed using the methods described in
> Section 6.2, 6.3 or 6.4 of the draft. One of the possible ways to prevent
> such micro-loops is local convergence delay (RFC 8333).
>
> 4.       TI-LFA procedures are complementary to application of any
> micro-loop avoidance procedures in the case of link or node failure:
>
> a.       Link or node failure requires some urgent action to restore the
> traffic that passed thru the failed resource. TI-LFA paths are pre-computed
> and pre-installed and therefore suitable for urgent recovery
>
> b.       The paths used in the micro-loop avoidance procedures typically
> cannot be pre-computed.
>
>
>
> Hopefully these notes would be useful.
>
>
>
> Regards,
>
> Sasha
>
>
>
> *From:* rtgwg <rtgwg-bounces@ietf.org> *On Behalf Of *
> bruno.decraene@orange.com
> *Sent:* Thursday, October 19, 2023 7:34 PM
> *To:* Stewart Bryant <stewart.bryant@gmail.com>
> *Cc:* rtgwg@ietf.org; rtgwg-chairs <rtgwg-chairs@ietf.org>;
> draft-ietf-rtgwg-segment-routing-ti-lfa@ietf.org
> *Subject:* [EXTERNAL] RE: draft-ietf-rtgwg-segment-routing-ti-lfa : A
> simple pathological network fragment
>
>
>
> Hi Stewart,
>
>
>
> I agree with you on the technical points, so the first part of your email
> up to “So I think”.
>
>
>
> But I don’t quite follow why you want to mix IGP Convergence issues with
> this Fast ReRoute Solution.
>
> To quote RFC 5714 « IP Fast Reroute Framework”
>
>
>
> In order to reduce packet disruption times to a duration commensurate
>
>    with the failure detection times, two mechanisms may be required:
>
>
>
>    a.  A mechanism for the router(s) adjacent to the failure to rapidly
>
>        invoke a repair path, which is unaffected by any subsequent re-
>
>        convergence.
>
>
>
>    b.  In topologies that are susceptible to micro-loops, a micro-loop
>
>        control mechanism may be required [RFC5715
> <https://datatracker.ietf.org/doc/html/rfc5715>].
>
>
>
>    Performing the first task without the second may result in the repair
>
>    path being starved of traffic and hence being redundant.
>
>
>
> https://datatracker.ietf.org/doc/html/rfc5714#section-4
>
>
>
> I would assume that you agree with the above (as you are an author of this RFC and my guess would be that you wrote that text)
>
>
>
> My point is that there are two different mechanisms involved, in two different time periods:
>
> -     Fast ReRoute (“a”): this is the scope of draft-ietf-rtgwg-segment-routing-ti-lfa
>
> o   Timing: from detection time , to start of the IGP convergence
>
> -     IGP Micro-loop avoidance (“b”)
>
> o   Timing: from start of IGP convergence to end of IGP convergence
>
>
>
> The scope of draft-ietf-rtgwg-segment-routing-ti-lfa is FRR / “a”. IGP
> micro-loop is out of scope. Other documents are proposing solutions for
> this. (and for those Micro-loop documents, FRR is similarly out of scope).
>
>
>
> Personally I agree with you that both mechanisms are needed. But I think
> that this is already highlighted in RFC 5714, and that this is no different
> than RFC 7490 (RLFA). Therefore, I don’t see why the outcome/text should be
> different. Hence my proposition to reuse the text from RFC 7490 (RLFA). I
> find it adequate. You wrote it so probably find it adequate.
>
>
>
> On a side note, RFC5715, that you also wrote, seems to already cover what you are asking for. Quoting the abstract, it
>
>       provides a summary of the causes and consequences of
>
>    micro-loops and enables the reader to form a judgement on whether
>
>    micro-looping is an issue that needs to be addressed in specific
>
>    networks.
>
>
>
> Note that this RFC5715 is already cited in the proposed text.
>
>
>
> PS: If you were ready to wrote a 5715bis, I would support this.
>
>
>
> Best regards,
>
> --Bruno
>
>
>
>
>
> Orange Restricted
>
> *From:* Stewart Bryant <stewart.bryant@gmail.com>
> *Sent:* Tuesday, October 17, 2023 1:48 PM
> *To:* DECRAENE Bruno INNOV/NET <bruno.decraene@orange.com>
> *Cc:* Stewart Bryant <stewart.bryant@gmail.com>; rtgwg@ietf.org;
> rtgwg-chairs <rtgwg-chairs@ietf.org>;
> draft-ietf-rtgwg-segment-routing-ti-lfa@ietf.org
> *Subject:* Re: draft-ietf-rtgwg-segment-routing-ti-lfa : A simple
> pathological network fragment
>
>
>
> Hi Bruno
>
>
>
> I was thinking about this some more. It is something that was recognised
> in the early days, but somewhat swept aside.
>
>
>
> The case that Gyan bought up was an ECMP case, but I fear that the case is
> more common and I think we should characterise it as part of the text
> rather that giving the impression it is unusual.
>
>
>
> I think the problem occurs whenever there are two or more nodes between
> the point of packet entry and the failure.
>
>
>
> CE1 - R1 - R2 - R3 - R4 -/- R5 - CE2
>
>       |                     |
>
>       R6 - R7 - R8 - R9 — R10
>
>
>
> The normal path CE1-CE2 is via R2
>
>
>
> When R4-R5 fails it is trivial to see how the repair works with R7 as the
> entry into Q space.
>
>
>
> However unless R1, R2,  R3 converge in that order there will be microloops
> for traffic entering via any of those three nodes.
>
>
>
> So I think we can say that unless the PLR is only receiving traffic to be
> protected directly or from its immediate neighbour it is not guaranteed
> that there  will not be micro loops that are not addressable by the propose
> strategy of aligning the repair path with the post convergence path.
>
>
>
> Now thinking about the text you have below, I think we need to write in in
> terms of - Unless the operator is certain that no micro loops will form
> over any path the protected traffic will traverse between entry to the
> network and arrival at the PLR a micro loop avoidance method MUST be
> deployed. Of course I think that it would be helpful to the operator
> community for the text to provide some guidance on how to ascertain whether
> there is a danger of the formation of micro loops.
>
>
>
> I would note that the long chains of nodes show in the example above were
> probably not present in the test topologies which as I remember were all
> national scale provider networks, but unless we provide guidance otherwise
> Ti-LFA could reasonably be deployed in edge networks and in the case of
> cell systems these are often ring topologies.
>
>
>
> So I think we need to agree (as a WG) on the constrains that we are
> prepared to specify in the text and the degree of warning we need to
> provide to the operator community and then we can polish the text below.
>
>
>
> Best regards
>
>
>
> Stewart
>
>
>
>
>
>
>
>
>
> On 16 Oct 2023, at 17:25, bruno.decraene@orange.com wrote:
>
>
>
> Hi Stewart,
>
>
>
> Please see inline
>
>
>
>
>
> Orange Restricted
>
> *From:* Stewart Bryant <stewart.bryant@gmail.com>
> *Sent:* Monday, October 16, 2023 2:08 PM
> *To:* rtgwg@ietf.org; rtgwg-chairs <rtgwg-chairs@ietf.org>;
> draft-ietf-rtgwg-segment-routing-ti-lfa@ietf.org
> *Cc:* Stewart Bryant <stewart.bryant@gmail.com>
> *Subject:* draft-ietf-rtgwg-segment-routing-ti-lfa : A simple
> pathological network fragment
>
>
>
> During the operations directorate early review
> of draft-ietf-rtgwg-segment-routing-ti-lfa
> Gyan Mishra points to a simple pathological network fragment that I think
> deserves wider discussion.
>
>
>
>
> https://datatracker.ietf.org/doc/review-ietf-rtgwg-segment-routing-ti-lfa-11-opsdir-early-mishra-2023-08-25/
>
>
>
> I am not aware of any response to the RTGWG by the draft authors
> concerning the review comment and I cannot see obvious new text addressing
> this concern.
>
>
> The fragment is as follows
>
>
> CE1 –R1- R2-/-R3-CE2
>      |         |
>      R4 – R5 -R6
>
> In the pre converged network R4 is ECMP CE2 via R5 (cost 4) and via R1
> (cost also 4).
>
> We can easily build a TI-LFA repair path from R2 under link failure to CE2
> (so long as we remember that R4 is an ECMP path to CE2), but the problem
> occurs during convergence. If R1 converges before R4, R4 may ECMP packets
> addressed to CE2 back to R1 in a micro loop. Meanwhile since no packets for
> R3 are reaching R2 the Ti-LFA repair is not doing anything useful.
>
> The Ti-LFA text leads the reader to conclude that it is a loop-free
> solution, but gives no guidance on how to determine when this assumption
> breaks down. There is an informational reference to
> draft-bashandy-rtgwg-segment-routing-uloop, but this short individual
> draft does little in the way of helping the reader determine when  loop
> avoidance strategy needs to be deployed and the loop-free approach it
> describes does not seem to be fully developed.
>
>
>
> I am worried that proceeding with the Ti-LFA draft without noting that
> there is a real risk that simple network fragments can micoloop, and
> providing a fully formed mitigation strategy is a disservice to the
> operator community given the industry interest in Ti-LDA and the insidious
> nature of unexpected micro loop network transients, I am wondering what the
> view of the working group is on how to proceed.
>
>
>
> One approach would be for the Ti-LFA draft to incorporate detailed
> guidance on how to determine the risk of a micro loop in a specific
> operator network, and to provide specific mitigation advice. Another
> approach would be to  reference a developed loop avoidance strategy and
> recommending its preemptive deployment. Another approach would be to make
> draft-bashandy-rtgwg-segment-routing-uloop a normative reference and tie
> the fate of the two drafts. Another approach would be to elaborate on the
> risks and their manifestations but declare it a currently unsolved problem.
> I am sure there are other options that the WG may formulate.
>
>
>
> What is the opinion of the working group on how we should proceed
> with draft-ietf-rtgwg-segment-routing-ti-lfa when considering the possible
> formation of micro loops?
>
>
>
> FRR takes place between the failure (detection) and the IGP reconvergence.
> Those are two consecutive steps that the WG has so far addressed with
> different solutions and documents.
>
> That’s not new and that’s not specific to TI-LFA. E.g., that’s applicable
> to RLFA.
>
>
>
> Would the below text, taken verbatim from RFC 7490 (RLFA), work for you?
>
>