Re: draft-ietf-rtgwg-segment-routing-ti-lfa : A simple pathological network fragment

Stewart Bryant <stewart.bryant@gmail.com> Wed, 08 November 2023 08:37 UTC

Return-Path: <stewart.bryant@gmail.com>
X-Original-To: rtgwg@ietfa.amsl.com
Delivered-To: rtgwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 38F6DC1C02B5; Wed, 8 Nov 2023 00:37:47 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.106
X-Spam-Level:
X-Spam-Status: No, score=-2.106 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_BLOCKED=0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id rYtny02SbPEU; Wed, 8 Nov 2023 00:37:42 -0800 (PST)
Received: from mail-lj1-x231.google.com (mail-lj1-x231.google.com [IPv6:2a00:1450:4864:20::231]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id C3D94C1CAB31; Wed, 8 Nov 2023 00:37:42 -0800 (PST)
Received: by mail-lj1-x231.google.com with SMTP id 38308e7fff4ca-2c6b30aca06so87087281fa.3; Wed, 08 Nov 2023 00:37:42 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1699432660; x=1700037460; darn=ietf.org; h=references:to:cc:in-reply-to:date:subject:mime-version:message-id :from:from:to:cc:subject:date:message-id:reply-to; bh=L0kFh5inQi2h0lEz2BMY2uNItE/JfHsGKcEB8hE1iFo=; b=fF6ozHU71TyD2ANSr3OtCeKf3uCNY4jNV8LGkRZpxvSddkIxi8TmCDIFeS4CJgyZn9 njZXfAI4acHjWQl7MRkeMWzV9ncycrpO/TTrEJf18X4jyF4AzRyORLjpQ1z7cWVb4cqi JgrGbm7N2O8I6d81QfEKV76gvu6i2YtJXFfTFPG5m/6qhLyIHkSNXubVxtXeVhG5/cl6 GXdfRDJIU9Bx1pgdhCUw8pE9EaTw+1+EjU1/bDAo3Ak5Q88Q+2thbfFyvsgMgWpCCGuX k4MePfEUW7jef4UB33vdT5V2uBpTo5cXDC7Bw7e0CBfYy+RzJdxQbssVasTGpqx+fogP h78w==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1699432660; x=1700037460; h=references:to:cc:in-reply-to:date:subject:mime-version:message-id :from:x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=L0kFh5inQi2h0lEz2BMY2uNItE/JfHsGKcEB8hE1iFo=; b=Vsc8P2f1VxjRXWAlVIaFX8ePNS9+vbNiYpLQyQKKhl5ZmWGTw1hFvOloIDoLCKdYgK IN1uYahXldmKgh5WYhU1CaKy3bwiWKZ2BFWjC+zDE7Aaeh+kMD6Y/TR0HXkRmUN0Cqov K/yhKP21+aiWP3zKRnLQ+ubis6OGK7xKrZnOaDUo18L1aJ+N8KNcVUrxEkVu/azNqQ98 AA3k4hGj7gDJvKD6xwxTwXyIRPJ/W2stzY+SrolntPikc7/puv+BwWtYo43bO+DTz3MU YUB5Gb71Xg43HfMlLcCFcJBQ5CY/vVwAWWCTmmSMf0iCpbO8a6qvv6MeNSOMmhQUzLO0 xkyQ==
X-Gm-Message-State: AOJu0YxYc+qhezNW24KW4WUIGuv/VHxMplWZ04wGIFm3cTlOsIbgU1Gf s4MuxCmNosNMIOqIwZeP4qf4YmWJyR4=
X-Google-Smtp-Source: AGHT+IGx0tjtjkzfqP/YSsv884G8ioGn/p654IMueq9wrJG3pX/4HUwHqKCcIP8qwIa3Vz5dvUgCQw==
X-Received: by 2002:a2e:a593:0:b0:2c6:f134:79b6 with SMTP id m19-20020a2ea593000000b002c6f13479b6mr1308524ljp.16.1699432659810; Wed, 08 Nov 2023 00:37:39 -0800 (PST)
Received: from smtpclient.apple ([85.255.232.214]) by smtp.gmail.com with ESMTPSA id j8-20020a05600c1c0800b004063977eccesm18668427wms.42.2023.11.08.00.37.38 (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Wed, 08 Nov 2023 00:37:39 -0800 (PST)
From: Stewart Bryant <stewart.bryant@gmail.com>
Message-Id: <34B1B7D3-E65C-4661-A460-B75797714F2C@gmail.com>
Content-Type: multipart/alternative; boundary="Apple-Mail=_02BB8DDF-40E0-49CB-8DCB-EBA4A49EE4EC"
Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3774.100.2.1.4\))
Subject: Re: draft-ietf-rtgwg-segment-routing-ti-lfa : A simple pathological network fragment
Date: Wed, 08 Nov 2023 08:37:27 +0000
In-Reply-To: <CABNhwV1ud2RyH_hCb1NOtBWiQ15e5P6Qx0mvrgs7h+tS6PyS=w@mail.gmail.com>
Cc: Stewart Bryant <stewart.bryant@gmail.com>, Ahmed Bashandy <abashandy.ietf@gmail.com>, Alexander Vainshtein <Alexander.Vainshtein@rbbn.com>, "rtgwg@ietf.org" <rtgwg@ietf.org>, "draft-ietf-rtgwg-segment-routing-ti-lfa@ietf.org" <draft-ietf-rtgwg-segment-routing-ti-lfa@ietf.org>, rtgwg-chairs <rtgwg-chairs@ietf.org>
To: Gyan Mishra <hayabusagsm@gmail.com>
References: <9908D9F3-45C6-497D-B3BF-84D8A68A5013@gmail.com> <AS2PR02MB88395D3114B0DEE583BEEF65F0D7A@AS2PR02MB8839.eurprd02.prod.outlook.com> <60124119-5847-4F52-8BB8-18398A9BA4AC@gmail.com> <AS2PR02MB8839FB5A5537FC3E9F37A560F0D4A@AS2PR02MB8839.eurprd02.prod.outlook.com> <PH0PR03MB63004F32F9AF282ECDB78637F6D9A@PH0PR03MB6300.namprd03.prod.outlook.com> <AS2PR02MB88393EC50B913A5F8C3AB5E2F0D8A@AS2PR02MB8839.eurprd02.prod.outlook.com> <PH0PR03MB6300D9A7F9DC3E2E864EF11EF6D8A@PH0PR03MB6300.namprd03.prod.outlook.com> <CABNhwV30uhLOo52WHAv6YS4Wg0k9gDbkrs1ANuGPPdLzc1=dsw@mail.gmail.com> <ef40ab1f-90b3-56d2-4d22-02a8eaab3ee0@gmail.com> <CABNhwV1ud2RyH_hCb1NOtBWiQ15e5P6Qx0mvrgs7h+tS6PyS=w@mail.gmail.com>
X-Mailer: Apple Mail (2.3774.100.2.1.4)
Archived-At: <https://mailarchive.ietf.org/arch/msg/rtgwg/KIAQPtdha0mCsGw2fTComRGLsjw>
X-BeenThere: rtgwg@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Routing Area Working Group <rtgwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/rtgwg>, <mailto:rtgwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/rtgwg/>
List-Post: <mailto:rtgwg@ietf.org>
List-Help: <mailto:rtgwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/rtgwg>, <mailto:rtgwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 08 Nov 2023 08:37:47 -0000


> On 8 Nov 2023, at 05:18, Gyan Mishra <hayabusagsm@gmail.com> wrote:
> 
> 
> 
> In the below RLFA RFC 7490 style  loop topology R1, R4, R5 are in the extended P space and  and Q space being R5, R6, R3 and TO-LFA algorithm post convergence path calculated RLFA PQ node being R5.
> 
> Using section 6.4 to build the post convergence repair path using RFC 5715 near side tunneling the repair path is NodeSid(R5), AdjSid(R6). So a near side tunnel is now built from R1 to R6.  
> 
> Looping is not an issue with R4 or R5 in looping packets back to R1 as the repair path is built from R1 to R6, tunneling over any nodes with un-converged FIBs.
> 
> Micro loop problem solved!
> 
> 
> CE1 –R1- R2-/-R3-CE2
>      |         |
>      R4 – R5 -R6

I think that it is important to note that if R1 reconverges first it will send packets to R4 using normal forwarding. However R4 is ECMP to CE2 via R1 which will micro loop back to R4.

At this point the repair is starved and no longer works.

Hence the point that I have been making and I think the point that Gyan originally made.

Without FRR the network converges in its own time and we accept micro loops and traffic discontinuity for an unknown time plus collateral damage to traffic that never used the failed link.

However once we deploy FRR we make a contract with the user that after a short while - of the order of 50ms - productive forwarding will continue uninterrupted. However this is not the case in some topologies (see above) and thus uloop prevention is required.

The thread has become somewhat difficult to follow with time, so I am now not sure what Bono’s text is. It would be helpful if it were repeated. However I think the draft has to say  that in order to warrant that FRR continues to provide traffic continuity until the network is reconverged a uloop strategy is required.

I would note as it is easily forgotten that a uloop strategy is also required when R2-R3 goes back into service. This is because if R4 converges first it will ECMP back to R1 which will send the packet back to R1.

Now we need to be clear that the micro looking is not the fault of the TiLFA design per se, but given that networks will deploy TiLFA with certain traffic continuity expectations we must clearly note to the reader that those expectations may not be met without addressing the uloop problem.

By way of referencing earlier work, RFC5714 does point to RFC5715 stating that a uloop technology is needed. In Section 10 of RFC 7490 the issue of loops is drawn to the attention of the network manager although perhaps with hindsight the text should be stronger.

- Stewart