[bess] Re: [**EXTERNAL**] Re: Re: Inverse multi-layer OAM

Joel Halpern <jmh@joelhalpern.com> Thu, 20 March 2025 03:52 UTC

Return-Path: <jmh@joelhalpern.com>
X-Original-To: bess@mail2.ietf.org
Delivered-To: bess@mail2.ietf.org
Received: from localhost (localhost [127.0.0.1]) by mail2.ietf.org (Postfix) with ESMTP id 86784F44D4E; Wed, 19 Mar 2025 20:52:33 -0700 (PDT)
X-Virus-Scanned: amavisd-new at ietf.org
X-Spam-Flag: NO
X-Spam-Score: -2.796
X-Spam-Level:
X-Spam-Status: No, score=-2.796 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, RCVD_IN_VALIDITY_RPBL_BLOCKED=0.001, RCVD_IN_VALIDITY_SAFE_BLOCKED=0.001, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: mail2.ietf.org (amavisd-new); dkim=pass (1024-bit key) header.d=joelhalpern.com
Received: from mail2.ietf.org ([166.84.6.31]) by localhost (mail2.ietf.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id IHHu9HBS_gX2; Wed, 19 Mar 2025 20:52:32 -0700 (PDT)
Received: from mailb2.tigertech.net (mailb2.tigertech.net [208.80.4.154]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-256)) (No client certificate requested) by mail2.ietf.org (Postfix) with ESMTPS id 56E08F44D47; Wed, 19 Mar 2025 20:52:32 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=joelhalpern.com; s=2.tigertech; t=1742442751; bh=i8XZSmmcixKmeZ1YkPu2nLSoUwtdgqdll7g6J87GIFQ=; h=Date:Subject:To:Cc:References:From:In-Reply-To:From; b=i0DntGzaSLi0Uig7aoQWZAl3dTqrG399zDaWw/t5b+rCl3J36DmKXCefGnK8lRy8z AprBoNia6aoVeak2Eu7h14L2CWKtEWbWmA6zXIERb2PowLqRy9nT44e2wk6sKnvV2B bN8/IYH86bO9iTvqbJxn4GR2D8/8f4UwRT2yAvpM=
Received: from localhost (localhost [127.0.0.1]) by mailb2.tigertech.net (Postfix) with ESMTP id 4ZJBVR3mSPz1nsn4; Wed, 19 Mar 2025 20:52:31 -0700 (PDT)
X-Quarantine-ID: <9wb-sOev_a0V>
X-Virus-Scanned: Debian amavis at b2.tigertech.net
Received: from [192.168.21.83] (unknown [50.233.136.230]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits)) (No client certificate requested) by mailb2.tigertech.net (Postfix) with ESMTPSA id 4ZJBVQ5M4Mz1nskQ; Wed, 19 Mar 2025 20:52:30 -0700 (PDT)
Content-Type: multipart/alternative; boundary="------------3wP3bVFNrVrfHlqPWfGKR0FU"
Message-ID: <236af686-8bcb-480a-944f-552a922e3101@joelhalpern.com>
Date: Wed, 19 Mar 2025 23:52:27 -0400
MIME-Version: 1.0
User-Agent: Mozilla Thunderbird
To: "Shah, Himanshu" <hshah@ciena.com>
References: <CA+RyBmWh8aShHPz4NaJxmtZvoTjhjH2Ecw1SAcE0VD7N3KwNdg@mail.gmail.com> <CAOj+MMGs-vUwV8V2ieGUF9zYFoRcP6q8emnVKhj9o5vJccD4-A@mail.gmail.com> <CA+RyBmWiiHj4eMK_A15cPD7xpuM+cu=GhSG0-JDwZrR0vnTE=A@mail.gmail.com> <991ec11a-8d64-42b1-ab3b-dd20a8765e13@joelhalpern.com> <MN2PR04MB598144DBAF98419520D01299AFD82@MN2PR04MB5981.namprd04.prod.outlook.com>
Content-Language: en-US
From: Joel Halpern <jmh@joelhalpern.com>
In-Reply-To: <MN2PR04MB598144DBAF98419520D01299AFD82@MN2PR04MB5981.namprd04.prod.outlook.com>
Message-ID-Hash: 62Y3K6Q6G7IQOI7NCIZTGLTF3YWGKTMQ
X-Message-ID-Hash: 62Y3K6Q6G7IQOI7NCIZTGLTF3YWGKTMQ
X-MailFrom: jmh@joelhalpern.com
X-Mailman-Rule-Misses: dmarc-mitigation; no-senders; approved; emergency; loop; banned-address; member-moderation; header-match-bess.ietf.org-0; nonmember-moderation; administrivia; implicit-dest; max-recipients; max-size; news-moderation; no-subject; digests; suspicious-header
CC: BESS <bess@ietf.org>, "draft-karboubi-spring-sidlist-optimized-cs-sr@ietf.org" <draft-karboubi-spring-sidlist-optimized-cs-sr@ietf.org>
X-Mailman-Version: 3.3.9rc6
Precedence: list
Subject: [bess] Re: [**EXTERNAL**] Re: Re: Inverse multi-layer OAM
List-Id: BGP-Enabled ServiceS working group discussion list <bess.ietf.org>
Archived-At: <https://mailarchive.ietf.org/arch/msg/bess/M7wnlZ1ybUMaj7LC3Q1xBhtNMb4>
List-Archive: <https://mailarchive.ietf.org/arch/browse/bess>
List-Help: <mailto:bess-request@ietf.org?subject=help>
List-Owner: <mailto:bess-owner@ietf.org>
List-Post: <mailto:bess@ietf.org>
List-Subscribe: <mailto:bess-join@ietf.org>
List-Unsubscribe: <mailto:bess-leave@ietf.org>

At the very least, I would expect to see some explanation of why one 
would also be running TI-LFA.  And probably a discussion of how this 
interacts with the information propagation when the local detection 
kicks in.   I can believe both points can be addressed, but it is hard 
to understand without them.

Yours,

Joel

On 3/19/2025 11:46 PM, Shah, Himanshu wrote:
>
> Disagree.
>
> We have discussed the motivation (for prioritizing e2e protection over 
> local protection) in the draft.
>
> It serves the purpose without having to disable TI-LFA on each node – 
> not a desirable option.
>
> Thanks,
>
> Himanshu
>
> *From: *Joel Halpern <jmh@joelhalpern.com>
> *Date: *Thursday, March 20, 2025 at 10:41 AM
> *To: *Greg Mirsky <gregimirsky@gmail.com>, Robert Raszuk 
> <robert@raszuk.net>
> *Cc: *Shah, Himanshu <hshah@ciena.com>, BESS <bess@ietf.org>, 
> draft-karboubi-spring-sidlist-optimized-cs-sr@ietf.org 
> <draft-karboubi-spring-sidlist-optimized-cs-sr@ietf.org>
> *Subject: *[**EXTERNAL**] Re: [bess] Re: Inverse multi-layer OAM
>
> It seems rather counter-intuitive to want to try to repair things 
> end-to-end faster than one expects local devices to detect local 
> failures.  The implied information race conditions seem an invitation 
> to trouble.
>
> Yours,
>
> Joel
>
> On 3/19/2025 11:14 PM, Greg Mirsky wrote:
>
>     Hi Robert,
>
>     I wholeheartedly agree that local and e2e OAM are complementary
>     tools in an operator's toolbox. Usually, a multi-layer OAM is
>     constructed so that e2e provides the network with a safety net. In
>     that manner, local repair of a link failure is expected to restore
>     services before the failure is detected on the e2e level. As I
>     understand it, the proposal uses a different scheme. According to
>     it, e2e network detection is expected to be more aggressive than
>     the link-level OAM. To me, that's an unusual arrangement.
>
>     As for performance monitoring, although some performance metrics
>     can be measured spatially to compose e2e metrics, e2e performance
>     monitoring is easier to deploy in many environments.
>
>     Regards,
>
>     Greg
>
>     On Wed, Mar 19, 2025 at 11:21 PM Robert Raszuk <robert@raszuk.net>
>     wrote:
>
>         Hi Greg,
>
>         I am very much in support of end to end path assurance. And by
>         assurance I mean not only e2e liveness but also e2e loss,
>         delays, jitter etc ...
>
>         The main reason is that link layer failures (even if done on
>         every link in the path) does not provide any information about
>         transit via network devices. And those can be subject to
>         packet drops, selective packet drops (brownouts), delays and
>         jitter via box fabrics in distributed systems etc ... So to me
>         even if e2e is slower then local link detection it still very
>         much a preferred way to assure end to end path quality.
>
>         Sure some of them is done at the application layer, but then
>         it is done mainly for statistics and reporting. Doing it at
>         network layer opens up possibilities to choose different path
>         (quite likely via different provider) when original path
>         experiences some issues or service degradation which with link
>         by link failure detection is invisible to the endpoints.
>
>         I think at the end of the day those two are not really
>         competing solutions but complimentary. And of course end to
>         end makes sense especially in deployments when you can have
>         diverse paths end to end.
>
>         Cheers
>
>         Robert
>
>         On Wed, Mar 19, 2025 at 4:58 AM Greg Mirsky
>         <gregimirsky@gmail.com> wrote:
>
>             Hi Himanshu,
>
>             Thank you for the presentation of
>             draft-karboubi-spring-sidlist-optimized-cs-sr
>             [datatracker.ietf.org]
>             <https://urldefense.com/v3/__https:/datatracker.ietf.org/doc/draft-karboubi-spring-sidlist-optimized-cs-sr/__;!!OSsGDw!J0VAlRE3z-g7qGfCezoeovWitrC4DFYS65Ly4YZq5r_I8SGk56sle4dQAFwya2R17BHyxx6ecg$>.
>             If I understood your response to Ali correctly, the
>             proposed mechanism is expected to use more
>             aggressive network failure detection than the link layer.
>             If that is correct, I have several questions about the
>             multi-layer OAM:
>
>               * AFAIK link-layer failures are detected within 10 ms
>                 using a connectivity check mechanism (CCM of Y.1731 or
>                 a single-hop BFD) with a 3.3 ms interval.
>               * If the link failure is detectable within 10 ms, what
>                 detection time for the path, i.e., E2E connection
>                 failure detection, is suggested? What interval between
>                 test probes will be used in that case?
>               * Furthermore, even if the path converges around the
>                 link failure before the local protection is deployed,
>                 the link failure will be detected, and the protection
>                 mechanism will be deployed despite the Orchestrator
>                 setting up its recovery path in the network. If
>                 that is correct, local defect detection and protection
>                 are unnecessary overheads. Would you agree?
>
>             Regards,
>
>             Greg
>
>             _______________________________________________
>             BESS mailing list -- bess@ietf.org
>             To unsubscribe send an email to bess-leave@ietf.org
>
>
>
>     _______________________________________________
>
>     BESS mailing list --bess@ietf.org
>
>     To unsubscribe send an email tobess-leave@ietf.org
>