Re: [Last-Call] [mpls] Rtgdir last call review of draft-ietf-mpls-p2mp-bfd-06

Greg Mirsky <gregimirsky@gmail.com> Sat, 02 March 2024 20:12 UTC

Return-Path: <gregimirsky@gmail.com>
X-Original-To: last-call@ietfa.amsl.com
Delivered-To: last-call@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 16431C14F5F6; Sat, 2 Mar 2024 12:12:32 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -7.105
X-Spam-Level:
X-Spam-Status: No, score=-7.105 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_HI=-5, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id AFI4eeYt6-Eg; Sat, 2 Mar 2024 12:12:28 -0800 (PST)
Received: from mail-yw1-x112b.google.com (mail-yw1-x112b.google.com [IPv6:2607:f8b0:4864:20::112b]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 0D390C14F6B5; Sat, 2 Mar 2024 12:12:23 -0800 (PST)
Received: by mail-yw1-x112b.google.com with SMTP id 00721157ae682-60821136c5aso23697007b3.1; Sat, 02 Mar 2024 12:12:22 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1709410342; x=1710015142; darn=ietf.org; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=moHXjwk5okF7G8gIfFxhNxDs8ZoUbe3d3YgcmzZmtHc=; b=noGODNetCSQyvxLT6QlSSugyWwNdEeQNDDDHTnHIE4TNFjn/gYTxApHxVY1ihJbwRW CP4ShWktVxkle1tI/DdR8Rlad2qSQmjwh2PDpU2+vmSbg7pDfQe4j34YnHuIZ6ZiNc18 RF1VbhHCWg5qJBQlKz8eJ2rGy6fNsW/39qMTHCrngUg1w4tMWe0RyXK67ZMsMyaYqQIb v59pILQpyy/xuxs33+0zavjJFJ9tYRFXIQ6/7jtIC7uXR9NR+9bqWjypHYtkzDy5ApRP d9S8sE/pPEvPaWrJBNQz4JvSw9DuNRRys31tHO0mPTrqrhE5wBR8Cf7mvB4dhQmH71py n3ag==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1709410342; x=1710015142; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=moHXjwk5okF7G8gIfFxhNxDs8ZoUbe3d3YgcmzZmtHc=; b=ceLyhIZHTFl7VQBCUZ2riTMrSwncy4Qw0H8zIyOOZ0w5exJw+BQaXCrV1htFAYnby6 wgTmslIFsD8Ys9Y1VvD8IPRAqoyt5FIPbJnLQ3XcMsccKufirBUkYhgnt51uEtyvtL0a n9fX9dPRG4dpBqxffcoCIMlqvKKY7xWgrcBgXDXtchWQU5wtUA7uqZvnb5rNdi0mzykk fEu6OHFBto6tjWYHap24ajHXo1gN7q9ZrD0kyCzAn71VyAn1D8neCvOlewQvcWlCZ5dV HsT4Y70BB64nZTqBcq/zbTQClNR/9B/KwqnbdE4fOoNSrCMF2yKmdpu1E/8xZb/ZtiqD 6cvQ==
X-Forwarded-Encrypted: i=1; AJvYcCX+Oq744LlvfeaAoY8JZNJRP3QI36l4E+WH5AaOVTgox6JsY7Wl148TqbPX0rylChF9JlDffAnLp5Kbk1PAi7Lcms0GXFV2XXk8+oa1iDLGg8U0ORQnCP4K+ZNWLuTQzMeDHQ4zfJJtWsZqj4liJotyjc786bXgn24=
X-Gm-Message-State: AOJu0Yxt+Dhb2DZGf1hW4VI5EjRMRjzPKL+mASk9xP8lA28WW6G/r1sM b2MkLviA1aqMnvO8zHMSrJ6iUgIHI2XFVvCfB5vM1DKvENmKONkNv6Jx3jLPR1QNxnUS6arypII oVO7WqO9n95RJWXaBmrkGwLJzwruNZAxvDOM=
X-Google-Smtp-Source: AGHT+IGAuchAR74XcApaSadr448UnSbXY1rEb8uDwM9IxqkFHUedWvwEzvpeuo7HQP0fl2LUupBT3ho30JLu6GHD2qU=
X-Received: by 2002:a0d:e905:0:b0:609:513c:a8f7 with SMTP id s5-20020a0de905000000b00609513ca8f7mr5119442ywe.0.1709410341760; Sat, 02 Mar 2024 12:12:21 -0800 (PST)
MIME-Version: 1.0
References: <170864700898.14065.4946299905740369098@ietfa.amsl.com> <CA+RyBmXitJr-57P3y_=pYEqwoHeMo4HKqPKOud-ZZ2dQQb_gGQ@mail.gmail.com> <176e1397-5b01-487f-8ae0-078bfe2f8ee7@joelhalpern.com> <CA+RyBmUMit0oc1MZTnQ0apTM8Wj_ra7Tna5JCwwMbtbKOfgyCQ@mail.gmail.com> <AS2PR02MB8839697CBBD90B7E98B65C38F05A2@AS2PR02MB8839.eurprd02.prod.outlook.com> <CA+RyBmX9ri3xSk2Q88WdJ_bmCA3M2prAcCOEOL83mWe5NV6kNw@mail.gmail.com> <AS2PR02MB883935EE24559D93FA623CD8F0592@AS2PR02MB8839.eurprd02.prod.outlook.com>
In-Reply-To: <AS2PR02MB883935EE24559D93FA623CD8F0592@AS2PR02MB8839.eurprd02.prod.outlook.com>
From: Greg Mirsky <gregimirsky@gmail.com>
Date: Sat, 02 Mar 2024 12:12:11 -0800
Message-ID: <CA+RyBmWyX_DTZ0ADt_UDSeWZ3csCg63E8dx_J3yMT+2CkbB2tg@mail.gmail.com>
To: bruno.decraene@orange.com
Cc: "rtg-dir@ietf.org" <rtg-dir@ietf.org>, "draft-ietf-mpls-p2mp-bfd.all@ietf.org" <draft-ietf-mpls-p2mp-bfd.all@ietf.org>, "last-call@ietf.org" <last-call@ietf.org>, "mpls@ietf.org" <mpls@ietf.org>, Joel Halpern <jmh@joelhalpern.com>
Content-Type: multipart/alternative; boundary="000000000000324f450612b31c82"
Archived-At: <https://mailarchive.ietf.org/arch/msg/last-call/Mwg23oEpiwg_axme3Fu__pwAZ-0>
Subject: Re: [Last-Call] [mpls] Rtgdir last call review of draft-ietf-mpls-p2mp-bfd-06
X-BeenThere: last-call@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: IETF Last Calls <last-call.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/last-call>, <mailto:last-call-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/last-call/>
List-Post: <mailto:last-call@ietf.org>
List-Help: <mailto:last-call-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/last-call>, <mailto:last-call-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sat, 02 Mar 2024 20:12:32 -0000

Hi Bruno,
thank you for sharing your thoughts. I am not sure that increasing the
range for jittering the transmission would not result in the increase of
the rate of the BFD Control packets transmitted by an egress LSR (leave)
that detected the failure of a p2mp LSP. I've added a new text in Section 5
with, what I consider, minimalistic jittering mechanism:
NEW TEXT:
   *  these BFD Control packets are transmitted at the rate of one per
      second until either it receives a control packet valid for this
      BFD session with the Final (F) bit set from the ingress LSR or the
      defect condition clears.  However, to improve the likelihood of
      notifying the ingress LSR of the failure of the p2mp MPLS LSP, the
      egress LSR SHOULD initially transmit three BFD Control packets
      defined above in short succession.  The actual transmission of the
      periodic BFD Control message MUST be jittered by up to 25% within
      one-second intervals.  Thus, the interval MUST be reduced by a
      random value of 0 to 25%, to reduce the possibility of congestion
      on the ingress LSR's data and control planes.

I am looking forward to more comments and continued discussion with other
experts.

Regards,
Greg

On Tue, Feb 27, 2024 at 5:03 AM <bruno.decraene@orange.com> wrote:

> Hi Greg,
>
>
>
> Thanks for considering my comment and for your reply.
>
> I’m not following the draft but a priori my understand is that the
> reporting from the egress to the root may happen at the discretion of the
> egress, with no constraint in term of respecting any timing. If so, I don’t
> see why we would need to be within 75% of a specific time limit. We could a
> priori choose any value for this time limit (configured on the egress or
> any default value) and pick a random number within 0% to 100% of this
> limit. But you are likely to know much better than me, so up to you.
>
>
>
> Regards,
>
> --Bruno
>
>
>
> *From:* Greg Mirsky <gregimirsky@gmail.com>
> *Sent:* Monday, February 26, 2024 6:05 PM
> *To:* DECRAENE Bruno INNOV/NET <bruno.decraene@orange.com>
> *Cc:* rtg-dir@ietf.org; draft-ietf-mpls-p2mp-bfd.all@ietf.org;
> last-call@ietf.org; mpls@ietf.org; Joel Halpern <jmh@joelhalpern.com>
> *Subject:* Re: [mpls] Rtgdir last call review of
> draft-ietf-mpls-p2mp-bfd-06
>
>
>
> Hi Bruno,
>
> thank you for your interest and the suggestion. BFD spec (RFC 5880)
> includes the mechanism intended to avoid synchronization of BFD Control
> messages:
>
>    The periodic transmission of BFD Control packets MUST be jittered on
>
>    a per-packet basis by up to 25%, that is, the interval MUST be
>
>    reduced by a random value of 0 to 25%, in order to avoid self-
>
>    synchronization with other systems on the same subnetwork.  Thus, the
>
>    average interval between packets will be roughly 12.5% less than that
>
>    negotiated.
>
> Do you think that the same randomization mechanism applied to the
> transmission of notifications to the root of p2mp LSP would be useful?
>
>
>
> Regards,
>
> Greg
>
>
>
> On Mon, Feb 26, 2024 at 2:45 AM <bruno.decraene@orange.com> wrote:
>
> Hi Greg,
>
>
>
> My 2 cents (not following the draft).
>
> Another typical option may be to allow the network operator to configure,
> on the egress, an acceptable delay before reporting to the root. The egress
> would then pick a random value in this range. Statiscally, the more egress
> the more spread the reports to the root, which a priori would be good for
> scaling.
>
> It would be up to the network operator to configure the right delay
> depending on the number of the leaves and the need for fast reporting (or
> not).
>
>
>
> Totally up to you, but that would have my vote as this is a typical issue.
> (granted this is more likely an issue with protocols handling thousands of
> customers, but even for MPLS LSR scaling, RSVP-TE scaling issues are not
> unheard)
>
>
>
> Regards,
>
> --Bruno
>
>
>
> *From:* mpls <mpls-bounces@ietf.org> *On Behalf Of *Greg Mirsky
> *Sent:* Sunday, February 25, 2024 12:25 AM
> *To:* Joel Halpern <jmh@joelhalpern.com>
> *Cc:* rtg-dir@ietf.org; draft-ietf-mpls-p2mp-bfd.all@ietf.org;
> last-call@ietf.org; mpls@ietf.org
> *Subject:* Re: [mpls] Rtgdir last call review of
> draft-ietf-mpls-p2mp-bfd-06
>
>
>
> Hi Joel,
>
> thank you for the clarification. My idea is to use a rate limiter at the
> root of the p2mp LSP that may receive notifications from the leaves
> affected by the failure. I imagine that the threshold of the rate limiter
> might be exceeded and the notifications will be discarded. As a result,
> some notifications will be processed by the headend of the p2mp BFD session
> later, as the tails transmit notifications periodically until the receive
> the BFD Control message with the Final flag set.  Thus, we cannot avoid the
> congestion but mitigate the negative effect it might cause by extending the
> convergence. Does that make sense?
>
>
>
> Regards,
>
> Greg
>
>
>
> On Sat, Feb 24, 2024 at 2:39 PM Joel Halpern <jmh@joelhalpern.com> wrote:
>
> That covers part of my concern.  But....  A failure near the root means
> that a lot of leaves will see failure, and they will all send notifications
> converging on the root.  Those notifications themselves, not just the final
> messages, seem able to cause congestion.  I am not sure what can be done
> about it, but we aren't allowed to ignore it.
>
> Yours,
>
> Joel
>
> On 2/24/2024 3:34 PM, Greg Mirsky wrote:
>
> Hi Joel,
>
> thank you for your support of this work and the suggestion. Would the
> following update of the last paragraph of Section 5 help:
>
> OLD TEXT:
>
>    An ingress LSR that has received the BFD Control packet, as described
>
>    above, sends the unicast IP/UDP encapsulated BFD Control packet with
>
>    the Final (F) bit set to the egress LSR.
>
> NEW TEXT:
>
>    As described above, an ingress LSR that has received the BFD Control
>
>    packet sends the unicast IP/UDP encapsulated BFD Control packet with
>
>    the Final (F) bit set to the egress LSR.  In some scenarios, e.g.,
>
>    when a p2mp LSP is broken close to its root, and the number of egress
>
>    LSRs is significantly large, the control plane of the ingress LSR
>
>    might be congested by the BFD Control packets transmitted by egress
>
>    LSRs and the process of generating unicast BFD Control packets, as
>
>    noted above.  To mitigate that, a BFD implementation that supports
>
>    this specification is RECOMMENDED to use a rate limiter of received
>
>    BFD Control packets passed to processing in the control plane of the
>
>    ingress LSR.
>
>
>
> Regards,
>
> Greg
>
>
>
> On Thu, Feb 22, 2024 at 4:10 PM Joel Halpern via Datatracker <
> noreply@ietf.org> wrote:
>
> Reviewer: Joel Halpern
> Review result: Ready
>
> Hello,
>
> I have been selected as the Routing Directorate reviewer for this draft.
> The
> Routing Directorate seeks to review all routing or routing-related drafts
> as
> they pass through IETF last call and IESG review, and sometimes on special
> request. The purpose of the review is to provide assistance to the Routing
> ADs.
> For more information about the Routing Directorate, please see
> https://wiki.ietf.org/en/group/rtg/RtgDir
>
> Although these comments are primarily for the use of the Routing ADs, it
> would
> be helpful if you could consider them along with any other IETF Last Call
> comments that you receive, and strive to resolve them through discussion
> or by
> updating the draft.
>
> Document: draft-name-version
> Reviewer: your-name
> Review Date: date
> IETF LC End Date: date-if-known
> Intended Status: copy-from-I-D
>
> Summary:  This document is ready for publication as a Proposed Standard.
>     I do have one question that I would appreciate being considered.
>
> Comments:
>     The document is clear and readable, with careful references for those
>     needing additional details.
>
> Major Issues: None
>
> Minor Issues:
>     I note that the security considerations (section 6) does refer to
>     congestion issues caused by excessive transmission of BFD requests.   I
>     wonder if section 5 ("Operation of Multipoint BFD with Active Tail over
>     P2MP MPLS LSP") should include a discussion of the congestion
> implications
>     of multiple tails sending notifications at the rate of 1 per second to
> the
>     head end, particularly if the failure is near the head end.  While I
>     suspect that the 1 / second rate is low enough for this to be safe,
>     discussion in the document would be helpful.
>
> ____________________________________________________________________________________________________________
>
> Ce message et ses pieces jointes peuvent contenir des informations confidentielles ou privilegiees et ne doivent donc
>
> pas etre diffuses, exploites ou copies sans autorisation. Si vous avez recu ce message par erreur, veuillez le signaler
>
> a l'expediteur et le detruire ainsi que les pieces jointes. Les messages electroniques etant susceptibles d'alteration,
>
> Orange decline toute responsabilite si ce message a ete altere, deforme ou falsifie. Merci.
>
>
>
> This message and its attachments may contain confidential or privileged information that may be protected by law;
>
> they should not be distributed, used or copied without authorisation.
>
> If you have received this email in error, please notify the sender and delete this message and its attachments.
>
> As emails may be altered, Orange is not liable for messages that have been modified, changed or falsified.
>
> Thank you.
>
> ____________________________________________________________________________________________________________
> Ce message et ses pieces jointes peuvent contenir des informations confidentielles ou privilegiees et ne doivent donc
> pas etre diffuses, exploites ou copies sans autorisation. Si vous avez recu ce message par erreur, veuillez le signaler
> a l'expediteur et le detruire ainsi que les pieces jointes. Les messages electroniques etant susceptibles d'alteration,
> Orange decline toute responsabilite si ce message a ete altere, deforme ou falsifie. Merci.
>
> This message and its attachments may contain confidential or privileged information that may be protected by law;
> they should not be distributed, used or copied without authorisation.
> If you have received this email in error, please notify the sender and delete this message and its attachments.
> As emails may be altered, Orange is not liable for messages that have been modified, changed or falsified.
> Thank you.
>
>