Re: [bess] New Version Notification for draft-wang-bess-mvpn-upstream-df-selection-03.txt

"Chensiyu (Susie)" <chensiyu27@huawei.com> Mon, 13 February 2023 12:11 UTC

Return-Path: <chensiyu27@huawei.com>
X-Original-To: bess@ietfa.amsl.com
Delivered-To: bess@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 3F075C169506 for <bess@ietfa.amsl.com>; Mon, 13 Feb 2023 04:11:27 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -6.897
X-Spam-Level:
X-Spam-Status: No, score=-6.897 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_HI=-5, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=unavailable autolearn_force=no
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id E0dfJb3cOrnO for <bess@ietfa.amsl.com>; Mon, 13 Feb 2023 04:11:22 -0800 (PST)
Received: from frasgout.his.huawei.com (frasgout.his.huawei.com [185.176.79.56]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 64BF5C169514 for <bess@ietf.org>; Mon, 13 Feb 2023 04:11:22 -0800 (PST)
Received: from lhrpeml500004.china.huawei.com (unknown [172.18.147.201]) by frasgout.his.huawei.com (SkyGuard) with ESMTP id 4PFjkG1LXkz67M3D for <bess@ietf.org>; Mon, 13 Feb 2023 20:06:46 +0800 (CST)
Received: from dggpemm500010.china.huawei.com (7.185.36.134) by lhrpeml500004.china.huawei.com (7.191.163.9) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2507.17; Mon, 13 Feb 2023 12:11:18 +0000
Received: from dggpemm500010.china.huawei.com (7.185.36.134) by dggpemm500010.china.huawei.com (7.185.36.134) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2507.6; Mon, 13 Feb 2023 20:11:16 +0800
Received: from dggpemm500010.china.huawei.com ([7.185.36.134]) by dggpemm500010.china.huawei.com ([7.185.36.134]) with mapi id 15.01.2507.017; Mon, 13 Feb 2023 20:11:16 +0800
From: "Chensiyu (Susie)" <chensiyu27@huawei.com>
To: "Jeffrey (Zhaohui) Zhang" <zzhang=40juniper.net@dmarc.ietf.org>, "bess@ietf.org" <bess@ietf.org>
CC: duanfanghong <duanfanghong@huawei.com>
Thread-Topic: RE: New Version Notification for draft-wang-bess-mvpn-upstream-df-selection-03.txt
Thread-Index: Adk/pBaqmhl2oULMSOObEobh3xluJA==
Date: Mon, 13 Feb 2023 12:11:16 +0000
Message-ID: <0f169e9431854ba295721855896e691d@huawei.com>
Accept-Language: zh-CN, en-US
Content-Language: zh-CN
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-originating-ip: [10.136.128.228]
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
X-CFilter-Loop: Reflected
Archived-At: <https://mailarchive.ietf.org/arch/msg/bess/tPXYEtwZwzAZS4App_bqfmSpcSw>
Subject: Re: [bess] New Version Notification for draft-wang-bess-mvpn-upstream-df-selection-03.txt
X-BeenThere: bess@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: BGP-Enabled ServiceS working group discussion list <bess.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/bess>, <mailto:bess-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/bess/>
List-Post: <mailto:bess@ietf.org>
List-Help: <mailto:bess-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/bess>, <mailto:bess-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 13 Feb 2023 12:11:27 -0000

Hi, Jeffrey
Thank you very much for your feedback on our draft. We will update a new version of the draft later based on our discussions. Our replies to all suggestions are as follow.
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
#1.
   For the "hot root standby"
   mechanism, all the ingress PEs, regardless of the primary or standby
   role, forward (C-S,C-G) flow to other PEs through a P-tunnel, forcing
   the egress PEs to discard all but one, which will cause the steady
   traffic redundancy throughout the backbone network.  In the scenarios
   of enterprise networks crossing provider networks, the waste of
   bandwidth is a crucial issue causing the increasement of the OpEx of
   enterprise networks, and the "warm root standby" mechanism is
   expected to be a better solution.
> Suggestion : The mentioning of Hot Root Standby (HRS) should be removed.
It is specifically designed for scenarios where it is critical for egress PEs to receive but discard duplicates and BW consumption is not a concern. Where BW is a concern, the Warm Root Standby
(WRS) can be used.
This draft should only compare to WRS.

>> Reply : We will modify the first paragraph of the Introduction Section in the new version.
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
#2.
   However, there are some problems
   when deploying the "warm root standby" mechanism described in
   [RFC9026].
   a.  Upon the failure of primary ingress PE, the standby ingress PE
       needs to send PIM join hop by hop toward multicast source for
       each multicast flow and the time when the standby ingress PE
       switches to the primary role may be inconsistent with the time
       when the leaf PE determines to accept multicast traffic sent by
       the standby root, causing which the failover time can hardly
       reach the same level of "hot root standby" mechanism.
> Suggestion : This point should be removed because it describes Cold Root Standby. With WRS in RFC9026, traffic already arrives on both PEs.

>> Reply : Agree. We will remove 'the standby ingress PE needs to send PIM join hop by hop toward multicast source'. 
                     Actually, upon the failure of primary ingress PE, the leaf PE is the one that needs to send the new C-multicast route towards the standby ingress PE without carrying the Standby PE BGP Community according to RFC9026.
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
#3.
   b.  There is no endogenous mechanism for standby ingress PEs to
       discover and detect the failure of primary ingress PEs, resulting
       in the uncertainty in deployment and implementation.

> Question : What's the problem of relying on egress PEs detecting the failure of primary PE?

>>Reply:  It takes longer switchover time. When the egress PE detects the failure of the ingress Primary PE, it will update all relevant C-multicast routes and send them to the standby ingress PE. 
                    For example, if there are 1000 (C-S, C-G)s, 1000 C-multicast routes will be updated and resent so that the standby PE can finally forward traffic. 
                    (The switchover time will be correspondingly longer than using ingress PE to detect and forward traffic directly.)
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
#4.
   c.  In [RFC9026], the standby ingress PE is determined by using
       "Standby PE Community" carried in the C-Multicast routes.  The
       premise of this mechanism is that all leaf PEs choose the same
       primary and standby ingress PEs, which may not be met due to
       transient unicast routing inconsistencies, the inconsistencies of
       P-Tunnel status determined by each leaf PE or lack of support of
       the Standby PE community on leaf PE, causing that the "warm root
       standby" mechanism is not stable and returns to "hot root
       standby" mode because the standby ingress PE also sends multicast
       traffic to backbone when the condition is not satisfied.

> Suggestion: The potential inconsistent choice among egress PEs is likely because that the tunnel branches may be up to some egress PEs while down to others.
That is actually a good reason for different choices - we don't want some egress PEs to not be able to receive from the primary PE. Besides, MVPN handles that situation well (remember that each egress PE can choose its ingress PE - see the "Another procedure ..." paragraph in https://www.rfc-editor.org/rfc/rfc6513.html#section-5.1.3).
If it desired to avoid that, the tunnel status consideration can be skipped and you'll get consistency that your draft wants, but both are at the cost of some egress PE may not be able to receive traffic.

>> Reply: Yes, we will think about appropriate solutions to solve this problem.
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
#5.
        anycast RPF checking mechanism in dataplane
> Suggestion : The word "anycast" is misleading. You should say that traffic from any of the ingress PEs can be accepted.

>> Reply: Agree. We will update it with a more appropriate word or just use the phrase of 'traffic from any of the ingress PEs can be accepted'.
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
#6.
   ... According to the negotiation community, a distinct
   C-Multicast route for (C-S, C-G) is sent to each multi-homed root PE.
   Leaf PE installs all P-Tunnels rooted from the multi-homed PEs into
   the anycast RPF tunnel checklist of the corresponding multicast
   traffic (C-S, C-G).

> Suggestion : This means, if there are four ingress PEs for the same flow,  all of them will receive the C-multicast route. With the RFC9026 procedure, only two of them will, and the selection can still be based on (s,g).
In summary, I don't see sufficient advantages of developing another solution now that RFC9026 have been implemented and deployed.

>> Reply: In RFC9026, there can be one standby PE or multiple standby PEs. 
a) Multiple standby PEs in Section 3.1.6.2:
     'If the site that contains C-S is connected to two or more PEs, a downstream PE will select one as its Primary Upstream PE, while others are considered to be Standby Upstream PEs'. 
b) One standby PE in Section 4:
     'In the case where more than two PEs are connected to the C-S site, selection of the Standby PE can be performed using one of the methods of selecting a UMH.'
In our draft, we will modify relevant descriptions so that only one PE will be selected as Standby PE. C-multicast route will only be sent to the selected Standby PE.
The advantage of our approach over RFC9026 has been mentioned in #3.
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
#7.
   For the scenarios of the same RD, this document introduces a new type
   of UMH route to be sent in MVPN SAFI, of which the NLRI key consists
   of the following fields:

> Suggestion: Instead of advertising the UMH route with the MVPN SAFI, we might as well advertise with the VPN-IP SAFI with RD/RT, and use RT to import into the default instance. It does not need any signaling changes and depending on implementation it may just work. We wrote a draft and circulated internally/ externally back in July 2022, but I am surprised that it was not posted.
I will forward you a copy for your review.
With this, it not only solves MVPN problem, but can also be used for unicast.

>> Reply:  We are reading and understanding the draft and we will reply about it later.

Thanks again for your contribution to this draft. We are looking forward to more feedback from you.
Best wishes,
Susie





-----Original Message-----
From: Jeffrey (Zhaohui) Zhang [mailto:zzhang=40juniper.net@dmarc.ietf.org] 
Sent: Wednesday, February 8, 2023 8:49 AM
To: Chensiyu (Susie) <chensiyu27@huawei.com>; bess@ietf.org
Cc: duanfanghong <duanfanghong@huawei.com>
Subject: RE: New Version Notification for draft-wang-bess-mvpn-upstream-df-selection-03.txt

Hi Siyu,

Please see comments below.

   For the "hot root standby"
   mechanism, all the ingress PEs, regardless of the primary or standby
   role, forward (C-S,C-G) flow to other PEs through a P-tunnel, forcing
   the egress PEs to discard all but one, which will cause the steady
   traffic redundancy throughout the backbone network.  In the scenarios
   of enterprise networks crossing provider networks, the waste of
   bandwidth is a crucial issue causing the increasement of the OpEx of
   enterprise networks, and the "warm root standby" mechanism is
   expected to be a better solution.

The mentioning of Hot Root Standby (HRS) should be removed.
It is specifically designed for scenarios where it is critical for egress PEs to receive but discard duplicates and BW consumption is not a concern. Where BW is a concern, the Warm Root Standby
(WRS) can be used.

This draft should only compare to WRS.

   However, there are some problems
   when deploying the "warm root standby" mechanism described in
   [RFC9026].

   a.  Upon the failure of primary ingress PE, the standby ingress PE
       needs to send PIM join hop by hop toward multicast source for
       each multicast flow and the time when the standby ingress PE
       switches to the primary role may be inconsistent with the time
       when the leaf PE determines to accept multicast traffic sent by
       the standby root, causing which the failover time can hardly
       reach the same level of "hot root standby" mechanism.

This point should be removed because it describes Cold Root Standby.
With WRS in RFC9026, traffic already arrives on both PEs.

   b.  There is no endogenous mechanism for standby ingress PEs to
       discover and detect the failure of primary ingress PEs, resulting
       in the uncertainty in deployment and implementation.

What's the problem of relying on egress PEs detecting the failure of primary PE?

   c.  In [RFC9026], the standby ingress PE is determined by using
       "Standby PE Community" carried in the C-Multicast routes.  The
       premise of this mechanism is that all leaf PEs choose the same
       primary and standby ingress PEs, which may not be meeted due to
       transient unicast routing inconsistencies, the inconsistencies of
       P-Tunnel status determined by each leaf PE or lack of support of
       the Standby PE community on leaf PE, causing that the "warm root
       standby" mechanism is not stable and returns to "hot root
       standby" mode because the standby ingress PE also sends multicast
       traffic to backbone when the condition is not satisfied.

The potential inconsistent choice among egress PEs is likely because that the tunnel branches may be up to some egress PEs while down to others.
That is actually a good reason for different choices - we don't want some egress PEs to not be able to receive from the primary PE. Besides, MVPN handles that situation well (remember that each egress PE can choose its ingress PE - see the "Another procedure ..." paragraph in https://www.rfc-editor.org/rfc/rfc6513.html#section-5.1.3).

If it desired to avoid that, the tunnel status consideration can be skipped and you'll get consistency that your draft wants, but both are at the cost of some egress PE may not be able to receive traffic.

   d.  The primary and standby role is fixed for each multicast flow,
       resulting in that ingress PEs cannot perform load balancing for
       multicast traffic.  Only two ingress PEs are selected for all
       multicast stream from a specific multicast source, causing that
       the "warm root standby" mechanism cannot be used in the scenarios
       of client nework multihoming to more than two ingress PEs.

I don't think that is true. The same Single Forward Selection rules in RFC6513, which can do load balancing (see 5.1.3 of RFC6513) among different
(C-S,C-G) flows, can be used to
choose both primary and secondary UMH (the secondary one can be chosen the same way after excluding the primary one). If RFC9026 is not clear about that, it can be clarified.

        anycast RPF checking mechanism in dataplane

The word "anycast" is misleading. You should say that traffic from any of the ingress PEs can be accepted.

   ... According to the negotiation community, a distinct
   C-Multicast route for (C-S, C-G) is sent to each multi-homed root PE.
   Leaf PE installs all P-Tunnels rooted from the multi-homed PEs into
   the anycast RPF tunnel checklist of the corresponding multicast
   traffic (C-S, C-G).

This means, if there are four ingress PEs for the same flow,  all of them will receive the C-multicast route. With the RFC9026 procedure, only two of them will, and the selection can still be based on (s,g).

In summary, I don't see sufficient advantages of developing another solution now that RFC9026 have been implemented and deployed.

   For the scenarios of the same RD, this document introduces a new type
   of UMH route to be sent in MVPN SAFI, of which the NLRI key consists
   of the following fields:

Instead of advertising the UMH route with the MVPN SAFI, we might as well advertise with the VPN-IP SAFI with RD/RT, and use RT to import into the default instance. It does not need any signaling changes and depending on implementation it may just work. We wrote a draft and circulated internally/ externally back in July 2022, but I am surprised that it was not posted.
I will forward you a copy for your review.

With this, it not only solves MVPN problem, but can also be used for unicast.

Thanks.
Jeffrey


Juniper Business Use Only

-----Original Message-----
From: BESS <bess-bounces@ietf.org> On Behalf Of Chensiyu (Susie)
Sent: Monday, November 21, 2022 7:27 AM
To: bess@ietf.org
Cc: duanfanghong <duanfanghong@huawei.com>
Subject: [bess] FW: New Version Notification for draft-wang-bess-mvpn-upstream-df-selection-03.txt

[External Email. Be cautious of content]


Hi all,
A new version of the draft https://urldefense.com/v3/__https://datatracker.ietf.org/doc/draft-wang-bess-mvpn-upstream-df-selection/__;!!NEt6yMaO-gk!HdrJ7-G-GzTAivmylUCtos02On6dZ0TiNLBVUuR9CcafA0rrdXSyAEluuJqj13mO0jxEdhdrR2FGjnJ7iMxq4w_iyAEwJo9O$  is published, in which the root PEs differentiation issue of UMH route and dual-root protection issue in Segmented Inter-AS Scenario are resolved.
We are looking forward to more comments, it would be appreciated if you can give some valuable suggestions to help us evolve it further .
Regards,
Siyu Chen

-------------------------------------------------------------------------------------------------------

A new version of I-D, draft-wang-bess-mvpn-upstream-df-selection-03.txt
has been successfully submitted by Siyu Chen and posted to the IETF repository.

Name:           draft-wang-bess-mvpn-upstream-df-selection
Revision:       03
Title:          Multicast VPN Upstream Designated Forwarder Selection
Document date:  2022-11-21
Group:          Individual Submission
Pages:          16
URL:            https://urldefense.com/v3/__https://www.ietf.org/archive/id/draft-wang-bess-mvpn-upstream-df-selection-03.txt__;!!NEt6yMaO-gk!HdrJ7-G-GzTAivmylUCtos02On6dZ0TiNLBVUuR9CcafA0rrdXSyAEluuJqj13mO0jxEdhdrR2FGjnJ7iMxq4w_iyIh-akra$
Status:         https://urldefense.com/v3/__https://datatracker.ietf.org/doc/draft-wang-bess-mvpn-upstream-df-selection/__;!!NEt6yMaO-gk!HdrJ7-G-GzTAivmylUCtos02On6dZ0TiNLBVUuR9CcafA0rrdXSyAEluuJqj13mO0jxEdhdrR2FGjnJ7iMxq4w_iyAEwJo9O$
Htmlized:       https://urldefense.com/v3/__https://datatracker.ietf.org/doc/html/draft-wang-bess-mvpn-upstream-df-selection__;!!NEt6yMaO-gk!HdrJ7-G-GzTAivmylUCtos02On6dZ0TiNLBVUuR9CcafA0rrdXSyAEluuJqj13mO0jxEdhdrR2FGjnJ7iMxq4w_iyGr5TPdY$
Diff:           https://urldefense.com/v3/__https://www.ietf.org/rfcdiff?url2=draft-wang-bess-mvpn-upstream-df-selection-03__;!!NEt6yMaO-gk!HdrJ7-G-GzTAivmylUCtos02On6dZ0TiNLBVUuR9CcafA0rrdXSyAEluuJqj13mO0jxEdhdrR2FGjnJ7iMxq4w_iyDj7YPMP$

Abstract:
   This document defines Multicast Virtual Private Network (VPN)
   extensions and procedures of designated forwarder election performed
   between ingress PEs, which is different from the one described in
   [RFC9026] in which the upstream designated fowarder determined by
   using the "Standby PE Community" carried in the C-Multicast routes.
   Based on the DF election, the failure detcetion point discovery
   mechanism between DF and standby DF is extended in MVPN procedures to
   achieve fast failover by using BFD session to track the status of
   detection point.  To realize a stable "warm root standby", this
   document obsolete the P-Tunnel status determining procedure for
   downstream PEs in regular MVPN by introducing an anycast RPF checking
   mechanism in dataplane as an instead.





The IETF Secretariat



_______________________________________________
BESS mailing list
BESS@ietf.org
https://urldefense.com/v3/__https://www.ietf.org/mailman/listinfo/bess__;!!NEt6yMaO-gk!HdrJ7-G-GzTAivmylUCtos02On6dZ0TiNLBVUuR9CcafA0rrdXSyAEluuJqj13mO0jxEdhdrR2FGjnJ7iMxq4w_iyIKMC0SE$