[Idr] 答复: I-D Action: draft-xu-idr-bgp-route-broker-02.txt

"xuxiaohu_ietf@hotmail.com" <xuxiaohu_ietf@hotmail.com> Fri, 11 August 2023 11:49 UTC

Return-Path: <xuxiaohu_ietf@hotmail.com>
X-Original-To: idr@ietfa.amsl.com
Delivered-To: idr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 3171AC151063 for <idr@ietfa.amsl.com>; Fri, 11 Aug 2023 04:49:08 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.233
X-Spam-Level:
X-Spam-Status: No, score=-1.233 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FORGED_HOTMAIL_RCVD2=0.874, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=no autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=hotmail.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 3LkWEHQSJuQD for <idr@ietfa.amsl.com>; Fri, 11 Aug 2023 04:49:07 -0700 (PDT)
Received: from EUR04-VI1-obe.outbound.protection.outlook.com (mail-vi1eur04olkn2036.outbound.protection.outlook.com [40.92.75.36]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id E8263C14CE2E for <idr@ietf.org>; Fri, 11 Aug 2023 04:49:06 -0700 (PDT)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=LfjI8TfRzxgUEvZz2m67DArr6oGYFbRRFlSOXaFEL7Se98w7isd+1dgdcYt2fVqWRwvkfEJkkncerAOmKQHqB9QdLTR9SqfG3hhErD5zvuiWOdwtaTjLXvm0j93uQEewTv5KM601Cq7Nz55Mz/BJKDAmeqeNDNefbkXoSbR5/sOtOjWJ96Cl77hZW/PAOMd8hk/YQdwlTNiKgBSQqKZQHkcETyYmRbuHIaoq7OdMDYrk4zpIMKYaFEubNDc8lQISKUgyKKj9u6otQ1ZeVK2TVva/RybBVb0VBED3bcMpl9YNXqgP51K66QqAwMSbKSgd7d/dsZJe3kbPKQWP5bkaZw==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=AoHUKjaNTrlvbZieuws2QUGdRkFBO4acdGP1HFRHmmM=; b=XBmU+Rywkhk5h7s4Q3UV/0gMZTAwBHcQGdRn31XSOHCYeLm0y2n8d/Y9oNOHfJSOJE1ThhdOy4xKyFDCc3UiaW9XzteNc1nX9bv5vY5v84SXgKokgt5v9IEcsUpjeJVrGC5Sl/4t71lU15eswNk1PBFBkyQyRFJPUMqpUJCvZhuDbpfxeLiabImSrDO0UdRKA4JuszuB/OgvP/Zm8WTGANKgZdn3FGlzcx+hbE8cXtpXfz9zYGfymZ3gqobk2mXLRuOieL3UPS221FYXwdeCIlnbnjgnSFt4ocOOy3cnJ3aOdGYpGDMMMy4T86udkgEM2z0z7AJfJVCNUec8EbK+Fw==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=none; dmarc=none; dkim=none; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=hotmail.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=AoHUKjaNTrlvbZieuws2QUGdRkFBO4acdGP1HFRHmmM=; b=duhtMwq5yNocgNvQ64FYO2FdGmcChIJD5q4I1Aa3AxOtyC3fDlBgdKnlj30Mdog99K28QPZrD3Mb5274T2uQwFoIOGieQPYrN1Yn2iSVkl5NQMRu2M6ICyfZrteFAlG89OESUI3evCQqA2Qr2xWSl1Wep4ylMoD0VsR7RZUbdfnA4PNa4q6RoFqYjBlH3pwwuTGTilwQHgMP+RBRhkadK/xJT1B8TGmeNGedBVhA23HJdOC+8pLkMTDkbUNLMzZ2QHX0Oup6y99G2qemBNRaMU9kWudZV8C+q8WfkJJlN/Ogi2zC/o/FRvFIW8A8HHITc3FZWGBkflNJBSitdrS7ug==
Received: from AM6P192MB0375.EURP192.PROD.OUTLOOK.COM (2603:10a6:209:3b::17) by DU0P192MB1746.EURP192.PROD.OUTLOOK.COM (2603:10a6:10:3bd::13) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6652.30; Fri, 11 Aug 2023 11:49:04 +0000
Received: from AM6P192MB0375.EURP192.PROD.OUTLOOK.COM ([fe80::836e:71:9168:528e]) by AM6P192MB0375.EURP192.PROD.OUTLOOK.COM ([fe80::836e:71:9168:528e%6]) with mapi id 15.20.6678.020; Fri, 11 Aug 2023 11:49:04 +0000
From: "xuxiaohu_ietf@hotmail.com" <xuxiaohu_ietf@hotmail.com>
To: Robert Raszuk <robert@raszuk.net>
CC: "idr@ietf. org" <idr@ietf.org>
Thread-Topic: I-D Action: draft-xu-idr-bgp-route-broker-02.txt
Thread-Index: AQHZysMxx+Wy1kO3G0OnG805v2t5DK/iurD1gABmFQCAAAjM3oAAC/KAgAAAxfqAAGbsAIAAtwROgAB1iACAAAUHVQ==
Date: Fri, 11 Aug 2023 11:49:04 +0000
Message-ID: <AM6P192MB0375485F2CD604A0B54AF92B8110A@AM6P192MB0375.EURP192.PROD.OUTLOOK.COM>
References: <169157989186.10790.10412166011795082010@ietfa.amsl.com> <CAOj+MMGLTgnwT9gQ6Of7OdMkZQSsNmDuncO=hvmAZkmsJJ1JpA@mail.gmail.com> <AM6P192MB03752E1310E848D0A35149358113A@AM6P192MB0375.EURP192.PROD.OUTLOOK.COM> <CAOj+MMH0g+vmrDf+_-xQTjRFARQn8YDzMxtTWBaEw5q3k_YUNg@mail.gmail.com> <AM6P192MB037580CC8CADF6CEDFAF55F88113A@AM6P192MB0375.EURP192.PROD.OUTLOOK.COM> <CAOj+MMFTOjF8Mdb6jESOopNVSHf31nTWqR15BEKv3wpg+RJoPA@mail.gmail.com> <AM6P192MB03751B24DDAE67D9F37AF3E48113A@AM6P192MB0375.EURP192.PROD.OUTLOOK.COM> <CAOj+MMGHrBuqDBfCtW-XLB4yprWCT2KCmj0ApKWFPcbxTWOpyA@mail.gmail.com> <AM6P192MB0375CF42F53E49571B8BDB2E8110A@AM6P192MB0375.EURP192.PROD.OUTLOOK.COM> <CAOj+MMFvy_fyeDETOanzq07i+1eafHYWfxfGJJ2W=DzLrFUncg@mail.gmail.com>
In-Reply-To: <CAOj+MMFvy_fyeDETOanzq07i+1eafHYWfxfGJJ2W=DzLrFUncg@mail.gmail.com>
Accept-Language: en-US
Content-Language: zh-CN
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-tmn: [5ljhf7pxJBe282DCBFbX+Ob1jQ3pSTjC]
x-ms-publictraffictype: Email
x-ms-traffictypediagnostic: AM6P192MB0375:EE_|DU0P192MB1746:EE_
x-ms-office365-filtering-correlation-id: 18467ac7-86b4-4e70-af96-08db9a60f64e
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: OVjwGOGJU5F/a2zGt+sduvWW0wrRarJe7llT5Wm0a1I5gjIWe8iKme4FjrABddvqIsWg6F+tb1dEIwYzddj5kjfc1FN/6VoRzI9dzr+mfMDFYQ4X1DxUkmLj3u8ZlaKGs3uVX9cOG6CATsB7Lg1yqgHOQQze3+PRLtubjWHJ588vsuLuZjCg9Bs9mg3yGmnosEbwykPaKedpojFJL/sq/qO+yIaR84oJYjFXdffFbNJCzf998gnmDO6mhbCNEFrWR7isEM3sbl8DFxf2K3jEfAWptUsRhw9kMotJ8+2J2/eiZC7s6wpeZsKL4RSxcVlFSDrnB9P/hfOYxnmTH4tFwCRrjiMIgi4mRiZIeM4W5M1VBkMF0HHOiA30kd8c+3z9bdgQevKZsC6oon12qmZXc+K/mFItuKG6IP3H2leEGtVSwD1X1stzwDpGHu5rN1bhpbg5bvPBcFeHQdc/ZJZAA8KdcbcZb0czCbd5P+bpycM8NZMsw5U6pTb8U05gFJP4Yi4bglNxpZ2cC958gMlEx5NG6OiyG0kYeQp+06N/3xKVt4gsgvNjyxJCVsA9eA0ec5ARb3UhRUHzXVxLkKucvHfAcg3FGrE5cfzYCLFdL4z3uD934wM3rY0s9ta8xYTjGbxaVNZ98Iw0g4Wz+/yQ1Q==
x-ms-exchange-antispam-messagedata-chunkcount: 1
x-ms-exchange-antispam-messagedata-0: kWUOs79s2YK9nLqgqXaBEhb40G7C8gHJzmQ8il9Tsi+xkVYUbB3L7C6Ph4yuzMTjUjCKmBq1NTHGOdboP5+UnQPp1d4zgeNNm7W9N7uZ7HvlUx/XvzSc2Rp4jsR4por2VdMPoIvue5Lt6GiJHh2jDBpW630YjVZ7nVtrgRgtHxVjRxGILfgWNN2gIK54/UBMX74YznvOdgd0kME2y6X7ggFteBFE3lFtU/C52UdGCZ2j5bqiCHe9b2/x1n7jnMgVm7dgl1aMpiBQQZLgjcml4JNzVxKUmwhvXm0rH45vQ9pm7SWMnFQFqHzXlOOktl5GsZpArfL4SL9M/MsZJ9n+sL7iTNi3227FvXPbK4dtbR7b38taZIJNmGQPeM64CgOaxzGudo/0ki/bh38kUDomt+tS/UtN9NpkeW+Zv7zj3d6ufueB4yvGX1WfFmlqTozb5A6Z1UM/TyUnXqcZxlzwTn7Gv1bT2rUIhiDt59GnLwxqBzL+vC9W+5lhOLLCPk9DUp68Scd2F4uLoqXzKisx2dtNvIzPZFH00NKlzqT8qqwviAq/L5AZbWiFLzq//WnrbrcshkSBKpLpSYFVG9R9lsv3RuBZALJ+cvnbVHIoxTa0vZlLHVCsrAVoylfmaZkfepWy4v4gV7DKttuNaruht8HFreszRvJt0aVf7EJpl0IreAZMcpIOkvzMzvi7be2i+ulEf2XQpn5pDxMoVJYnruwFlB5FkLw32goddToNW0to4A8b0Tk9Yegr62TRhcQxcAh3O5f2Q66F2sArFby1+Rn7+LAk4bnnsbbnBJ8WSanLMlCHKoSFT/cwzNNzRHp9dzEbvzQR5T2kGEPN+QucAfxDX+oIBb0DKsakIZfX8ZWP2TktVzGo52YbFwnF8wULJBMJA4dap3mgPoRqazHNioVay5UnHQYC2Na3tIocerx75SjBA3r3qfaOeMr/Cu+sA+9aFy6A12JiTb+J1FtJiT07cMbZcmUM3geLH/055mD374Fyny6lw/eaOIEV9WvAJr6J9r8OiAnidUP49v2C0IwZorwCodaxkBOScBMRaHhZRivBkXpakRyLOfovMotD+Fx6ztiQLSf1wVw2mjiS7FH+aO4alu+/ANqRVUzQQy0PByEhvF2i+D0rhdNJWEwmumxGq1qgTKKyYL/acIyDT8ooHkctnPmG1IoxidXZC0MWw2mHyeolpmC+sb4ZzBuyzRr68FE0pJJg1/qMGKqp7ahohRZp3demjOrbbFMJaXP8zDoj2PDmT1OJUqgN12z4
Content-Type: multipart/alternative; boundary="_000_AM6P192MB0375485F2CD604A0B54AF92B8110AAM6P192MB0375EURP_"
MIME-Version: 1.0
X-OriginatorOrg: sct-15-20-4755-11-msonline-outlook-fb43a.templateTenant
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-AuthSource: AM6P192MB0375.EURP192.PROD.OUTLOOK.COM
X-MS-Exchange-CrossTenant-RMS-PersistedConsumerOrg: 00000000-0000-0000-0000-000000000000
X-MS-Exchange-CrossTenant-Network-Message-Id: 18467ac7-86b4-4e70-af96-08db9a60f64e
X-MS-Exchange-CrossTenant-originalarrivaltime: 11 Aug 2023 11:49:04.2868 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 84df9e7f-e9f6-40af-b435-aaaaaaaaaaaa
X-MS-Exchange-CrossTenant-rms-persistedconsumerorg: 00000000-0000-0000-0000-000000000000
X-MS-Exchange-Transport-CrossTenantHeadersStamped: DU0P192MB1746
Archived-At: <https://mailarchive.ietf.org/arch/msg/idr/47vCdXbNnTbyApqfLdszu_KMBCo>
Subject: [Idr] 答复: I-D Action: draft-xu-idr-bgp-route-broker-02.txt
X-BeenThere: idr@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Inter-Domain Routing <idr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/idr>, <mailto:idr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/idr/>
List-Post: <mailto:idr@ietf.org>
List-Help: <mailto:idr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/idr>, <mailto:idr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 11 Aug 2023 11:49:08 -0000

Hi,

发件人: Robert Raszuk <robert@raszuk.net>
日期: 星期五, 2023年8月11日 16:41
收件人: xuxiaohu_ietf@hotmail.com <xuxiaohu_ietf@hotmail.com>
抄送: idr@ietf. org <idr@ietf.org>
主题: Re: I-D Action: draft-xu-idr-bgp-route-broker-02.txt
Hi,

For the benefit of IDR WG participants could you clearly state by email or in the draft what is the problem with addressing your deployment needs by using hierarchical VPNv4/v6 route reflection (as is - no single line of code change needed) + RTC along with it ?

【Xiaohu3】Assume there are 100K vPEs/servers, 100k VRFs/tenants, 100M VPN routes (most of them are host routes) in a large-scale data center

So you have 1 VRF per vPE ? And each such VRF is advertising 1000 routes ? Not a very realistic case but let's continue ...

[Xiaohu4] As I mentioned earlier, it assumes that one vPE is attached with 20 VRFs on average. In fact, for some VRFs which contains a lot of containers, 1000 routes per VRF may not be enough.

SDN environment, you deploy a two-layer hierarchical RR architecture where there are 10 top-level RRs and 10 bottom-level RRs. As mentioned in the draft, there is no scaling pressure on top-level RRs since each just needs to store about 10 percent of the 100M VPN routes and peers with 10 bottom-level RRs.

Please observe that in practice static allocation of RT to top level RRs never work well. And now in the other mail you suggested to make it even worse with unique export  RT per VRF even in the same VPNs.

[Xiaohu4] Can you give concrete reasons why it doesn’t work well?

However, each bottom-level RR needs to peer with about 10K vPEs, in the worst case where the collection of VRFs attached to the 10K vPEs cover all VPNs within the data center, therefore each RR would have to store the 100M VPN routes.

Then increase the number of lower level RRs. If you have 100K PEs, deploy 1000 of lower level RRs.

[Xiaohu4] It may be acceptable provided the cost associated with 1K RRs was not a problem.


Furthermore, it’s almost impossible to use the peer-group attribute for the 10K vPE peers since the VRFs attached to one vPE are different from that of another vPE.

Usage of statically configured peer-groups is an archaism. Modern BGP stacks use the concept of dynamic update groups and peer templates with inheritance. Besides this is VPN env so update generation is optimized by attribute based walk.

【Xiaohu】I doubt the effect of dynamic update group in the data center SDN environment where it’s unacceptable to attach the same VRFs to a group of vPEs.

As a result, each bottom-level RR would have to hold a local-RIB containing 100M VPN route entries and 10K RIB-out tables with each consists of about 20K VPN routes (assume each vPE is attached with about 20 VRFs). Besides, the route update frequency would be very high due to the widely adoption of containers.

Again you have made a suboptimal choice of the number of RRs and now you are drawing conclusions based on it.

BTW, there is no need to introduce the headache issue associated with RTC in the hierarchical RR architecture as mentioned in https://www.ietf.org/archive/id/draft-mohanty-idr-rtc-hierarchical-rr-00.html.

It's not a headache but a solution.

Besides, you can still drop locally. Dropping is cheap.

For an example, there is no need for the top-level RRs to reflect the RT membership advertisement received from the route brokers.

With hierarchy of RRs you do not need to do that either. You can push your static RT ranges configured on the top level RRs and be done just like in your current proposal. But very soon you will see that static RT range assignment on top level RRs is suboptimal.

【Xiaohu4】It seems straightforward for top-level RRs acting as centralized collectors and distributors of VPN routes, to be preconfigured with a block of RTs and then advertise them towards its peers.

[Xiaohu3] It’s true that Virtual RRs are widely used in the data center SDN environment. However, it’s not that easy to scale the well-known BGP-based SDN solution to support more than 5K vPEs.

Then maybe it's time to consider not using the well-known BGP based SDN solution for such deployments ?

[Xiaohu4] Maybe RTC is not a good choice for data center SDN scenarios.  ORF-based route pull modes as described in (https://datatracker.ietf.org/doc/draft-xu-bess-l3vpn-prefix-orf/) may be more efficient since it doesn’t need to distribute any given RT membership from PEs to PEs.

[Xiaohu3] Route brokers don’t need to be capable of handling all VPN routes at any given period of time. There are many ways to achieve that goal. BTW, it’s acceptable to cost a relatively long period of time for VPN route convergence when restarting all vPEs within a data center due to some unexpected reasons (e.g.,power interruption in the scope of a data center).

Putting aside a debate if this is good or bad idea to space in time BGP processing from any peer till some other peer's routes are accepted and all propagated to the top level RRs - how are you going to to do that ?

Are you going to keep rejecting BGP OPEN session establishment or if the session is already established are you going to artificially stop peers from sending by say TCP zero-window block ?  Just curious how are you going to delay this.

[Xiaohu] Honestly speaking, the above approaches seems to me a little bit clumsy, sorry. In fact, there are many elegant ways to achieve the above goal, for example, you can make the vPEs as passive BGP peers which would not actively initialize BGP session with route brokers. In other words, route brokers could initialize BGP sessions with those passive BGP peers deliberately according to their current resource status.

Best regards,
Xiaohu

Thx,
Robert