Re: [Idr] What are the solutions to address large number of routes convergence caused by Cloud Infrastructure failure described in draft-ietf-rtgwg-net2cloud-problem-statement?

Linda Dunbar <linda.dunbar@futurewei.com> Tue, 02 August 2022 21:27 UTC

Return-Path: <linda.dunbar@futurewei.com>
X-Original-To: idr@ietfa.amsl.com
Delivered-To: idr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 7AE64C157B5C; Tue, 2 Aug 2022 14:27:53 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.007
X-Spam-Level:
X-Spam-Status: No, score=-2.007 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.001, HTTPS_HTTP_MISMATCH=0.1, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=futurewei.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id d883qEdMJLHz; Tue, 2 Aug 2022 14:27:49 -0700 (PDT)
Received: from NAM02-DM3-obe.outbound.protection.outlook.com (mail-dm3nam02on2108.outbound.protection.outlook.com [40.107.95.108]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 486FBC14CF11; Tue, 2 Aug 2022 14:27:49 -0700 (PDT)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=Ar702jppKquzQ9DmoaEV/lVprR8zqlCaA8HAbuoaYmADu7eIBYEW5viWXwlluHEJhE0mehl396Sg/vfVGu6Oy6Q7Gcy1akSKMHDPeEdkE4X68IHWWi/L9vSQwcvbf3oI4rcwYAo+X/zeF1OraGVCmOSKy9lyZ7uRUOInoDA6/1LhYuQDregMg97aGuo/vT2hzFJRvO2jIQsOm/2DK9d+9wAZm8Xe9OzFcYnmQkkZHQLwcGRdaXedjLmDqKyNlTXF9mWn2gQUlno4E8XbVY2FeEoltGZP2Xdd0OQvNErtPgpqwrrizSqirEcCZ9KhEG13lhgSns/wSVYqTjN4eUWLHQ==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=R+TptlTS7mFCTiea3n/FExTH7EP5itssFwtmfMiaN7g=; b=Hh3pEEAHe1tYbeuF6c4/fhKYjwl+A3Hj1BQWpjBApEIew7UC5PB3qepcLING+5kO8+E4mQSXu7U+sswTkorHXhOqcMusAKyZ+YeWbwcdUwgmvePs2MnSKTfUnE9Is7ehMeZK7hV690ie4aLG+UTdfEaeEecPaq5wSWMpw3I2nBgnob0/pHXYQBo60agNHcFOrG29V6jcSfYZVOnVKwCJQEoBK8vCUWFj8Bux5GQwKncM7uLnQtBnrubw47ARyQcmgsEwob0nvoihBnCT16IrOgHpDAxgIoUxlq8LUV8ESPLjlBxczp7+9cdXi0le+7ZjbVwlQd/q3ISvjGvYBGTnNA==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=futurewei.com; dmarc=pass action=none header.from=futurewei.com; dkim=pass header.d=futurewei.com; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=Futurewei.com; s=selector2; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=R+TptlTS7mFCTiea3n/FExTH7EP5itssFwtmfMiaN7g=; b=TaQ1yYJrevBuWGbjZc4b/hqPSijRwtQGmIBitCon86X5pbcanWxjcBgZUfq/MQU0UPPh9VBHkGDzAbeIG2MgFNhyJLWoCYr/TlKTh9l69630eXWDO3XmNYw9SWrgxpYayI2yndwt4jLg15Ic7zfOPFBiq/XDihqpFlCjLKqDsXA=
Received: from CO1PR13MB4920.namprd13.prod.outlook.com (2603:10b6:303:f7::17) by CH0PR13MB4650.namprd13.prod.outlook.com (2603:10b6:610:c0::12) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.5504.10; Tue, 2 Aug 2022 21:27:43 +0000
Received: from CO1PR13MB4920.namprd13.prod.outlook.com ([fe80::355e:24d6:38ac:2eef]) by CO1PR13MB4920.namprd13.prod.outlook.com ([fe80::355e:24d6:38ac:2eef%5]) with mapi id 15.20.5504.014; Tue, 2 Aug 2022 21:27:43 +0000
From: Linda Dunbar <linda.dunbar@futurewei.com>
To: Chongfeng Xie <chongfeng.xie@foxmail.com>, RTGWG <rtgwg@ietf.org>, idr <idr@ietf.org>
Thread-Topic: [Idr] What are the solutions to address large number of routes convergence caused by Cloud Infrastructure failure described in draft-ietf-rtgwg-net2cloud-problem-statement?
Thread-Index: AdiMAvlcxm8zN76eTOCPQ76D2ADrwAXIdCZDAONdguA=
Date: Tue, 02 Aug 2022 21:27:43 +0000
Message-ID: <CO1PR13MB4920255AE2EB7438BD50BD65859D9@CO1PR13MB4920.namprd13.prod.outlook.com>
References: <CO1PR13MB49207D39F25D7416585F9BDF85BB9@CO1PR13MB4920.namprd13.prod.outlook.com> <tencent_7D7EA1487EDBF500200F879ED96D6C70A506@qq.com>
In-Reply-To: <tencent_7D7EA1487EDBF500200F879ED96D6C70A506@qq.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=futurewei.com;
x-ms-publictraffictype: Email
x-ms-office365-filtering-correlation-id: 5e66fa18-9ab9-4658-8606-08da74cdd5d8
x-ms-traffictypediagnostic: CH0PR13MB4650:EE_
x-ms-exchange-senderadcheck: 1
x-ms-exchange-antispam-relay: 0
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: hXvfCFpT6oxP0uXFFZkEIt/cyrZVTnouXHceXYMCrgB/u987UGS67BTt6vScBx4Ttv/JY3QSYL4I6EKwlIiCKbJI1dzWy5xN1/hq5rIBzOqiUGqOjueQK1F3kcGP89IUa/Qt5cBX9X1BoU+BnhbEkCn4wAf9+kdBDORBYeg12aprnksvMtHs3cuw1Hz8uEOFmm7KKtvRunw8boIkPs/XiyqRqNah5Txo2vYyOE9OtQKrl1BfqbHKkAJaDGG7x22RAFTk6K2Ds0g5g30INncApv66JUzhggye+UsVVAFLfNeK565yloWoibbXq3k2RYdM/RhTKrK6zOa9CPAEuD2G8mmQfWLBe6BgaRdzmer51EbhcpViu9/I3At0Ih5HtqBMJqHV+TMqCFqH+8GgrbN/J5terJvhzC+VQP/uleo8VWua5Yx0TxG28SvoeooIOp2w6ECwku2naL4ATCI5DxvpmhED559iyII6nxLQLL9oYCodJz3JfH3hfSDm0+xgsm29zTfwb14OowzG+B0Zm+phD8oBCFNdId5aUclIL8WcmqPBFIJr/hrVyDWGSVHA5+zElo/Bikmf1LpHf/wRavrpE+Z/8DZ8WkqJCs0tGSfwbBJk1Hue1V2+ROG2TOaoSZTwpP3umlUCS0kUh+xDyM7dMWNg7IlW2Um8OO06vCYfEF/5KqkG+dI2MpJ6XcNwUnGCkssEcMB626G/qwhvY6HpPAD++wwf3Eck57lh1aWT53BGAIGXj3R1E95+KNcW1eYXtH8fX9aKym0Dxvc+pK3YddfI9BXLfMPUng6TKKtqI5lfjm920rgEnEMRaZtvIFD6wsRFld/WkgVCu8X4V7Hb2F7CZpdIjLlyC9v10pHrzH3RlOnZxKRZSblxoQNdjltPRFIUFd54/KF2XHQai92jApKG4xTHoQ3lo1MJeHRqoUf8pmnqJJaJey6zwpq3Ua4a
x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:CO1PR13MB4920.namprd13.prod.outlook.com; PTR:; CAT:NONE; SFS:(13230016)(4636009)(39840400004)(396003)(366004)(376002)(346002)(136003)(44832011)(186003)(166002)(966005)(38100700002)(478600001)(122000001)(38070700005)(83380400001)(5660300002)(66946007)(8676002)(86362001)(66556008)(64756008)(66446008)(76116006)(66476007)(53546011)(6506007)(9686003)(7696005)(52536014)(71200400001)(41300700001)(110136005)(26005)(316002)(55016003)(2906002)(33656002)(8936002)(48020200002); DIR:OUT; SFP:1102;
x-ms-exchange-antispam-messagedata-chunkcount: 1
x-ms-exchange-antispam-messagedata-0: XfrCMIZz+qrg6vH3Pg9CGVvR9nz8EClTknfJG6lRBvS8TRQieRBbmItOVHU/ViA+V3S/m4AhWj2i9Gdv8q3SG0n477s9cGF2OGBCa86AggHUtmQJopzRXGU4bryoTI8p42r+9/80vA4jIP0tU60dsTI7DxVgiTtWJOhEctJ6YdYDQtZOjgdHdFw19TVlyIXa9pIR3XTipuQZIaiUeHBN/3l7P5hybYRkXSVPrPi2zeoYnJQkqjGWNdNoRSA/7iHCLtJXgBfSIH3Na/+f5iwLxKeoNjlUab1PwgB4cjstaVXb6HCo3/NNFcVuawJqIRqFLozmE72kDNz09XdsxdXIRFA4bDjA83Q0l6k/HPMDdqicgzRhH2dbDF6Gf2nysNzA4cdyOn2jwISuzCoydsRjdNmg3rCP8lKjLwHvBhqlSFfJGYIZOLbb7zkCwFfx0kOM80uUAeOYrc7qYQ7AfZWW9GHbZCQf/Gn5omV89614esE/Fdg58fYp1rHxctzxXyhnG+OOuxmYfYveASd5+yqmmgX5EXoNO8yDFn5HTojlANtgArif4pvziXjtbz773bSxmUo5PkHZtCrt8EcFcNjDizkt32OWMvDAbwPTz1xleD3FSHh2KASStzSs4HifxXSH6iGCrtxDciH6kupL4CmFnTOjwr+xUZGglZZrAJvgRtMXyU8jDvHg/MnFyKdWzWgjL8hksuT+t9139SBailcLP8vihUuoQezkC9//dEhKCYzrZbTAnnRNvkEkegU1WvAY3HUUWhrEtC0HnrSMoz00/OjCe5m0UhAWPXOAwMuNWGd8gTTg5MZXm857U21AklMA32PMw5oYop8yegjix8mrSuziTfymoZYKR/EcZ5n4djtWLHMggk8MsckGGDJBaJ2tEZT3Ob78YojbdzqPzUfMiinuIKoJ1J6bOATIX1VdbnN7uDLGQzrOjF9JyVW2VFuJ7L8zWs/zqQZdS1SmD3Ag3Z2LC7GoSuc0aHfWLmQIRqq+uEdDNpRJfmPfrhy0oC4VLsf5ZTkHgfKhOVTXLoxflLAaMsrwvFpcorbrBKuv1rUMWVMfc5XbSOEYNH8/XcqFJQvLIvTQ7KX0fDY9oeWLfdmHOH5SrD92IeUeauprggzZvQTgon/VHub5MdUYMYgpfpGhyXbxtrkltRh7Dn0iTsJnGNgpQzkxV15s8wLs2+4eurJvfEgRBNVnX2w0jGFUSpntZ2phBBwH0swxVc3ERhc4F3s0SSjIKB6E2ojgw3+ZdQzQ8cS698jw2rj3wpQxRtkiKu0/reFA/Q1x7P006el/p7eA9gHFP7Ic3btXaMDdXJDVnryj0ukIL0blOmn16p0zV3n/8tTwckg3wznztP+2Zff9DvbatYbEOdpoCJ2iO7pu5opViSr8u5yx7yBjw06KMlDgHiSQE+BGPeEN1wVQ5VxgS9eKEykI/IzQHE56QkE6aJqjQqu7YqhJyC6Z11Ck/Y2oKihoWaPEV/+VHYCD8iwFDZ5bpXfWERGe0ZUuN79UGDtTuQ/bsAmQgbEUn4jCkcinWEVgV3RqrF1+qT5L0RLmogRpryi64PkrAOHbKi0a8Zj3J7MfvpYvRqce
Content-Type: multipart/alternative; boundary="_000_CO1PR13MB4920255AE2EB7438BD50BD65859D9CO1PR13MB4920namp_"
MIME-Version: 1.0
X-OriginatorOrg: Futurewei.com
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-AuthSource: CO1PR13MB4920.namprd13.prod.outlook.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 5e66fa18-9ab9-4658-8606-08da74cdd5d8
X-MS-Exchange-CrossTenant-originalarrivaltime: 02 Aug 2022 21:27:43.0888 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 0fee8ff2-a3b2-4018-9c75-3a1d5591fedc
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: n2uWUjHb65WphD0imfSlo12BzwmciKFzUus3yTE0898lqkzO5QuoUV2fP+FFA/0q7p1GyBYgySXzTIjnoNDOCA==
X-MS-Exchange-Transport-CrossTenantHeadersStamped: CH0PR13MB4650
Archived-At: <https://mailarchive.ietf.org/arch/msg/idr/hgi0ozOobGrLOUeDj4YgMn1pWc0>
Subject: Re: [Idr] What are the solutions to address large number of routes convergence caused by Cloud Infrastructure failure described in draft-ietf-rtgwg-net2cloud-problem-statement?
X-BeenThere: idr@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Inter-Domain Routing <idr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/idr>, <mailto:idr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/idr/>
List-Post: <mailto:idr@ietf.org>
List-Help: <mailto:idr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/idr>, <mailto:idr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 02 Aug 2022 21:27:53 -0000

ChongFeng,

When a UE roams from one UPF to another and keep the same IP address, a closer Edge DC might have the service instances that the UE is accessing.
Depending on the stickiness of the service, a very small number of services might need to stick to the original service instance closer to the UPF before the moving.
See this draft for detail: https://datatracker.ietf.org/doc/draft-dunbar-6man-5g-edge-compute-sticky-service/

Linda




From: Chongfeng Xie <chongfeng.xie@foxmail.com>
Sent: Friday, July 29, 2022 3:26 AM
To: Linda Dunbar <linda.dunbar@futurewei.com>; RTGWG <rtgwg@ietf.org>; idr <idr@ietf.org>
Subject: Re: [Idr] What are the solutions to address large number of routes convergence caused by Cloud Infrastructure failure described in draft-ietf-rtgwg-net2cloud-problem-statement?


Hi,Linda,
You mentioned the mobile case in your presentation yesterday.  When the mobile terminal roams from one BS to another BS,  because UPF is the access anchor point, the address of the termianl will not change as long as it is continuous served by the same UPF of mobile network, is ther any specific requirement or effect to Net2cloud in this case?

Best regards
Chongfeng
________________________________
chongfeng.xie@foxmail.com<mailto:chongfeng.xie@foxmail.com>

From: Linda Dunbar<mailto:linda.dunbar@futurewei.com>
Date: 2022-06-30 05:49
To: rtgwg@ietf.org<mailto:rtgwg@ietf.org>; idr@ietf.org<mailto:idr@ietf.org>
Subject: [Idr] What are the solutions to address large number of routes convergence caused by Cloud Infrastructure failure described in draft-ietf-rtgwg-net2cloud-problem-statement?
BGP experts:

The Section 3.2 of https://datatracker.ietf.org/doc/draft-ietf-rtgwg-net2cloud-problem-statement/<https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fdatatracker.ietf.org%2Fdoc%2Fdraft-ietf-rtgwg-net2cloud-problem-statement%2F&data=05%7C01%7Clinda.dunbar%40futurewei.com%7C7fa1ad7f0a674005675a08da713bea99%7C0fee8ff2a3b240189c753a1d5591fedc%7C1%7C0%7C637946799410688623%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=51LhdN4fM2i1DEBC51z4piqNe6U%2BLTFjzAzpG9RBZMM%3D&reserved=0> describes a problem of a Cloud DC infrastructure failure, that may lead to massive route changes.

   As described in RFC7938, Cloud DC BGP might not have an IGP to route
   around link/node failures within the Assess. Fiber-cut is not uncommon
   within Cloud DCs or between sites. Sometimes, an entire cloud data
   center goes dark caused by a variety of reasons, such as too many
   changes and updates at once, changes of outside of maintenance
   windows, cybersecurity threats attacks, cooling failures,
   insufficient backup power, etc. When those events happen, massive
   numbers of routes need to be changed.

   The large number of routes switching over to another site can also
   cause overloading that triggers more failures.

   In addition, the routes (IP addresses) in a Cloud DC cannot be
   aggregated nicely, triggering very large number of BGP UPDATE
   messages when a failure occurs.

EVPN [RFC7432] defined mass withdraw mechanism to signal a large number  of routes being changed to remote PE nodes.

Is Mass withdrawn supported by all networks?

Thank you
Linda Dunbar