Re: [Tsv-art] Tsvart early review of draft-ietf-rtgwg-net2cloud-problem-statement-22

Linda Dunbar <linda.dunbar@futurewei.com> Mon, 17 April 2023 21:51 UTC

Return-Path: <linda.dunbar@futurewei.com>
X-Original-To: tsv-art@ietfa.amsl.com
Delivered-To: tsv-art@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 1FEE8C13737D; Mon, 17 Apr 2023 14:51:25 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.097
X-Spam-Level:
X-Spam-Status: No, score=-2.097 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_PASS=-0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=futurewei.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 5lpd3VEzJsXf; Mon, 17 Apr 2023 14:51:20 -0700 (PDT)
Received: from NAM11-CO1-obe.outbound.protection.outlook.com (mail-co1nam11on2072a.outbound.protection.outlook.com [IPv6:2a01:111:f400:7eab::72a]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 967E2C13737F; Mon, 17 Apr 2023 14:51:20 -0700 (PDT)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=c6u5piabUbzTyD6vio7egPtKDdvtlkvQ2f2UbX+jrTBlTjeqbA4RI1H5Mz/YDT5Xt8aLLBRDskR2AZNZiUN6l+CdH9onD6eemwKfQ86N8IrOrcFgPBRpPlHSJX9jUDAr/ruzGCHQ6p6OfnSs+XHTthmcgENF4L2CXurwDFxPfe9wbpM9F40/xh1w09cCiFEt8+XkvOQkoUE4HK141qn4Cnv/h+5OMYLmlWTnL3vU/h2L50iBeZAvyKUNQQiiKrogBpuScnx4qL8p8KnkS3j3ZFtEP+ufkEtSNVoowHlKxIkUgqDSPjd2ThNTAtwXX4f2i7/2HS/uZR4+ok3XUndOKA==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=MwrlE2NhsSBhitUGRSqjShq++LTmi5X3NSYxXAoYOhM=; b=MUEl32Ql+hZcapieu3+mGJKf6fPT9BZNscq9pah3idO0aLhCiZNUIrZc786lJ4pYCFQ57PQ4olIqAyD/yrCo0qHhKBakEZNZeRrb5lDQ5rkUFvgLgi31xjcRRYLDl7JaKmW3Mokco4/x1h4AiPTADS/eCrmuDSYIiLnbU74+9qHqfUEtkNXxhCEsSIKcuPjLZAEIjbT7U5GYNMUdiofHs8R4ZPFT4//RVMvoOGW/BJmVJvzYvcajAUERWbBwRUmAdXCsAsC6xr1dbNheg6bdCSwvBUqgVapiWQ1U9s5uHa76GjUzZ2yl6rSv9ep6/Xe64qDpjRx8ENI/+vbqw2lxZw==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=futurewei.com; dmarc=pass action=none header.from=futurewei.com; dkim=pass header.d=futurewei.com; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=Futurewei.com; s=selector2; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=MwrlE2NhsSBhitUGRSqjShq++LTmi5X3NSYxXAoYOhM=; b=cL+epKxYd6FsziT5dsxyZ6fbcVV/TUy4sfmwXhIYKrcud9iXoD8yvoEZ4cq8ID3R5PIol6rQ5WO7/p/4oTb6mEE54YLsot9gBm+T8s7rpw7SoUR9u2KgQ9GQwT9sVsFEF7mEHSsSSdwJgM/MyRcOi51427aJoMFaOAOErEBDseg=
Received: from CO1PR13MB4920.namprd13.prod.outlook.com (2603:10b6:303:f7::17) by DM6PR13MB3707.namprd13.prod.outlook.com (2603:10b6:5:247::23) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6298.45; Mon, 17 Apr 2023 21:51:14 +0000
Received: from CO1PR13MB4920.namprd13.prod.outlook.com ([fe80::72ee:346:28a:3200]) by CO1PR13MB4920.namprd13.prod.outlook.com ([fe80::72ee:346:28a:3200%5]) with mapi id 15.20.6298.045; Mon, 17 Apr 2023 21:51:14 +0000
From: Linda Dunbar <linda.dunbar@futurewei.com>
To: Lukasz Bromirski <lukasz.bromirski@gmail.com>
CC: David Black <david.black@dell.com>, "tsv-art@ietf.org" <tsv-art@ietf.org>, "draft-ietf-rtgwg-net2cloud-problem-statement.all@ietf.org" <draft-ietf-rtgwg-net2cloud-problem-statement.all@ietf.org>, "rtgwg@ietf.org" <rtgwg@ietf.org>
Thread-Topic: Tsvart early review of draft-ietf-rtgwg-net2cloud-problem-statement-22
Thread-Index: AQHZZnEFlDaRI8oI00272vvtZ3jEC68rFW7AgABhNICAAakFAIACrr4AgABHr6A=
Date: Mon, 17 Apr 2023 21:51:14 +0000
Message-ID: <CO1PR13MB4920BAB57B60E159ADB3FBFE859C9@CO1PR13MB4920.namprd13.prod.outlook.com>
References: <168055635654.11507.17750417804419163710@ietfa.amsl.com> <PH0PR13MB49229EDCFEC1D54173EA590585999@PH0PR13MB4922.namprd13.prod.outlook.com> <FDAE23AC-5834-4EC3-B368-249F94E9DE9F@gmail.com> <CO1PR13MB492041AD9B39FFE205EAF5AA859F9@CO1PR13MB4920.namprd13.prod.outlook.com> <A7BCC79A-45FA-4427-8068-BCC0162BA538@gmail.com>
In-Reply-To: <A7BCC79A-45FA-4427-8068-BCC0162BA538@gmail.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=futurewei.com;
x-ms-publictraffictype: Email
x-ms-traffictypediagnostic: CO1PR13MB4920:EE_|DM6PR13MB3707:EE_
x-ms-office365-filtering-correlation-id: 1789d737-e0f4-4681-5f77-08db3f8dddb2
x-ms-exchange-senderadcheck: 1
x-ms-exchange-antispam-relay: 0
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: joxqeRfujt9ww9UTb4eGZi6ZILKcRvYXq1imueT5CvWPJ7Ms2ZsJwEu1ms5FwLbxFzoBj/9s0WiuW/0JppO7A9yRu/IIhXS7YU8Inzxxf39qgDBNL6aw8IWTP8eBCgQqTa9nFyIqTWNribT47XNrkLbyLuavctosdCNxa7I3ugekg12BU2KHSlDI5IrqQcgcsHBFH5NiJ8wHZfVLeSXCCN1YCV/BcjbhesjOPRK+n/s6/rLKQtKTqeZMHoeWb8xnB61PtNnoK3lUFhN/90zdlqoFZOnOwQfu6iWons8O+SVpfoxEdKOgyIgejoPPoHVAPwEU2rmFl4RoGO4P4V5fHnOQB1/klq4IQJqpCKZnDsX8BDNOAqFIHOYAs9Hm+AkJEzGHN8nf4L32XWBRlM96jlFUyggsSQ7rzfOdCYumU9Xu5xz9Rq1HM979VWIecZUUUZSzrYR3ZObxXN7vykS92OWgIIdp0nnaykFYZUAqKsZ9O+DEFALx5rglQffMnuK1zX+63BCL9WneMmMzpH//35qBoqB8GWcPS6ZxA3b98hu4p24h1+GCCsxp315qNEad9ZIqF9E9ztTswuY8xtqoHbnQqPMxmqwTmjsuXtqK2FpADZLw/W2T2aoy6F40nBXOqG+6Fjag2Fd6gshgkXl82w==
x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:CO1PR13MB4920.namprd13.prod.outlook.com; PTR:; CAT:NONE; SFS:(13230028)(4636009)(366004)(39850400004)(346002)(376002)(396003)(136003)(451199021)(52536014)(5660300002)(44832011)(86362001)(83380400001)(66574015)(966005)(186003)(53546011)(9686003)(6506007)(122000001)(26005)(38100700002)(166002)(8676002)(38070700005)(8936002)(33656002)(54906003)(478600001)(7696005)(71200400001)(316002)(41300700001)(55016003)(76116006)(4326008)(6916009)(66446008)(66476007)(66556008)(66946007)(64756008)(2906002)(30864003)(66899021); DIR:OUT; SFP:1102;
x-ms-exchange-antispam-messagedata-chunkcount: 1
x-ms-exchange-antispam-messagedata-0: o8y4S2HoCcvS7LWx7NFogsxSsw+4kn2CLd7jKPWZbicxFrfGNeDThtTkUm3BP7hSR5gwSTvQIfDkuYhExs3WpSYQ2Onjf7boLg+9NFBrQJ494gxMWKKipB0l3G9JY4ltRwYWsDXCMNE5JA6Fk6BpJgYA4ouyiGsmn5Ll7Qjh+OYDwerQzSX7ReVt1Jm0IgGMxC0VEIeciBcM8vUb/P8f6Jc0ka041I6dfkgbvRFG6Y78BTbE4nm5H1eXtMP92n1GnSE/XEgw0uB1O1nygJYkDeI4vExrmkWy7CSKAgpJh7PCMvcHfKKaEc0HC2FtXpW3f9OOdMP1Y7kRR2dO4qmm8UHvdEalirKlpageyOsWmwP/K2ldgtt4b17mUCKYIbF93CGBOi4ob0b6jw51MPCmTtoI3gb+eehM6K6cHgUlPulEdb9UbL7i3l8EkLlB5fmIXT1EfGmm2cOfTUlG5MH3TL3HthJbW+ZJ0AJZ7B2UYw5HshEiHZxcbh89e/yuDg4336ZGEn2CSPgLWvAitXcE08ROeEc4T0IV6ZmiIgeZ5w2N1KoA8qSADaW9VvxZ6z/99SfVqUIFP2rfpA8TacV326agnv18kgSJzm2kvksSIgRqcRn/SPsKwrmxrcSYOy2QA+euSWmamx7633sKaurMUKFpNesjROgf+py5Bg/MGbJhCbsJOH17pXQlvGy0pG+7d4RrR43OjufA678wqVG45fDr5Jxhks/OIAx0E3mXppaeRkSuAbTDM/BpCf+W71U6AgFUyJ8yjUR79K5vV5m7fURL1iCqc17oDCByn3YsdR2ve9symUKU0I58qX1tg5s1sf07B+f86/lA7n993zrJmoKbzRoRLFfDN44nKdlFjhkbA5kqT/l3yTqeAv802aIsNM/2dFikwMJ0RpK0pMCDPzGlg2oLoCYvNIwgNaV0xX6qoVln3/gcOc1EoClDqlseyLdQdL4kHunSElNrCDgT4MwoAtHFmcSQxVDkH1LuTfvKUORtlVjBcvWXBIs/ohEBnEiPbS+YYZopK6MMJ2yzRQ/6L9/+5DM/1Yfgr7JFBHFZyyCPhSMJITiCOLURjVvqqxitVcFWm+76Fzd/EPCjiyVjyOAk6I/s+nbnX2K3CUFgFjfxpSbgCzbNLza5vt7mjb8vf9+XyvG3js4Tv/Hb839ANnV3TQAl7SduuWhf4W3kEvVbt/Qao2oRxqqHF1cEEYyUd/9S6MWIfHuCkW9edOCOKYQZdoykDk/gOZeeMpTd/ILfB5oa26ozyz45s8Ak50kGwHclqC+Brq42l/3/YeWY7k69p8XT/pl940+Icknsp9pIAFmRyGBhFazoUUH2dlclluzos2UK1UIUTesVwfVQkmaA0N0ATlbAIDfZWACVdIXsZm7So6knpVeJfit7AZ3cRX3wz+0856848E9vGkQvPho6C8XOeMLgIiSYsCAIliVqRdJ4562X6twSgvOASW6ojZ0drcOiU6w5Mf2sf1/AXTTHufZ5GjY4/u2S7mhXaJLO4nyQF8QplAVeJHRNYPqvKJkKj6aY2G21jAwp4lB6kc0p7YXpXNXx66iXK2+tDN3KBjP/dHAclNm4E7Vb
Content-Type: multipart/alternative; boundary="_000_CO1PR13MB4920BAB57B60E159ADB3FBFE859C9CO1PR13MB4920namp_"
MIME-Version: 1.0
X-OriginatorOrg: Futurewei.com
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-AuthSource: CO1PR13MB4920.namprd13.prod.outlook.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 1789d737-e0f4-4681-5f77-08db3f8dddb2
X-MS-Exchange-CrossTenant-originalarrivaltime: 17 Apr 2023 21:51:14.4947 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 0fee8ff2-a3b2-4018-9c75-3a1d5591fedc
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: E1rB5VLWKZCDm94MNn/ZGOvwDa0vMG/U2T/w0JIwoY5NeUBsF3xrzo7sfYaMZeBSbhcLqjS2HwyXMm+dl2DNGw==
X-MS-Exchange-Transport-CrossTenantHeadersStamped: DM6PR13MB3707
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsv-art/MzyW1pRIENtJZSjDVqQgwyxiYiA>
Subject: Re: [Tsv-art] Tsvart early review of draft-ietf-rtgwg-net2cloud-problem-statement-22
X-BeenThere: tsv-art@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Transport Area Review Team <tsv-art.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsv-art>, <mailto:tsv-art-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsv-art/>
List-Post: <mailto:tsv-art@ietf.org>
List-Help: <mailto:tsv-art-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsv-art>, <mailto:tsv-art-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 17 Apr 2023 21:51:25 -0000

Lukasz,

Thank you very much for the feedback. Please see below for the detailed resolutions.

Linda

From: Lukasz Bromirski <lukasz.bromirski@gmail.com>
Sent: Monday, April 17, 2023 11:57 AM
To: Linda Dunbar <linda.dunbar@futurewei.com>
Cc: David Black <david.black@dell.com>; tsv-art@ietf.org; draft-ietf-rtgwg-net2cloud-problem-statement.all@ietf.org; rtgwg@ietf.org
Subject: Re: Tsvart early review of draft-ietf-rtgwg-net2cloud-problem-statement-22

Linda,

Thanks - my responses inline:


On 16 Apr 2023, at 03:16, Linda Dunbar <linda.dunbar@futurewei.com<mailto:linda.dunbar@futurewei.com>> wrote:

Lukasz,

Thank you very much for reviewing the document and the comments.
Please see below for the resolutions to your comments.

Linda

From: Łukasz Bromirski <lukasz.bromirski@gmail.com<mailto:lukasz.bromirski@gmail.com>>
Sent: Friday, April 14, 2023 5:38 PM
To: Linda Dunbar <linda.dunbar@futurewei.com<mailto:linda.dunbar@futurewei.com>>
Cc: David Black <david.black@dell.com<mailto:david.black@dell.com>>; tsv-art@ietf.org<mailto:tsv-art@ietf.org>; draft-ietf-rtgwg-net2cloud-problem-statement.all@ietf.org<mailto:draft-ietf-rtgwg-net2cloud-problem-statement.all@ietf.org>; rtgwg@ietf.org<mailto:rtgwg@ietf.org>
Subject: Re: Tsvart early review of draft-ietf-rtgwg-net2cloud-problem-statement-22

Hi Linda, Group,

Let me offer some points related to the latest version of the draft:

1. "DSVPN" - this is Huawei specific term describing VPNs that allow for dynamic connections between spokes which itself is 1:1 copy of Cisco DMVPN down to use of NHRP and mGRE (https://support.huawei.com/enterprise/en/doc/EDOC1100112360/a485316c/overview-of-dsvpn). Shouldn't we avoid vendor-specific product/solution names in RFC documents?

It's actually called out again in point 4.2 later on along with Cisco's DMVPN callout at the same time (which itself is not defined anywhere).
[Linda] Agree with your point. Is "NHRP [RFC2735] based multi-point VPN" a better name? Or Can you suggest a name to indicate the NHRP based multi-point-to-point or multi-point-to multi-point tunnels among those client's own virtual routers?

NHRP is just a piece of the wider architecture, but yes, if you're looking for something short to put as a description of the concept, I'd say something like "Dynamic VPN solution for p2p or p2mp".

[Linda] Changed in -25 version per your suggestion.

However, I'd argue if this definition is even needed, given it's mentioned only 5 times, four of which are "DMVPN or DSVPN", while DMVPN definition is nowhere to be found. There's clear skew to Huawei specific naming here.

[Linda] Has removed all references to DMVPN/DSVPN in -25 revision.

It can easily be substituted to this definition above without explicitly mentioning vendor-specific names at all, given how it's used to describe specific use cases.


2.

"3.1: [...] Cloud GWs need to peer with a larger variety of parties, via private circuits or IPsec over public internet."

As far as I understood, the whole 3.1. section tries to underline need for flexible/resilient BGP implementation and I agree with that. However,  I'd argue that a lot of cloud-based connections happen via BGP over internet directly, not necessarily through private circuits or IPsec. The 4.2 section of that draft even mentions some examples of that use case.

[Linda] Azure's ExpressRoute (https://azure.microsoft.com/en-us/products/expressroute ), AWS's  Direct Connect ( https://aws.amazon.com/directconnect/ ) are via private circuits, which are widely used. They all limit on the inbound routes via both Direct connect and via Internet.

That's correct, but that's not the point I was trying to make. Some connections are directly over BGP without need to either use MPLS VPNs (who even allows that?) or IPsec. There's so much focus in this doc that the connectivity to the cloud is either of those two options (MPLS VPN and IPsec) and that's simply not the case.

Direct peerings and/or GRE tunneling are used as well. Not to mention any way of tunneling if Customer is deploying their own virtual routers at the edge of specific cloud service which is very common at this point.

[Linda] Will add a section on the direct peering over GRE in version -25.


Also, case in point below:

There's so much focus in the document on only two types of connection - MPLS VPN or IPsec. The actual use case of connecting your workload to the cloud can be easily addressed by any type of overlay routing, like GRE or VXLAN/GENEVE terminated on the virtual cloud gateway.
[Linda] When the Cloud connection is via Internet, the IPsec is exclusively used as outer header. Within the payload of the IPsec, client routes can be encapsulated by VXLAN/GENEVE, which is not the focus of the document.

It is not.

Look at the GRE options for example (https://docs.aws.amazon.com/vpc/latest/tgw/tgw-connect.html) or general connectivity for private direct peerings.

[Linda] Thanks for the link. Transit gateway for interconnecting  over AWS direct connect. Will add a section on the Direct peering in Version -25.

On top of that, Enterprises do deploy their own virtual routers at the edge of public clouds and pack their own way of extending overlays - up to, and including EVPN based solutions based on top of GRE or VXLAN/GENEVE encaps. There's no need for IPsec in those cases, as typically traffic is anyway encrypted via TLS in the app layer, or in some cases - enterprises don't really care about encrypting the traffic simply as that. There are number of reasons that's the case, not relying on specific internetworking configuration being one of the most important ones.

"When inbound routes exceed the maximum routes threshold for a peer, the current common practice is generating out of band alerts (e.g., Syslog) via management system to the peer, or terminating the BGP session (with cease notification messages [RFC 4486] being sent)."

For completness sake, shouldn't we explicitly state what's the action in the first case? Typically, the additional routes above the threshold are ignored and this in turn may lead to other reachability problems.
[Linda] At the current time, there is no standard procedure when inbound routes exceed the maximum limits or across certain thresholds. We are planning to write a standard track draft in IDR WG to kick start the discussion, like sending notifications when threshold across, ignoring routes that are not originated by the clients, or having some kind of policy on ignoring additional routes. There will be a lot of debates on this subject. IDR WG had many attempts on this in the past. None has reached consensus.

Got it, but current gen of routers typically give you two options - drop the session (described as option #2) or keep the session up and ignore additional prefixes. It's not really that important here, but just wanted to ask if it wouldn't make sense to clarify what happens in option #1.
[Linda] somehow it is important to many people in IDR in the past. Hope you can join us in the new draft.

"3.4.1: [...] Therefore, the edge Cloud that is the closest doesn't contribute much to the overall latency."

How that's a problem?

[Linda] Here is what is intended to say:

1.       The difference in routing distances to multiple server instances in different edge Clouds is relatively small. Therefore, the edge Cloud with the shortest routing distance might not be the best in providing the overall latency.

Yeah, that version makes more sense. The one I quoted should be fixed then?
 [Linda] Yes, in version -25.
"4.3: [...] However, traditional MPLS-based VPN solutions are sub-optimized for dynamically connecting to workloads/applications in cloud DCs."

The whole section says existing MPLS VPNs and/or IPsec tunnels are being used to connect to Cloud DCs. So how exactly the "traditional MPLS-based VPNs" are "sub-optimized" if at the same time they're the exact means document mentions of solving the problem?
[Linda] "sub-optimal" because
The Provider Edge (PE) nodes of the enterprise's VPNs might not have direct connections to the third-party cloud DCs used by the enterprise to provide easy access to its end users. When the user base changes, the enterprise's workloads/applications may be migrated to a new cloud DC location closest to the new user base. The existing MPLS VPN provider might not have PEs at the new location. Deploying PEs routers at new locations is not trivial, which defeats one of the benefits of Clouds' geographically diverse locations allowing workloads to be as close to their end-users as possible.

Yeah, that's the point I make below. Given how distributed current ISP infra is (where it can provide MPLS VPNs to Customers) versus centralized and limited cloud DC physical connectivity is, this statement is not true. It's easier to find MPLS VPN offering in given point of geography than to find there cloud DC - there are very limited in numbers. Let's take a look at AWS site map:
https://aws.amazon.com/about-aws/global-infrastructure/ and compare this with any major ISP PoP map.

"4.3. [...] The existing MPLS VPN provider might not have PEs at the new location. Deploying PEs routers at new locations is not trivial, which defeats one of the benefits of Clouds' geographically diverse locations allowing workloads to be as close to their end-users as possible."

When reading this literally, I'd say that any SP offering MPLS VPNs will be anyway more flexible in terms of reach (if it covers given geo) than pretty much fixed and limited number of cloud DCs available. However, I sense the intent here was to underline role of "agile" DCs set up by for example "cloud" stacks of 5G services (and similar services), and if so - that likely would require some clarification to be well understood.
[Linda] Setting up MPLS circuits takes weeks/months.

Sure, but that's not what the point says.


"4.3. [...] As MPLS VPNs provide more secure and higher quality services, choosing a PE closest to the Cloud GW for the IPsec tunnel is desirable to minimize the IPsec tunnel distance over the public Internet."

MPLS VPNs provide more secure and higher quality services.... than what?
[Linda] MPLS VPNs utilize private links.  Entrance to MPLS VPNs with edge filters provide additional filter. These are more secure than the public Internet.

I could agree with that, but such statements should be stated explicitly (why we believe that's so). Different people will thing about "more secure" in different ways. Some focus on encryption, some on authentication, some on the routing security you seem to mention. I assume the point was about all of that so let's clarify that.



"4.3. [...] As multiple Cloud DCs are interconnected by the Cloud provider's own internal network, the Cloud GW BGP session might advertise all of the prefixes of the enterprise's VPC, regardless of which Cloud DC a given prefix is actually in. This can result in inefficient routing for the end-to-end data path."

That's true, but either we praise use of anycast (in the doc above) or claim it's inferior to instead polluting routing table (announcing more prefixes), or limiting visibility (by announcing less prefixes). You can't really have it both ways.
[Linda] the intent of the section is to document the problem and describe a get around method:
To get around this problem, virtual routers in Cloud DCs can be used to attach metadata (e.g., GENEVE header or IPv6 optional header) to indicate Geo-location of the Cloud DCs.
Can you suggest a better text?

Maybe something like:

"As multiple Cloud DCs are interconnected by the Cloud provider own internal network, it's topology and routing policies are not transparent or even visible to Enterprise Customer. While normally, Cloud GW BGP sessions will provide prefixes across Enterprise VPCs and that typically achieves goals of universal connectivity, load-balancing (due to ECMP) and high availability (multiple Cloud DC points can go down and the rest will still provide service), it's worth to note that configuration by default may not provide best or even stable end-to-end data path for Customer traffic."
[Linda] Good suggestion. Still a little different from the original intent:

  *   BGP typically advertises all the attached routes.
  *   One Cloud GW BGP session might advertise all of the prefixes of the enterprise's VPC, regardless of which Cloud DC a given prefix is actually located.
  *   This can cause improper optimal path selection (in ECMP or weighted cost multi-path).
To get around: virtual routers in Cloud DCs can be used to attach metadata (e.g., GENEVE header or IPv6 optional header) to indicate Geo-location of the Cloud DCs

How about the following:
As multiple Cloud DCs are interconnected by the Cloud provider's own internal network, it's topology and routing policies are not transparent or even visible to Enterprise Customer. One Cloud GW BGP session might advertise prefixes across Enterprise VPCs, which can cause improper optimal path selection by enterprise's on-prem routers. To get around this problem, virtual routers in Cloud DCs can be used to attach metadata (e.g., GENEVE header or IPv6 optional header) to indicate Geo-location of the Cloud DCs.


"7. [...] Potential risk of augmenting the attack surface with inter-Cloud DC connection by means of identity spoofing, man-in-the-middle, eavesdropping or DDoS attacks. One example of mitigating such attacks is using DTLS to authenticate and encrypt MPLS-in-UDP encapsulation (RFC 7510)."

How it is different than protection offered by IPsec?
[Linda] This section is about those attacks to the public facing "interface" that support IPsec.

Yeah, but DTLS would be suspectible to the same or even easier DDoS than IPsec. So that would likely need complete rewrite to address both at the same time, just like the point below:


"7. [...] When IPsec tunnels established from enterprise on-premises CPEs are terminated at the Cloud DC gateway where the workloads or applications are hosted, traffic to/from an enterprise's workload can be exposed to others behind the data center gateway (e.g., exposed to other organizations that have workloads in the same data center).

To ensure that traffic to/from workloads is not exposed to unwanted entities, IPsec tunnels may go all the way to the workload (servers, or VMs) within the DC."

How that problem statement would be different than DTLS solution/protection from the beginning of the section?

[Linda] DTLS is at the Transport Layer. Here we are talking about IP layer. The answer to that security question is long.  As you know IPSEC has different attack planes than DTLS at different costs. Are you looking for a chart that compares this facet?  Or can you simply reference the appropriate RFCs?

No, the whole point of me commenting on that and DTLS section was that they tend to try to describe risks, but reference each other as solution to the problem. Neither of which is actually solution for a problem, first of all.

[Linda] The document is in the RTGwg, for routers that only forwarding the received packets, but not terminating the packets. DTLS is a the transport layer, for hosts that terminate the packets.

Thanks,
-
Łukasz Bromirski