Re: [nvo3] Shepherd's review of draft-ietf-nvo3-vmm-06

"Bocci, Matthew (Nokia - GB)" <matthew.bocci@nokia.com> Fri, 21 February 2020 16:45 UTC

Return-Path: <matthew.bocci@nokia.com>
X-Original-To: nvo3@ietfa.amsl.com
Delivered-To: nvo3@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 44B131201EA; Fri, 21 Feb 2020 08:45:06 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.9
X-Spam-Level:
X-Spam-Status: No, score=-1.9 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=nokia.onmicrosoft.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id iFtQe3qiUckm; Fri, 21 Feb 2020 08:45:01 -0800 (PST)
Received: from EUR03-DB5-obe.outbound.protection.outlook.com (mail-eopbgr40139.outbound.protection.outlook.com [40.107.4.139]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 479531200F1; Fri, 21 Feb 2020 08:45:01 -0800 (PST)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=ONAji9qmXGDO6kVCcfjnQIXgxcqxhAU9ioM9foX3MpiqT8yfZhfeOqp/18440X0QrajCh52GIZBpmUudsrJJ19LFEmCMHqXAK0xFg122mAV7B1BGIi9GOzi3vDtIrMwPAPhLMLUcKsmm/lzIMpfqijStNa+S8CHd7N/JBCtiZfmlipR7R+pdxoOeLeyaax297sdBYAik1ymsoQc50YzcckUI/00pVNQhiY8jhVWVZ87JEpjfSKaMkSFVrAJoOfeobhjVkmNQLA/6TD8UrNOnXLquh1rNYYVIQhi9CsyvbsH0NWShZw7WaJ1HsMwEsogFmsGtWkefGGayGnq6YiGBWQ==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=JSMJ5y6ilI46FGsQRnpI38pb9OeMo33agQ+uuIG7APQ=; b=J24KZehQTPgPh+kgk6FOgYAKiaBmyZOJQh7U16T/2oyUeoQOIMe2RxjwCEOT3dc4ChkdhONo/Y4ryfLK42q2qrcxS13Kauue4wDPJLMg7nMjDwnueTqIK857fjVwSM+4AEv+YnNs8bhTIXy5G+NdXd4Ddbv142HG6SW8GhwlryMPG1XnYa4WvrILm0erYCNv8M1Edw5Gd88xdVPNgXglzT6W68r1KReHNSucq9hMSIyb8dzWHT3h85kFntRXp9N8nBJVPnRPNQvfL0CG1TWkwbypRczCUBZQGOoHuq6ISR3fnmz6ryxApuxExCkzh+XlU36hvUnhSLWvRHXSR87iew==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=nokia.com; dmarc=pass action=none header.from=nokia.com; dkim=pass header.d=nokia.com; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=nokia.onmicrosoft.com; s=selector1-nokia-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=JSMJ5y6ilI46FGsQRnpI38pb9OeMo33agQ+uuIG7APQ=; b=kGaQsTgiGK0ko61uaxRo3iZBeZJ9f0e0WDNgtrAkND3EwMnOxlwIP5w8KfHn/0UcS34pLCGXj7hn2k6bzB1QL/HeSYAfk7a8EpLMYhZ7zwebr+OzI71VrgnvO8nooiG97dUBLgtiriN4dWsC/mD4iJFZwQ0CHRwJlAzLdL1fvIA=
Received: from DB7PR07MB4106.eurprd07.prod.outlook.com (52.134.103.159) by DB7PR07MB3948.eurprd07.prod.outlook.com (52.134.97.18) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.2750.8; Fri, 21 Feb 2020 16:44:58 +0000
Received: from DB7PR07MB4106.eurprd07.prod.outlook.com ([fe80::d1d:70e6:1bc8:7437]) by DB7PR07MB4106.eurprd07.prod.outlook.com ([fe80::d1d:70e6:1bc8:7437%7]) with mapi id 15.20.2750.021; Fri, 21 Feb 2020 16:44:58 +0000
From: "Bocci, Matthew (Nokia - GB)" <matthew.bocci@nokia.com>
To: "draft-ietf-nvo3-vmm@ietf.org" <draft-ietf-nvo3-vmm@ietf.org>, Linda Dunbar <linda.dunbar@futurewei.com>
CC: NVO3 <nvo3@ietf.org>
Thread-Topic: Shepherd's review of draft-ietf-nvo3-vmm-06
Thread-Index: AQHV506G8r/CExDZbUuMIkUXDsuycqgkibfQgAD6mQCAAFLOEIAABptC
Date: Fri, 21 Feb 2020 16:44:58 +0000
Message-ID: <DB7PR07MB4106EBC19BD973F1E72F6104EB120@DB7PR07MB4106.eurprd07.prod.outlook.com>
References: <F3AE9A2C-20AA-452F-AC3B-DC1E4380634B@nokia.com> <MWHPR1301MB209646A4B133F35DFE3F57C385120@MWHPR1301MB2096.namprd13.prod.outlook.com> <29C0C34D-69DF-47D7-8ACD-2356FDB015BB@nokia.com>, <MWHPR1301MB2096305A69C1E433BDB5231F85120@MWHPR1301MB2096.namprd13.prod.outlook.com>
In-Reply-To: <MWHPR1301MB2096305A69C1E433BDB5231F85120@MWHPR1301MB2096.namprd13.prod.outlook.com>
Accept-Language: en-GB, en-US
Content-Language: en-GB
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
authentication-results: spf=none (sender IP is ) smtp.mailfrom=matthew.bocci@nokia.com;
x-originating-ip: [81.108.178.133]
x-ms-publictraffictype: Email
x-ms-office365-filtering-ht: Tenant
x-ms-office365-filtering-correlation-id: 8346aefb-d0a8-4fd9-9e41-08d7b6ed632f
x-ms-traffictypediagnostic: DB7PR07MB3948:
x-microsoft-antispam-prvs: <DB7PR07MB3948FD556D6147FD1E6F5EF2EB120@DB7PR07MB3948.eurprd07.prod.outlook.com>
x-ms-oob-tlc-oobclassifiers: OLM:9508;
x-forefront-prvs: 0320B28BE1
x-forefront-antispam-report: SFV:NSPM; SFS:(10019020)(4636009)(396003)(346002)(136003)(366004)(39860400002)(376002)(199004)(189003)(478600001)(26005)(66556008)(71200400001)(316002)(186003)(64756008)(53546011)(66476007)(66446008)(91956017)(110136005)(66946007)(2906002)(7696005)(55236004)(45080400002)(76116006)(6506007)(19627235002)(55016002)(5660300002)(52536014)(9686003)(33656002)(8676002)(30864003)(8936002)(81156014)(81166006)(86362001)(4326008); DIR:OUT; SFP:1102; SCL:1; SRVR:DB7PR07MB3948; H:DB7PR07MB4106.eurprd07.prod.outlook.com; FPR:; SPF:None; LANG:en; PTR:InfoNoRecords; A:1; MX:1;
received-spf: None (protection.outlook.com: nokia.com does not designate permitted sender hosts)
x-ms-exchange-senderadcheck: 1
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: ztmC60rVc6VpoARrtmf6TBOqlyUhvGHVDW/kjtcj4CpYeQtdnYw0tbpAHt84nRV6QldhFxGnXuiCgFBe1ltw5ry81AdHLIrd65KaJkSVEORH8fGSU8ilVwXlSfdEGHtiACNShDetSjxNvdJmrQv4LwUISRM9DFOKuncAWK70vHn3wYoBfvbcHJR0TpUZc2TWnuQne+fMqzPSUAtrL7+rblwxAsdodrOemCku/pPzWRZ92Fbc7zHmSAIr829E2J+4Z5un2uw3dbH6dYGJMsmp0ywKNc4bd4v7XpheazZxysqhMeljInlhr2dlH6F9bxCqga7FDcgprrDBcuFQu5E0tybKcKa8K1Szy0y61aXy0aqobOTIwYDuW6YZavAmDKV0B03liHb8sakvA6su1R8QdxpQ53K2bn/G6ApUB5CsvA3Vh9ied3tmJ7dueOe3l+QUI/veC9Qq7fGjV9FpFXQtp3SN7Ic/gQwCNPLwk/EuV74QMj8QedYyKEaVEgM/6NAT58Wtyrp7IORzhPAEy6alDQ==
x-ms-exchange-antispam-messagedata: Cr/t+jU1zniKSZ8ilFdNBZZqN4yN6/KcLsbfZ1mwPPYbzALQnhVeID3RXxQsVz+Mf2PuXVcPdettQ+XMkFOptCL1ygo4scg0qtEprIpW+mPxVDuCtaHvJnZE2MSnuEY5Vy5+v/OL5anER4GGHeQeMw==
x-ms-exchange-transport-forked: True
Content-Type: multipart/alternative; boundary="_000_DB7PR07MB4106EBC19BD973F1E72F6104EB120DB7PR07MB4106eurp_"
MIME-Version: 1.0
X-OriginatorOrg: nokia.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 8346aefb-d0a8-4fd9-9e41-08d7b6ed632f
X-MS-Exchange-CrossTenant-originalarrivaltime: 21 Feb 2020 16:44:58.3234 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 5d471751-9675-428d-917b-70f44f9630b0
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: u7QKXwEkUjvyLp+q3kZht1n1B+iZAaDeZvDVDzseRlqbXkvdKCa4/XU9T/A9YNU9D22eZhc3sxb8pXtMzIcytA==
X-MS-Exchange-Transport-CrossTenantHeadersStamped: DB7PR07MB3948
Archived-At: <https://mailarchive.ietf.org/arch/msg/nvo3/qqMXapHsq-4qGapyawFda-skENI>
Subject: Re: [nvo3] Shepherd's review of draft-ietf-nvo3-vmm-06
X-BeenThere: nvo3@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "Network Virtualization Overlays \(NVO3\) Working Group" <nvo3.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/nvo3>, <mailto:nvo3-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/nvo3/>
List-Post: <mailto:nvo3@ietf.org>
List-Help: <mailto:nvo3-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/nvo3>, <mailto:nvo3-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 21 Feb 2020 16:45:07 -0000

Hi Linda

Ok let's go forward with that revised wording.

Thanks
Matthew

Get Outlook for Android<https://aka.ms/ghei36>

________________________________
From: Linda Dunbar <linda.dunbar@futurewei.com>
Sent: Friday, February 21, 2020 4:36:21 PM
To: Bocci, Matthew (Nokia - GB) <matthew.bocci@nokia.com>; draft-ietf-nvo3-vmm@ietf.org <draft-ietf-nvo3-vmm@ietf.org>
Cc: NVO3 <nvo3@ietf.org>
Subject: RE: Shepherd's review of draft-ietf-nvo3-vmm-06




Matthew,





How about the following change to address your comment?

Current: “Old NVE gets New NVE address from NVA in the request to move the VM.”



Revised: “Old NVE gets New NVE address from its NVA, assuming that the NVA gets the notification when a VM is moved from one NVE to another. It is out of the scope of this document on which entity manages the VM move and how NVA gets notified of the move”.





Thank you.



Linda





MB> How does the old NVE know this? I assume the NVA pushes this to the old NVE, but it is not at all clear from the text. Please clarify.

[Linda] “Old NVE gets New NVE address from NVA in the request to move the VM”. Not clear?



MB2> The directionality of the ‘request to move the NVE’ is not clear to me. I assume what happens is that when some entity decides to move the VM from one server to another then it tells the NVA to update all the NVEs in the DC VPN accordingly.

So the NVA pushes the information about how to reach the VM at the new NVE from the old NVE. I don’t think the NVA actually moves the VM, since the NVA is just managing the network, not the spin up/down/migration of the VMs themselves. Likewise, I don’t think the NVE asks the NVA to move a VM as it is not much more than a tunnel endpoint.  It would be helpful if you can clarify the sequence of steps and the directionality of any messaging here.





From: Bocci, Matthew (Nokia - GB) <matthew.bocci@nokia.com>
Sent: Friday, February 21, 2020 5:24 AM
To: Linda Dunbar <linda.dunbar@futurewei.com>; draft-ietf-nvo3-vmm@ietf.org
Cc: NVO3 <nvo3@ietf.org>
Subject: Re: Shepherd's review of draft-ietf-nvo3-vmm-06



Hi Linda,



Thanks for your responses. I am fine with most of these.



I have just a couple of comments below.



Matthew



From: Linda Dunbar <linda.dunbar@futurewei.com<mailto:linda.dunbar@futurewei.com>>
Date: Friday, 21 February 2020 at 00:41
To: "Bocci, Matthew (Nokia - GB)" <matthew.bocci@nokia.com<mailto:matthew.bocci@nokia.com>>, "draft-ietf-nvo3-vmm@ietf.org<mailto:draft-ietf-nvo3-vmm@ietf.org>" <draft-ietf-nvo3-vmm@ietf.org<mailto:draft-ietf-nvo3-vmm@ietf.org>>
Cc: NVO3 <nvo3@ietf.org<mailto:nvo3@ietf.org>>
Subject: RE: Shepherd's review of draft-ietf-nvo3-vmm-06



Matthew,



Thank you very much for the detailed comments. They are very helpful.

Inserted below are the detailed changes per your comments.



Can you let us know if they are acceptable?



Thank you.



Linda



From: Bocci, Matthew (Nokia - GB) <matthew.bocci@nokia.com<mailto:matthew.bocci@nokia.com>>
Sent: Wednesday, February 19, 2020 12:01 PM
To: draft-ietf-nvo3-vmm@ietf.org<mailto:draft-ietf-nvo3-vmm@ietf.org>
Cc: NVO3 <nvo3@ietf.org<mailto:nvo3@ietf.org>>
Subject: Shepherd's review of draft-ietf-nvo3-vmm-06



Authors



Here is my document shepherd’s review. Please treat these comments as you would any other last call comment.



Thanks



Matthew



The document is missing the definite article (‘the’, ‘a’ etc) in numerous places. Please go through the document carefully and correct these.

[Linda] thank you. We are going through to add them.



Please expand the term ‘DC’ on first use.

[Linda] added.



Section 3, Requirements

VM mobility should not require changing their IP addresses after the

   move.



MB> Who is ‘their’? Do you mean ‘a VM’s’ ?

[Linda]. Yes. Changed the sentence to “Virtual machine (VM) mobility should not require changing VMs’ IP addresses after the move.”



   There exist "Hot Migration" where transport service continuity is

   maintained, and "Cold Migration" where the transport service needs

   to be restarted, i.e., execution of the tasks   is stopped on the

   "Old" NVE, moved to the "New" NVE and the task is restarted.



MB> What is the requirement here? Can you rephrase this paragraph?



[Linda] Add the following sentences to the paragraph:

“Not all DCs support “Hot Migration. DCs that only support Cold Migration should make their customers aware of the potential service interruption during the Cold migration.”



4.1. VM Migration in Layer-2 Network

[...]

Therefore, this scheme is highly desirable for utilization in

     large scale multi-tenant DCs.



MB> Which scheme? Do you mean dynamic VM mobility? Please can you clarify this sentence.

[Linda] Yes. Revised the sentence to the following:

“Therefore, dynamic VM Mobility is highly desirable for large scale multi-tenant DCs.”



[...]



Such a change enables all NVEs

     to encapsulate the outgoing MAC frames with the current target NVE

     IP address. It may take some time to refresh the ARP/ND cache when

     a VM has moved to a New NVE.  During this period, a tunnel is

     needed for that Old NVE to forward packets destined to the VM

     under the New NVE.



MB> How does the old VM know which tunnel to use?

[Linda] the procedure is explained in the paragraph right following this sentence:

“In IPv4, the VM immediately after the move should send a gratuitous ARP request message containing its IPv4 and Layer 2 MAC address in its new NVE.  This message's destination address is the broadcast address.  Upon receiving this message, both Old and New NVEs should update the VM's ARP entry in the central directory at the NVA, to update its mappings to record the IPv4 address & MAC address of the moving VM along with the new NVE IPv4 address.  An NVE-to-NVA protocol is used for this purpose [RFC8014].”



[...]



Reverse ARP (RARP) which enables the host to discover its IPv4

     address when it boots from a local server [RFC0903], is not used

     by VMs because the VM already knows its IPv4 address. Next, we

     describe a case where RARP is used.



MB> Please clarify. First you say RARP is not used, but then you say it is. Perhaps it is just the was this paragraph is phrased.

[Linda] Changed the sentence to the following:

“Reverse ARP (RARP) which enables the host to discover its IPv4 address when it boots from a local server [RFC0903], is not used by VMs if the VM already knows its IPv4 address (most common scenario). Next, we describe a case where RARP is used.



4.2. Task Migration in Layer-3 Network



     Layer-2 based DC networks become quickly prohibitive because

     ARP/neighbor caches don't scale.



MB> That is a very strong statement for a BCP. Perhaps you mean to say that ARP/neighbour cache scalability considerations can limit the size of Layer-2 based DC networks?

[Linda] That is correct. Changed to your suggested wording.



[...]



Cold task migration, which is a common practice in many data

     centers, involves the following steps:



     - Stop running the task.

     - Package the runtime state of the job.

     - Send the runtime state of the task to the new NVE where the

        task is to run.

     - Instantiate the task's state on the new machine.

     - Start the tasks   continuing it from the point at which it was

        stopped.





     Address migration and connection migration in moving tasks or VMs

     are addressed next.



MB> This last sentence seems redundant. I suggest removing it.

[Linda] removed. Thanks.



[...]



5. Handling Packets in Flight



     The Old NVE may receive packets from the VM's ongoing

     communications. These packets should not be lost; they should be

     sent to the New NVE to be delivered to the VM.  The steps involved

     in handling packets in flight are as follows:



     Preparation Step:  It takes some time, possibly a few seconds for

     a VM to move from its Old NVE to a New NVE. During this period, a

     tunnel needs to be established so that the Old NVE can forward

     packets to the New NVE. Old NVE gets New NVE address from NVA in

     the request to move the VM. The Old NVE can store the New NVE

     address for the VM with a timer. When the timer expired, the entry

     for the New NVE for the VM can be deleted.



MB> How does the old NVE know this? I assume the NVA pushes this to the old NVE, but it is not at all clear from the text. Please clarify.

[Linda] “Old NVE gets New NVE address from NVA in the request to move the VM”. Not clear?



MB2> The directionality of the ‘request to move the NVE’ is not clear to me. I assume what happens is that when some entity decides to move the VM from one server to another then it tells the NVA to update all the NVEs in the DC VPN accordingly.

So the NVA pushes the information about how to reach the VM at the new NVE from the old NVE. I don’t think the NVA actually moves the VM, since the NVA is just managing the network, not the spin up/down/migration of the VMs themselves. Likewise, I don’t think the NVE asks the NVA to move a VM as it is not much more than a tunnel endpoint.  It would be helpful if you can clarify the sequence of steps and the directionality of any messaging here.



[...]



6. Moving Local State of VM



MB> This whole section seems to be out of scope of the DC VPN network. Therefore, I would think it is out of scope of this draft and should be removed.

[Linda] How about we remove the description for the detailed State Transferring. And states that it is out of the scope of this document?

6. Moving Local State of VM

In addition to the VM mobility related signaling (VM Mobility Registration Request/Reply), the VM state needs to be transferred to the New NVE.  The state includes its memory and file system if the VM cannot access the memory and the file system after moving to the New NVE.

The mechanism of transferring VM States and file system is out of the scope of this document.







[...]

There is also a Hot Standby option in addition to the Hot

     Mobility, where there are VMs in both primary and secondary NVEs.



MB> This section title says Hot Mobility, but only talks about Hot Standby.

[Linda] Added a new subtitle “Other VM Mobility Options”



     They have identical information and can provide services

     simultaneously as in load-share mode of operation.  If the VM in

     the primary NVE fails, there is no need to actively move the VM to

     the secondary NVE because the VM in the secondary NVE already

     contains identical information.  The Hot Standby option is the

     costliest mechanism, and hence this option is utilized only for

     mission-critical applications and services.  In Hot Standby

     option, regarding TCP connections, one option is to start with and

     maintain TCP connections to two different VMs at the same time.

     The least loaded VM responds first and starts providing service

     while the sender (origin) still continues to receive Ack from the

     heavily loaded (secondary) VM and chooses not to use the service

     of the secondary responding VM.  If the situation (loading

     condition of the primary responding VM) changes the secondary VM

     may start providing service to the sender (origin).



[...]



8. VM Operation

     Once a VM moves to a new NVE, the VM's IP address does not change

     and the VM should be able to continue to receive packets to its

     address(es).



MB> How does that work for the hot standby case? Do you swap the old/new VM IP addresses?

[Linda] moved the description of the behaviour on Hot VM Mobility to be under the Section 7 (Handling Hot, Warm and Cold VM Mobility). Changed the section title to “VM Lifecycle Management” (which is out of the scope of the document).