Re: [armd] address resolution requirement from hosts to overlay edge nodes. Any opinion?

Igor Gashinsky <igor@yahoo-inc.com> Wed, 15 February 2012 22:08 UTC

Return-Path: <igor@yahoo-inc.com>
X-Original-To: armd@ietfa.amsl.com
Delivered-To: armd@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 4C6C321E80A2 for <armd@ietfa.amsl.com>; Wed, 15 Feb 2012 14:08:54 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -18.599
X-Spam-Level:
X-Spam-Status: No, score=-18.599 tagged_above=-999 required=5 tests=[BAYES_00=-2.599, RCVD_IN_DNSWL_LOW=-1, USER_IN_DEF_WHITELIST=-15]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id kVc2x8tWvLG7 for <armd@ietfa.amsl.com>; Wed, 15 Feb 2012 14:08:50 -0800 (PST)
Received: from mrout2.yahoo.com (mrout2.yahoo.com [216.145.54.172]) by ietfa.amsl.com (Postfix) with ESMTP id 1348B21E8032 for <armd@ietf.org>; Wed, 15 Feb 2012 14:08:49 -0800 (PST)
Received: from netops1.corp.bf1.yahoo.com (netops1.corp.bf1.yahoo.com [98.139.254.110]) by mrout2.yahoo.com (8.14.4/8.14.4/y.out) with ESMTP id q1FM8VLK011533; Wed, 15 Feb 2012 14:08:31 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=yahoo-inc.com; s=cobra; t=1329343711; bh=DpmZOM9EWe4B88hr+7sTdkmeY4NUjgoPAbKj7cwoG5Q=; h=Date:From:To:cc:Subject:In-Reply-To:Message-ID:References: MIME-Version:Content-Type:Content-ID; b=fPBh346CeGQkXvYGqVDNSqa99iuacoOKd8PUlbZvQti+uR7z826vKw9EPFUxqjc13 puddmAMMDUrEZcoW7E7QEuAXU8GG5Pq7yJpFTrB/4hgMUVQAwWk3BHtfv6Y8BZU6o6 6MHb0y8rdM+lE4KgAJ+zxytiNFKExuE21h7mKu/c=
Date: Wed, 15 Feb 2012 14:08:30 -0800 (PST)
From: Igor Gashinsky <igor@yahoo-inc.com>
X-X-Sender: igor@netops1.corp.bf1.yahoo.com
To: Mike McBride <mmcbride7@gmail.com>
In-Reply-To: <CAL3FGfwx=n9kKjwcARg6-ge2a-t-R+7RmR=d-qRJx=TdzNHMAQ@mail.gmail.com>
Message-ID: <alpine.LRH.2.00.1202141450530.7083@netops1.corp.bf1.yahoo.com>
References: <CA+-tSzzNeLP4N=Nv1EeBML51KTpmxPP3NWut+vnaWFy8RtUViA@mail.gmail.com> <7AE6A4247B044C4ABE0A5B6BF427F8E291E1A5@dfweml503-mbx> <CA+-tSzyvoDfwnKc7Yt65abQWSqMg2jF0iQax=wcYkmwtNGxZng@mail.gmail.com> <60C093A41B5E45409A19D42CF7786DFD522A9BE1F1@EUSAACMS0703.eamcs.ericsson.se> <CA+-tSzwZVYyEO62ngYGojwSrkSBBY2SWr93PDQmAp7a3y_7TMQ@mail.gmail.com> <CAL3FGfy0iyo_TTr-iuSzQuqRm8Li753UFWQsk=RGWh_nCdPMMw@mail.gmail.com> <CA+-tSzwFWBWd0_QZ4CqgQmjTUaXnBafNVdk8oZvK6oRTCR4Jqg@mail.gmail.com> <CAL3FGfwx=n9kKjwcARg6-ge2a-t-R+7RmR=d-qRJx=TdzNHMAQ@mail.gmail.com>
User-Agent: Alpine 2.00 (LRH 1167 2008-08-23)
MIME-Version: 1.0
Content-Type: MULTIPART/MIXED; BOUNDARY="-1956483330-726143141-1329260620=:7083"
Content-ID: <alpine.LRH.2.00.1202151352530.31155@netops1.corp.bf1.yahoo.com>
X-Mailman-Approved-At: Wed, 15 Feb 2012 14:11:20 -0800
Cc: Thomas Narten <narten@us.ibm.com>, "armd@ietf.org" <armd@ietf.org>
Subject: Re: [armd] address resolution requirement from hosts to overlay edge nodes. Any opinion?
X-BeenThere: armd@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "Discussion of issues associated with large amount of virtual machines being introduced in data centers and virtual hosts introduced by Cloud Computing." <armd.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/armd>, <mailto:armd-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/armd>
List-Post: <mailto:armd@ietf.org>
List-Help: <mailto:armd-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/armd>, <mailto:armd-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 15 Feb 2012 22:08:54 -0000

On Tue, 14 Feb 2012, Mike McBride wrote:

:: Anoop,
:: 
:: On Tue, Feb 14, 2012 at 12:32 PM, Anoop Ghanwani <anoop@alumni.duke.edu> wrote:
:: > The whole problem of sending L2 multicast over a campus or data center
:: > backbone, in any sort of significant way, is a new one enabled for the first
:: > time by overlays.  There are interesting challenges when pushing large
:: > amounts of multicast traffic through a network, and  have thus far been dealt
:: > with using purpose-built networks.  While the overlay proposals have
:: > been careful not to impose new protocol requirements, they have not
:: > addressed the issues of performance and scalability, nor the large-scale
:: > availability of these protocols.
:: 
:: There are interesting challenges with unicast and multicast in big
:: flat DC networks. That's why we are here. One of the constant themes
:: we hear with multicast is the FUD that it doesn't scale. Especially
:: from vendors who are pushing another solution or who have a crappy
:: multicast implementation. Meanwhile multicast is being deployed in
:: large scale networks without scaling issue. Large L2 overlay networks?
:: I don't know. Would be good to find out from the community about
:: performance and scalability of multicast in the DC.

I've so far stayed pretty quiet on this, but, on this, I have to strongly 
disagree. It is not FUD that multicast doesn't scale well inside large 
datacenters -- it is a simple fact, and I speak as an operator of what 
several of my vendors called the 2nd largest multicast deployment they 
have ever seen, with many 10's of thousands (S,G) entries. 

Sorry, but many of us who operate really large scale datacenters with 
a lot of variability in the communication flows have removed all 
*application* use of multicast from the network due to scalability 
and stability reasons (ie by the time an (S,G) entry is programmed into 
hardware, we might be 3k+ packets deep into the flow, or the flow is over 
already, or what happens when you need to scale to 20k+ (S,G)'s?). 
Obviously, network protocols are still able to use multicast, but due 
to the very low volumes of flows and (S,G)s that is still acceptable.

Yes, we could have pushed the multicast scaling capabilities a bit higher 
by use of BIDIR, and yes, if the network hardware/software was built 
better it could have scaled a bit further, but, that's not the world we 
live in, and that would have cost significant troubleshooting capability, and a 
lot of extra complexity which we chose to instead solve via unicast, more 
bandwidth, and better application software.

So, whatever overlay protocols are proposed, I would suggest that counting 
on there being multicast underneath to transport you is not a very good 
idea...

Thanks,
-igor