Re: [bess] VXLAN BGP EVPN Question

Jeff Tantsura <jefftant.ietf@gmail.com> Thu, 23 April 2020 22:04 UTC

Return-Path: <jefftant.ietf@gmail.com>
X-Original-To: bess@ietfa.amsl.com
Delivered-To: bess@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id B14673A0A99 for <bess@ietfa.amsl.com>; Thu, 23 Apr 2020 15:04:41 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.097
X-Spam-Level:
X-Spam-Status: No, score=-2.097 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id UPd7PQttYb8F for <bess@ietfa.amsl.com>; Thu, 23 Apr 2020 15:04:39 -0700 (PDT)
Received: from mail-pj1-x102c.google.com (mail-pj1-x102c.google.com [IPv6:2607:f8b0:4864:20::102c]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 7870C3A146C for <bess@ietf.org>; Thu, 23 Apr 2020 15:04:39 -0700 (PDT)
Received: by mail-pj1-x102c.google.com with SMTP id hi11so3071083pjb.3 for <bess@ietf.org>; Thu, 23 Apr 2020 15:04:39 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=date:from:to:message-id:in-reply-to:references:subject:mime-version; bh=O6rIX4f+LviS76fBGbQ9v+jWH0R6hNQ+kiTlgLU7+Ds=; b=oSh8anCToYIYl0Zuohec4AMQW5iSQyt+XhK5/UGkfmYuiCTQpQ7TIk4gJWDjGvgBuD sbP+LD5B88ThzRv8GTiJToKvsg0Mx+XYbPCyIgARNdOXuz7ApCNpn5nPDJDHOfsj+tuh Y6jKphO+5pauM+UkqGk5NhFqVAJC+8nhQXzHIfyTw45aeOP9n8JGgkyyydcBZhWOBox6 1sX9zg/jQV2x/i4l6u1f62wq4ED9B2G30+8wao2lfg98cFlYm4QZ3NqIy08epjuE+7dm f7hCJx8puF7YhFyoLinYGd8twTA9jx0KuFtH+9wSvDszMl3XE6bu8YFxCWN3tUAAgCqL E/mg==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:message-id:in-reply-to:references :subject:mime-version; bh=O6rIX4f+LviS76fBGbQ9v+jWH0R6hNQ+kiTlgLU7+Ds=; b=SMf88m4I9yRlQptXsGBy8js15UrzwFU5MFrgK9aL1jhOqFKTMmTXBVJO6h3cDvfSQE /qEmAPajbod1bBcW606QBylZwEZSRkYMpC68WYRfR6M4RAtbS05QfMAn1e5IuBn5m3oX siPxUDFD2drjkkvZheYXRdqRGDZOwn7J7mjbyCqS+fe3SU0z5b+y9iyGNgVoO1KP9Mtw 0oaDks0cEYmK6CFQZvkPOODcgywz9fAQMaJsorv1vD8mMbzgC0h4pjp2sHRG7sX3IorW 3r8Y01CM9WpUQDL9pwfY1kzs86aL4073l8NvoorVhGRaFPBELdlYoJoQB04rphQ6BsVl Ts5A==
X-Gm-Message-State: AGi0PuaZqUfwJHQGPbK5WsIupkhfw9DFEQmSriNtxUlt6vzQ6lGUZHhp z6J6BeyaroB0YWXinAt2FwEBUdmc
X-Google-Smtp-Source: APiQypJV27cmiJNWfAY8GsSoR8s7GvHqt6Z8mOJ6jm6gnHLlfaKKVADoyz6DQsP1pnbDKFsYBvN/BA==
X-Received: by 2002:a17:90a:71c2:: with SMTP id m2mr2939612pjs.21.1587679477293; Thu, 23 Apr 2020 15:04:37 -0700 (PDT)
Received: from [192.168.1.5] (c-73-63-232-212.hsd1.ca.comcast.net. [73.63.232.212]) by smtp.gmail.com with ESMTPSA id h9sm3515106pfo.129.2020.04.23.15.04.35 (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Thu, 23 Apr 2020 15:04:36 -0700 (PDT)
Date: Thu, 23 Apr 2020 15:04:26 -0700
From: Jeff Tantsura <jefftant.ietf@gmail.com>
To: BESS <bess@ietf.org>, Gyan Mishra <hayabusagsm@gmail.com>
Message-ID: <8193067d-3b28-40fc-8c96-3f3c528ece6c@Spark>
In-Reply-To: <CABNhwV0Tww-8SxBocQBZ4vuj7DMW44ymN4ux4h1JoaYNENnioQ@mail.gmail.com>
References: <CABNhwV1kPDhRcuqS8GOpTiKoyk_QHKVXa582JUznLvKfXQe54w@mail.gmail.com> <CABNhwV352jKVSu2Jwf9dRgzmjc05gvmLo_CL5EGuR12en-7Z_A@mail.gmail.com> <CABNhwV0Tww-8SxBocQBZ4vuj7DMW44ymN4ux4h1JoaYNENnioQ@mail.gmail.com>
X-Readdle-Message-ID: 8193067d-3b28-40fc-8c96-3f3c528ece6c@Spark
MIME-Version: 1.0
Content-Type: multipart/alternative; boundary="5ea210f2_9daf632_181f7"
Archived-At: <https://mailarchive.ietf.org/arch/msg/bess/sR9c5VkZPagldLXD_TbRCLiUCSc>
Subject: Re: [bess] VXLAN BGP EVPN Question
X-BeenThere: bess@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: BGP-Enabled ServiceS working group discussion list <bess.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/bess>, <mailto:bess-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/bess/>
List-Post: <mailto:bess@ietf.org>
List-Help: <mailto:bess-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/bess>, <mailto:bess-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 23 Apr 2020 22:04:42 -0000

Gyan,

"Multi site” is not really an IETF terminology, this is a solution implement by NX-OS, there’s a draft though. Its main functionality is to localize VxLAN tunnels and provide segmented path vs end2end full mesh of VxLAN tunnels (participating in the same EVI). We are talking HER here.
The feature is heavily HW dependent as it requires BUM re-encapsulation at the boundaries (leaf1->BGW1-BGW2->leaf2..n). So good luck seeing it soon on low end silicon.
It doesn’t eliminate BUM traffic but significantly reduces the span of “broadcast domain” and reduces the need for large flood domains (modern HW gives you ~512 large flood groups, obviously depending on HW)

Wrt your question about Mac conversation learning - this is an implementation issue, nothing in EVPN specifications precludes you of doing so, moreover in the implementation I was designing (in my previous life) we indeed decoupled data plane learning from control plane advertisement so control plane was aware of “Active” flows.  Needless to say - this creates  an additional layer of complexity and all kinds of funky states in the system ;-).

Hope this helps

Cheers,
Jeff
On Apr 23, 2020, 8:30 AM -0700, Gyan Mishra <hayabusagsm@gmail.com>, wrote:
>
>
> Slight clarification with the arp traffic.  What I meant was broadcast traffic translated into BUM traffic with the EVPN architecture is there any way to reduce the amount of BUM traffic with a data center design requirement with vlan anywhere sprawl with 1000s of type 2 Mac mobility routes being learned between all the leaf VTEPs.
>
> The elimination of broadcast is a tremendous gain and with broadcast suppression of multicast that does help but it would be nice to not have such massive Mac tables type 2 route churn chatter with a conversation learning where only active flows are are in the type 2 rib.
>
> Kind regards
>
> Gyan
>
> > On Wed, Apr 22, 2020 at 6:47 PM Gyan Mishra <hayabusagsm@gmail.com> wrote:
> > >
> > > In the description of the vxlan BGP evpn scenario has a typo on the multisite feature segmented LSP inter pod with the RT auto rewrite which is similar to MPLS inter-as option b not a.
> > >
> > > Kind regards
> > >
> > > Gyan
> > >
> > > > On Wed, Apr 22, 2020 at 5:57 PM Gyan Mishra <hayabusagsm@gmail.com> wrote:
> > > > >
> > > > > All
> > > > >
> > > > > Had a question related to vxlan BGP EVPN architecture specifications defined in BGP EVPN NVO3 overlay RFC 8365 and VXLAN data plane RFC 7348.
> > > > >
> > > > > In a Data Center environment where you have a multiple PODs individual fabrics per POD connected via a super spine extension using a Multi site feature doing auto rewrite of RTs to stitch the NVE tunnel between pods similar to inter-as option A.
> > > > >
> > > > > So in this scenario where you have vlan sprawl everywhere with L2 and L3 VNIs everywhere as if it were a a single L2 domain.  The topology is a typical vxlan spine leaf topology where the L3 leafs are the TOR switch so very small physical  L2 fault domain. So I was wondering if with the vxlan architecture if this feature below is possible or if their is a way to do so in the current specification.
> > > > >
> > > > > Cisco use to have a DC product called “fabric path” which was based on conversation learning.
> > > > >
> > > > > Is there any way with existing vxlan BGP evpn specification or maybe future enhancement to have a Mac conversation learning capability so that only the active mac’s that are part of a conversations flow are the mac that are flooded throughout the vxlan fabric.  That would really help tremendously with arp storms so if new arp entries are generated locally on a leaf they are not flooded through the fabric unless their are active flows between leafs.
> > > > >
> > > > > Also is there a way to filter type 2 Mac mobility routes between leaf switches at the control plane level based on remote vtep or maybe other parameters..  That would also reduce arp storms BUM traffic.
> > > > >
> > > > > Kind regards
> > > > >
> > > > > Gyan
> > > > > --
> > > > > Gyan  Mishra
> > > > > Network Engineering & Technology
> > > > > Verizon
> > > > > Silver Spring, MD 20904
> > > > > Phone: 301 502-1347
> > > > > Email: gyan.s.mishra@verizon.com
> > > > >
> > > > >
> > > --
> > > Gyan  Mishra
> > > Network Engineering & Technology
> > > Verizon
> > > Silver Spring, MD 20904
> > > Phone: 301 502-1347
> > > Email: gyan.s.mishra@verizon.com
> > >
> > >
> --
> Gyan  Mishra
> Network Engineering & Technology
> Verizon
> Silver Spring, MD 20904
> Phone: 301 502-1347
> Email: gyan.s.mishra@verizon.com
>
>
> _______________________________________________
> BESS mailing list
> BESS@ietf.org
> https://www.ietf.org/mailman/listinfo/bess