Re: [trill] Thoughts on active-active edge

Radia Perlman <radiaperlman@gmail.com> Wed, 12 December 2012 05:59 UTC

Return-Path: <radiaperlman@gmail.com>
X-Original-To: trill@ietfa.amsl.com
Delivered-To: trill@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 9D88A21F84CF for <trill@ietfa.amsl.com>; Tue, 11 Dec 2012 21:59:48 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -3.598
X-Spam-Level:
X-Spam-Status: No, score=-3.598 tagged_above=-999 required=5 tests=[BAYES_00=-2.599, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_LOW=-1]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id MXCbOwosQf1W for <trill@ietfa.amsl.com>; Tue, 11 Dec 2012 21:59:46 -0800 (PST)
Received: from mail-vb0-f45.google.com (mail-vb0-f45.google.com [209.85.212.45]) by ietfa.amsl.com (Postfix) with ESMTP id 5AD2621F84CA for <trill@ietf.org>; Tue, 11 Dec 2012 21:59:46 -0800 (PST)
Received: by mail-vb0-f45.google.com with SMTP id p1so330166vbi.18 for <trill@ietf.org>; Tue, 11 Dec 2012 21:59:42 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=78aUUtGXcVgeeQiEB8dnTs9vfqMHe2UcS1MT436OoS8=; b=RsdAPDsF1GpbW4WsQCR8XyKVqsZrviuOo10apYO+E2zDIdz0PAcwLcabbAkEF5n43Q F0T95mtJewdZGyS7WDkOX6pghQwkDVolaO+wmMwbCRqSGXlsJXfMZFpdnWNiy9zB5AbG 5rahSvgmYQFmjFUlIRpsWgfbIN5j0nIXBBLDMDyP+NW281iaZpEdLcmgPxKKWYmwchdc NPhT7jxpHcs21GGwrGfJdJEWfOGv46Bb22z/deIul18Rw/dCGpYvco+m2yFqI5d5hIrb FTaIcOuvs9e9DowTjpbpUef3V3dN0RADYiUAhLBLpBdMmIPOH71Zp8WJpKRt6PhlQFm+ XguA==
MIME-Version: 1.0
Received: by 10.58.222.40 with SMTP id qj8mr336506vec.36.1355291982532; Tue, 11 Dec 2012 21:59:42 -0800 (PST)
Received: by 10.58.207.138 with HTTP; Tue, 11 Dec 2012 21:59:40 -0800 (PST)
In-Reply-To: <FBEA3E19AA24F847BA3AE74E2FE1935628892DF6@xmb-rcd-x08.cisco.com>
References: <CAFOuuo4zvX5AtD-oGRRftuZaKmhY7C7-SvDjznMOdzUj+Q3fGQ@mail.gmail.com> <FBEA3E19AA24F847BA3AE74E2FE1935628892DF6@xmb-rcd-x08.cisco.com>
Date: Tue, 11 Dec 2012 21:59:40 -0800
Message-ID: <CAFOuuo5LP1EzajpeBri2KhTT-wf+vv=JwmTLma9_mxg7dM5PvQ@mail.gmail.com>
From: Radia Perlman <radiaperlman@gmail.com>
To: "Tissa Senevirathne (tsenevir)" <tsenevir@cisco.com>
Content-Type: multipart/alternative; boundary="047d7bdc8fd02f8e8e04d0a1818a"
Cc: "trill@ietf.org" <trill@ietf.org>
Subject: Re: [trill] Thoughts on active-active edge
X-BeenThere: trill@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "Developing a hybrid router/bridge." <trill.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/trill>, <mailto:trill-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/trill>
List-Post: <mailto:trill@ietf.org>
List-Help: <mailto:trill-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/trill>, <mailto:trill-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 12 Dec 2012 05:59:48 -0000

Tissa, there are always technical tradeoffs between approaches.

Tissa said:
>>[Answer] This depends on the traffic Radia. You cannot assume all
multicast hash to Node R1 and all unicast to >>R2 and R3 and hence
multicast unicast address ping pong is rare. In the contrary, you could
have heavy >>multicast stream and a unicast stream both hashing in to the
same RBridge RB1.

I'm not sure what relevance there is to whether unicast hashes to R1, R2,
R3, or alternates between them, since the rule is that they all use the
pseudonode nickname when ingressing from E. The only way there will be
frequent endnode moves for E is if E frequently alternates sending
multicast and unicast.  All unicast from E, whether it is sent to R1, R2,
or R3, will be perceived by R8 as coming from pseudonode P.  Multicast, on
the other hand, with the proposed rule, could bounce between R1, R2, and
R3.  Though as an optimization, if the pseudonode P is attached to R2, then
R2 could use P when ingressing either unicast or multicast, so multicast
that happens to hash to R2 will not be perceived as an endnode move,
whereas multicast hashing to R1 or R3 would.

Tissa said about why endnode moves are a problem:

>>[Answer] Yes it is big time.  Unlike ancient devices modern devices to
hardware based learning. When address >>association start moving, it will
constantly bombard the CPU with new address or address move notifications.

Your description sounds to me like it's only a problem if there are a lot
of moves.  Since it seems unlikely that E will keep alternating between
sending multicast and unicast, or even if there were one node that was
doing that for some reason, that there would be lots of nodes acting this
way, it seems like the amount of moves seen, relative to the total amount
of traffic received by R8, should be really small.

So, the disadvantages of this proposal, as Sunny pointed out, are
1) the active-active RBs might need a hardware change in order to
selectively use a different ingress nickname depending on whether the
packet is multicast or unicast  (which implementations is this true for?)
2) distant RBs (say R8), will perceive E moving when E switches from
sending unicast to multicast.

But as I said, all approaches have pluses and minuses, so the minuses have
to be weighed against the pluses.

The advantages of this proposed approach relative to the Affinity TLV are:

a) you only need one tree for the campus, and you can have arbitrarily many
active-active nodes.  Or to put it a different way, you can have more
active-active RBs than there are trees.

b) furthermore, it allows the active-active RBs to do load splitting of
multicast (choosing more than one tree), without requiring n*k trees, where
n is the number of active-active RBs, and k is the number of trees each
might want to use for load splitting.

c) you don't need all RBs to change at once in the campus (as you would
with the Affinity proposal).  If R8 is unmodified, R8 will see E changing
when E changes from multicast to unicast, but will not have incorrect
behavior.  With affinity, absolutely every RB in the campus must be
modified or there will be incorrect (not just suboptimal) behavior.


Radia