Re: 答复: [rbridge] Call for draft-tissa-trill-cmt-00 to WG draft

Jon Hudson <jon.hudson@gmail.com> Fri, 30 March 2012 21:33 UTC

Return-Path: <rbridge-bounces@postel.org>
X-Original-To: ietfarch-trill-archive-Osh9cae4@ietfa.amsl.com
Delivered-To: ietfarch-trill-archive-Osh9cae4@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id F1A7921F8650 for <ietfarch-trill-archive-Osh9cae4@ietfa.amsl.com>; Fri, 30 Mar 2012 14:33:27 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -3.143
X-Spam-Level:
X-Spam-Status: No, score=-3.143 tagged_above=-999 required=5 tests=[AWL=-0.062, BAYES_00=-2.599, DATE_IN_PAST_06_12=1.069, HTML_MESSAGE=0.001, J_CHICKENPOX_72=0.6, MIME_8BIT_HEADER=0.3, MIME_HTML_MOSTLY=0.001, MIME_QP_LONG_LINE=1.396, RCVD_IN_DNSWL_MED=-4, SARE_SUB_ENC_UTF8=0.152]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id PseqKFBrQin6 for <ietfarch-trill-archive-Osh9cae4@ietfa.amsl.com>; Fri, 30 Mar 2012 14:33:26 -0700 (PDT)
Received: from boreas.isi.edu (boreas.isi.edu [128.9.160.161]) by ietfa.amsl.com (Postfix) with ESMTP id E40D621F864F for <trill-archive-Osh9cae4@lists.ietf.org>; Fri, 30 Mar 2012 14:33:25 -0700 (PDT)
Received: from boreas.isi.edu (localhost [127.0.0.1]) by boreas.isi.edu (8.13.8/8.13.8) with ESMTP id q2UL5UdY019654; Fri, 30 Mar 2012 14:05:32 -0700 (PDT)
Received: from mail-yx0-f180.google.com (mail-yx0-f180.google.com [209.85.213.180]) by boreas.isi.edu (8.13.8/8.13.8) with ESMTP id q2UL4uuS019588 (version=TLSv1/SSLv3 cipher=RC4-SHA bits=128 verify=NOT) for <rbridge@postel.org>; Fri, 30 Mar 2012 14:05:05 -0700 (PDT)
Received: by yenl4 with SMTP id l4so917003yen.39 for <rbridge@postel.org>; Fri, 30 Mar 2012 14:04:56 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=subject:references:from:content-transfer-encoding:content-type :in-reply-to:message-id:date:cc:to:mime-version:x-mailer; bh=tvXitxZ06budh8fWCUdAeOW+xvyaKvxFapHIQ2NZF8w=; b=iY188m7TD5tD8xk7uyq0yEYn0Ka7y/XmymghLBSKW8rPoxfn8GML/Cf5IWGJCqeZ4k u3qxM3MOnqqJnI0tTvPtLz+2Rtqh4/JJ6ESLfMgXi1fZ7FPEofGcf2aQXy/J6SFlj/Ap 9fGOzzmJy+9KDH27f1d8vW3dc3hS+jCZtDw0qkAqIMNhJPrTd91N4Ujy+l+szyNeXDQt 5eXjDrAa9MRciwGP9GNGeGkvcV7CM7feOWrQBoTN6ceHSOq8phCvOtPnNzxxQglwEdDl /nv8IYpQjEY//9C2q5JdAh/ztDlfRn6ObO6sbgDzYORG0lwq8ETokJp66357P9jXyjxY Qy6A==
Received: by 10.68.195.169 with SMTP id if9mr536783pbc.68.1333141495457; Fri, 30 Mar 2012 14:04:55 -0700 (PDT)
Received: from [10.17.99.24] ([166.205.137.19]) by mx.google.com with ESMTPS id y8sm8114408pbg.43.2012.03.30.14.04.38 (version=TLSv1/SSLv3 cipher=OTHER); Fri, 30 Mar 2012 14:04:52 -0700 (PDT)
Subject: Re: 答复: [rbridge] Call for draft-tissa-trill-cmt-00 to WG draft
References: <344037D7CFEFE84E97E9CC1F56C5F4A5DA0303@xmb-sjc-214.amer.cisco.com> <OFB0DE3615.39F65513-ON482579CE.003C2CB1-482579CE.003E7761@zte.com.cn> <4552F0907735844E9204A62BBDD325E728CAD030@SZXEML507-MBS.china.huawei.com> <344037D7CFEFE84E97E9CC1F56C5F4A5DA05D7@xmb-sjc-214.amer.cisco.com> <CAFOuuo79GLE1QQ3MEn=Wx0z-eY8Vsnscy-bVw_8dmKSw8Efe_Q@mail.gmail.com> <4552F0907735844E9204A62BBDD325E728CAE193@SZXEML507-MBS.china.huawei.com> <CD31C838133B34469D6E01406350E68C0EA39F2F@xmb-sjc-215.amer.cisco.com> <CE35792847FBE84687FF9117132C8EFD4711441353@EXCH-CLUSTER-11.force10networks.com> <4552F0907735844E9204A62BBDD325E728CAE2F1@SZXEML507-MBS.china.huawei.com> <CE35792847FBE84687FF9117132C8EFD4711441389@EXCH-CLUSTER-11.force10networks.com> <4552F0907735844E9204A62BBDD325E728CAE77F@SZXEML507-MBS.china.huawei.com> <4552F0907735844E9204A62BBDD325E728CB086C@SZXEML507-MBS.china.huawei.com> <CE35792847FBE84687FF9117132C8EFD47114414C4@EXCH-CLUSTER-11.force10networks.com> <4552F0907735844E9204A62BBDD325E728CB099E@SZXEML507-MBS.china.huawei.com>
From: Jon Hudson <jon.hudson@gmail.com>
In-Reply-To: <4552F0907735844E9204A62BBDD325E728CB099E@SZXEML507-MBS.china.huawei.com>
Message-Id: <0AA2AFF7-87AE-4300-9DC2-784A22C628DC@gmail.com>
Date: Fri, 30 Mar 2012 13:56:03 +0200
To: Mingui Zhang <zhangmingui@huawei.com>
Mime-Version: 1.0 (1.0)
X-Mailer: iPhone Mail (9B176)
X-ISI-4-43-8-MailScanner: Found to be clean
X-MailScanner-From: jon.hudson@gmail.com
Cc: "rbridge@postel.org" <rbridge@postel.org>, Janardhanan Pathangi Narasimhan <jana@force10networks.com>
X-BeenThere: rbridge@postel.org
X-Mailman-Version: 2.1.6
Precedence: list
List-Id: "Developing a hybrid router/bridge." <rbridge.postel.org>
List-Unsubscribe: <http://mailman.postel.org/mailman/listinfo/rbridge>, <mailto:rbridge-request@postel.org?subject=unsubscribe>
List-Archive: <http://mailman.postel.org/pipermail/rbridge>
List-Post: <mailto:rbridge@postel.org>
List-Help: <mailto:rbridge-request@postel.org?subject=help>
List-Subscribe: <http://mailman.postel.org/mailman/listinfo/rbridge>, <mailto:rbridge-request@postel.org?subject=subscribe>
Content-Type: multipart/mixed; boundary="===============0478304954=="
Sender: rbridge-bounces@postel.org
Errors-To: rbridge-bounces@postel.org

If I may make one minor point, the last statement is not correct. 

> ZMG> Think about it. When an operator have a LAG connected to a group of RBridges, he has to decide which port to plug the cables in. He need enter the console window to input the command line to configure parameters and start the LAG. He also need to configure the tuple used for hashing.

I know of two solutions (one shipping) where default hash parameters are set and where any two cables plugged into the same two RBridges will auto create a LAG.

Not directly pertinent to the conversation at hand, but manual input is not a given. Zero-config is the goal for many of us.

Just a side note.

Jon

On Mar 29, 2012, at 8:56 PM, Mingui Zhang <zhangmingui@huawei.com> wrote:

> Hi Jana,
> 
> Let us assume that I have 20 RBridges in the campus which have end systems connected to it, and all of them need to be reachable. So if a few of them are converted to MTRA RBridges to support active-active, does it mean that RBridges which are not yet converted will not be able to anymore see the nodes connected to these virtual RBridges? Or am I missing something here?
> 
> ZMG> Although physical reachability can be all to all, it does not mean than the end systems will really communicate in an all to all manner. It is restricted by the VLAN configuration of the campus. Let me give you an example, RB1, RB2, RB3 and RB4 have an all to all connection. The access ports of RB1 and RB3 are configured as the VLAN for end systems of the Dept of Computer Science while the access ports of RB2 and RB4 are configured as another VLAN for end systems of the Dept of Literature. End systems from the same department can talk with each other. But end systems from two different departments will not talk with each other.
> 
> I am not sure if that is practical, and always possible may depend on the nodes connected and also what controls are available on them
> 
> ZMG> Think about it. When an operator have a LAG connected to a group of RBridges, he has to decide which port to plug the cables in. He need enter the console window to input the command line to configure parameters and start the LAG. He also need to configure the tuple used for hashing.
> 
> Thanks,
> Mingui
> 
> 
> 
> 发件人: Janardhanan Pathangi Narasimhan [jana@force10networks.com]
> 发送时间: 2012年3月30日 0:49
> 到: Mingui Zhang
> Cc: rbridge@postel.org
> 主题: RE: [rbridge] Call for draft-tissa-trill-cmt-00 to WG draft
> 
> Hi Mingui,
>  
> I am slightly confused.
>  
> For the intra-topology scenario, MT unaware RBridges will not appear in a specific topology other than the based topology. In other words, MTRA RBridges only talks only with MTRA RBridges. Since they all support MT, MTRA can work very well among them. In this scenario, MTRA RBridges and old RBridges co-exist in the same campus. Therefore, this is a solid example that MTRA can be deployed incrementally. Remember, CMT does not allow such kind of deployment.
>  
> Let us assume that I have 20 RBridges in the campus which have end systems connected to it, and all of them need to be reachable. So if a few of them are converted to MTRA RBridges to support active-active, does it mean that RBridges which are not yet converted will not be able to anymore see the nodes connected to these virtual RBridges? Or am I missing something here?
>  
> For the inter-topology scenario, we assume MTRA RBridges have to talk with MT unaware RBridges (This is a rigorous assumption.) If operators do want to realize  such kind of connection, it is reasonable to assume they will accept the requirement that they need to configure their hashing function to remove the possible MAC flip-flop. 
>  
> I am not sure if that is practical, and always possible may depend on the nodes connected and also what controls are available on them
>  
> Thanks
> Jana
>  
> Thanks,
> Mingui
> 发件人: rbridge-bounces@postel.org [rbridge-bounces@postel.org] 代表 Mingui Zhang [zhangmingui@huawei.com]
> 发送时间: 2012年3月29日 18:50
> 到: Janardhanan Pathangi Narasimhan
> Cc: rbridge@postel.org
> 主题: 答复: [rbridge] Call for draft-tissa-trill-cmt-00 to WG draft
> 
> Hi Jana, 
>  
> >In section 4.1, could you please explain how the frame that is ingressed by RB1 through RBv001 is received at RBx? 
>  
> Two scenarios are given in Section 4. Section 4.1 talks about the intra-topology scenario. Here, RBx is not in topology 1 so it is unreachable by RB1. It need not set up any forwarding path to RBx.
>  
> Thanks,
> Mingui
> 发件人: Janardhanan Pathangi Narasimhan [jana@force10networks.com]
> 发送时间: 2012年3月28日 22:52
> 到: Mingui Zhang
> Cc: rbridge@postel.org
> 主题: RE: [rbridge] Call for draft-tissa-trill-cmt-00 to WG draft
> 
> Hi Mingui,
>  
> Section 4, does not give backward compatibility. For e.g. in section 4.2 multi destination frames will be ingressed with two different bridge ID causing MAC move, and having to impose restrictions on hashing functions at the external node may or may not be acceptable . I am not sure that this can be classified as backward compatible since it is not a full solution and can also cause frequent MAC flip flops etc.
>  
> In section 4.1, could you please explain how the frame that is ingressed by RB1 through RBv001 is received at RBx?
>  
> Thanks
> Jana
>  
>  
> From: Mingui Zhang [mailto:zhangmingui@huawei.com] 
> Sent: Wednesday, March 28, 2012 7:15 PM
> To: Janardhanan Pathangi Narasimhan
> Cc: rbridge@postel.org
> Subject: 答复: [rbridge] Call for draft-tissa-trill-cmt-00 to WG draft
>  
> Hi Jana,
>  
> I think I should answer your question. I am Mingui Zhang.
> Please refer to the draft http://tools.ietf.org/html/draft-zhang-trill-multi-topo-rpfc-00. Section 4 is right for the topic you are interested. As a starter draft on the RPFC using Multi-topology. I will be happy to receive your comments.
>  
> Thanks,
> Mingui
> 发件人: Janardhanan Pathangi Narasimhan [jana@force10networks.com]
> 发送时间: 2012年3月28日 19:14
> 到: Rohit Watve (rwatve); Mingui Zhang; Radia Perlman; Tissa Senevirathne (tsenevir)
> Cc: rbridge@postel.org
> 主题: RE: [rbridge] Call for draft-tissa-trill-cmt-00 to WG draft
> 
> Hi Zhai,
>  
> In the Multi-topology case I am not sure what is meant by incrementally deployable. Assuming that all the RBridges in the campus do not understand MT except for the RBridges which participate in the Virtual RBrodge, it looks like this will cause MAC flip flop on the other RBridges and other issues since all the other RBridges do not understand MT.
>  
> Could you please explain how MT can work in such scenarios?
> 
> Thanks
> Jana
>  
>  
> From: rbridge-bounces@postel.org [mailto:rbridge-bounces@postel.org] On Behalf Of Rohit Watve (rwatve)
> Sent: Wednesday, March 28, 2012 3:53 PM
> To: Mingui Zhang; Radia Perlman; Tissa Senevirathne (tsenevir)
> Cc: rbridge@postel.org
> Subject: Re: [rbridge] Call for draft-tissa-trill-cmt-00 to WG draft
>  
> Hi Mingui,
> 
> 5. Fast convergence on failure recovery. We know that resilience is one of the two main purposes of "aggregation". This dimension discusses whether the solution offers a way to do fast failure recovery when there is a failure of an aggregated RBridge or link.
> 
> PN: NO. The new distribution tree need to be recomputed.
> 
> CMT: NO. A overall re-configuration is necessary.
> 
> MTRA: YES.
> 
> CMT does not need reconfiguration. If one Rbridge advertizing affinity tlv goes down, the tree can be rooted at the other Egress Rbridge ID advertizing affinity tlv.
> 
> Also, consider adding following to the comparison table
> 
> Number of New TLVs (and resulting complexity)
> 
> PN: 3
> 
> CMT: 1
> 
> MTRA: multiple
> 
> Thanks
> Rohit
> 
> -----Original Message-----
> From: rbridge-bounces@postel.org [mailto:rbridge-bounces@postel.org] On Behalf Of Mingui Zhang
> Sent: Wednesday, March 28, 2012 2:10 AM
> To: Radia Perlman; Tissa Senevirathne (tsenevir)
> Cc: rbridge@postel.org
> Subject: 答复: [rbridge] Call for draft-tissa-trill-cmt-00 to WG draft
> 
> Hi Radia,
> 
> It is a good idea to list the Pros and Cons of each solution. There are three candidate solutions for the same RPFC problem: PN, CMT and MTRA (Multi-Topology TRILL for RBridge Aggregation). Let me compare them in the following dimensions.
> 
> 1. Whether the solution is incrementally deploy-able.
> 
> PN: YES.
> 
> CMT: NO.
> 
> MTRA: YES.
> 
> 2. Use only existing silicons.
> 
> PN: NO. New silicon is necessary to support "tunneling" or else MAC addresses may flip-flop.
> 
> CMT: YES.
> 
> MTRA: Currently NO. But after multi-topology is supported by TRILL, it is a "YES".
> 
> 3. Multi-cast traffic up from RBv can be load-balanced.
> 
> PN: NO.
> 
> CMT: YES.
> 
> MTRA: YES.
> 
> 4. Multi-cast traffic down to RBv can be load-balanced.
> 
> PN: NO.
> 
> CMT: YES.
> 
> MTRA: YES.
> 
> 5. Fast convergence on failure recovery. We know that resilience is one of the two main purposes of "aggregation". This dimension discusses whether the solution offers a way to do fast failure recovery when there is a failure of an aggregated RBridge or link.
> 
> PN: NO. The new distribution tree need to be recomputed.
> 
> CMT: NO. A overall re-configuration is necessary.
> 
> MTRA: YES.
> 
> The following table lists all above comparisons (A jpg version is also attached just in case.). MTRA is always YES. I think it deserves our follow-up.
> 
> +--------+------------+-----------+----------+------------+-------------+
> 
> |Solution|Incre-Deploy|Old Silicon|LB Upwards|LB Downwards|Fast Converge|
> 
> +--------+------------+-----------+----------+------------+-------------|
> 
> |PN      |YES         |NO         |NO        |NO          |NO           |
> 
> +--------+------------+-----------+----------+------------+-------------|
> 
> |CMT     |NO          |YES        |YES       |YES         |NO           |
> 
> +--------+------------+-----------+----------+------------+-------------|
> 
> |MTRA    |YES         |YES(will)  |YES       |YES         |YES          |
> 
> +--------+------------+-----------+----------+------------+-------------+
> 
> Thanks,
> 
> Mingui
> 
>  
> 
> 发件人: rbridge-bounces@postel.org [rbridge-bounces@postel.org] 代表 Radia Perlman [radiaperlman@gmail.com]
> 
> 发送时间: 2012年3月28日 12:52
> 
> 到: Tissa Senevirathne (tsenevir)
> 
> Cc: rbridge@postel.org
> 
> 主题: Re: [rbridge] Call for draft-tissa-trill-cmt-00 to WG draft
> 
> 
> 
> 
> 
> 
> 
> It is my understanding that
> 
> a) CMT does require changing all the RBs in the campus
> 
> b) CMT requires there be at least as many trees as there are active/active RBs on any link
> 
>  
> 
> Of course, no solution is ideal.  It might have been nice to write up all the proposed solutions to this and pros/cons. Maybe it's
> 
> still worth doing.
> 
>  
> 
> So from memory...other proposals for allowing R1, R2, and R3 to be active/active/active on a link:
> 
>  
> 
> First note: regardless of whether their port to the link is in a tree, unicast works, and they can use the pseudonode
> 
> nickname when encapsulating unicast.
> 
>  
> 
> Also, if Ri's port to the link is in at least one tree, Ri can use the pseudonode nickname for that link when encapsulating
> 
> multidestination frames for ingress, and the RPF check will work without any problems.
> 
>  
> 
> However, if Ri's port to the link is not in any of the trees,
> 
> here were some of the proposed solutions.  So, let's say that R1 and R2's port to the shared link is in at least one tree, and R3's
> 
> port to the link is not in any of the trees, and R3 needs to encapsulate a multidestination frame:
> 
>  
> 
> 1) R3 could tunnel it to one of {R1, R2}, and let them inject the packet (with pseudonode nickname) into the campus.
> 
>  
> 
> Pro: Doesn't affect any RBs other than the ones on the link; backwards compatible.  Con: more hops, might be difficult
> 
> in some implementations for R3 to forward by tunneling, might be difficult for R2 to accept a tunneled packet.
> 
>  
> 
> 2) R3 could use its own nickname instead of the pseudonode nickname in this case
> 
>  
> 
> Pro: Doesn't affect any RBs other than the ones on the link; backwards compatible.  Con: MAC learning in distant RBs will have
> 
> frequent learning changes of the source MAC S between being on the pseudonode nickname (whenever S sends unicast,
> 
> or multicast through R1 or R2), or being on R3 (when R3 has to encapsulate a multidestination frame from S).
> 
>  
> 
> ------
> 
> There might have been some other proposals, but they wound up not to work.
> 
>  
> 
> Personally, I don't love the CMT thing because of the two disadvantages I mentioned above, but I can live with it.
> 
>  
> 
> Radia
> 
>  
> 
> _______________________________________________
> rbridge mailing list
> rbridge@postel.org
> http://mailman.postel.org/mailman/listinfo/rbridge
_______________________________________________
rbridge mailing list
rbridge@postel.org
http://mailman.postel.org/mailman/listinfo/rbridge