Re: [pim] Call for adoption of sr-mpls-multicast-framework

David Allan I <david.i.allan@ericsson.com> Thu, 09 August 2018 19:38 UTC

Return-Path: <david.i.allan@ericsson.com>
X-Original-To: pim@ietfa.amsl.com
Delivered-To: pim@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 68D96130ECB for <pim@ietfa.amsl.com>; Thu, 9 Aug 2018 12:38:34 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.31
X-Spam-Level:
X-Spam-Status: No, score=-4.31 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_MED=-2.3, SPF_PASS=-0.001, T_DKIMWL_WL_HIGH=-0.01, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=ericsson.com header.b=dxAK8dV/; dkim=pass (1024-bit key) header.d=ericsson.com header.b=hpd97V2z
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id rmDb7jkbJbb3 for <pim@ietfa.amsl.com>; Thu, 9 Aug 2018 12:38:30 -0700 (PDT)
Received: from sessmg22.ericsson.net (sessmg22.ericsson.net [193.180.251.58]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 52306130EBE for <pim@ietf.org>; Thu, 9 Aug 2018 12:38:30 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; d=ericsson.com; s=mailgw201801; c=relaxed/simple; q=dns/txt; i=@ericsson.com; t=1533843508; h=From:Sender:Reply-To:Subject:Date:Message-ID:To:Cc:MIME-Version:Content-Type: Content-Transfer-Encoding:Content-ID:Content-Description:Resent-Date:Resent-From: Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:In-Reply-To:References:List-Id: List-Help:List-Unsubscribe:List-Subscribe:List-Post:List-Owner:List-Archive; bh=mCNKdYW2ePSmmhZ6ftDbPgYnys2bybsP7oSPmO/Fe1c=; b=dxAK8dV/nq3WazS7O2KnR5nCue6G5vSO3Z/dpDAcXZJb63cvpE1+ZCsKhqkE/Ro3 6bhjM79IqAyUaZI0W2sSw3iPxvMxy+Q0wQr5SV6YbBfAnbmO8ZzLxdykltMioo7o mWNmqewYXRysUwCblqH4GSvKRPcNa6jNRH8iwNnWeKU=;
X-AuditID: c1b4fb3a-481ff7000000145f-b5-5b6c9834358e
Received: from ESESSMB502.ericsson.se (Unknown_Domain [153.88.183.120]) by sessmg22.ericsson.net (Symantec Mail Security) with SMTP id 38.1B.05215.4389C6B5; Thu, 9 Aug 2018 21:38:28 +0200 (CEST)
Received: from ESESSMB505.ericsson.se (153.88.183.166) by ESESSMB502.ericsson.se (153.88.183.163) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_CBC_SHA256_P256) id 15.1.1466.3; Thu, 9 Aug 2018 21:38:27 +0200
Received: from NAM05-CO1-obe.outbound.protection.outlook.com (153.88.183.157) by ESESSMB505.ericsson.se (153.88.183.166) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_CBC_SHA256_P256) id 15.1.1466.3 via Frontend Transport; Thu, 9 Aug 2018 21:38:27 +0200
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ericsson.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=mCNKdYW2ePSmmhZ6ftDbPgYnys2bybsP7oSPmO/Fe1c=; b=hpd97V2zc2fAMrGLIjFM6//tNdhhRjmM/dj08EN3DaZ0kfGcaT/V9VC0lQMZo9T76R1X7Hn+9DtWKqRw3soBJezNhzvnlC7Air50xQcyqvFEykzQIGfYmwi5hBTFSAnDGEJM341o/TF8C4IfhoW6I/NhFXfsZde6KAOMkrClTbE=
Received: from CY1PR15MB0874.namprd15.prod.outlook.com (10.169.22.140) by CY1PR15MB0442.namprd15.prod.outlook.com (10.163.235.12) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.1038.23; Thu, 9 Aug 2018 19:38:24 +0000
Received: from CY1PR15MB0874.namprd15.prod.outlook.com ([fe80::c83c:4d8d:bc1b:88e4]) by CY1PR15MB0874.namprd15.prod.outlook.com ([fe80::c83c:4d8d:bc1b:88e4%2]) with mapi id 15.20.1038.023; Thu, 9 Aug 2018 19:38:24 +0000
From: David Allan I <david.i.allan@ericsson.com>
To: Eric C Rosen <erosen@juniper.net>, Michael McBride <Michael.McBride@huawei.com>, "pim@ietf.org" <pim@ietf.org>
Thread-Topic: [pim] Call for adoption of sr-mpls-multicast-framework
Thread-Index: AQHUL+Zqqpra4qWZ60G7gn0/0hczzaS3yosA
Date: Thu, 09 Aug 2018 19:38:24 +0000
Message-ID: <CY1PR15MB0874332C5952796D2C0A7776D0250@CY1PR15MB0874.namprd15.prod.outlook.com>
References: <8CCB28152EA2E14A96BBEDC15823481A1CC89175@sjceml521-mbx.china.huawei.com> <aecd258e-2d0d-db07-147e-826e6e4c1943@juniper.net>
In-Reply-To: <aecd258e-2d0d-db07-147e-826e6e4c1943@juniper.net>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
authentication-results: spf=none (sender IP is ) smtp.mailfrom=david.i.allan@ericsson.com;
x-originating-ip: [198.24.6.220]
x-ms-publictraffictype: Email
x-microsoft-exchange-diagnostics: 1; CY1PR15MB0442; 6:vA+SRfusDPPsHsdw7xwmZI6sFYz2VhyHmLNFkMtN9LC8aBSCQwbsdjEzd7PpuvvjjB0av22WNMMjdniQjQk2VZer3XGg7baQBwmT0sLhyculxc/f9JstekCvVM47Yri7xBGRHDDySPL/xiN161znAWbuNjFM4d62bHmMIc0on/VCfAvUEEE9t7zBGvNgnh1xL1Tq2f6Cfgt9nJgP9ASyE6WvPgeRws+3mXBs2jumXN1OsyyM4MFYpvCs5ovmv0mW64QIC876aIhSZdzdYcUKszYxnH+oeYCjxVHvrCMKaZ99SDUTDS7reFcGF0v41frzH2vLO/8NrVIxdH6no2ftJCNdME6H0HXV7QBQGmvZ8JaaLEebEfxPpysaKOm4CIFTtzJIvlm+IezdnsA+D+JKq0pskRT73iMrY8N8CXg+jRQZONEapWWgLqHMWI0R58yM3iqVzo8neHp47lRonz1jgg==; 5:EifU0Gdw1awKn0QLNbnBghPxXnb2+28iVRKHiyKxxs9C5EeUaLp9FQgKshUgVO52koJPnavRntG7S4TnEvb4ApduDCM1WfiT6RjMWaUa49VNXjbGu4MiHuYOGNp1qfraX0Dtet4szRipGroUyqMbmaIE8ZgtdK/v53c9XD+4OTo=; 7:K0CHPedUPcpRdWNvpSFcWnoz77mNJfEWmCv33DOu2YvBYrX55woqldT8Fk02KoU8zSiah56lbCDIP52IF4mnqW7cTRvkhmk6kmfWZXT4YdFOoXKx5kW79U5UUPwFoDlE3XrcnMW8cSlhyMcEG9jpYCgCyBZQVzHD1dAGJiqtxy1mpTdq/63tXWgTbPkzG41bK8NQE+zV70FhcuyStSyy9ZXamPIg9dqzD2TfdVgcWVZrBYIjDsTkVBDhtjLbb7fI
x-ms-exchange-antispam-srfa-diagnostics: SOS;
x-ms-office365-filtering-correlation-id: 9a87dbd6-ae3e-4e27-1161-08d5fe2fac00
x-microsoft-antispam: BCL:0; PCL:0; RULEID:(7020095)(4652040)(8989117)(4534165)(4627221)(201703031133081)(201702281549075)(8990107)(5600074)(711020)(2017052603328)(7153060)(7193020); SRVR:CY1PR15MB0442;
x-ms-traffictypediagnostic: CY1PR15MB0442:
x-microsoft-antispam-prvs: <CY1PR15MB044227CB0BC764B61AA2E7F7D0250@CY1PR15MB0442.namprd15.prod.outlook.com>
x-exchange-antispam-report-test: UriScan:(155532106045638);
x-ms-exchange-senderadcheck: 1
x-exchange-antispam-report-cfa-test: BCL:0; PCL:0; RULEID:(8211001083)(6040522)(2401047)(5005006)(8121501046)(10201501046)(3002001)(3231311)(944501410)(52105095)(93006095)(93001095)(149027)(150027)(6041310)(20161123558120)(20161123564045)(20161123560045)(20161123562045)(201703131423095)(201702281528075)(20161123555045)(201703061421075)(201703061406153)(6072148)(201708071742011)(7699016); SRVR:CY1PR15MB0442; BCL:0; PCL:0; RULEID:; SRVR:CY1PR15MB0442;
x-forefront-prvs: 0759F7A50A
x-forefront-antispam-report: SFV:NSPM; SFS:(10009020)(396003)(366004)(39860400002)(136003)(346002)(376002)(51914003)(189003)(199004)(26005)(102836004)(2900100001)(6436002)(476003)(81156014)(486006)(86362001)(2906002)(186003)(8676002)(105586002)(106356001)(7736002)(8936002)(97736004)(305945005)(5024004)(966005)(1941001)(6246003)(256004)(66066001)(53936002)(3846002)(6306002)(478600001)(5250100002)(2501003)(110136005)(9686003)(6116002)(55016002)(14454004)(561944003)(74316002)(6506007)(68736007)(99286004)(25786009)(7696005)(81166006)(316002)(11346002)(446003)(76176011)(229853002)(5660300001)(33656002); DIR:OUT; SFP:1101; SCL:1; SRVR:CY1PR15MB0442; H:CY1PR15MB0874.namprd15.prod.outlook.com; FPR:; SPF:None; LANG:en; PTR:InfoNoRecords; MX:1; A:1;
received-spf: None (protection.outlook.com: ericsson.com does not designate permitted sender hosts)
x-microsoft-antispam-message-info: e25eLCV5v3gZ3klh22yJnve8XeBv7q96HebfWEkWIcV6PGNt00vPIgnbKwMfxAOc2kriP35HmLAjJZZ+7E7u96yO1lsxiLJoCctHTwaSGdThxdrs2aaIs8P21P2ksgYRBIVD0DyO+nGNJZjgRL/2R9IwquHy0rv6hu/xgxz2fXZ1oPDMnd5AZHDhNDVX7cF7JZOZ0JWV1XvFIQw2g4/woWii2/YmOfpQGHQBhHQHx8z9IhGeoipZ/G+yIVbTVX+gwYkr6V6te05kjQ/F/ab1F7wDYAyFFXB6Xfuq+2dEPoBn5T8E3PVUaeCnluqa5dOzsGNJYeDTJS4fMpZ0l3ToJMDvjBYznnv6sfkkYZ6Fz3E=
spamdiagnosticoutput: 1:99
spamdiagnosticmetadata: NSPM
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
MIME-Version: 1.0
X-MS-Exchange-CrossTenant-Network-Message-Id: 9a87dbd6-ae3e-4e27-1161-08d5fe2fac00
X-MS-Exchange-CrossTenant-originalarrivaltime: 09 Aug 2018 19:38:24.4355 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 92e84ceb-fbfd-47ab-be52-080c6b87953f
X-MS-Exchange-Transport-CrossTenantHeadersStamped: CY1PR15MB0442
X-OriginatorOrg: ericsson.com
X-Brightmail-Tracker: H4sIAAAAAAAAA+NgFprIKsWRmVeSWpSXmKPExsUyM2J7ha7JjJxog6cXuC3WbfjAbPH3wWwm iy8PbzI7MHu0HHnL6rFkyU8mj+tNV9kDmKO4bFJSczLLUov07RK4Mt6s3cdYcC6xYvWCY4wN jD3xXYycHBICJhLPL3xh7WLk4hASOMoosXveDkYI5yujxJ+uP1DOYiaJc8dmsYE4LAITmCVW 3p0H1TOJSWLhlu3sEM5DRokfN3rZQSazCRhI7Pn/hRHEFhEokejb/AwsLizgLPHiZC8zRNxF onvSGXYI20hi6o6LrCA2i4CKxKdb61hAbF6BGIkjO09DLehilNj6ZzXYUE4Be4me3WfAbEYB MYnvp9YwgdjMAuISt57MZ4J4T0BiyZ7zzBC2qMTLx/9YIepjJR4dbWCDiCtInLrawAhhy0pc mt8N9rSEwD52iYlztrNAJHQlPkydCjXIV+L4e5A4SNFxRok73/+xQyS0JC693Ag1NVvixO2X ULa1xOzJd6EGyUms6n0IZe9jllhyNG8Co8EsJIfPYuQAsjUl1u/ShwgrSkzpfsg+CxwYghIn Zz5hWcDIsopRtDi1uDg33chIL7UoM7m4OD9PLy+1ZBMjMKEc3PLbagfjweeOhxgFOBiVeHi/ teREC7EmlhVX5h5ilOBgVhLhtfUBCvGmJFZWpRblxxeV5qQWH2KU5mBREud1SrOIEhJITyxJ zU5NLUgtgskycXBKNTAutzpU9ui4QMI32VPfayzC03n7beSTZfb613xQ75u28m/vL5/VCku6 DI6skgg9Nll++zK18mvvTkuyGm39r6TW1DRlW0GuTrJN04xJU3ZUf1WTCLuov+9vtV8ZR/c8 59nRVl9Whuf4OoVunR7S/O383YSei6ukpF4ks/C0XFpoWXmUecq2ChslluKMREMt5qLiRAC6 RKUiJAMAAA==
Archived-At: <https://mailarchive.ietf.org/arch/msg/pim/0_1MGx9WoEWRndoEowqSCkVlEFs>
Subject: Re: [pim] Call for adoption of sr-mpls-multicast-framework
X-BeenThere: pim@ietf.org
X-Mailman-Version: 2.1.27
Precedence: list
List-Id: Protocol Independent Multicast <pim.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/pim>, <mailto:pim-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/pim/>
List-Post: <mailto:pim@ietf.org>
List-Help: <mailto:pim-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/pim>, <mailto:pim-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 09 Aug 2018 19:38:35 -0000

Hi Eric:

Replies in line, prefaced with DA>

I am opposed to the adoption of this draft.

The document is not very clear, and it is not always possible to tell what is actually being proposed.  The parts I think I understand have numerous problems which do not appear to have been properly considered.

DA> Then I'll try to respond to the comments you have, and perhaps we can work our way through them.

The document appears to specify a way of setting up P2MP LSPs (let's call them "trees") using IGP as the control plane. Every tree is associated with an IPv4 or IPv6 group address.  Through some unspecified magic, each node determines whether it has "transmit interest" or "receive interest" for a given group address.  This interest is advertised via the IGP.

DA> If it is magic then clearly you understand the trick and where the wires are. 😊

It is not clear whether these group addresses have anything to do with the group addresses that appear in IP multicast packets.  If they don't, it is not clear why a tree has to be associated with an IPv4/IPv6 address at all.  If they do, it is not clear how multiple user flows can be aggregated on a single tree.

DA> Good feedback. Personally I do not see adding support for them as being a particularly big deal. Explicit mention of this will be in the next version.

The draft seems to explicitly prohibit a tree from having parallel links between a given pair of nodes; this means that ECMP load balancing is not possible, which suggests that a tree is to carry only one flow. (I'm not sure why this prohibition is made, but it does seem to be there in rule 2 of section 5.2.2.)  
DA> There I think you have misunderstood the pruning algorithm. If at some point you have parallel paths you can consider them to be a single path from the point of view of resolving where state is going to end up.  It is certainly not a prohibition against ECMP. The algorithm is specifically designed to make ECMP as painless as possible. What you are commenting on is a simplification of the topology model as an intermediate step in determining where state will need to be installed.

On the other hand, there are a few sentences suggesting that these trees can be used to instantiate MVPN/EVPN PMSIs, which suggests that the trees aggregate flows and thus could benefit from ECMP.
DA> See replies above.

The idea of using IGP to advertise interest in sending or receiving particular multicast flows is not new.  It recurs every few years, and generally disappears when people start to examine the evident scaling problems.  The number of groups, transmitters, and receivers is not bounded, and their rate of change is not bounded.  The last thing one wants to do is create an unbounded amount of IGP overhead. Surprisingly, the impact on IGP scaling is not mentioned at all in the draft.  (Imagine if one had to kick the IGP every time MVPN/EVPN initiates a new S-PMSI!)
DA> Deployments of 802.1aq shortest path bridging would provide an existence proof for using the IGP without detrimental effects and I'm aware of some sizeable networks using it. It does have a virtuous circle in that all multicast state exchange is not tied to topology changes, so when a topology change does occur the control plane can simply get on with it with a minimum of messaging.  

Each <transmitter, group address> pair seems to be assigned a domain-wide unique label, or SID.  I think the intention is that the transmitter gets configured with this SID, and then advertises it in the IGP messages.  Assigning these domain-wide unique identifiers to trees that may appear and disappear with some frequency seems like an interesting problem.  However, the authors seem to regard this problem as out of scope for the document; that's a bit strange for a document that purports to be describing a framework.
DA> Again that is good feedback. We'll address this in the next version.

Whether a given node has transmit and/or receive interest for a given group address is, according to the document, determined by management configuration.  There is no consideration of having the edge nodes support PIM/IGMP/MLD/mLDP, etc.  There is some mention of MVPN/EVPN, but only a mention, not an actual proposal.  That also seems strange for a framework document.  Surely a framework document should clearly delineate the boundary between overlay and underlay, and address the interactions between them.
DA> That is an oversight in the content itself, interactions with IGMP, PIM etc. have been discussed outside the draft content. For the next version.

Once the IGP distributes information about who has transmit and/or receive interest in each group address, a computation is performed to figure out the best tree for a particular <transmitter, group> pair. Some rules are sketched out for figuring this out, but the rules are not clear or precise enough to result in interoperable implementations.
DA> If you could enumerate where you think it is not clear, that would be very useful input.

Once the tree is determined, per-tree state needs to be established at each node that is either a transmitter, a receiver, or a replication point.  If a node doesn't fall into one of these categories for a particular tree, that node will not need the per-tree state.  If a particular replication point on a tree is not directly attached to the next replication point downstream, unicast tunneling is used to get the packets from one replication point to the next.
DA> Correct

Much is made of the fact that per-tree data plane state exists only at nodes that are transmitters, replication points, or receivers for the tree.  This is presented as significant reduction in data plane state when compared to, say, PIM or mLDP.  Of course, whether there is any signficant reduction in state depends on the size and shape of the trees.  No analysis is given to show that the state reduction is significant, just a lot of handwaving.
DA> I would have thought it was self evident as described in the PIM and RTGWG presentations where the worst case is that (2xleaf count) nodes need to install state per tree irrespective of network size, and a fairly simple example provided. I don’t recall seeing you there. Paraphrasing a comment of yours from years ago you said that the role of a draft or RFC was to describe "what" and not "why", but if now you think some more "why" is needed we're happy to oblige.

On the other hand, there is an enormous increase in control plane state resulting from the fact that every single node in the network is aware of every tree and all its members.  Existing multicast protocols do not require a node to know about a tree unless the tree passes through it, and do not require all nodes along a tree to know about all the leaf nodes of the tree.  The large increase in the amount of control plane state is not considered at all, for some reason.
DA> I guess that is a bias where control plane memory is cheaper, and therefore more plentiful than dataplane state, making DP state something to optimize around.  We'll think about how to handle this.

To send a data packet along a particular tree, the SID for that tree is pushed onto the packet's label stack.  Each node that receives a packet and sees a particular SID at the top of the label stack will replicate the packet and send (or tunnel) each copy to the next replication point, where the same SID will be seen.  Since unicast tunneling is generally used to get a packet from one replication point to the next, packets traveling along a particular tree may arrive over any interface.  This makes it impossible to do a data plane RPF check (such as PIM would do).  
DA> If PHP is employed, then the multicast SID would be top label, and split horizon filtering would be possible as a form of an RPF check. We'll add this to the draft.   However this is a useful insight to be reminded of when considering what technologies the approach is applicable to as suggested by Toerless's post.  

Since the tree has a domain-wide unique SID, you can't tell from the label who the previous node was, so you can't reject packets from the "wrong previous node" (such as mLDP would do).  This may lead to potentially disastrous multicast loops during periods when a tree is being modified, especially if node A was previously upstream of node B, but the change puts it downstream of node B.  (Modifications can be caused by topology changes or simply by a change in the set of
receivers.)  These issues do not appear to have been considered.
DA> They were, just not elucidated in the draft itself.

I do not see how this document can be considered for WG adoption. Given the set of signficant issues that have not received proper consideration, it is not possible to say that this document is a good basis for future work.
DA> Personally I’m not seeing any show stoppers in your list of concerns. But that is my opinion.

Thanks for the feedback
Dave



_______________________________________________
pim mailing list
pim@ietf.org
https://www.ietf.org/mailman/listinfo/pim