Re: [Tsv-art] Tsvart telechat review of draft-ietf-pim-source-discovery-bsr-08
"Black, David" <David.Black@dell.com> Wed, 24 January 2018 17:40 UTC
Return-Path: <David.Black@dell.com>
X-Original-To: tsv-art@ietfa.amsl.com
Delivered-To: tsv-art@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 89F1B12711E; Wed, 24 Jan 2018 09:40:50 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.721
X-Spam-Level:
X-Spam-Status: No, score=-2.721 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=dell.com header.b=vp8glueq; dkim=fail (1024-bit key) reason="fail (message has been altered)" header.d=emc.com header.b=gLM3BVix
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id bsnkmLK7q2Ow; Wed, 24 Jan 2018 09:40:48 -0800 (PST)
Received: from esa3.dell-outbound.iphmx.com (esa3.dell-outbound.iphmx.com [68.232.153.94]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 1551C1270FC; Wed, 24 Jan 2018 09:40:48 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=simple/simple; d=dell.com; i=@dell.com; q=dns/txt; s=smtpout; t=1516815277; x=1548351277; h=from:cc:to:subject:date:message-id:references: in-reply-to:content-transfer-encoding:mime-version; bh=YsVm7rJbKg7zLiM5xIBGl2gTgBz61bEkJi844SjT3Sw=; b=vp8glueqRH1+TvtytML1LW3YoOkh+lSdxmC6jx0/FO0NeUt671aGrK74 ZQEFTO5EdkIzoVZGbZKFui7VfF4vLpPVPyoIIYiRthd3fHbXTwF5rSZm6 QFqin8+PPdvPWl8wqneejuKHf6H85jEiPJS6NVsmYluIZEB3hCN04pL2A M=;
X-IronPort-Anti-Spam-Filtered: true
X-IronPort-Anti-Spam-Result: A2GIAAD+xGhah8mZ6EReGgEBAQEBAgEBAQEIAQEBAYJsgSyBBCcHg1aZC4ICiRGORIE/QwqCAYM6AhqEZ1cVAQEBAQEBAQECAQIQAQEBCgsJCCgvgjgkAQ5LIQYyAQEBAQEBAQEBAQEBAQEBAQEBFwI9EwIYAQEBBCMRDB8aAQsEAgEIEQQBAQECAgYdAwICAh8RFAEICAIEARIIE4oCAxUBsnaCJ4MRhDENgwsBAQEBAQEBAQEBAQEBAQEBAQEBAQEVCIEPg0OBNl+BWIQHWDaCa0QEgVgZFYMAMYI0o043BgKQWYchhh+Laop5gyaJCwIEAgQFAhqBPDWBdHCCfIJkgXN4jROBFwEBAQ
X-IPAS-Result: A2GIAAD+xGhah8mZ6EReGgEBAQEBAgEBAQEIAQEBAYJsgSyBBCcHg1aZC4ICiRGORIE/QwqCAYM6AhqEZ1cVAQEBAQEBAQECAQIQAQEBCgsJCCgvgjgkAQ5LIQYyAQEBAQEBAQEBAQEBAQEBAQEBFwI9EwIYAQEBBCMRDB8aAQsEAgEIEQQBAQECAgYdAwICAh8RFAEICAIEARIIE4oCAxUBsnaCJ4MRhDENgwsBAQEBAQEBAQEBAQEBAQEBAQEBAQEVCIEPg0OBNl+BWIQHWDaCa0QEgVgZFYMAMYI0o043BgKQWYchhh+Laop5gyaJCwIEAgQFAhqBPDWBdHCCfIJkgXN4jROBFwEBAQ
Received: from esa1.dell-outbound2.iphmx.com ([68.232.153.201]) by esa3.dell-outbound.iphmx.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 24 Jan 2018 11:34:29 -0600
From: "Black, David" <David.Black@dell.com>
Cc: "tsv-art@ietf.org" <tsv-art@ietf.org>, "ietf@ietf.org" <ietf@ietf.org>, "pim@ietf.org" <pim@ietf.org>, "draft-ietf-pim-source-discovery-bsr.all@ietf.org" <draft-ietf-pim-source-discovery-bsr.all@ietf.org>, "Black, David" <David.Black@dell.com>
Received: from mailuogwhop.emc.com ([168.159.213.141]) by esa1.dell-outbound2.iphmx.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 24 Jan 2018 23:34:23 +0600
Received: from maildlpprd04.lss.emc.com (maildlpprd04.lss.emc.com [10.253.24.36]) by mailuogwprd03.lss.emc.com (Sentrion-MTA-4.3.1/Sentrion-MTA-4.3.0) with ESMTP id w0OHeZjY003606 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NO); Wed, 24 Jan 2018 12:40:35 -0500
X-DKIM: OpenDKIM Filter v2.4.3 mailuogwprd03.lss.emc.com w0OHeZjY003606
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed/relaxed; d=emc.com; s=jan2013; t=1516815636; bh=e6JqFG/eEXrtRE6i9kiTMw/DJ8k=; h=From:To:CC:Subject:Date:Message-ID:References:In-Reply-To: Content-Type:Content-Transfer-Encoding:MIME-Version; b=gLM3BVix1PIvmkxAzPcK1Xp1uuRGPGvGbg6N28RfjB9Iwj7nPWQUAH+510PCFbAZ6 irlWiAi20RQMNg66OUtrtRGRHJryWKX+BLpQfmDTMqAJGOpfguZWKfS1pGTtpTUNxi gpJ7+A6Cz68h8ROzgrJqSnbRvH1D8OfGZDndoImY=
X-DKIM: OpenDKIM Filter v2.4.3 mailuogwprd03.lss.emc.com w0OHeZjY003606
Received: from mailusrhubprd54.lss.emc.com (mailusrhubprd54.lss.emc.com [10.106.48.19]) by maildlpprd04.lss.emc.com (RSA Interceptor); Wed, 24 Jan 2018 12:40:19 -0500
Received: from MXHUB310.corp.emc.com (MXHUB310.corp.emc.com [10.146.3.36]) by mailusrhubprd54.lss.emc.com (Sentrion-MTA-4.3.1/Sentrion-MTA-4.3.0) with ESMTP id w0OHeNKu022360 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-SHA384 bits=256 verify=FAIL); Wed, 24 Jan 2018 12:40:23 -0500
Received: from MX307CL04.corp.emc.com ([fe80::849f:5da2:11b:4385]) by MXHUB310.corp.emc.com ([10.146.3.36]) with mapi id 14.03.0352.000; Wed, 24 Jan 2018 12:40:22 -0500
To: Stewart Bryant <stewart.bryant@gmail.com>, Stig Venaas <stig@venaas.com>
Thread-Topic: Tsvart telechat review of draft-ietf-pim-source-discovery-bsr-08
Thread-Index: AQHTlKx8O014Dp6Y2UCNgS7hHZaaXqODLyUAgABgtgD//7uGoA==
Date: Wed, 24 Jan 2018 17:40:22 +0000
Message-ID: <CE03DB3D7B45C245BCA0D243277949362FFAB5E4@MX307CL04.corp.emc.com>
References: <151675081688.15722.801207813861297527@ietfa.amsl.com> <CAHANBt+a1eoMGNJs5tKvNOtLKKBbW5CHE00ZaUGS62OP5goviw@mail.gmail.com> <CE03DB3D7B45C245BCA0D243277949362FFAB39B@MX307CL04.corp.emc.com> <622506ad-8fb6-f194-3b70-403c26f67d02@gmail.com>
In-Reply-To: <622506ad-8fb6-f194-3b70-403c26f67d02@gmail.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-originating-ip: [10.238.44.138]
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
MIME-Version: 1.0
X-Sentrion-Hostname: mailusrhubprd54.lss.emc.com
X-RSA-Classifications: public
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsv-art/coJwHmRnvdpMbxtf6b4CgswreqI>
Subject: Re: [Tsv-art] Tsvart telechat review of draft-ietf-pim-source-discovery-bsr-08
X-BeenThere: tsv-art@ietf.org
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: Transport Area Review Team <tsv-art.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsv-art>, <mailto:tsv-art-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsv-art/>
List-Post: <mailto:tsv-art@ietf.org>
List-Help: <mailto:tsv-art-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsv-art>, <mailto:tsv-art-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 24 Jan 2018 17:40:51 -0000
That works for me, Thanks, --David > -----Original Message----- > From: Stewart Bryant [mailto:stewart.bryant@gmail.com] > Sent: Wednesday, January 24, 2018 11:45 AM > To: Black, David <david.black@emc.com>; Stig Venaas <stig@venaas.com> > Cc: tsv-art@ietf.org; ietf@ietf.org; pim@ietf.org; draft-ietf-pim-source- > discovery-bsr.all@ietf.org > Subject: Re: Tsvart telechat review of draft-ietf-pim-source-discovery-bsr-08 > > The problem with complex processing under error conditions is that that > is where all the software bugs hang out because they are hard to test > and don't show up until you have the problem they are trying to fix. > > This is a case where you want the simplest possible process like a small > burst followed by your 60s interval which seems unlikely to stress any > sensibly designed implementation on a reasonably sized network. > > - Stewart > > > On 24/01/2018 16:30, Black, David wrote: > > Hi Stig, > > > >> I agree with all you wrote and will update the document. However, > >> there is one slight issue with the minimum time between origination of > >> each message. When a new source is detected, we would like to > >> originate a message ASAP so that receivers can start receiving the > >> multicast without much delay. A 10s delay would be a rather long time > >> if a source was detected right after the previous message was > >> originated. I think some delay would be warranted though, in > >> particular in a case where perhaps a router starts up and a large > >> number of directly connected sources could be detected within a short > >> time frame. I think an exponential back-off could make sense here. > >> E.g., if it is just one new source, maybe trigger a message ASAP. If a > >> new source is detected right after the previous one, wait a bit > >> longer, which also allows for aggregation of multiple sources in one > >> messages if several are detected later. In extreme cases one could > >> over time keep increasing the delay until the next update. > >> If sufficient we could maybe have a fixed minimum delay of 1s or not, > >> but that is probably too short in those extreme cases. Hence maybe an > >> exponential back-off. > > Exponential back-off sounds like a very good idea - I'd suggest adding > something starting from RFC 5059's back-off functionality. > > > >> I would appreciate some further guidance what you think is reasonable > >> here, and perhaps whether I can borrow something here from other > >> protocols/drafts. Part of the experiment here might be to find out > >> what minimum values, or how rapid back-off, is needed based on the > >> size of the network, the amount of sources, the types of links etc. > > In addition to burst scenarios (e.g., router starts up, lots of new sources > detected quickly as a result), I strongly suggest thinking about chaos > scenarios where links and/or routers are coming and going so rapidly that the > source population is in a constant state of flux. If things are really bad, the > best thing to do may be to shut up and hope that the chaos settles out, as > not much useful will happen until it does, and send messages about > observed changes risks make things worse. Again, exponential back-off > makes sense, possibly quite aggressive, e.g., back-off from 10 seconds by a > small factor a few times, and if things still look bad, wait at least a minute or > two with further back-off from that longer time until things stabilize. This > needs more thought on how to adjust the back-off factor, as that off-the- > top-of my-head example probably exhibits peculiar behavior in scenarios > that just are on the edge of tripping the long delay - some thinking about > what stability means and how to get there may help in figuring out the > relative merits and applicability of backing off further vs. some kind of > dramatic reset, analogous to TCP's congestion window reset on timeout. > > > > As this is intended to be an experimental RFC, I don’t think a completely > worked-out solution is expected or required - a good discussion of the > problems and explanation of areas that need investigation as part of the > experiment ought to suffice, as suggested in last sentence quoted above. I > would add some initial exponential back-off functionality as a starting point. > > > >> Also note that the general mechanism can be used for many types of > >> information. It depends on the information how urgent it is to > >> distribute it. Source discovery is particular is fairly urgent. > > And that should be discussed, perhaps in Section 3 somewhere. > > > > Thanks, --David > > > > > >> -----Original Message----- > >> From: Stig Venaas [mailto:stig@venaas.com] > >> Sent: Tuesday, January 23, 2018 7:44 PM > >> To: Black, David <david.black@emc.com> > >> Cc: tsv-art@ietf.org; draft-ietf-pim-source-discovery-bsr.all@ietf.org; > >> ietf@ietf.org; pim@ietf.org > >> Subject: Re: Tsvart telechat review of draft-ietf-pim-source-discovery- > bsr-08 > >> > >> Hi, thanks for the great comments. > >> > >> I agree with all you wrote and will update the document. However, > >> there is one slight issue with the minimum time between origination of > >> each message. When a new source is detected, we would like to > >> originate a message ASAP so that receivers can start receiving the > >> multicast without much delay. A 10s delay would be a rather long time > >> if a source was detected right after the previous message was > >> originated. I think some delay would be warranted though, in > >> particular in a case where perhaps a router starts up and a large > >> number of directly connected sources could be detected within a short > >> time frame. I think an exponential back-off could make sense here. > >> E.g., if it is just one new source, maybe trigger a message ASAP. If a > >> new source is detected right after the previous one, wait a bit > >> longer, which also allows for aggregation of multiple sources in one > >> messages if several are detected later. In extreme cases one could > >> over time keep increasing the delay until the next update. > >> If sufficient we could maybe have a fixed minimum delay of 1s or not, > >> but that is probably too short in those extreme cases. Hence maybe an > >> exponential back-off. > >> > >> I would appreciate some further guidance what you think is reasonable > >> here, and perhaps whether I can borrow something here from other > >> protocols/drafts. Part of the experiment here might be to find out > >> what minimum values, or how rapid back-off, is needed based on the > >> size of the network, the amount of sources, the types of links etc. > >> > >> Also note that the general mechanism can be used for many types of > >> information. It depends on the information how urgent it is to > >> distribute it. Source discovery is particular is fairly urgent. > >> > >> Stig > >> > >> > >> On Tue, Jan 23, 2018 at 3:40 PM, David Black <david.black@dell.com> > wrote: > >>> Reviewer: David Black > >>> Review result: Ready with Issues > >>> > >>> I've reviewed this document as part of TSV-ART's ongoing effort to > review key > >>> IETF documents. These comments were written primarily for the > transport area > >>> directors, but are copied to the document's authors for their information > and > >>> to allow them to address any issues raised. When done at the time of > IETF Last > >>> Call, the authors should consider this review together with any other > last-call > >>> comments they receive. Please always CC tsv-art@ietf.org if you reply to > or > >>> forward this review. > >>> > >>> This draft describes an experimental PFM (PIM Flooding Mechanism) > mechanism for > >>> flooding PIM information among multicast routers that is a generalized > form of > >>> the RFC 5059 PIM BSR (BootStrap Router) mechanism, and applies this > mechanism > >>> to distribution of source group mappings (PFM-SD). > >>> > >>> Early implementation experience with PFM-SD on low bandwidth radio > links > >>> (described Section 2) suggests that the mechanism is able to work better > than > >>> PIM-SM without starving other traffic in the fashion that PIM-DM may. > This is > >>> promising and (in this reviewer's opinion) justifies experimentation at > larger > >>> scale and in other network environments. In general, this is a well- > written > >>> document and the authors should be commended for including the > "running code" > >>> implementation experience report in Section 2. > >>> > >>> Flooding mechanisms are very useful, but the time periods that govern > sending > >>> of flooding messages are crucial to avoid excessive consumption of > network > >>> resources. Section 5 of RFC 5059 has a solid discussion of the time > periods > >>> that apply to use of flooding by the BSR mechanism. The discussion in > this > >>> draft is somewhat weaker, raising a couple of minor issues: > >>> > >>> 1) For PFM-SD, Section 4.2 provides a reasonable discussion of time > periods > >>> that apply, but appears to be missing a minimum time period between > sending > >>> messages. Section 5 of RFC 5059 recommends a default of 10 seconds > for that > >>> minimum time period by comparison to a default PIM BSR sending > interval of 60 > >>> seconds. That 10 second minimum default should be added to this draft, > as the > >>> same default sending interval of 60 seconds is used. > >>> > >>> 2) For future use of PFM for other purposes, Section 3.3 provides the > following > >>> guidance: > >>> > >>> Each TLV definition will need to define when a triggered PFM message > needs > >>> to be originated, and also whether to send periodic messages, and > how > >>> frequent. > >>> > >>> That guidance is correct as far as it goes, but it's not particularly helpful > >>> to future protocol designers. Text should be added to at least point to > the > >>> examples in section 4.2 of this draft and/or part of Section 5 of RFC 5059 > to > >>> suggest the sorts of values that have proven to be workable, and > perhaps also > >>> strongly encourage (SHOULD use) a default minimum time between > messages of at > >>> least 10 seconds. > >>> > >>> Understanding this draft requires that the reader be familiar with > multicast > >>> and PIM, which is reasonable. In addition, an understanding of PIM BSR > is also > >>> required, which is perhaps somewhat less reasonable. An example that > this > >>> reviewer tripped over is that Section 3 of this draft states that "Like BSR, > >>> messages are forwarded hop by hop." There is no further explanation > or > >>> definition of "forwarded hop by hop," making it necessary to consult RFC > 5059 > >>> to understand that term, e.g., this has nothing to do with IPv6 hop-by- > hop > >>> options. A sentence or two of explanation of this hop by hop forwarding > >>> concept ought to be copied and adapted from RFC 5059, and it would be > good to > >>> check for other concepts that rely on RFC 5059 for definitions. > >>> > >>> >
- [Tsv-art] Tsvart telechat review of draft-ietf-pi… David Black
- Re: [Tsv-art] Tsvart telechat review of draft-iet… Stig Venaas
- Re: [Tsv-art] Tsvart telechat review of draft-iet… Mirja Kuehlewind (IETF)
- Re: [Tsv-art] Tsvart telechat review of draft-iet… Black, David
- Re: [Tsv-art] Tsvart telechat review of draft-iet… Stewart Bryant
- Re: [Tsv-art] Tsvart telechat review of draft-iet… Black, David
- Re: [Tsv-art] Tsvart telechat review of draft-iet… Stig Venaas
- Re: [Tsv-art] Tsvart telechat review of draft-iet… Black, David
- Re: [Tsv-art] Tsvart telechat review of draft-iet… Stig Venaas
- Re: [Tsv-art] Tsvart telechat review of draft-iet… Black, David
- Re: [Tsv-art] Tsvart telechat review of draft-iet… Stig Venaas
- Re: [Tsv-art] Tsvart telechat review of draft-iet… Black, David