Re: WGLC for draft-rtgwg-mrt-frr-architecture

Stewart Bryant <stewart.bryant@gmail.com> Wed, 23 December 2015 12:37 UTC

Return-Path: <stewart.bryant@gmail.com>
X-Original-To: rtgwg@ietfa.amsl.com
Delivered-To: rtgwg@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 4ED9C1ACDBF for <rtgwg@ietfa.amsl.com>; Wed, 23 Dec 2015 04:37:49 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.999
X-Spam-Level:
X-Spam-Status: No, score=-1.999 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, SPF_PASS=-0.001] autolearn=ham
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id vo-lubb09Sk7 for <rtgwg@ietfa.amsl.com>; Wed, 23 Dec 2015 04:37:47 -0800 (PST)
Received: from mail-wm0-x229.google.com (mail-wm0-x229.google.com [IPv6:2a00:1450:400c:c09::229]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id CC2CB1ACEFA for <rtgwg@ietf.org>; Wed, 23 Dec 2015 04:37:46 -0800 (PST)
Received: by mail-wm0-x229.google.com with SMTP id p187so147012958wmp.0 for <rtgwg@ietf.org>; Wed, 23 Dec 2015 04:37:46 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=subject:to:references:from:message-id:date:user-agent:mime-version :in-reply-to:content-type; bh=gAokDSjW4Uuy77kpEpKt48cWTF4M5EiUGso1fAcXJJw=; b=H4VEt6KQPeJztFFBe4RQAbvLAcmDdR50ckQI2VJzlqGEhlzLLGAkTv2VALzZzXfgP1 7EV1VC1h99+vx0kHaoUbqDQIKVq+Fqa/nE9Tdct5rIIrr+ERYLoOVPKxcIYKx8XWTZTc 8SUuZ4OZUMtgFZI8gfX7D8JLVI+4/jYnyndyaf1OqnfdFiE6ocSInCx/Untijma6qAbc cAMcV4x6NZ7oLfIScKA4d9X5t7MIQ2g8OiXmAUdBBk3DnvVCm2ktX91kbFLnJLn4XYtb 43ZXELth4Jue1S5yS+7jaznGjbpc/1sFraCfvR5dKqDxAr9kL1EWr1D8rC4p8CmV1GjN BPjw==
X-Received: by 10.194.222.135 with SMTP id qm7mr33592110wjc.106.1450874265348; Wed, 23 Dec 2015 04:37:45 -0800 (PST)
Received: from [192.168.2.126] (host213-123-124-182.in-addr.btopenworld.com. [213.123.124.182]) by smtp.gmail.com with ESMTPSA id gy2sm17869727wjc.12.2015.12.23.04.37.43 (version=TLSv1/SSLv3 cipher=OTHER); Wed, 23 Dec 2015 04:37:43 -0800 (PST)
Subject: Re: WGLC for draft-rtgwg-mrt-frr-architecture
To: Rob Shakir <rjs@rob.sh>, Chris Bowers <cbowers@juniper.net>, "rtgwg@ietf.org" <rtgwg@ietf.org>, "draft-rtgwg-mrt-frr-architecture@tools.ietf.org" <draft-rtgwg-mrt-frr-architecture@tools.ietf.org>, Alvaro Retana <aretana@cisco.com>
References: <566083D0.1020607@gmail.com> <etPan.566efcf6.5e5c4ea5.122@piccolo.rob.sh> <CO2PR05MB619B671A75A6BB2075DD25DA9E40@CO2PR05MB619.namprd05.prod.outlook.com> <etPan.56781edf.6b1fb64.10d@latte>
From: Stewart Bryant <stewart.bryant@gmail.com>
Message-ID: <567A9597.5030308@gmail.com>
Date: Wed, 23 Dec 2015 12:37:43 +0000
User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:38.0) Gecko/20100101 Thunderbird/38.4.0
MIME-Version: 1.0
In-Reply-To: <etPan.56781edf.6b1fb64.10d@latte>
Content-Type: multipart/alternative; boundary="------------060206060404090408000204"
Archived-At: <http://mailarchive.ietf.org/arch/msg/rtgwg/LhS4rUbgwqgHFqXoyF3Rmf37dCQ>
X-BeenThere: rtgwg@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: Routing Area Working Group <rtgwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/rtgwg>, <mailto:rtgwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/rtgwg/>
List-Post: <mailto:rtgwg@ietf.org>
List-Help: <mailto:rtgwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/rtgwg>, <mailto:rtgwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 23 Dec 2015 12:37:49 -0000


On 21/12/2015 15:46, Rob Shakir wrote:
> Chris,
>
> Thanks for the clarification around the fact that MRT can be run as 
> part of a multi-technology approach. I wouldn’t endorse such an 
> approach operationally — why run multiple technologies alongside each 
> other that one must understand vs. a single one that could meet 
> multiple requirements - but some operators may find such an approach 
> useful.
>
> On 21 December, 2015 at 2:47:20 PM, Chris Bowers (cbowers@juniper.net 
> <mailto:cbowers@juniper.net>) wrote:
>
>> I also want to comment on the fact that remote LFA produces multiple 
>> alternates to choose from. With respect to determining if an 
>> alternate provides node protection or not, the fact that remote LFA 
>> computes many possible alternate paths could be viewed as a drawback, 
>> as opposed to an advantage. For a given PLR and failure mode and 
>> destination, in general it will be the case that many nodes qualify 
>> as PQ nodes. In order to determine if the complete repair path from 
>> PLR to PQ-node and PQ-node to destination is node-protecting, 
>> additional computation is needed. The most efficient approach seems 
>> to be to run a forward SPF from the PQ-node being evaluated. In some 
>> topologies, it is not uncommon for many nodes to qualify as PQ nodes. 
>> In order to avoid spending too much time churning away at running 
>> forward SPFs rooted at PQ-nodes, some implementations may find it 
>> useful to limit the number of PQ nodes evaluated for node-protection.
>>
>> By comparison, for roughly the computational cost of evaluating three 
>> PQ nodes for node-protection, MRT produces a path which is guaranteed 
>> to be node-protecting, if node protection is possible. In cases where 
>> node-protection and maximum coverage is important, it seems 
>> reasonable to give operators the option of having an efficient means 
>> of generating a node-protecting path as opposed to the trial and 
>> error approach of evaluating large numbers of PQ nodes, which may or 
>> may not ultimately provide a node-protecting path.
>
> I feel this analysis misses a fundamental point — ‘cost’ does not 
> equate only to the number of cycles that we must spend to find an 
> alternate. Instead we need to consider the whole picture. The question 
> one really needs to consider here is whether the “cost” of using CPU 
> cycles is something that we want to optimise for, over the cost of 
> investing in operational expertise/tooling to ensure manageability and 
> capacity.
>
I certainly agree with that statement. Compute limit is much less of a 
problem than when any of this work was started.

Stewart

> Much of the work that LFA manageability and TI-LFA do is to make flows 
> align with what is “expected” to happen in the network - such that it 
> is easy to understand for operational personnel, but also, it does not 
> drive investment in new capacity which is used *only* in repair 
> scenarios (such investment is generally required IMHO, in order to not 
> congest and degrade all application traffic during that repair).
>
> In my experience (and of course, YMMV), optimising for the latter 
> operational reasons is very much worth spending more cycles on the 
> control-plane, especially now that there are tending to be more 
> resources available there. I think characterisations such as the one 
> above are very academically interesting, but I find that in this case, 
> that has less relevance when we come to actually operating a network.
>
> I think there’s interesting work here, but I’ll continue to struggle 
> with whether it really is usable operationally.
>
> Regards,
>
> r.
>