Re: [Anima] Self-Managed Networks

"Toy, Mehmet" <Mehmet_Toy@cable.comcast.com> Thu, 15 October 2015 03:02 UTC

Return-Path: <Mehmet_Toy@cable.comcast.com>
X-Original-To: anima@ietfa.amsl.com
Delivered-To: anima@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id AB2001B2F5C for <anima@ietfa.amsl.com>; Wed, 14 Oct 2015 20:02:21 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: 0.225
X-Spam-Level:
X-Spam-Status: No, score=0.225 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HELO_EQ_MODEMCABLE=0.768, HOST_EQ_MODEMCABLE=1.368, SPF_PASS=-0.001, T_RP_MATCHES_RCVD=-0.01] autolearn=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id qgrty6BeDknX for <anima@ietfa.amsl.com>; Wed, 14 Oct 2015 20:02:20 -0700 (PDT)
Received: from pacdcmhout01.cable.comcast.com (PACDCMHOUT01.cable.comcast.com [68.87.31.167]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id EAC1E1B2F5B for <anima@ietf.org>; Wed, 14 Oct 2015 20:02:19 -0700 (PDT)
X-AuditID: 44571fa7-f79776d000005d5f-26-561f1739ee78
Received: from PACDCEXHUB01.cable.comcast.com (cas-umc02.ndceast.pa.bo.comcast.net [68.87.34.28]) (using TLS with cipher AES256-SHA (256/256 bits)) (Client did not present a certificate) by pacdcmhout01.cable.comcast.com (SMTP Gateway) with SMTP id 32.6C.23903.A371F165; Wed, 14 Oct 2015 23:02:18 -0400 (EDT)
Received: from VAADCEX36.cable.comcast.com (147.191.103.213) by PACDCEXHUB01.cable.comcast.com (24.40.56.114) with Microsoft SMTP Server (TLS) id 14.3.181.6; Wed, 14 Oct 2015 23:02:17 -0400
Received: from VAADCEX37.cable.comcast.com (147.191.103.214) by VAADCEX36.cable.comcast.com (147.191.103.213) with Microsoft SMTP Server (TLS) id 15.0.1044.25; Wed, 14 Oct 2015 23:02:16 -0400
Received: from VAADCEX37.cable.comcast.com ([fe80::3aea:a7ff:fe12:38b0]) by VAADCEX37.cable.comcast.com ([fe80::3aea:a7ff:fe12:38b0%19]) with mapi id 15.00.1044.021; Wed, 14 Oct 2015 23:02:05 -0400
From: "Toy, Mehmet" <Mehmet_Toy@cable.comcast.com>
To: Toerless Eckert <eckert@cisco.com>
Thread-Topic: Self-Managed Networks
Thread-Index: AdD/kRO6wwWTW9ATTCGfmWDsg6vmTgHZrtGAAAGfsnA=
Date: Thu, 15 Oct 2015 03:02:04 +0000
Message-ID: <5fe9c36320aa4921b12eb0279463d2cb@VAADCEX37.cable.comcast.com>
References: <9accdec6dd894df6afed38215b28b494@VAADCEX36.cable.comcast.com> <20151014231557.GA13294@cisco.com>
In-Reply-To: <20151014231557.GA13294@cisco.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-ms-exchange-transport-fromentityheader: Hosted
x-originating-ip: [96.114.156.8]
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
X-CFilter-Loop: Forward
X-Brightmail-Tracker: H4sIAAAAAAAAA+NgFvrMIsWRmVeSWpSXmKPExsXiEq4ko2slLh9msHG2gcWc3h/sFg8XXWey aLu4j8ni688frBZ7l99js7g27yKzA5vHwZVz2D2m/N7I6rFz1l12jyVLfjJ5fLn8mS2ANYrL JiU1J7MstUjfLoEr4/zNSUwF+00rvj/5ztLAuFi7i5GTQ0LARGLes6VsELaYxIV764FsLg4h gV1MEj++TIFyDjJK/D+1mB2kSkjgMKPE3aMcEImTjBJn155mBUmwCRhJzDtylQXEFhFQkzh7 sgesm1lgEpPE3GtPwXYICyhJ3Nx6jx2iSFni4altzBC2lcSJ2/PBalgEVCVW/VvGCGLzCnhJ nL3exAaxOV/i/olVYL2cAvoSO2YsB7MZge7+fmoNE4jNLCAucevJfCaIfwQkluw5zwxhi0q8 fPyPFcI2kNi6dB8LhK0gsX3/NhaIXh2JBbs/sUHY2hLLFr5mhrhBUOLkzCcsEDdoSkxadhFq prjE4SM7WCcwSs9CsnoWklGzkIyahWTUAkaWVYxyBYnJKcm5GfmlJQaGesmJSTmpesn5ucmJ xSUgehMjOCnIL9/BeO+F0yFGAQ5GJR7eCnb5MCHWxLLiytxDjBIczEoivNob5MKEeFMSK6tS i/Lji0pzUosPMUpzsCiJ826+9ytUSCA9sSQ1OzW1ILUIJsvEwSnVwMijzXH6pJjY2gmlh+sd Gq6tmfCs8Z3Y3xKvH59fql9/7Xsolt012aRCesktydoF049eCkncx2rKI2DQKZvwvpRF82RD 38w1RYxXXFh3T7F39Nps/Wjf/hpHjZpIK3mes42XXS7We2o9XBB2WVL1Ot9+hWtaqa+4Pkxb WWc752n2q0kPliqfVVNiKc5INNRiLipOBADBvPFQBgMAAA==
Archived-At: <http://mailarchive.ietf.org/arch/msg/anima/sDAnAh5_5L2RE8O8QLd4SAJK35s>
Cc: "Romascanu, Dan (Dan) (dromasca@avaya.com)" <dromasca@avaya.com>, "anima@ietf.org" <anima@ietf.org>, "mbehring@cisco.com" <mbehring@cisco.com>, "anima-chairs@tools.ietf.org" <anima-chairs@tools.ietf.org>
Subject: Re: [Anima] Self-Managed Networks
X-BeenThere: anima@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: Autonomic Networking Integrated Model and Approach <anima.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/anima>, <mailto:anima-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/anima/>
List-Post: <mailto:anima@ietf.org>
List-Help: <mailto:anima-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/anima>, <mailto:anima-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 15 Oct 2015 03:02:21 -0000

Toerless,
Please see below.
Thanks
Mehmet

-----Original Message-----
From: Toerless Eckert [mailto:eckert@cisco.com] 
Sent: Wednesday, October 14, 2015 7:16 PM
To: Toy, Mehmet
Cc: anima-chairs@tools.ietf.org; brian.e.carpenter@gmail.com; mbehring@cisco.com; Romascanu, Dan (Dan) (dromasca@avaya.com)
Subject: Re: Self-Managed Networks

Toy:

Unless you feel its more appropriate to keep this discussion in this closed mailing list, i would suggest to have it on the anima mailing list. 

First off: You've compressed really a lot of crucial arch design from your detailed first slidedeck into this a lot more pithy, yet comprehensive first draft. I think my main concern is how we can work on the SW engineering aspects of how to build it and build upon the anima infrastructure.

I think i did ask you some IETFs ago when you presented your slides, that it would be good if we could focus on that aspect.

Eg: Could we add a section to the document outlining a summary of how this framework would leverage and benefit from the the currently planned anima infrastructure - why/ how to use it - so we know the requirements we have, especially if those are any we would need to consider during recharter.
MT: As you can see from my previous email, I am not on board with current architectural definitions.  Once we are sync on the definitions of architectural components, I should make an attempt to address your concerns.

If you'd ask me, primarily as a contributor considering what would best help to make this happen - and move ANIMA forward, i think the key aspects are: 

1) easily, ideally autonomous downloadable agent/application infra.

Eg: I don't see any way that either vendors or operators could realistically build this framework if these components are tied into the classical router OS software delivery model where it takes months to validate a sofftware update and more months to deploy it. If this can be downloaded, as separate apps then its so much easier to incrementally build/deploy it on a totally different deployment process. 

How exactly should this be done so operators like comcast will most likely start incrementally deploying this ?

Eg: I am sure autonomic agents will be the next re-charter item, the functionality you describe for fault-management via such agents is great, and i can see how that is just good work to refine, so i am mostly worred about the sW-engineering and operational aspects of such autonomic agents.

MT: What I propose is unlikely to be done completely just by autonomic agents or software, but can be partially done by autonomic agents.  This is the key reason for these email exchanges.

2) Clear APIs to existing router operations.

We would need to have an idea how these new components would talk to the rest of the router. This seems like an eay "oh, we can have those agents do 30 year old router CLI locally, but we'd prefer Netconf/Yang with IETF standardized models, and we do need the following yang models for the first important FP operations". There is also talk about expanding into Thrift for higher performance data collection (beyond netconf, still with yang).

MT: Agree, but we are far from these steps.  We have to agree on the fundamentals.
3) connectivity between the agents.

This is primarily where at the lowest layer the ACP would come in and it would be great to see argument if/how security, zero-touch build-up and indestructibility are required and/or beneficial for these agents functions.  You already mention the security aspect in your security section. Great. 

Next layer is leveraging GRASP as a protocol for agent-to-agent communication. Is it feasible/beneficial to expect a common new message signaling layer with GRAP ? If so, it would be great to describee why and how. Eg: what communication patterns would your agents have, do the patterns GRASP support suit your agents ?
MT: In general I am not in favor of defining another control layer for this purposes. Therefore, my proposed design  does not need and does not have a protocol for communications.  

Your section 7 already has one specific layer of message communicartions, but it is directly tied to ethernet frame. I'd like to understand this better. I for once would think one would define for the message eleemnts a Yang/CBOR data model "failure management yang model". Then we look how to carry it. agent-to-agent, i would propose to actually run over GRASP (using CBOR representation of course), but to humans behind an NMS its more likely via netconf/yang from an agent into the NMS.
MT: Ethernet frame is just an example. We can collectively decide to have something else. The key point here is to have one message format for all FM related communications.

There may be other aspects, but if this makes sense to you, it would be great if we could get such a section added. Let me know if you need any help with that.
MT: Appreciate the offer. Again we need to sync up on the fundamentals first.

Cheers
    Toerless

On Mon, Oct 05, 2015 at 05:14:19PM +0000, Toy, Mehmet wrote:
> Dear All:
> I couldn't attend the Prague meeting, but luckily Dan was able to present my slides on "Self-Managed Networks with Fault Management Hierarchy".   The feedback was to position the work in the ANIMA WG scope and framework.
> 
> ANIMA charter in "M. Behringer, et. al., A Reference Model for Autonomic Networking
> draft-behringer-anima-reference-model-03" refers to "self-healing".  RFC7575,  "M. Behringer, et al.,   Autonomic Networking: Definitions and Design Goals",  refers to "self-management". However, both documents do not  articulate fault management aspect of the self-management.  It is possible to interpret the fault management aspect of autonomic networks as part of "self-healing" and therefore as part of the ANIMA charter.  In that case, the "Architectural Framework for Self-Managed Networks with Fault Management Hierarchy, draft-mtoy-anima-self-faultmang-framework-00.txt" contribution can target to fill that gap.  The control plane aspect of self-healing is addressed by "M. Behringer, et al., An Autonomic Control Plane, draft-behringer-anima-autonomic-control-plane-03".  I believe these contributions are complementary to each other. I can try to address that in the contribution.
> 
> Please let me know if you agree with me. If not, I suggest to modify the charter since without covering fault management aspect of the autonomic networks, the concept of autonomic network will be incomplete.
> 
> Regards,
> Mehmet
> 
> 

--
---
Toerless Eckert, eckert@cisco.com