Re: [Anima] Self-Managed Networks

"Toerless Eckert (eckert)" <eckert@cisco.com> Sun, 18 October 2015 01:03 UTC

Return-Path: <eckert@cisco.com>
X-Original-To: anima@ietfa.amsl.com
Delivered-To: anima@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 678491A0364 for <anima@ietfa.amsl.com>; Sat, 17 Oct 2015 18:03:10 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -14.51
X-Spam-Level:
X-Spam-Status: No, score=-14.51 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_HI=-5, SPF_PASS=-0.001, T_RP_MATCHES_RCVD=-0.01, USER_IN_DEF_DKIM_WL=-7.5] autolearn=ham
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 8jp8YskOfjXv for <anima@ietfa.amsl.com>; Sat, 17 Oct 2015 18:03:06 -0700 (PDT)
Received: from rcdn-iport-8.cisco.com (rcdn-iport-8.cisco.com [173.37.86.79]) (using TLSv1 with cipher RC4-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 772E91A01D6 for <anima@ietf.org>; Sat, 17 Oct 2015 18:03:06 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=cisco.com; i=@cisco.com; l=20944; q=dns/txt; s=iport; t=1445130186; x=1446339786; h=from:to:cc:subject:date:message-id:references: in-reply-to:reply-to:mime-version; bh=h+9P2dIEV4i6ebXkiy+IFkCYCEg9ZQ7wPw7ROef3AKo=; b=DzzKff3w1cdt76qB4MF+UUb16Mi/Yjv+zRhGc6KMUmydHu6eHcJ2dCLD C1p11owWLZe78mEf2roUWOi8LhXMp7BosKo0KA+j4Xo9O6bjIcMViCd5s I+mpyhrGJkTKncI0wiGAGDDQs8LfgEDLk273T0myYjVFxIcnqV44Ba+U+ E=;
X-IronPort-Anti-Spam-Filtered: true
X-IronPort-Anti-Spam-Result: A0AdAgCn7iJW/4cNJK1egmlNVG+9dQENgVoXAQmCMxCDOgKBHDgUAQEBAQEBAYEKhC0BAQEDAQEBAWsLBQcEAgEIDgMEAQEBCR4HDxIGCw4GCQgCBA4FiBsDCggNtm6HFQ2EfAEBAQEBAQEBAQEBAQEBAQEBAQEBARQEim+BBoFPgQGBWhEBJicEBwaEKAWGBQmQFQGLJ4F1gViEPI07f4Nbg24BHwEBQoIRHYFVcgGEHwcXB4EiAQEB
X-IronPort-AV: E=Sophos; i="5.17,695,1437436800"; d="scan'208,217"; a="36708586"
Received: from alln-core-2.cisco.com ([173.36.13.135]) by rcdn-iport-8.cisco.com with ESMTP; 18 Oct 2015 01:03:04 +0000
Received: from XCH-ALN-009.cisco.com (xch-aln-009.cisco.com [173.36.7.19]) by alln-core-2.cisco.com (8.14.5/8.14.5) with ESMTP id t9I134KU006827 (version=TLSv1/SSLv3 cipher=AES256-SHA bits=256 verify=FAIL); Sun, 18 Oct 2015 01:03:04 GMT
Received: from xch-rcd-003.cisco.com (173.37.102.13) by XCH-ALN-009.cisco.com (173.36.7.19) with Microsoft SMTP Server (TLS) id 15.0.1104.5; Sat, 17 Oct 2015 20:02:48 -0500
Received: from xch-rcd-003.cisco.com ([173.37.102.13]) by XCH-RCD-003.cisco.com ([173.37.102.13]) with mapi id 15.00.1104.000; Sat, 17 Oct 2015 20:02:48 -0500
From: "Toerless Eckert (eckert)" <eckert@cisco.com>
To: John Strassner <strazpdj@gmail.com>, "Toy, Mehmet" <Mehmet_Toy@cable.comcast.com>
Thread-Topic: [Anima] Self-Managed Networks
Thread-Index: AQHRBtZHwwWTW9ATTCGfmWDsg6vmTp5sMeIAgAQvzoCAABIM3A==
Date: Sun, 18 Oct 2015 01:02:48 +0000
Message-ID: <db3pj962kuhreh9qlbsvy2kh.1445129845742@email.android.com>
References: <9accdec6dd894df6afed38215b28b494@VAADCEX36.cable.comcast.com> <20151014231557.GA13294@cisco.com> <5fe9c36320aa4921b12eb0279463d2cb@VAADCEX37.cable.comcast.com>, <CAJwYUrGKdk1ywkGcUV7JO+ArTCGYMm=R3voNoOfGe5+0pmxPgg@mail.gmail.com>
In-Reply-To: <CAJwYUrGKdk1ywkGcUV7JO+ArTCGYMm=R3voNoOfGe5+0pmxPgg@mail.gmail.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-ms-exchange-transport-fromentityheader: Hosted
Content-Type: multipart/alternative; boundary="_000_db3pj962kuhreh9qlbsvy2kh1445129845742emailandroidcom_"
MIME-Version: 1.0
Archived-At: <http://mailarchive.ietf.org/arch/msg/anima/9Kh7DsyFEgGTv5xcnpSezOtWHdc>
Cc: "Romascanu, Dan (Dan) (dromasca@avaya.com)" <dromasca@avaya.com>, "Michael Behringer (mbehring)" <mbehring@cisco.com>, "anima@ietf.org" <anima@ietf.org>, "anima-chairs@tools.ietf.org" <anima-chairs@tools.ietf.org>
Subject: Re: [Anima] Self-Managed Networks
X-BeenThere: anima@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
Reply-To: "Toerless Eckert (eckert)" <eckert@cisco.com>
List-Id: Autonomic Networking Integrated Model and Approach <anima.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/anima>, <mailto:anima-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/anima/>
List-Post: <mailto:anima@ietf.org>
List-Help: <mailto:anima-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/anima>, <mailto:anima-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 18 Oct 2015 01:03:10 -0000

Of course, being easily in@tallable and uninstallable does not make the majorities of android/ios apps useful, but if ios/android only had preinstalled software and you had to upgrade the os to get other apps - those os would not exist.

I dont understand how router vendor os survive without that concept, and if at all possible id like to make  sure we do worry about aspects like these to nmake the autonomic vision operationally successful.




Sent from my Samsung Captivate Glide on AT&T

John Strassner <strazpdj@gmail.com> wrote:
Hi all,

Toerless wrote:

> 1) easily, ideally autonomous downloadable agent/application infra.
<jcs>
Frankly, I've never thought of that. However, I'm not sure that this is a
complete answer. Without specifications that define how the agents
and app infrastructure are built, how they communicate, etc., I don't
see how this works. So shouldn't the ANIMA WG build those specs?

Just because something can be downloaded and deployed doesn't
mean it is useful. Remember the disaster that was Active Networks,
which set AI in networking back two decades.
</jcs>

Mehmet wrote:

MT: What I propose is unlikely to be done completely just by autonomic agents or software, but can be partially done by autonomic agents.  This is the key reason for these email exchanges.
<jcs>
You lost me. If it isn't software, then it must be hardware. I like ASICs
and FPGAs, but they can't do everything, especially in an adaptive
environment. Please elaborate more.
</jcs>

 ...

Mehmet wrote:
MT: In general I am not in favor of defining another control layer for this purposes. Therefore, my proposed design  does not need and does not have a protocol for communications.
<jcs>
Then how do different autonomic elements find each other and form
themselves into a group that can do something useful?
</jcs>
...

regards,
John

On Wed, Oct 14, 2015 at 8:02 PM, Toy, Mehmet <Mehmet_Toy@cable.comcast.com<mailto:Mehmet_Toy@cable.comcast.com>> wrote:
Toerless,
Please see below.
Thanks
Mehmet

-----Original Message-----
From: Toerless Eckert [mailto:eckert@cisco.com<mailto:eckert@cisco.com>]
Sent: Wednesday, October 14, 2015 7:16 PM
To: Toy, Mehmet
Cc: anima-chairs@tools.ietf.org<mailto:anima-chairs@tools.ietf.org>; brian.e.carpenter@gmail.com<mailto:brian.e.carpenter@gmail.com>; mbehring@cisco.com<mailto:mbehring@cisco.com>; Romascanu, Dan (Dan) (dromasca@avaya.com<mailto:dromasca@avaya.com>)
Subject: Re: Self-Managed Networks

Toy:

Unless you feel its more appropriate to keep this discussion in this closed mailing list, i would suggest to have it on the anima mailing list.

First off: You've compressed really a lot of crucial arch design from your detailed first slidedeck into this a lot more pithy, yet comprehensive first draft. I think my main concern is how we can work on the SW engineering aspects of how to build it and build upon the anima infrastructure.

I think i did ask you some IETFs ago when you presented your slides, that it would be good if we could focus on that aspect.

Eg: Could we add a section to the document outlining a summary of how this framework would leverage and benefit from the the currently planned anima infrastructure - why/ how to use it - so we know the requirements we have, especially if those are any we would need to consider during recharter.
MT: As you can see from my previous email, I am not on board with current architectural definitions.  Once we are sync on the definitions of architectural components, I should make an attempt to address your concerns.

If you'd ask me, primarily as a contributor considering what would best help to make this happen - and move ANIMA forward, i think the key aspects are:

1) easily, ideally autonomous downloadable agent/application infra.

Eg: I don't see any way that either vendors or operators could realistically build this framework if these components are tied into the classical router OS software delivery model where it takes months to validate a sofftware update and more months to deploy it. If this can be downloaded, as separate apps then its so much easier to incrementally build/deploy it on a totally different deployment process.

How exactly should this be done so operators like comcast will most likely start incrementally deploying this ?

Eg: I am sure autonomic agents will be the next re-charter item, the functionality you describe for fault-management via such agents is great, and i can see how that is just good work to refine, so i am mostly worred about the sW-engineering and operational aspects of such autonomic agents.

MT: What I propose is unlikely to be done completely just by autonomic agents or software, but can be partially done by autonomic agents.  This is the key reason for these email exchanges.

2) Clear APIs to existing router operations.

We would need to have an idea how these new components would talk to the rest of the router. This seems like an eay "oh, we can have those agents do 30 year old router CLI locally, but we'd prefer Netconf/Yang with IETF standardized models, and we do need the following yang models for the first important FP operations". There is also talk about expanding into Thrift for higher performance data collection (beyond netconf, still with yang).

MT: Agree, but we are far from these steps.  We have to agree on the fundamentals.
3) connectivity between the agents.

This is primarily where at the lowest layer the ACP would come in and it would be great to see argument if/how security, zero-touch build-up and indestructibility are required and/or beneficial for these agents functions.  You already mention the security aspect in your security section. Great.

Next layer is leveraging GRASP as a protocol for agent-to-agent communication. Is it feasible/beneficial to expect a common new message signaling layer with GRAP ? If so, it would be great to describee why and how. Eg: what communication patterns would your agents have, do the patterns GRASP support suit your agents ?
MT: In general I am not in favor of defining another control layer for this purposes. Therefore, my proposed design  does not need and does not have a protocol for communications.

Your section 7 already has one specific layer of message communicartions, but it is directly tied to ethernet frame. I'd like to understand this better. I for once would think one would define for the message eleemnts a Yang/CBOR data model "failure management yang model". Then we look how to carry it. agent-to-agent, i would propose to actually run over GRASP (using CBOR representation of course), but to humans behind an NMS its more likely via netconf/yang from an agent into the NMS.
MT: Ethernet frame is just an example. We can collectively decide to have something else. The key point here is to have one message format for all FM related communications.

There may be other aspects, but if this makes sense to you, it would be great if we could get such a section added. Let me know if you need any help with that.
MT: Appreciate the offer. Again we need to sync up on the fundamentals first.

Cheers
    Toerless

On Mon, Oct 05, 2015 at 05:14:19PM +0000, Toy, Mehmet wrote:
> Dear All:
> I couldn't attend the Prague meeting, but luckily Dan was able to present my slides on "Self-Managed Networks with Fault Management Hierarchy".   The feedback was to position the work in the ANIMA WG scope and framework.
>
> ANIMA charter in "M. Behringer, et. al., A Reference Model for Autonomic Networking
> draft-behringer-anima-reference-model-03" refers to "self-healing".  RFC7575,  "M. Behringer, et al.,   Autonomic Networking: Definitions and Design Goals",  refers to "self-management". However, both documents do not  articulate fault management aspect of the self-management.  It is possible to interpret the fault management aspect of autonomic networks as part of "self-healing" and therefore as part of the ANIMA charter.  In that case, the "Architectural Framework for Self-Managed Networks with Fault Management Hierarchy, draft-mtoy-anima-self-faultmang-framework-00.txt" contribution can target to fill that gap.  The control plane aspect of self-healing is addressed by "M. Behringer, et al., An Autonomic Control Plane, draft-behringer-anima-autonomic-control-plane-03".  I believe these contributions are complementary to each other. I can try to address that in the contribution.
>
> Please let me know if you agree with me. If not, I suggest to modify the charter since without covering fault management aspect of the autonomic networks, the concept of autonomic network will be incomplete.
>
> Regards,
> Mehmet
>
>

--
---
Toerless Eckert, eckert@cisco.com<mailto:eckert@cisco.com>

_______________________________________________
Anima mailing list
Anima@ietf.org<mailto:Anima@ietf.org>
https://www.ietf.org/mailman/listinfo/anima



--
regards,
John