Re: [lmap] draft-ietf-lmap-information-model-05: Controller timeout

"Carey, Timothy (Timothy)" <timothy.carey@alcatel-lucent.com> Wed, 13 May 2015 12:37 UTC

Return-Path: <timothy.carey@alcatel-lucent.com>
X-Original-To: lmap@ietfa.amsl.com
Delivered-To: lmap@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 59A9E1A8984 for <lmap@ietfa.amsl.com>; Wed, 13 May 2015 05:37:10 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -6.91
X-Spam-Level:
X-Spam-Status: No, score=-6.91 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_HI=-5, T_RP_MATCHES_RCVD=-0.01] autolearn=ham
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id q2z_S2YRw5cg for <lmap@ietfa.amsl.com>; Wed, 13 May 2015 05:37:08 -0700 (PDT)
Received: from smtp-fr.alcatel-lucent.com (fr-hpgre-esg-01.alcatel-lucent.com [135.245.210.22]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 126C61A8983 for <lmap@ietf.org>; Wed, 13 May 2015 05:37:07 -0700 (PDT)
Received: from us70tusmtp1.zam.alcatel-lucent.com (unknown [135.5.2.63]) by Websense Email Security Gateway with ESMTPS id E41D8EF34DDC1; Wed, 13 May 2015 12:37:02 +0000 (GMT)
Received: from US70TWXCHHUB04.zam.alcatel-lucent.com (us70twxchhub04.zam.alcatel-lucent.com [135.5.2.36]) by us70tusmtp1.zam.alcatel-lucent.com (GMO) with ESMTP id t4DCb3To027306 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=FAIL); Wed, 13 May 2015 08:37:03 -0400
Received: from US70UWXCHMBA05.zam.alcatel-lucent.com ([169.254.10.167]) by US70TWXCHHUB04.zam.alcatel-lucent.com ([135.5.2.36]) with mapi id 14.03.0195.001; Wed, 13 May 2015 08:37:03 -0400
From: "Carey, Timothy (Timothy)" <timothy.carey@alcatel-lucent.com>
To: Juergen Schoenwaelder <j.schoenwaelder@jacobs-university.de>
Thread-Topic: [lmap] draft-ietf-lmap-information-model-05: Controller timeout
Thread-Index: AdCMHecUFUgxoWRISVOQIyTHkXEC2AAnxEkAAATNJ5AAAJpsgAAH9ouw///bqQD//tQ9MA==
Date: Wed, 13 May 2015 12:37:01 +0000
Message-ID: <9966516C6EB5FC4381E05BF80AA55F77BA2626E6@US70UWXCHMBA05.zam.alcatel-lucent.com>
References: <9966516C6EB5FC4381E05BF80AA55F77BA2612C2@US70UWXCHMBA05.zam.alcatel-lucent.com> <20150512100724.GC26662@elstar.local> <9966516C6EB5FC4381E05BF80AA55F77BA2616F5@US70UWXCHMBA05.zam.alcatel-lucent.com> <20150512124209.GC41964@elstar.local> <9966516C6EB5FC4381E05BF80AA55F77BA261858@US70UWXCHMBA05.zam.alcatel-lucent.com> <20150512142006.GB57299@elstar.local>
In-Reply-To: <20150512142006.GB57299@elstar.local>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-originating-ip: [135.5.27.17]
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
Archived-At: <http://mailarchive.ietf.org/arch/msg/lmap/t0ogFqJTg3qi9HGypw0asIGiOlo>
Cc: "lmap@ietf.org" <lmap@ietf.org>
Subject: Re: [lmap] draft-ietf-lmap-information-model-05: Controller timeout
X-BeenThere: lmap@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: Large Scale Measurement of Access network Performance <lmap.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/lmap>, <mailto:lmap-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/lmap/>
List-Post: <mailto:lmap@ietf.org>
List-Help: <mailto:lmap-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/lmap>, <mailto:lmap-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 13 May 2015 12:37:10 -0000

Juergen,

If you do not want to disable the scheduled tasks - I would actually say the Parent tasks not the schedules are disabled. The reason is because we are talking about wanted to disable Tasks that are not related to the communication between the MA and Controller or proper functioning of the MA.  The MA knows this as part of their TaskCapability reporting (which somehow got deleted in draft 05).

As to the reporting of the status I would do the following:
Add an OperationalStatus attribute with the following values to the action object (which was scheduled task): enabled, disabled-schedule, disabled-task, disabled-other.



I would also add an OperationalStatus to:
1) Schedule object with the following values: enabled, disabled-suppressed, disabled-other.
2) Task object (I guess it could go in Task Status) with the following values: enabled, disabled-suppressed, disabled-ma, disabled-other.
3) Measurement Agent object with the following values: enabled, degraded-controller-communication, degraded-other, disabled.

We would always include a disabled-other to allow other conditions. For example in TR-069 we have the capability to administratively enable/disable the multi-instance objects by conventions - so if for some reason these were administratively disabled - we would set the OperationalStatus to disabled-other or just disabled for the MA.

These status' would provide a clear operational status of the elements that can be disabled or degraded based on what is currently documented in the framework and information model. I would suspect as we implement protocols there might be other values we want to support - e.g., problems with the collector-MA communication...

BR,
Tim
-----Original Message-----
From: Juergen Schoenwaelder [mailto:j.schoenwaelder@jacobs-university.de] 
Sent: Tuesday, May 12, 2015 9:20 AM
To: Carey, Timothy (Timothy)
Cc: lmap@ietf.org
Subject: Re: [lmap] draft-ietf-lmap-information-model-05: Controller timeout

On Tue, May 12, 2015 at 01:04:49PM +0000, Carey, Timothy (Timothy) wrote:

> So does that now mean we need to have a operational status now for a 
> scheduled task? I can see it now as attribute with values like:
> enabled, disabled, suppressed and errored or possibly a conditions 
> object?

I think there two questions here:

a) If I loose connectivity, do I disable all schedules (and thus
   implicitely all actions) or do I disable scheduled tasks? My idea
   was to simply disable all schedules (and as such the schedules have
   their state changed to disabled).

   The suppression mechanism can suppress both a list of tasks and a
   list of schedules and it is configurable what is being suppressed.

   One option is that loss of connectivity simply causes suppression
   to be activated. Another option is that loss of connectivity
   supresses all tasks that are not essential for communication with a
   controller and house keeping. Another option is that loss of
   connectivity suppresses all schedules. We need to select one.

b) Do we expose the internal state that is maintained? Right now, we
   do not expose any operational state for schedules. But we do expose
   operational state for tasks. If loss of connectivity means we
   disable all tasks that are not essential for communication with a
   controller and house keeping, than exposing that information would
   be a rather minor change to the information and data model.

   A task state would then be enabled (default), suppressed or
   disabled. Error information is already covered in the last-failed
   attributes of a task. (And this is useful in situations where tasks
   fail occasionally so you can get information about the last failure
   even though the task is working fine right now).

/js

-- 
Juergen Schoenwaelder           Jacobs University Bremen gGmbH
Phone: +49 421 200 3587         Campus Ring 1 | 28759 Bremen | Germany
Fax:   +49 421 200 3103         <http://www.jacobs-university.de/>