Re: [Teas-ns-dt] Availability

Eric Gray <eric.gray@ericsson.com> Tue, 02 June 2020 16:24 UTC

Return-Path: <eric.gray@ericsson.com>
X-Original-To: teas-ns-dt@ietfa.amsl.com
Delivered-To: teas-ns-dt@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id F079E3A0C78 for <teas-ns-dt@ietfa.amsl.com>; Tue, 2 Jun 2020 09:24:52 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.101
X-Spam-Level:
X-Spam-Status: No, score=-2.101 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.001, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=ericsson.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id XKWpAK6MBcnM for <teas-ns-dt@ietfa.amsl.com>; Tue, 2 Jun 2020 09:24:50 -0700 (PDT)
Received: from NAM12-BN8-obe.outbound.protection.outlook.com (mail-bn8nam12on2081.outbound.protection.outlook.com [40.107.237.81]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 65E4A3A0C75 for <teas-ns-dt@ietf.org>; Tue, 2 Jun 2020 09:24:50 -0700 (PDT)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=ivdJGeRVMmgTOedYAG2lobjS4uwOi9F0Av6cprZvVCMPaarz0Bu82/f9aGbYLxe/QQKV1enbvuUGNxGnGrx/VqXrvk1VRmTzdJe4/V9nT0nwNVmOn4QI3C8de/I0mxtazBUzHgZosXRKZAFltGaRo5PkErWPSRWI8nG3KicaHHKEqzkXhGKQPo4xyCYuUFUTG4IU5g5Aawx6hUCk9Lh7vIUDy0Svh9nhhR8l+RQuErB2es/y9WV9m8BZjB5ILoAz4aidpAlDxZHCYvNnkvMDBqiBIpOOoJ5NSl3K2990q8a0OimgsNLRylWMv55/zq+xZF59pi7dunH3qlxuj3bhhA==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=eqUdXc70BnoA5NwP0mHD4Jbb46OFFHhDjTfdODHRj6Y=; b=mvUiX8yViI6e16Pv4a5LVEF4Io6OA0CnUbzLSEYkhOoCFYP6f1wuks9mrMM3IYex+pNcH5w7Ng1j3/RKJnEuCP+sYKvfkJTOyKRse4432XwrxZHsIkXSmZxhsFjDPuj1Tl9dhDw42FK38UrklxTMQb5voHKBvtYctlT5aJU7Fu6LEeIvdxxGaYRtQY1Tl68WMhyALOubv+2ECB+auUNzWCNyu2RQ6NZpRU2XZnvNTBgRjAJ6FLFncGFIqH71RgeRM3PGPeLm3SX0cTQPRzHeIh1J0xFF2Fk5XNX6cpHEQpc52L+Pw4N4EKdhBTDA7tVJX3/R3oc2FamgX8Ha6UXKCg==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=ericsson.com; dmarc=pass action=none header.from=ericsson.com; dkim=pass header.d=ericsson.com; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ericsson.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=eqUdXc70BnoA5NwP0mHD4Jbb46OFFHhDjTfdODHRj6Y=; b=lUwrkh1FM3b9Z1Ec7PPwzWQGDU3ji4NP2GIX/HKUMOE83UsEFT1cdg3pbyLBATRFlwTHd0E5yx+5NNzPQXIq2fQFDBkPskp1jvJMhp+eQGkCCMDjwcSomqcDOASa3GbQD1c5SwS/BGZQ7aiJxXoLKtyVzX7zk8f6zdMImY9qcHM=
Received: from MN2PR15MB3103.namprd15.prod.outlook.com (2603:10b6:208:f9::10) by MN2PR15MB3022.namprd15.prod.outlook.com (2603:10b6:208:e9::21) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.3045.19; Tue, 2 Jun 2020 16:24:46 +0000
Received: from MN2PR15MB3103.namprd15.prod.outlook.com ([fe80::6dab:2470:4c23:1471]) by MN2PR15MB3103.namprd15.prod.outlook.com ([fe80::6dab:2470:4c23:1471%5]) with mapi id 15.20.3066.018; Tue, 2 Jun 2020 16:24:46 +0000
From: Eric Gray <eric.gray@ericsson.com>
To: Greg Mirsky <gregimirsky@gmail.com>
CC: Kiran Makhijani <kiranm@futurewei.com>, "teas-ns-dt@ietf.org" <teas-ns-dt@ietf.org>, Eric Gray <eric.gray=40ericsson.com@dmarc.ietf.org>, Jeff Tantsura <jefftant.ietf@gmail.com>
Thread-Topic: [Teas-ns-dt] Availability
Thread-Index: AdY4HfMdDO+ydZ5wRD6WLlyw/XAJzAAF+XTAAAeTRAAAAOHBgAAA0V0AAABIawAAAH2egAAAUCuAAABvAwAAJRAnAA==
Date: Tue, 2 Jun 2020 16:24:46 +0000
Message-ID: <MN2PR15MB310321EE807E84AEB909CC77978B0@MN2PR15MB3103.namprd15.prod.outlook.com>
References: <MN2PR15MB31030C424C7AEF28118B5704978A0@MN2PR15MB3103.namprd15.prod.outlook.com> <BYAPR13MB24377B5E3FD0DEB85599C724D98A0@BYAPR13MB2437.namprd13.prod.outlook.com> <22dcfd45-85ce-4300-a973-765b8575c4dd@Spark> <CA+RyBmXXKjGPA7Fgwp+axVfnjx-iUySjyW8JF4Au3awDOUHTrQ@mail.gmail.com> <09306ffd-5ac5-4006-a9fc-4ede36b5b4d3@Spark> <CA+RyBmVMcfKhr4dDTnb00muPuSWaAaLvkteZ+To8BXj5v0CfUA@mail.gmail.com> <0432c69e-1151-404d-893c-cd240c5531a3@Spark> <CA+RyBmVBSph4dkgNUSNLmx0x67mJZAqTM31J-B4VJ2x5xrO4gA@mail.gmail.com> <a9aee371-521c-4a5a-ae60-3d742b58e77b@Spark>
In-Reply-To: <a9aee371-521c-4a5a-ae60-3d742b58e77b@Spark>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
authentication-results: gmail.com; dkim=none (message not signed) header.d=none;gmail.com; dmarc=none action=none header.from=ericsson.com;
x-originating-ip: [73.248.143.71]
x-ms-publictraffictype: Email
x-ms-office365-filtering-correlation-id: 7afa23af-d75b-4cf7-41e4-08d80711772d
x-ms-traffictypediagnostic: MN2PR15MB3022:
x-microsoft-antispam-prvs: <MN2PR15MB30221994A6EF8FC5313C4FED978B0@MN2PR15MB3022.namprd15.prod.outlook.com>
x-ms-oob-tlc-oobclassifiers: OLM:10000;
x-forefront-prvs: 0422860ED4
x-ms-exchange-senderadcheck: 1
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: ffCk/YHaqnNdB6Bpwx3f2NhZSRpgj3mH6qfkO5mtxRURyDW4twgGDQwzR6GkvIwNDtf9T4nTBax2OdzIUgjExxtqWUv/UT5ImmJrOmqYdMlZrkSlH23NclSdF20zfYLjS5zX+hybRDzWz3p+FIH5Jbx6+q4YtWF5RQIdogmTLPMv6xCj5lra5HVzx6MPW/N8/QHtXoKIRiAGk8WgJxn9wvD3G5nC20AJmKVy7hJPPRAXFm33x6cYgjTDS8HxiSieO980C/ojNK8FqxHHnUOde0CQUAu0LPZlFwqvliHQBx1oeU8rTqtEiNlM764114Rts+kZsIPRPTLeD9C7Xt41EWA2VmQxxinpDr0w1I0AfTzhiTtskthPNMt73/lfVeLHj0cjc2qYwWy85dFcj/yBOw==
x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:MN2PR15MB3103.namprd15.prod.outlook.com; PTR:; CAT:NONE; SFTY:; SFS:(4636009)(39860400002)(346002)(376002)(396003)(366004)(136003)(7696005)(66574014)(6506007)(26005)(53546011)(2906002)(33656002)(44832011)(5660300002)(166002)(186003)(71200400001)(83380400001)(4326008)(316002)(8936002)(66476007)(54906003)(6916009)(66446008)(64756008)(66556008)(76116006)(66946007)(52536014)(478600001)(86362001)(9686003)(8676002)(55016002)(966005); DIR:OUT; SFP:1101;
x-ms-exchange-antispam-messagedata: Fuvnp+dccUkqDz6pxNqLHZ+hB9f+bPeOH2mi8EqeB4jU/n2RQ/gM6+PV0u/jjE9C3DzSUqjM5WFtwLq4p/HXe6vXhlw4vvbc1kP7bATPV1edmHEdlX5fWlJ/7kNFrhLv8neI2t5hwoMF8pgUTCf1PoaGrKSSmp3oF+wk1WZlVsRoO0+2xCJgaUyTAIlpC0o7muAK3Bq3ReXkwPRkCG74X/AWddDajyWLyQ57MfeNllqXL79UUXNAg0kpJolHsYzNyOq9KZmmKoyEbacDJ5ZTouKG+GVc22sLRJ+a1lJq2YV/5Jd65pBWW1OjyAAfEhrsE30i13u8ghxOA48ed6eSQQPfBgirkEM54desBBfrXyOZ2u5HW5gcVJgu7eAmrF+0AkfK4jtmUessP+KMn7Mazmq9OoM72r9GS1XlzBiNLqwTShXMz4+tREVBSFc02i2OhKwJdB5fBiMwndyYzQB80UJN3Mw6OwLD7XTGhASn06I=
x-ms-exchange-transport-forked: True
Content-Type: multipart/alternative; boundary="_000_MN2PR15MB310321EE807E84AEB909CC77978B0MN2PR15MB3103namp_"
MIME-Version: 1.0
X-OriginatorOrg: ericsson.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 7afa23af-d75b-4cf7-41e4-08d80711772d
X-MS-Exchange-CrossTenant-originalarrivaltime: 02 Jun 2020 16:24:46.5080 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 92e84ceb-fbfd-47ab-be52-080c6b87953f
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: M/syWWMWSxBbkfc4f28OSOZDXT0dijR/jsu1CAz+JqQZZmFY027fSesBMichaLUtNa+FK2ZY3Kk/6U4AER3rzw==
X-MS-Exchange-Transport-CrossTenantHeadersStamped: MN2PR15MB3022
Archived-At: <https://mailarchive.ietf.org/arch/msg/teas-ns-dt/3tHn1fUe__O8rmKXoia7yXrPa7s>
Subject: Re: [Teas-ns-dt] Availability
X-BeenThere: teas-ns-dt@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: TEAS Network Slicing Design Team <teas-ns-dt.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/teas-ns-dt>, <mailto:teas-ns-dt-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/teas-ns-dt/>
List-Post: <mailto:teas-ns-dt@ietf.org>
List-Help: <mailto:teas-ns-dt-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/teas-ns-dt>, <mailto:teas-ns-dt-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 02 Jun 2020 16:24:53 -0000

Greg,

              As mentioned in a separate thread, it is possible to look at availability as a parameter to be associated with each individual SLO.  I would need to see evidence that it is worth the extra complexity, however, before I would agree that we should do it that way.

              For example, it is conceivable that a service might be defined where the availability is different for packet loss and delay variation.

              There are simpler ways to express this sort of unequal weighting of SLO parameters, however, and my impression (from talking to provider representatives in relevant standards work over multiple decades) is that defining a single availability is the usual case.

              For example, packet loss and availability are both expressed as a percentage, then you can choose a value for packet loss percentage that – in combination with a given availability percentage – results in any desirable SLO loss percentage.

              In other words – if you’re looking to define an SLA that might allow for out-of-bounds delay variation 5 minutes a year, and want to define a packet loss SLO with a consistent availability, you have only to choose an appropriate packet loss percentage (where losing less than the acceptable percentage 99.999 % of a year and then 100% of all packets for 5 minutes is about equal to that percentage).

              As you should see from this example, if the availability is a very high percentage, this sort of math produces a difference that is way down in the statistical “noise” – hence very few people bother dealing with this level of complexity and simply set a single availability for the service.

              Note that another key reason for having a single availability number is that – in the event of a severe service outage (loss, impairment or degradation) – many (if not all) SLO parameters are likely to be out of bounds for approximately the same amount of time.

              Since the case Jeff refers to (where all SLO values are essentially ANDed) is the common case, it is simplest to treat failing to meet any SLO as service unavailability.

              Lastly, in any case where the provider detects the service failure, the way that they will likely have implemented violating their SLA with a subscriber will typically involve switching the service to an alternative – which will likely impact the service’s ability to meet other SLOs.

--
Eric

From: Teas-ns-dt <teas-ns-dt-bounces@ietf.org> On Behalf Of Jeff Tantsura
Sent: Monday, June 1, 2020 6:07 PM
To: Greg Mirsky <gregimirsky@gmail.com>
Cc: Kiran Makhijani <kiranm@futurewei.com>om>; teas-ns-dt@ietf.org; Eric Gray <eric.gray=40ericsson.com@dmarc.ietf.org>
Subject: Re: [Teas-ns-dt] Availability

Hi Greg,

A service (SLA) could have 1 or more SLO’s associated with it.
A SLO is met (TRUE), when its objective is met (within boundaries specified).
Usually a SLA is composed of a set of SLO’s with logical AND, e.g if any of SLO’s is FALSE -> SLA (or else)

Example:
If  SLO (availability) is met but SLO (packet_loss) isn’t, availability becomes an irrelevant objective.

Cheers,
Jeff
On Jun 1, 2020, 2:55 PM -0700, Greg Mirsky <gregimirsky@gmail.com<mailto:gregimirsky@gmail.com>>, wrote:

Hi Jeff,
thank you for the clarification. Does measuring uptime considers whether all metrics included in SLO are within their respective acceptable limits? In other words, if the quality of the TS degraded, due to, for example, excessive packet loss, below the requested threshold, would that time period be attributed to the Service uptime period? In my experience, uptime of a node (router, server) is easy to express. Uptime of a service? Much appreciate it if you help with an example or a reference to the definition.

Regards,
Greg

On Mon, Jun 1, 2020 at 2:46 PM Jeff Tantsura <jefftant.ietf@gmail.com<mailto:jefftant.ietf@gmail.com>> wrote:
Hi Greg,

SLO - is an objective (as the name suggests), not a metric. A metric without a context is meaningless.
SLO makes use of the metrics gathered to derive whether the objective has been met.

Example:
SLO (availability) = uptime 90% over 10 hours
total_time=10h
uptime=8h

using the metrics above we can conclude that the total_availability = 80%, which is less than the service objective set (90%) ->  SLA(or else)

Hope this clarifies

Cheers,
Jeff
On Jun 1, 2020, 2:32 PM -0700, Greg Mirsky <gregimirsky@gmail.com<mailto:gregimirsky@gmail.com>>, wrote:

Hi Jeff,
in my reading of the definition, it is the intersection of metrics already listed in the SLO. If that is the case, how useful is another metric that is only a reflection of other metrics?

Regards,
Greg

On Mon, Jun 1, 2020 at 2:24 PM Jeff Tantsura <jefftant.ietf@gmail.com<mailto:jefftant.ietf@gmail.com>> wrote:
Greg,

I thought the definition provided was pretty clear and comprehendible, why do we need to rephrase it?

Cheers,
Jeff
On Jun 1, 2020, 2:00 PM -0700, Greg Mirsky <gregimirsky@gmail.com<mailto:gregimirsky@gmail.com>>, wrote:

Hi Jeff,
if we define availability as the ratio of the period all requested in the SLO metrics are within an acceptable range to the time since the service was handed to the customer (I propose to refer to this metric as "availability ratio"), then I think it can be expressed as
[\bigcap _{i=1}^{n}A_{i}], where Ai is the time period the particular metric remained within its acceptable boundary.

Regards,
Greg

On Mon, Jun 1, 2020 at 1:35 PM Jeff Tantsura <jefftant.ietf@gmail.com<mailto:jefftant.ietf@gmail.com>> wrote:
Mostly agree with Eric/Kiran

It should not be removed, but further clarified.
Network/service availability is a measurable metric, availability = uptime/total_time(uptime+downtime)
Rule of thumb - a service is deemed available when all the SLO’s associated with it are met(TRUE).
In a complex/multidimensional service, different objects might have different availability metrics .
For simplicity sake - total_availability(normalized metric) = Σ(subservice-1..subservice-n), so both, per SLO as well as composite metrics can be used.

Cheers,
Jeff
On Jun 1, 2020, 10:08 AM -0700, Kiran Makhijani <kiranm@futurewei.com<mailto:kiranm@futurewei.com>>, wrote:

Thanks! I support not removing it.
Sticking with individual SLO seems to be a right decision but can be deferred to NBI document. we need not state that here.
-Kiran

From: Teas-ns-dt <teas-ns-dt-bounces@ietf.org<mailto:teas-ns-dt-bounces@ietf.org>> On Behalf Of Eric Gray
Sent: Monday, June 1, 2020 7:35 AM
To: teas-ns-dt@ietf.org<mailto:teas-ns-dt@ietf.org>
Subject: [Teas-ns-dt] Availability

I agree that the definition needs to be cleaned up, but I disagree that it should be omitted.

A part of what probably should be cleaned up is the part that talks about service degradation.  In general, this is an important factor in determining availability, but it is a bit vague for the purpose of definition.

I also disagree that availability is not measurable.

As a proof of concept for measuring , if there are any mandatory measurable objectives, then failing to meet any of those objectives makes the service measurably unavailable.  That is, if you can determine if specific mandatory objectives are being met, then you can determine if they are not being met and therefore determine if the service is unavailable.

Availability is an important aspect of any service, because it is understood that the higher the required availability, the more difficult (and thus expensive) it is to provide that service.

Defining availability as a fraction as we have done in the draft, allows for services that may experience a certain amount of outages over a service period.  A service request may ask for as high an availability as the provider and requester have agreed to (under the terms they agreed to) in advance.

Note that this elevates the importance of having (at least mostly) measurable objectives, simply because you cannot determine if a non-measurable objective is being met – hence you cannot (necessarily) determine the availability of any service that depends on that objective.

It is further interesting to note that the notion of a service depending on objectives that it cannot determine are not being met is a non-sequitur.

Measuring availability in terms of mandatory objectives – as a whole – is the simplest approach; one could group one or more mandatory objectives and define an availability separately for the group – thus allowing for a higher degree of acceptance for failing to meet one set of service objectives compared to others.

If we were going to do that, it would probably be better to define availability as a parameter that applies individually to service objectives.

In my opinion we should at least initially stick to the simple case, where availability is defined as a service objective, rather than as a parameter of every service objective – but I am willing to go either way.

--
Eric
--
Teas-ns-dt mailing list
Teas-ns-dt@ietf.org<mailto:Teas-ns-dt@ietf.org>
https://www.ietf.org/mailman/listinfo/teas-ns-dt
--
Teas-ns-dt mailing list
Teas-ns-dt@ietf.org<mailto:Teas-ns-dt@ietf.org>
https://www.ietf.org/mailman/listinfo/teas-ns-dt