Re: [Teas-ns-dt] Availability

Greg Mirsky <gregimirsky@gmail.com> Tue, 02 June 2020 15:35 UTC

Return-Path: <gregimirsky@gmail.com>
X-Original-To: teas-ns-dt@ietfa.amsl.com
Delivered-To: teas-ns-dt@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 343703A0B51 for <teas-ns-dt@ietfa.amsl.com>; Tue, 2 Jun 2020 08:35:17 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.097
X-Spam-Level:
X-Spam-Status: No, score=-2.097 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id dsPoAK-wVm7u for <teas-ns-dt@ietfa.amsl.com>; Tue, 2 Jun 2020 08:35:14 -0700 (PDT)
Received: from mail-lj1-x22b.google.com (mail-lj1-x22b.google.com [IPv6:2a00:1450:4864:20::22b]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 85A423A0C66 for <teas-ns-dt@ietf.org>; Tue, 2 Jun 2020 08:35:04 -0700 (PDT)
Received: by mail-lj1-x22b.google.com with SMTP id c11so13158793ljn.2 for <teas-ns-dt@ietf.org>; Tue, 02 Jun 2020 08:35:04 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=SZ21+z0+NZ0bf1BjyoiXMHaa9EhctHBX15/OdebKTuE=; b=phXQeJUKlzywTOsdSa60YzaToxCRdmDdgsgetVRSjRWDBlc/uokdL5x258c794TgdB pfAZxsrgDM8SeSF18Myz94wKY15E3Iq9HEgWXrYW9LOIe6OZnBM5EAvuuqmdoYY93QNY hIx/wPRX+GKWorek7COaJinLHd63tXbBtPO9MGr7NcvkDFbPRnW2C1B9Zb1s9q8l/9/d WbG56GYoSrVgumF7KFTLUjHEfWcIpKP2zBJH+KdHTqlR1bKEcAal3FQIAYWUthLuMGnU 67ig98InRVHllpui+bURAfYbU/qBovDPrhzqTvje42E0iNulb6HrS97ZpaXfU7/xecaY mP/A==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=SZ21+z0+NZ0bf1BjyoiXMHaa9EhctHBX15/OdebKTuE=; b=i231Z2xKqQol0ZSAxxOrkhvQRoc7jUB0/49JkT5BMc7PC+O7Fy9RoQ36nE2yRBy5KY IrbrZ5QTRx05TpMDqvYZC2Cg8F7F3O0tbNhIqvoh9MNVnlvscoOMtxgGCWvJyiKBy020 PDbCNrePnAQn4OnoR5vokvTgW0/OxQmv+0PCO7/udp56B/jQIHauNMPlduXEoEbunITM MTChhz0sfAVvc2xxilo+7N0NaCGJgFpHL88RldfSD+7Q95l4+Rrx9MS3kjePmz82zWzn 9ZA/ZgFEPxdOWlz+Jz8qNOXwTdrQr713z3NxFZXRAWT/2C/zcrfLOi5A05e1JGfzga79 XQzw==
X-Gm-Message-State: AOAM533dt16i3+OEHp7XN7BsSYHVl8gnLHgeqPLSG+LMZbdAdQ5MbmFb JgMJKlM9byWTScOYxBaHAe5gsP+wPK9oIgfoOY6a6C3W/oM=
X-Google-Smtp-Source: ABdhPJzPfTJE4a+eVYIwERbjY2CL2aIwP203qEt6gohsO+A4pbrfrR6hyo6eq/ZhT7vZjmQ6kg3lO7WBLpgQZZj0w+Q=
X-Received: by 2002:a05:651c:1103:: with SMTP id d3mr3886217ljo.110.1591112102550; Tue, 02 Jun 2020 08:35:02 -0700 (PDT)
MIME-Version: 1.0
References: <MN2PR15MB31030C424C7AEF28118B5704978A0@MN2PR15MB3103.namprd15.prod.outlook.com> <BYAPR13MB24377B5E3FD0DEB85599C724D98A0@BYAPR13MB2437.namprd13.prod.outlook.com> <22dcfd45-85ce-4300-a973-765b8575c4dd@Spark> <CA+RyBmXXKjGPA7Fgwp+axVfnjx-iUySjyW8JF4Au3awDOUHTrQ@mail.gmail.com> <09306ffd-5ac5-4006-a9fc-4ede36b5b4d3@Spark> <CA+RyBmVMcfKhr4dDTnb00muPuSWaAaLvkteZ+To8BXj5v0CfUA@mail.gmail.com> <0432c69e-1151-404d-893c-cd240c5531a3@Spark> <CA+RyBmVBSph4dkgNUSNLmx0x67mJZAqTM31J-B4VJ2x5xrO4gA@mail.gmail.com> <a9aee371-521c-4a5a-ae60-3d742b58e77b@Spark> <CA+RyBmUOFJ+Wti5mMRjBAAgA3-Arbjxo4dGAN+vMBv1yztnBxg@mail.gmail.com> <2eda7ee1-2284-480c-a697-83fe761e9204@Spark> <CA+RyBmXQddq6zoRz0iYQxjkwwsu3KgYpqEjT=aKmhnjsp92Zvw@mail.gmail.com> <beea33aa-3931-49c2-8a0d-9a80787c650b@Spark>
In-Reply-To: <beea33aa-3931-49c2-8a0d-9a80787c650b@Spark>
From: Greg Mirsky <gregimirsky@gmail.com>
Date: Tue, 2 Jun 2020 08:34:51 -0700
Message-ID: <CA+RyBmWqfLbQb4Zmy=PyFH8KBCxtGeemNZYbQTVkk1CCM9Z9ng@mail.gmail.com>
To: Jeff Tantsura <jefftant.ietf@gmail.com>
Cc: Eric Gray <eric.gray=40ericsson.com@dmarc.ietf.org>, "teas-ns-dt@ietf.org" <teas-ns-dt@ietf.org>, Kiran Makhijani <kiranm@futurewei.com>
Content-Type: multipart/alternative; boundary="000000000000ab9ea905a71ba7cf"
Archived-At: <https://mailarchive.ietf.org/arch/msg/teas-ns-dt/OuzckuuUNYwhKL-MPxIdSrS4TdA>
Subject: Re: [Teas-ns-dt] Availability
X-BeenThere: teas-ns-dt@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: TEAS Network Slicing Design Team <teas-ns-dt.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/teas-ns-dt>, <mailto:teas-ns-dt-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/teas-ns-dt/>
List-Post: <mailto:teas-ns-dt@ietf.org>
List-Help: <mailto:teas-ns-dt-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/teas-ns-dt>, <mailto:teas-ns-dt-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 02 Jun 2020 15:35:17 -0000

Hi Jeff,
perhaps you can share how you interpret the definition of the availability
in the current version and, particularly, what is "performance":
      Availability: It a measure of how often a customer defined service
      is lost or degraded to the point of unacceptable performance due
      to any fault in the network.  It is a ratio of time the transport
      slice meets agreed SLO over the total time where the transport
      slice is contracted.
Also, what is the measurement unit of the SLO that is used as the threshold
that separates "meets SLO" from "doesn't meet SLO"?

Regards,
Greg

On Mon, Jun 1, 2020 at 11:59 PM Jeff Tantsura <jefftant.ietf@gmail.com>
wrote:

> Hi Greg,
>
> No, I disagree, what you are describing is a performance SLA.
> Mixing non comparable objectives within a single objective (that is really
> binary, either TRUE or FALSE) is a bad idea.
>
> Cheers,
> Jeff
> On Jun 1, 2020, 6:14 PM -0700, Greg Mirsky <gregimirsky@gmail.com>om>, wrote:
>
> Hi Jeff,
> that is an interesting quote, thank you. I agree with the relationship
> between SLO and SLA. What I'm not finding - the definition of availability
> (and quality but that is outside the scope of our discussion). Looking once
> again at the current definition:
>       Availability: It a measure of how often a customer defined service
>       is lost or degraded to the point of unacceptable performance due
>       to any fault in the network.  It is a ratio of time the transport
>       slice meets agreed SLO over the total time where the transport
>       slice is contracted.
> The definition refers to the performance. I understand performance as a
> combination of bandwidth, packet loss, and packet latency/jitter. Would you
> agree?
> More, "agreed SLO", in my reading, is "performance", which is a
> compounded. Thus, in my interpretation of the definition, availability is a
> compounded objective. Perhaps we can change the definition.
>
> Regards,
> Greg
>
>
> On Mon, Jun 1, 2020 at 4:16 PM Jeff Tantsura <jefftant.ietf@gmail.com>
> wrote:
>
>> Copied from wikipedia:
>> "The SLA is the entire agreement that specifies what *service* is to be
>> provided, how it is supported, times, locations, costs, performance, and
>> responsibilities of the parties involved.
>> SLOs are specific measurable characteristics of the SLA such as
>> availability, throughput, frequency, response time, or quality."
>>
>> Compounded SLAs are a common thing, compounded SLOs aren’t, and perhaps
>> we should not use them either.
>>
>> Cheers,
>> Jeff
>> On Jun 1, 2020, 3:18 PM -0700, Greg Mirsky <gregimirsky@gmail.com>om>,
>> wrote:
>>
>> Hi Jeff,
>> many thanks for your patience and clarification. the picture in my head
>> was not the correct one, agreed. But if the Availability is as defined in
>> the current text, it appears to me being dependent on other SLOs, other
>> metrics, e.g., packet loss ratio, packet delay, as well as, path
>> continuity. In other words, the availability does not appear as an
>> independent metric but a composite of other metrics, of other SLOs.
>>
>> Regards,
>> Greg
>>
>>
>>
>> On Mon, Jun 1, 2020 at 3:07 PM Jeff Tantsura <jefftant.ietf@gmail.com>
>> wrote:
>>
>>> Hi Greg,
>>>
>>> A service (SLA) could have 1 or more SLO’s associated with it.
>>> A SLO is met (TRUE), when its objective is met (within boundaries
>>> specified).
>>> Usually a SLA is composed of a set of SLO’s with logical AND, e.g if any
>>> of SLO’s is FALSE -> SLA (or else)
>>>
>>> Example:
>>> If  SLO (availability) is met but SLO (packet_loss) isn’t, availability
>>> becomes an irrelevant objective.
>>>
>>> Cheers,
>>> Jeff
>>> On Jun 1, 2020, 2:55 PM -0700, Greg Mirsky <gregimirsky@gmail.com>om>,
>>> wrote:
>>>
>>> Hi Jeff,
>>> thank you for the clarification. Does measuring uptime considers whether
>>> all metrics included in SLO are within their respective acceptable limits?
>>> In other words, if the quality of the TS degraded, due to, for example,
>>> excessive packet loss, below the requested threshold, would that time
>>> period be attributed to the Service uptime period? In my experience, uptime
>>> of a node (router, server) is easy to express. Uptime of a service? Much
>>> appreciate it if you help with an example or a reference to the definition.
>>>
>>> Regards,
>>> Greg
>>>
>>> On Mon, Jun 1, 2020 at 2:46 PM Jeff Tantsura <jefftant.ietf@gmail.com>
>>> wrote:
>>>
>>>> Hi Greg,
>>>>
>>>> SLO - is an objective (as the name suggests), not a metric. A metric
>>>> without a context is meaningless.
>>>> SLO makes use of the metrics gathered to derive whether the objective
>>>> has been met.
>>>>
>>>> Example:
>>>> SLO (availability) = uptime 90% over 10 hours
>>>> total_time=10h
>>>> uptime=8h
>>>>
>>>> using the metrics above we can conclude that the total_availability =
>>>> 80%, which is less than the service objective set (90%) ->  SLA(or else)
>>>>
>>>> Hope this clarifies
>>>>
>>>> Cheers,
>>>> Jeff
>>>> On Jun 1, 2020, 2:32 PM -0700, Greg Mirsky <gregimirsky@gmail.com>om>,
>>>> wrote:
>>>>
>>>> Hi Jeff,
>>>> in my reading of the definition, it is the intersection of metrics
>>>> already listed in the SLO. If that is the case, how useful is another
>>>> metric that is only a reflection of other metrics?
>>>>
>>>> Regards,
>>>> Greg
>>>>
>>>> On Mon, Jun 1, 2020 at 2:24 PM Jeff Tantsura <jefftant.ietf@gmail.com>
>>>> wrote:
>>>>
>>>>> Greg,
>>>>>
>>>>> I thought the definition provided was pretty clear and comprehendible,
>>>>> why do we need to rephrase it?
>>>>>
>>>>> Cheers,
>>>>> Jeff
>>>>> On Jun 1, 2020, 2:00 PM -0700, Greg Mirsky <gregimirsky@gmail.com>om>,
>>>>> wrote:
>>>>>
>>>>> Hi Jeff,
>>>>> if we define availability as the ratio of the period all requested in
>>>>> the SLO metrics are within an acceptable range to the time since the
>>>>> service was handed to the customer (I propose to refer to this metric as
>>>>> "availability ratio"), then I think it can be expressed as
>>>>> [image: \bigcap _{i=1}^{n}A_{i}], where Ai is the time period the
>>>>> particular metric remained within its acceptable boundary.
>>>>>
>>>>> Regards,
>>>>> Greg
>>>>>
>>>>> On Mon, Jun 1, 2020 at 1:35 PM Jeff Tantsura <jefftant.ietf@gmail.com>
>>>>> wrote:
>>>>>
>>>>>> Mostly agree with Eric/Kiran
>>>>>>
>>>>>> It should not be removed, but further clarified.
>>>>>> Network/service availability is a measurable metric, availability =
>>>>>> uptime/total_time(uptime+downtime)
>>>>>> Rule of thumb - a service is deemed available when all the SLO’s
>>>>>> associated with it are met(TRUE).
>>>>>> In a complex/multidimensional service, different objects might have
>>>>>> different availability metrics .
>>>>>> For simplicity sake - total_availability(normalized metric) =
>>>>>> Σ(subservice-1..subservice-n), so both, per SLO as well as composite
>>>>>> metrics can be used.
>>>>>>
>>>>>> Cheers,
>>>>>> Jeff
>>>>>> On Jun 1, 2020, 10:08 AM -0700, Kiran Makhijani <kiranm@futurewei.com>om>,
>>>>>> wrote:
>>>>>>
>>>>>> Thanks! I support not removing it.
>>>>>>
>>>>>> Sticking with individual SLO seems to be a right decision but can be
>>>>>> deferred to NBI document. we need not state that here.
>>>>>>
>>>>>> -Kiran
>>>>>>
>>>>>>
>>>>>>
>>>>>> *From:* Teas-ns-dt <teas-ns-dt-bounces@ietf.org> *On Behalf Of* Eric
>>>>>> Gray
>>>>>> *Sent:* Monday, June 1, 2020 7:35 AM
>>>>>> *To:* teas-ns-dt@ietf.org
>>>>>> *Subject:* [Teas-ns-dt] Availability
>>>>>>
>>>>>>
>>>>>>
>>>>>> I agree that the definition needs to be cleaned up, but I disagree
>>>>>> that it should be omitted.
>>>>>>
>>>>>>
>>>>>>
>>>>>> A part of what probably should be cleaned up is the part that talks
>>>>>> about service degradation.  In general, this is an important factor in
>>>>>> determining availability, but it is a bit vague for the purpose of
>>>>>> definition.
>>>>>>
>>>>>>
>>>>>>
>>>>>> I also disagree that availability is not measurable.
>>>>>>
>>>>>>
>>>>>>
>>>>>> As a proof of concept for measuring , if there are any mandatory
>>>>>> measurable objectives, then failing to meet any of those objectives makes
>>>>>> the service measurably unavailable.  That is, if you can determine if
>>>>>> specific mandatory objectives are being met, then you can determine if they
>>>>>> are not being met and therefore determine if the service is unavailable.
>>>>>>
>>>>>>
>>>>>>
>>>>>> Availability is an important aspect of any service, because it is
>>>>>> understood that the higher the required availability, the more difficult
>>>>>> (and thus expensive) it is to provide that service.
>>>>>>
>>>>>>
>>>>>>
>>>>>> Defining availability as a fraction as we have done in the draft,
>>>>>> allows for services that may experience a certain amount of outages over a
>>>>>> service period.  A service request may ask for as high an availability as
>>>>>> the provider and requester have agreed to (under the terms they agreed to)
>>>>>> in advance.
>>>>>>
>>>>>>
>>>>>>
>>>>>> Note that this elevates the importance of having (at least mostly)
>>>>>> measurable objectives, simply because you cannot determine if a
>>>>>> non-measurable objective is being met – hence you cannot (necessarily)
>>>>>> determine the availability of any service that depends on that objective.
>>>>>>
>>>>>>
>>>>>>
>>>>>> It is further interesting to note that the notion of a service
>>>>>> depending on objectives that it cannot determine are not being met is a
>>>>>> non-sequitur.
>>>>>>
>>>>>>
>>>>>>
>>>>>> Measuring availability in terms of mandatory objectives – as a whole
>>>>>> – is the simplest approach; one could group one or more mandatory
>>>>>> objectives and define an availability separately for the group – thus
>>>>>> allowing for a higher degree of acceptance for failing to meet one set of
>>>>>> service objectives compared to others.
>>>>>>
>>>>>>
>>>>>>
>>>>>> If we were going to do that, it would probably be better to define
>>>>>> availability as a parameter that applies individually to service objectives.
>>>>>>
>>>>>>
>>>>>>
>>>>>> In my opinion we should at least initially stick to the simple case,
>>>>>> where availability is defined as a service objective, rather than as a
>>>>>> parameter of every service objective – but I am willing to go either way.
>>>>>>
>>>>>>
>>>>>>
>>>>>> --
>>>>>>
>>>>>> Eric
>>>>>> --
>>>>>> Teas-ns-dt mailing list
>>>>>> Teas-ns-dt@ietf.org
>>>>>> https://www.ietf.org/mailman/listinfo/teas-ns-dt
>>>>>>
>>>>>> --
>>>>>> Teas-ns-dt mailing list
>>>>>> Teas-ns-dt@ietf.org
>>>>>> https://www.ietf.org/mailman/listinfo/teas-ns-dt
>>>>>>
>>>>>