Re: [Teas-ns-dt] Availability

Greg Mirsky <gregimirsky@gmail.com> Tue, 02 June 2020 01:14 UTC

Return-Path: <gregimirsky@gmail.com>
X-Original-To: teas-ns-dt@ietfa.amsl.com
Delivered-To: teas-ns-dt@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id BCE013A07A5 for <teas-ns-dt@ietfa.amsl.com>; Mon, 1 Jun 2020 18:14:28 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.097
X-Spam-Level:
X-Spam-Status: No, score=-2.097 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Pu3IsOKluQa6 for <teas-ns-dt@ietfa.amsl.com>; Mon, 1 Jun 2020 18:14:26 -0700 (PDT)
Received: from mail-lj1-x229.google.com (mail-lj1-x229.google.com [IPv6:2a00:1450:4864:20::229]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id AC23F3A07A6 for <teas-ns-dt@ietf.org>; Mon, 1 Jun 2020 18:14:25 -0700 (PDT)
Received: by mail-lj1-x229.google.com with SMTP id e4so10546335ljn.4 for <teas-ns-dt@ietf.org>; Mon, 01 Jun 2020 18:14:25 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=uyDbAoupc58iSZD5DR12c9258HXKtPI04vOhE+YYEuc=; b=AFFSyuoIDyu/2Qy/SJTCVYxHKF7ryAE02ghj/bSlafmBQuBo6WUVT85eoBBTRVKQZq kNPSlLt/2ESltH2K4ELyygmX8a7fjfLeKTePvRUETh0EjIc2UUMCA3c8orbAoEyBW5Gq MtoRFwIdPTmznd0llUnwfs4/EuXnNn9HhDOq2/+Qcd7jb5kXCn+gip19WcyJR5o7HyOt JpVvfWg44QReLDX+6+vgmwzxs5d4HzFTcV0DfxuWY7z6OFaX0RObHSWX9fImveN002tI X3Iissn/Bnj/1+ngqmiDpjc5dsV5Uw1mY/idhzLtYMO2Sssg7MjPjbIZ4sI9HUVg6moR dP5w==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=uyDbAoupc58iSZD5DR12c9258HXKtPI04vOhE+YYEuc=; b=W6Egqa28TO+OxHRMPzdb6bapiwAkp5f7oJVcUrg6IyPdLKsPiJ7i4RaxYykapYdROP IA3JV/fxKG9UdMJHm4VjZnohccnHAmZ9ToQxGAay4IMpKoAlmmhvDaTqp5njNDHNaqHU n/ND1N67sGtNsFp5iqoq/3/3595hYaiieWzP2IQiRRdyIAELwkR2LFBt8jqfs658RLu5 X9hks2I4gkQPwNZ2bzAjwN94jhbYGGCj12wewr3XixnA75JstN+p0yWB8h2tF2JdmE7z Z+KouHEQoLzQmbmHF9WoY270VXcGumOZenKQA9gLy+QdFRQ0r8w0CNm++nIUgRP1f1rI 0CAg==
X-Gm-Message-State: AOAM532Fou7IREsljP5YgSqZE0wNLL4u+I2PSEfJmGWd+TpWfZqlhmoF T1jslX8KV/OdkKwWh8v5nYGWKv7I3aQ7sd7XBOk=
X-Google-Smtp-Source: ABdhPJyz0uMXdo4sjwfgFvdj9OQzzgMYKEYp5K8SSNNVIk46ES/ynTLZq0P+mXjr5iGmiTKcBfricfx1+cDc3eqGves=
X-Received: by 2002:a05:651c:54e:: with SMTP id q14mr8482546ljp.279.1591060463668; Mon, 01 Jun 2020 18:14:23 -0700 (PDT)
MIME-Version: 1.0
References: <MN2PR15MB31030C424C7AEF28118B5704978A0@MN2PR15MB3103.namprd15.prod.outlook.com> <BYAPR13MB24377B5E3FD0DEB85599C724D98A0@BYAPR13MB2437.namprd13.prod.outlook.com> <22dcfd45-85ce-4300-a973-765b8575c4dd@Spark> <CA+RyBmXXKjGPA7Fgwp+axVfnjx-iUySjyW8JF4Au3awDOUHTrQ@mail.gmail.com> <09306ffd-5ac5-4006-a9fc-4ede36b5b4d3@Spark> <CA+RyBmVMcfKhr4dDTnb00muPuSWaAaLvkteZ+To8BXj5v0CfUA@mail.gmail.com> <0432c69e-1151-404d-893c-cd240c5531a3@Spark> <CA+RyBmVBSph4dkgNUSNLmx0x67mJZAqTM31J-B4VJ2x5xrO4gA@mail.gmail.com> <a9aee371-521c-4a5a-ae60-3d742b58e77b@Spark> <CA+RyBmUOFJ+Wti5mMRjBAAgA3-Arbjxo4dGAN+vMBv1yztnBxg@mail.gmail.com> <2eda7ee1-2284-480c-a697-83fe761e9204@Spark>
In-Reply-To: <2eda7ee1-2284-480c-a697-83fe761e9204@Spark>
From: Greg Mirsky <gregimirsky@gmail.com>
Date: Mon, 1 Jun 2020 18:14:11 -0700
Message-ID: <CA+RyBmXQddq6zoRz0iYQxjkwwsu3KgYpqEjT=aKmhnjsp92Zvw@mail.gmail.com>
To: Jeff Tantsura <jefftant.ietf@gmail.com>
Cc: Eric Gray <eric.gray=40ericsson.com@dmarc.ietf.org>, "teas-ns-dt@ietf.org" <teas-ns-dt@ietf.org>, Kiran Makhijani <kiranm@futurewei.com>
Content-Type: multipart/alternative; boundary="000000000000c0d54105a70fa17a"
Archived-At: <https://mailarchive.ietf.org/arch/msg/teas-ns-dt/3jKmzL16zx0Tfva3frXaUxo7jCk>
Subject: Re: [Teas-ns-dt] Availability
X-BeenThere: teas-ns-dt@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: TEAS Network Slicing Design Team <teas-ns-dt.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/teas-ns-dt>, <mailto:teas-ns-dt-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/teas-ns-dt/>
List-Post: <mailto:teas-ns-dt@ietf.org>
List-Help: <mailto:teas-ns-dt-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/teas-ns-dt>, <mailto:teas-ns-dt-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 02 Jun 2020 01:14:29 -0000

Hi Jeff,
that is an interesting quote, thank you. I agree with the relationship
between SLO and SLA. What I'm not finding - the definition of availability
(and quality but that is outside the scope of our discussion). Looking once
again at the current definition:
      Availability: It a measure of how often a customer defined service
      is lost or degraded to the point of unacceptable performance due
      to any fault in the network.  It is a ratio of time the transport
      slice meets agreed SLO over the total time where the transport
      slice is contracted.
The definition refers to the performance. I understand performance as a
combination of bandwidth, packet loss, and packet latency/jitter. Would you
agree?
More, "agreed SLO", in my reading, is "performance", which is a compounded.
Thus, in my interpretation of the definition, availability is a compounded
objective. Perhaps we can change the definition.

Regards,
Greg


On Mon, Jun 1, 2020 at 4:16 PM Jeff Tantsura <jefftant.ietf@gmail.com>
wrote:

> Copied from wikipedia:
> "The SLA is the entire agreement that specifies what *service* is to be
> provided, how it is supported, times, locations, costs, performance, and
> responsibilities of the parties involved.
> SLOs are specific measurable characteristics of the SLA such as
> availability, throughput, frequency, response time, or quality."
>
> Compounded SLAs are a common thing, compounded SLOs aren’t, and perhaps we
> should not use them either.
>
> Cheers,
> Jeff
> On Jun 1, 2020, 3:18 PM -0700, Greg Mirsky <gregimirsky@gmail.com>om>, wrote:
>
> Hi Jeff,
> many thanks for your patience and clarification. the picture in my head
> was not the correct one, agreed. But if the Availability is as defined in
> the current text, it appears to me being dependent on other SLOs, other
> metrics, e.g., packet loss ratio, packet delay, as well as, path
> continuity. In other words, the availability does not appear as an
> independent metric but a composite of other metrics, of other SLOs.
>
> Regards,
> Greg
>
>
>
> On Mon, Jun 1, 2020 at 3:07 PM Jeff Tantsura <jefftant.ietf@gmail.com>
> wrote:
>
>> Hi Greg,
>>
>> A service (SLA) could have 1 or more SLO’s associated with it.
>> A SLO is met (TRUE), when its objective is met (within boundaries
>> specified).
>> Usually a SLA is composed of a set of SLO’s with logical AND, e.g if any
>> of SLO’s is FALSE -> SLA (or else)
>>
>> Example:
>> If  SLO (availability) is met but SLO (packet_loss) isn’t, availability
>> becomes an irrelevant objective.
>>
>> Cheers,
>> Jeff
>> On Jun 1, 2020, 2:55 PM -0700, Greg Mirsky <gregimirsky@gmail.com>om>,
>> wrote:
>>
>> Hi Jeff,
>> thank you for the clarification. Does measuring uptime considers whether
>> all metrics included in SLO are within their respective acceptable limits?
>> In other words, if the quality of the TS degraded, due to, for example,
>> excessive packet loss, below the requested threshold, would that time
>> period be attributed to the Service uptime period? In my experience, uptime
>> of a node (router, server) is easy to express. Uptime of a service? Much
>> appreciate it if you help with an example or a reference to the definition.
>>
>> Regards,
>> Greg
>>
>> On Mon, Jun 1, 2020 at 2:46 PM Jeff Tantsura <jefftant.ietf@gmail.com>
>> wrote:
>>
>>> Hi Greg,
>>>
>>> SLO - is an objective (as the name suggests), not a metric. A metric
>>> without a context is meaningless.
>>> SLO makes use of the metrics gathered to derive whether the objective
>>> has been met.
>>>
>>> Example:
>>> SLO (availability) = uptime 90% over 10 hours
>>> total_time=10h
>>> uptime=8h
>>>
>>> using the metrics above we can conclude that the total_availability =
>>> 80%, which is less than the service objective set (90%) ->  SLA(or else)
>>>
>>> Hope this clarifies
>>>
>>> Cheers,
>>> Jeff
>>> On Jun 1, 2020, 2:32 PM -0700, Greg Mirsky <gregimirsky@gmail.com>om>,
>>> wrote:
>>>
>>> Hi Jeff,
>>> in my reading of the definition, it is the intersection of metrics
>>> already listed in the SLO. If that is the case, how useful is another
>>> metric that is only a reflection of other metrics?
>>>
>>> Regards,
>>> Greg
>>>
>>> On Mon, Jun 1, 2020 at 2:24 PM Jeff Tantsura <jefftant.ietf@gmail.com>
>>> wrote:
>>>
>>>> Greg,
>>>>
>>>> I thought the definition provided was pretty clear and comprehendible,
>>>> why do we need to rephrase it?
>>>>
>>>> Cheers,
>>>> Jeff
>>>> On Jun 1, 2020, 2:00 PM -0700, Greg Mirsky <gregimirsky@gmail.com>om>,
>>>> wrote:
>>>>
>>>> Hi Jeff,
>>>> if we define availability as the ratio of the period all requested in
>>>> the SLO metrics are within an acceptable range to the time since the
>>>> service was handed to the customer (I propose to refer to this metric as
>>>> "availability ratio"), then I think it can be expressed as
>>>> [image: \bigcap _{i=1}^{n}A_{i}], where Ai is the time period the
>>>> particular metric remained within its acceptable boundary.
>>>>
>>>> Regards,
>>>> Greg
>>>>
>>>> On Mon, Jun 1, 2020 at 1:35 PM Jeff Tantsura <jefftant.ietf@gmail.com>
>>>> wrote:
>>>>
>>>>> Mostly agree with Eric/Kiran
>>>>>
>>>>> It should not be removed, but further clarified.
>>>>> Network/service availability is a measurable metric, availability =
>>>>> uptime/total_time(uptime+downtime)
>>>>> Rule of thumb - a service is deemed available when all the SLO’s
>>>>> associated with it are met(TRUE).
>>>>> In a complex/multidimensional service, different objects might have
>>>>> different availability metrics .
>>>>> For simplicity sake - total_availability(normalized metric) =
>>>>> Σ(subservice-1..subservice-n), so both, per SLO as well as composite
>>>>> metrics can be used.
>>>>>
>>>>> Cheers,
>>>>> Jeff
>>>>> On Jun 1, 2020, 10:08 AM -0700, Kiran Makhijani <kiranm@futurewei.com>om>,
>>>>> wrote:
>>>>>
>>>>> Thanks! I support not removing it.
>>>>>
>>>>> Sticking with individual SLO seems to be a right decision but can be
>>>>> deferred to NBI document. we need not state that here.
>>>>>
>>>>> -Kiran
>>>>>
>>>>>
>>>>>
>>>>> *From:* Teas-ns-dt <teas-ns-dt-bounces@ietf.org> *On Behalf Of* Eric
>>>>> Gray
>>>>> *Sent:* Monday, June 1, 2020 7:35 AM
>>>>> *To:* teas-ns-dt@ietf.org
>>>>> *Subject:* [Teas-ns-dt] Availability
>>>>>
>>>>>
>>>>>
>>>>> I agree that the definition needs to be cleaned up, but I disagree
>>>>> that it should be omitted.
>>>>>
>>>>>
>>>>>
>>>>> A part of what probably should be cleaned up is the part that talks
>>>>> about service degradation.  In general, this is an important factor in
>>>>> determining availability, but it is a bit vague for the purpose of
>>>>> definition.
>>>>>
>>>>>
>>>>>
>>>>> I also disagree that availability is not measurable.
>>>>>
>>>>>
>>>>>
>>>>> As a proof of concept for measuring , if there are any mandatory
>>>>> measurable objectives, then failing to meet any of those objectives makes
>>>>> the service measurably unavailable.  That is, if you can determine if
>>>>> specific mandatory objectives are being met, then you can determine if they
>>>>> are not being met and therefore determine if the service is unavailable.
>>>>>
>>>>>
>>>>>
>>>>> Availability is an important aspect of any service, because it is
>>>>> understood that the higher the required availability, the more difficult
>>>>> (and thus expensive) it is to provide that service.
>>>>>
>>>>>
>>>>>
>>>>> Defining availability as a fraction as we have done in the draft,
>>>>> allows for services that may experience a certain amount of outages over a
>>>>> service period.  A service request may ask for as high an availability as
>>>>> the provider and requester have agreed to (under the terms they agreed to)
>>>>> in advance.
>>>>>
>>>>>
>>>>>
>>>>> Note that this elevates the importance of having (at least mostly)
>>>>> measurable objectives, simply because you cannot determine if a
>>>>> non-measurable objective is being met – hence you cannot (necessarily)
>>>>> determine the availability of any service that depends on that objective.
>>>>>
>>>>>
>>>>>
>>>>> It is further interesting to note that the notion of a service
>>>>> depending on objectives that it cannot determine are not being met is a
>>>>> non-sequitur.
>>>>>
>>>>>
>>>>>
>>>>> Measuring availability in terms of mandatory objectives – as a whole –
>>>>> is the simplest approach; one could group one or more mandatory objectives
>>>>> and define an availability separately for the group – thus allowing for a
>>>>> higher degree of acceptance for failing to meet one set of service
>>>>> objectives compared to others.
>>>>>
>>>>>
>>>>>
>>>>> If we were going to do that, it would probably be better to define
>>>>> availability as a parameter that applies individually to service objectives.
>>>>>
>>>>>
>>>>>
>>>>> In my opinion we should at least initially stick to the simple case,
>>>>> where availability is defined as a service objective, rather than as a
>>>>> parameter of every service objective – but I am willing to go either way.
>>>>>
>>>>>
>>>>>
>>>>> --
>>>>>
>>>>> Eric
>>>>> --
>>>>> Teas-ns-dt mailing list
>>>>> Teas-ns-dt@ietf.org
>>>>> https://www.ietf.org/mailman/listinfo/teas-ns-dt
>>>>>
>>>>> --
>>>>> Teas-ns-dt mailing list
>>>>> Teas-ns-dt@ietf.org
>>>>> https://www.ietf.org/mailman/listinfo/teas-ns-dt
>>>>>
>>>>