Re: [Teas-ns-dt] Availability

Greg Mirsky <gregimirsky@gmail.com> Tue, 02 June 2020 16:00 UTC

Return-Path: <gregimirsky@gmail.com>
X-Original-To: teas-ns-dt@ietfa.amsl.com
Delivered-To: teas-ns-dt@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id CBC833A0C36 for <teas-ns-dt@ietfa.amsl.com>; Tue, 2 Jun 2020 09:00:27 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.097
X-Spam-Level:
X-Spam-Status: No, score=-2.097 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id XYBoCf6GvumV for <teas-ns-dt@ietfa.amsl.com>; Tue, 2 Jun 2020 09:00:25 -0700 (PDT)
Received: from mail-lj1-x233.google.com (mail-lj1-x233.google.com [IPv6:2a00:1450:4864:20::233]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id E3ABA3A08F8 for <teas-ns-dt@ietf.org>; Tue, 2 Jun 2020 09:00:24 -0700 (PDT)
Received: by mail-lj1-x233.google.com with SMTP id z18so13192681lji.12 for <teas-ns-dt@ietf.org>; Tue, 02 Jun 2020 09:00:24 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=PIgRGGTW6hYhbs8XrNttwfOK2Z6YDRZ7tWy7Z9KNjp0=; b=DRO6kWwQX7IsZfyx+6vdE+E1lx39jgpRlJINjz3SOHaR2OBztoy7wUcJwvfgulxaLd omTkGy7zpjXOdxHQnu5PoQLm0EeUrS8wJwbRQZxLzVqDPBwKVPhRJeGTLFLeNWiFm9GX kgFIUksAozyFuuTauoGtzFqxaJbyRKku9wfZ+70ireudbwph4Tm+kiihj+SaDbX54P9o 7bcv2rVvZyBIZ5oZHb7lHwYo0ZxInP+AIVCxq+zDX6V1Z08LFVTyRp+20if+8phH1sKM zKs0K8BxizBueHUOQ7iD18slHvFMFEsXiJ2AFQtixaSp3F0jSRr7wDcBM9q5peJ04r0r Ua2w==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=PIgRGGTW6hYhbs8XrNttwfOK2Z6YDRZ7tWy7Z9KNjp0=; b=U4gC48bMPnhlCS96SuzyVZTpfhna3/HIxHrEYyAi/BHxjcFSk3FZ27dLJVi6BafFZx HgTaKeSHR/bZeC7SZ3JYvBK0ywuO95BkIOGTujmGcK9iKOz3ahNzmaINaLcNlQ5yb6KY NXRunLvELmwLDV6NrjJnvmHYOaxLbo6/YD2juoCVmM2cKrElQZsJg5raij5juAt3+vWe is5q2kPu5s6vqkNIiHMw1u3VlSzg3MRu2d123KCVls3tzWpp3lpRNiTKI3XYKKa0p7jY 3CWg07f1MbsNWSAI5SiQSLQF5rOr8z/8hscb7By6clphX3YNKpqkB5I5nus0pflpFC3F JJNQ==
X-Gm-Message-State: AOAM531bX/fvLw/tMofQhZEKShRcs1sfsZCj+rRoJckSeWK5vyW2QofB gs2aKIcxYpSWFSLO3BqQH6J564lGz3QoyMbTRKg=
X-Google-Smtp-Source: ABdhPJxE4gpwEixmhryVSQ/ZYk0BTtcl3+QDLE0CgulHxhzdUybPUciqm+vBiVLrPSrBbDowWDO2ayFPIyvr1xwxrY0=
X-Received: by 2002:a05:651c:1103:: with SMTP id d3mr3937747ljo.110.1591113622909; Tue, 02 Jun 2020 09:00:22 -0700 (PDT)
MIME-Version: 1.0
References: <MN2PR15MB31030C424C7AEF28118B5704978A0@MN2PR15MB3103.namprd15.prod.outlook.com> <BYAPR13MB24377B5E3FD0DEB85599C724D98A0@BYAPR13MB2437.namprd13.prod.outlook.com> <22dcfd45-85ce-4300-a973-765b8575c4dd@Spark> <CA+RyBmXXKjGPA7Fgwp+axVfnjx-iUySjyW8JF4Au3awDOUHTrQ@mail.gmail.com> <09306ffd-5ac5-4006-a9fc-4ede36b5b4d3@Spark> <CA+RyBmVMcfKhr4dDTnb00muPuSWaAaLvkteZ+To8BXj5v0CfUA@mail.gmail.com> <0432c69e-1151-404d-893c-cd240c5531a3@Spark> <CA+RyBmVBSph4dkgNUSNLmx0x67mJZAqTM31J-B4VJ2x5xrO4gA@mail.gmail.com> <a9aee371-521c-4a5a-ae60-3d742b58e77b@Spark> <MN2PR15MB310332371723B9153EBC8F29978B0@MN2PR15MB3103.namprd15.prod.outlook.com>
In-Reply-To: <MN2PR15MB310332371723B9153EBC8F29978B0@MN2PR15MB3103.namprd15.prod.outlook.com>
From: Greg Mirsky <gregimirsky@gmail.com>
Date: Tue, 2 Jun 2020 09:00:11 -0700
Message-ID: <CA+RyBmVLGN9xxi5wFp6PcTzuOBJZ-+2C7B2mxYGfko0AYFuhnA@mail.gmail.com>
To: Eric Gray <eric.gray@ericsson.com>
Cc: Jeff Tantsura <jefftant.ietf@gmail.com>, Kiran Makhijani <kiranm@futurewei.com>, "teas-ns-dt@ietf.org" <teas-ns-dt@ietf.org>, Eric Gray <eric.gray=40ericsson.com@dmarc.ietf.org>
Content-Type: multipart/alternative; boundary="0000000000004a75eb05a71c0233"
Archived-At: <https://mailarchive.ietf.org/arch/msg/teas-ns-dt/vKj_FRYi1nCuAtXYofTcNOXG1gg>
Subject: Re: [Teas-ns-dt] Availability
X-BeenThere: teas-ns-dt@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: TEAS Network Slicing Design Team <teas-ns-dt.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/teas-ns-dt>, <mailto:teas-ns-dt-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/teas-ns-dt/>
List-Post: <mailto:teas-ns-dt@ietf.org>
List-Help: <mailto:teas-ns-dt-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/teas-ns-dt>, <mailto:teas-ns-dt-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 02 Jun 2020 16:00:28 -0000

Hi Eric,
it is much better to have a use case to discuss, thank you. Do you think
that availability includes all performance-related parameters, e.g., packet
loss ratio, latency, and jitter? My interpretation of the definition in the
draft is that availability is the intersection of time periods when each
performance-related SLO is within the agreed range. And I think that it is
important to see it as an intersection because if during the same period,
for example, the packet loss ratio is 0% but latency is above its
threshold, that period of time is considered as unavailability period for
the given TS. What do you think?

Regards,
Greg

On Tue, Jun 2, 2020 at 8:48 AM Eric Gray <eric.gray@ericsson.com> wrote:

> Jeff,
>
>
>
>               I have to disagree.
>
>
>
>               In a simple example, if availability requires a service to
> be conformant with all SLO parameters at least 99.999 % of the time, then
> it is possible to have 100% packet loss for the remaining .001 % of the
> time and still have the service be within SLA.
>
>
>
>               To help people to get their heads around this percentage,
> .001 % of a year is a little over 5 minutes and 15 seconds.
>
>
>
>               So, a service may not be meeting any or all of its other SLO
> parameters and – as long as it is still meeting its availability SLO
> parameter, it is still within SLA.
>
>
>
>               Obviously, under those circumstances, availability does
> matter.
>
>
>
> --
>
> Eric
>
>
>
> *From:* Teas-ns-dt <teas-ns-dt-bounces@ietf.org> *On Behalf Of *Jeff
> Tantsura
> *Sent:* Monday, June 1, 2020 6:07 PM
> *To:* Greg Mirsky <gregimirsky@gmail.com>
> *Cc:* Kiran Makhijani <kiranm@futurewei.com>om>; teas-ns-dt@ietf.org; Eric
> Gray <eric.gray=40ericsson.com@dmarc.ietf.org>
> *Subject:* Re: [Teas-ns-dt] Availability
>
>
>
> Hi Greg,
>
> A service (SLA) could have 1 or more SLO’s associated with it.
> A SLO is met (TRUE), when its objective is met (within boundaries
> specified).
> Usually a SLA is composed of a set of SLO’s with logical AND, e.g if any
> of SLO’s is FALSE -> SLA (or else)
>
> Example:
> If  SLO (availability) is met but SLO (packet_loss) isn’t, availability
> becomes an irrelevant objective.
>
>
>
> Cheers,
>
> Jeff
>
> On Jun 1, 2020, 2:55 PM -0700, Greg Mirsky <gregimirsky@gmail.com>om>, wrote:
>
> Hi Jeff,
>
> thank you for the clarification. Does measuring uptime considers whether
> all metrics included in SLO are within their respective acceptable limits?
> In other words, if the quality of the TS degraded, due to, for example,
> excessive packet loss, below the requested threshold, would that time
> period be attributed to the Service uptime period? In my experience, uptime
> of a node (router, server) is easy to express. Uptime of a service? Much
> appreciate it if you help with an example or a reference to the definition.
>
>
>
> Regards,
>
> Greg
>
>
>
> On Mon, Jun 1, 2020 at 2:46 PM Jeff Tantsura <jefftant.ietf@gmail.com>
> wrote:
>
> Hi Greg,
>
> SLO - is an objective (as the name suggests), not a metric. A metric
> without a context is meaningless.
> SLO makes use of the metrics gathered to derive whether the objective has
> been met.
>
> Example:
> SLO (availability) = uptime 90% over 10 hours
> total_time=10h
> uptime=8h
>
> using the metrics above we can conclude that the total_availability = 80%,
> which is less than the service objective set (90%) ->  SLA(or else)
>
> Hope this clarifies
>
>
>
> Cheers,
>
> Jeff
>
> On Jun 1, 2020, 2:32 PM -0700, Greg Mirsky <gregimirsky@gmail.com>om>, wrote:
>
> Hi Jeff,
>
> in my reading of the definition, it is the intersection of metrics already
> listed in the SLO. If that is the case, how useful is another metric that
> is only a reflection of other metrics?
>
>
>
> Regards,
>
> Greg
>
>
>
> On Mon, Jun 1, 2020 at 2:24 PM Jeff Tantsura <jefftant.ietf@gmail.com>
> wrote:
>
> Greg,
>
> I thought the definition provided was pretty clear and comprehendible, why
> do we need to rephrase it?
>
>
>
> Cheers,
>
> Jeff
>
> On Jun 1, 2020, 2:00 PM -0700, Greg Mirsky <gregimirsky@gmail.com>om>, wrote:
>
> Hi Jeff,
>
> if we define availability as the ratio of the period all requested in the
> SLO metrics are within an acceptable range to the time since the service
> was handed to the customer (I propose to refer to this metric as
> "availability ratio"), then I think it can be expressed as
>
> [image: \bigcap _{i=1}^{n}A_{i}], where Ai is the time period the
> particular metric remained within its acceptable boundary.
>
>
>
> Regards,
>
> Greg
>
>
>
> On Mon, Jun 1, 2020 at 1:35 PM Jeff Tantsura <jefftant.ietf@gmail.com>
> wrote:
>
> Mostly agree with Eric/Kiran
>
> It should not be removed, but further clarified.
> Network/service availability is a measurable metric, availability =
> uptime/total_time(uptime+downtime)
> Rule of thumb - a service is deemed available when all the SLO’s
> associated with it are met(TRUE).
> In a complex/multidimensional service, different objects might have
> different availability metrics .
> For simplicity sake - total_availability(normalized metric) =
> Σ(subservice-1..subservice-n), so both, per SLO as well as composite
> metrics can be used.
>
>
>
> Cheers,
>
> Jeff
>
> On Jun 1, 2020, 10:08 AM -0700, Kiran Makhijani <kiranm@futurewei.com>om>,
> wrote:
>
> Thanks! I support not removing it.
>
> Sticking with individual SLO seems to be a right decision but can be
> deferred to NBI document. we need not state that here.
>
> -Kiran
>
>
>
> *From:* Teas-ns-dt <teas-ns-dt-bounces@ietf.org> *On Behalf Of* Eric Gray
> *Sent:* Monday, June 1, 2020 7:35 AM
> *To:* teas-ns-dt@ietf.org
> *Subject:* [Teas-ns-dt] Availability
>
>
>
> I agree that the definition needs to be cleaned up, but I disagree that it
> should be omitted.
>
>
>
> A part of what probably should be cleaned up is the part that talks about
> service degradation.  In general, this is an important factor in
> determining availability, but it is a bit vague for the purpose of
> definition.
>
>
>
> I also disagree that availability is not measurable.
>
>
>
> As a proof of concept for measuring , if there are any mandatory
> measurable objectives, then failing to meet any of those objectives makes
> the service measurably unavailable.  That is, if you can determine if
> specific mandatory objectives are being met, then you can determine if they
> are not being met and therefore determine if the service is unavailable.
>
>
>
> Availability is an important aspect of any service, because it is
> understood that the higher the required availability, the more difficult
> (and thus expensive) it is to provide that service.
>
>
>
> Defining availability as a fraction as we have done in the draft, allows
> for services that may experience a certain amount of outages over a service
> period.  A service request may ask for as high an availability as the
> provider and requester have agreed to (under the terms they agreed to) in
> advance.
>
>
>
> Note that this elevates the importance of having (at least mostly)
> measurable objectives, simply because you cannot determine if a
> non-measurable objective is being met – hence you cannot (necessarily)
> determine the availability of any service that depends on that objective.
>
>
>
> It is further interesting to note that the notion of a service depending
> on objectives that it cannot determine are not being met is a non-sequitur.
>
>
>
> Measuring availability in terms of mandatory objectives – as a whole – is
> the simplest approach; one could group one or more mandatory objectives and
> define an availability separately for the group – thus allowing for a
> higher degree of acceptance for failing to meet one set of service
> objectives compared to others.
>
>
>
> If we were going to do that, it would probably be better to define
> availability as a parameter that applies individually to service objectives.
>
>
>
> In my opinion we should at least initially stick to the simple case, where
> availability is defined as a service objective, rather than as a parameter
> of every service objective – but I am willing to go either way.
>
>
>
> --
>
> Eric
>
> --
> Teas-ns-dt mailing list
> Teas-ns-dt@ietf.org
> https://www.ietf.org/mailman/listinfo/teas-ns-dt
>
> --
> Teas-ns-dt mailing list
> Teas-ns-dt@ietf.org
> https://www.ietf.org/mailman/listinfo/teas-ns-dt
>
>