Re: [alto] AD review of draft-ietf-alto-performance-metrics-15

Martin Duke <martin.h.duke@gmail.com> Fri, 02 July 2021 22:23 UTC

Return-Path: <martin.h.duke@gmail.com>
X-Original-To: alto@ietfa.amsl.com
Delivered-To: alto@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 434D23A1313 for <alto@ietfa.amsl.com>; Fri, 2 Jul 2021 15:23:31 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.096
X-Spam-Level:
X-Spam-Status: No, score=-2.096 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_BLOCKED=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id RSkR0D6WlbSu for <alto@ietfa.amsl.com>; Fri, 2 Jul 2021 15:23:26 -0700 (PDT)
Received: from mail-il1-x12b.google.com (mail-il1-x12b.google.com [IPv6:2607:f8b0:4864:20::12b]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id A041A3A0AB2 for <alto@ietf.org>; Fri, 2 Jul 2021 15:22:45 -0700 (PDT)
Received: by mail-il1-x12b.google.com with SMTP id a11so11146699ilf.2 for <alto@ietf.org>; Fri, 02 Jul 2021 15:22:45 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=xiEYjcM7B5mRFxDmCQLAor0R4isasf81HkP5nalBfew=; b=HNPT8kmRmODe4QMdIxGKKP7/18UiUnQioZZqDUA8UbaAKXEP5KZKiJz3Xf/WbaACwR ytnhNRM/5/bM0nBm6eT50VZ9b6TrXZD42k4aMMRCXy/UhO0HTf739x0U+ywCm/Ywf2Yf HtlYg6+2rKhjQEsn578y16hvBj8OB8dljSLT3R2KU1dnZ552N9sOVbKgNhwSWSRVmFik AvXjFyYEKLSDWUQRjcdFfYbF+Tv11JMt0Ibjzbro/bW4Hm2Hv/vvBNoQ208I3LeZSlhL BnoWAHEBmyRwRkybYUxR+tmkcmNAogpe1PKfzayYODb0cih5XZ4dVsAfiyuhzzPrJR+n sAPQ==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=xiEYjcM7B5mRFxDmCQLAor0R4isasf81HkP5nalBfew=; b=SmrQAE8IQgRt6cHKYKPQmdZr9pENbmBq3Nz5VBiB7oY+Ypv24c1TouwNLsRML3ez1C Yz0Sg3AFDMwG9zUFM+MapGWpQgX//Min78T59QmOERWWvaX8GVXQfnocpEomoDVxr0Su j8HBY0MF+nMz+ns1j0PbkddzJGXtjqXQzKH5SsU8pyaLqKf22v+fzf1X6j4+FZE6/9ro xSk76JbRh1s7kLU4P7QpAjaV380J643xcW8l29YhLZe1qqsLkHoraET0n483Qj+aVt2m VUblpMMV7WdEL0lNspPpfKFrf9WDDv0lheMQLkE0ZqBIZAhem4AV9aWQcvBLWAfgqBMw HkHQ==
X-Gm-Message-State: AOAM532BclXuKqMFqd3aBPMFFCuloU26IQO6+y1n2Uxo3PhOiFQZEVea J1XntNydLj4oVr+VryKRT3mZtsvC7Ov5OWAqQPI=
X-Google-Smtp-Source: ABdhPJyc4tpMBzb8W0LVLpX+SDX03j2m+JJ9W9+lCl7wLJp4qaJN+7K3W8gmpEOUmqjH14imiMBhXUEn1DR/UQQNjcs=
X-Received: by 2002:a92:d245:: with SMTP id v5mr1325129ilg.287.1625264564024; Fri, 02 Jul 2021 15:22:44 -0700 (PDT)
MIME-Version: 1.0
References: <7f82686c324449dbae46f975f3bcc0e5@huawei.com>
In-Reply-To: <7f82686c324449dbae46f975f3bcc0e5@huawei.com>
From: Martin Duke <martin.h.duke@gmail.com>
Date: Fri, 02 Jul 2021 15:22:33 -0700
Message-ID: <CAM4esxSH=8_TbbG5xT=9jHPN+=Qo8ruk0nezV2nMAVn=azH0vA@mail.gmail.com>
To: Qin Wu <bill.wu@huawei.com>
Cc: "Y. Richard Yang" <yry@cs.yale.edu>, IETF ALTO <alto@ietf.org>
Content-Type: multipart/alternative; boundary="0000000000000139cf05c62b65cd"
Archived-At: <https://mailarchive.ietf.org/arch/msg/alto/i7GyF9u4OxeVEYYwrYGNFi5uJ8Q>
Subject: Re: [alto] AD review of draft-ietf-alto-performance-metrics-15
X-BeenThere: alto@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "Application-Layer Traffic Optimization \(alto\) WG mailing list" <alto.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/alto>, <mailto:alto-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/alto/>
List-Post: <mailto:alto@ietf.org>
List-Help: <mailto:alto-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/alto>, <mailto:alto-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 02 Jul 2021 22:23:39 -0000

This has been in "Revised I-D needed" for 95 days. Is there an issue I
should be aware of, or will I see a revision soon?

On Wed, Jun 9, 2021 at 5:36 AM Qin Wu <bill.wu@huawei.com> wrote:

> Hi, Richard and Martin:
>
>
>
> *发件人:* alto [mailto:alto-bounces@ietf.org] *代表 *Martin Duke
> *发送时间:* 2021年5月4日 2:01
> *收件人:* Y. Richard Yang <yry@cs.yale.edu>
> *抄送:* Brian Trammell <ietf@trammell.ch>; IETF ALTO <alto@ietf.org>
> *主题:* Re: [alto] AD review of draft-ietf-alto-performance-metrics-15
>
>
>
> Hi Richard,
>
>
>
> Replies inline.
>
>
>
> On Thu, Apr 8, 2021 at 1:04 PM Y. Richard Yang <yry@cs.yale.edu> wrote:
>
> If the intent is that it be machine-readable, then there are several
> places where this standard is going to need more standardization (i.e.
> precise definition of text fields).
>
>
>
> Some of the authors discussed the issue and feel that going down the path
> of making the content machine-readable, in a systematic way, adds
> substantial complexity.
>
>
>
> OK, let's just make the intent clear in the draft.
>
>
>
> .
>
> But zooming out: I understand that the point is that "the client knows
> more about the context," which is pretty much what 5.1 says. But I don't
> understand if the "client" is the user or a user agent, and what either one
> would actually do with the information. Would the application execute a
> policy based on the source? Why would it use a latency that came from an
> sla, but not from measurement? etc.
>
>
>
> This is a good comment. Here is an example (
> https://ipnetwork.bgtmo.ip.att.net/pws/averages.html) that motivated some
> early discussions. In this example, AT&T post both target (aka sla. should
> we change the name from sla to service-level-target?) and actual
> measurements. In this sense, ALTO can be considered as a standard way of
> providing and update the information. Both target and actual values can be
> useful. Make sense?
>
>
>
> So this use case is about allowing users to query performance metrics for
> consumption by humans, rather than by automated clients trying to pick a
> server? This seems like a lot of machinery for that purpose, but if this is
> important to the WG than OK. Let's just write down how this context info
> can be used, if this is the strongest use case.
>
>
>
> [Qin] I think operators may need to monitor the performance where there is SLA and proactively detect when the service is degrading. The regulators are also interested in monitoring the performance of broadband service and compare performance between ISPs or between different countries. All of these use cases have been documented in RFC7536 which performance metric draft could reference.
>
>
>
>
>
>
>
>
>
> - Sec 2.1 I am confused about the meaning of the "sla" cost-source. Does
> this refer to an SLA the ALTO client has with the network? Between the
> target IP and the network? Or something else? If the first, does this link
> to client authentication in some way? If the second, what are the privacy
> implications of exposing these SLAs?
>
>
>
> It is target IP and the network. Here is some text in the current version
> on the authentication and privacy aspects (Sec. 6):
> "Indeed, TE
>
>    performance is a highly sensitive ISP information; therefore, sharing
>    TE metric values in numerical mode requires full mutual confidence
>    between the entities managing the ALTO server and the ALTO client.
>    ALTO servers will most likely distribute numerical TE performance to
>    ALTO clients under strict and formal mutual trust agreements.  On the
>    other hand, ALTO clients must be cognizant on the risks attached to
>    such information that they would have acquired outside formal
>    conditions of mutual trust."
>
>
>
> Will this be OK?
>
>
>
> That privacy information is alright, but exposing the details of
> third-party SLAs deserves special attention.
>
>
> Yes.
>
>
>
> But to follow up your answer: if the client has a better SLA than the
> target, this won't show up in the metrics at all?
>
>
>
> Now I see that I need to clarify. The metric is end-to-end, from src IP to
> dst (target) IP. Does this clarify? I can add a sentence to clarify this.
>
>
>
> Yes, that clarifies it. It also spurs my first followup question: does
> this link to client authentication in some way, or can anyone impersonate a
> client to get its SLA?
>
> [Qin]: I agree with Martin we should prevent Excess disclosure of the ALTO service provider's data to the unauthorized client or third party,RFC7285 defines protection strategy in the security section to address these risks such as adopt HTTP Digestion authentication or TLS client authentication.
>
> I suggest to quote some text in RFC7285 to address this issue.
>
>
>
>
>
>
>
>
>
>
>
> - Sec 2.1. Related to the above, the text suggests that any cost-source
> expressed as "import" could also be expressed as "estimation". Why would
> the server do this? The text should say, or perhaps it would be
> conceptually cleaner if "estimation" and "import" were mutually exclusive
> sources by definition.
>
>
>
> In the early WG discussion, they were considered separate, and then the
> agreement was that import is a special case of estimation, with more
> specific dependency tracking. Consider data provenance of how the ALTO data
> are computed. Estimation means that the server does not want to indicate
> the specific details, and the important gives a precise indication of the
> exact protocols.
>
>
>
> OK, I now understand that "import" implies a specific set of parameters. I
> can't understand what value this distinction has, but that just circles
> around to me not understanding the cost-source information at all.
>
>
>
>
> I see. Let me try to clarify slightly differently. import means that the
> ALTO server can provide a precise source of information using specific
> parameters, and estimate is that it comes from a black-box (the server does
> not reveal). Thinking about it a bit more, we can go down the path of
> specifying a precise format (rfc/section just as ippm) when specifying
> import. Will this be a direction that you want to go?
>
>
>
> I don't have a strong opinion on the outcome, except that it is strongly
> motivated by a use case and have semantics consistent with that use case.
> There can be loosely defined contexts that are designed to be human
> readable, which allows end users to see the QoS they're getting. Or, it can
> be machine readable allowing policy to execute off of it, but I'm not 100%
> sure why policy would care about the context.
>
> [Qin]: My impression is that distinguish the measured value from SLA value
> make sense given the use case provided above. Regarding the distinguish
> import from estimate, my feeling the client doesn’t need to care about
> whether it is one hop metric or multiple hop metric (i.e., end to end
> metric) as long as we have already specify the source address and
> destination address in the request/response. I think distinguish measured
> value from estimated value make sense to me, I think the key difference is
> the measure value focus on directly measured value while the estimated
> value focuses on compound derived value. Maybe we should reference RFC6792
> for the definition of direct metric, cumulative metric and sampled metric
>
> “
>
>    Direct metrics
>
>
>
>       Metrics that can be directly measured or calculated and are not
>
>       dependent on other metrics.
>
>
>
>    Cumulative metrics
>
>
>
>       Metrics measured over several reporting intervals for accumulating
>
>       statistics.  The time period over which measurements are
>
>       accumulated can be the complete RTP session, or some other
>
>       interval signaled using an RTCP Measurement Information XR Block
>
>       [RFC6776 <https://datatracker.ietf.org/doc/html/rfc6776>].  An example cumulative metric is the total number of
>
>       RTP packets lost since the start of the RTP session.
>
>
>
>    Sampled metrics
>
>
>
>       Metrics measured at a particular time instant and sampled from the
>
>       values of a continuously measured or calculated metric within a
>
>       reporting interval (generally, the value of some measurement as
>
>       taken at the end of the reporting interval).  An example is the
>
>       inter-arrival jitter reported in RTCP SR and RR packets, which is
>
>
>
> ”
>
> Make sense?
>
> If this make sense, I think distinction nominal, sla from estimated make
> sense, without introducing cost context, we may get different results of
> performance metrics.
>
> Maybe the issues we need a better terminology since the current term such
> estimation, import may be a little bit misleading,  import should go away
> in my opinion based on clarification above.
>
>
>
>
>
> - Sec 5.4.1: "...the ALTO server may provide the client with the validity
> period of the exposed metric values."
>
>
>
> Shouldn't there be a standard format for this? Or are you implying the use
> of cost-calendar?
>
>
>
> Good catch. The decision of the WG at the time was to use HTTP whenever
> possible. For example, the freshness is indicated by HTTP timestamp (see
> Sec. 5.2); by consistency, then, we should use HTTP Expires. We can add
> this. Agree?
>
>
>
> Agreed.
>