[ippm] Re: Responsiveness under working conditions

Magnus Olden <magnus@domos.no> Mon, 11 November 2024 13:36 UTC

Return-Path: <magnus@domos.no>
X-Original-To: ippm@ietfa.amsl.com
Delivered-To: ippm@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id D7210C14F617 for <ippm@ietfa.amsl.com>; Mon, 11 Nov 2024 05:36:38 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: 3.096
X-Spam-Level: ***
X-Spam-Status: No, score=3.096 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, GB_SUMOF=5, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_BLOCKED=0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=no autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=domos-no.20230601.gappssmtp.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id IuM-jKWh989F for <ippm@ietfa.amsl.com>; Mon, 11 Nov 2024 05:36:34 -0800 (PST)
Received: from mail-ed1-x52d.google.com (mail-ed1-x52d.google.com [IPv6:2a00:1450:4864:20::52d]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature ECDSA (P-256) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id A522AC14F605 for <ippm@ietf.org>; Mon, 11 Nov 2024 05:36:34 -0800 (PST)
Received: by mail-ed1-x52d.google.com with SMTP id 4fb4d7f45d1cf-5ced377447bso6534487a12.1 for <ippm@ietf.org>; Mon, 11 Nov 2024 05:36:34 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=domos-no.20230601.gappssmtp.com; s=20230601; t=1731332192; x=1731936992; darn=ietf.org; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=v0IiPzu78Et4woREE5hrQ7LyOTI+pB776hBVVatGJEQ=; b=vDu71rQxUCvEnKYy7cZHUTWD+Nk1bQIkiNKgLIUq45SOqcIiPIT0HKK00LFzjgLu8R XxWHFcA0A9cPKBRgdm/QOBT5xOAhICa/k8sNw5ZOf2VrAgyYyRB5i2HbwuRWalJQPEFn F1jHpOPPPfsLbpmgUnUy2Z01FcS20vyDS3DB3quqG1TJQKieVJGHjaqUDzTgwqOYDjwg tSERL23usBwLmhfOfkMw58H9heuqy6Q3ECKcBK0YgFh7FMRWEyzfGvTZhGVItmtuTIL3 l9L+9FKl9Gpjjl+nDYH+p7XPIdB69922ybv+518boLq9vz2NS+3/qAkgwE4bRH4ePzjB d14w==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1731332192; x=1731936992; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=v0IiPzu78Et4woREE5hrQ7LyOTI+pB776hBVVatGJEQ=; b=LK5WINSGX315lY9nw6MP04tYTLh2ZlK2kBEZ13ANrX2QEMXGjvphbTrGwWTYofyUzI G+nsGuWLGPntHIaehcS1M3lCKye/3mMY/9G5m91r8IjDxbde2yS/69b4YeqlepdnnRae XJYjzW6A0JpRgU6VaLCW1W1Q7d820pPt0ajowFN0TOT++Jae+RvoAJj0yJqYT8xg8D5+ xBrExNxjsLJxgdaumQUVUl+nb8pEOUN8wC59B/R46A4uEMMYOGwjwYIGpEPjUBsTsro5 o0KW9Vsj9Gddjd6+qAYVBMb6Nru3yJz2pxZWfF4p1aXvwnTcQ6auszVn5MD8Q3OZ+/cv QgTw==
X-Gm-Message-State: AOJu0YwJeMQR0KW4QRErs4qhttpOXVHk2PWBkP+p2uPsQMdreW1yngcY s4ChGLZBmNpxgd3tG/cNuKu7H+DWHKeugT+nMyEyJxxPa9eGvsGcBX/uNSvEVQlcN80BinPy93W d2WguyReGUI26G3A1FR56MTJpnbzwSKx9jWw4TBYlY5QC4fTPBfY=
X-Google-Smtp-Source: AGHT+IHiGmQWhc+UyguTuiWlzZvw+wLMZifKghrdrQIBe8HvrKE97T8upLt6AL+VRGSWOdTSFwquSsEExspmyzm+gJ4=
X-Received: by 2002:a05:6402:42d1:b0:5c9:3ff:2734 with SMTP id 4fb4d7f45d1cf-5cf0a31de82mr8321905a12.12.1731332190819; Mon, 11 Nov 2024 05:36:30 -0800 (PST)
MIME-Version: 1.0
References: <CACJ6M16-2y9RT15SooPVZjfuVY+2-LOP3menYf6DR6reaJh1SQ@mail.gmail.com>
In-Reply-To: <CACJ6M16-2y9RT15SooPVZjfuVY+2-LOP3menYf6DR6reaJh1SQ@mail.gmail.com>
From: Magnus Olden <magnus@domos.no>
Date: Mon, 11 Nov 2024 14:36:21 +0100
Message-ID: <CAF6F6UQ4zj0Mn-HN2+zqowKbxCb6dLdbC=E1NCQSoHj-k7yzAw@mail.gmail.com>
To: Chris Box <chris.box.ietf@gmail.com>, Bjørn Ivar Teigen <bjorn@domos.no>
Content-Type: multipart/alternative; boundary="00000000000038f6d20626a33098"
Message-ID-Hash: 6RD5DR46SVJURJ5JC7AIGZCRCYITVH74
X-Message-ID-Hash: 6RD5DR46SVJURJ5JC7AIGZCRCYITVH74
X-MailFrom: magnus@domos.no
X-Mailman-Rule-Misses: dmarc-mitigation; no-senders; approved; emergency; loop; banned-address; member-moderation; header-match-ippm.ietf.org-0; nonmember-moderation; administrivia; implicit-dest; max-recipients; max-size; news-moderation; no-subject; digests; suspicious-header
CC: ippm@ietf.org
X-Mailman-Version: 3.3.9rc6
Precedence: list
Subject: [ippm] Re: Responsiveness under working conditions
List-Id: IETF IP Performance Metrics Working Group <ippm.ietf.org>
Archived-At: <https://mailarchive.ietf.org/arch/msg/ippm/6Z3_yMbN5bNlhlhjzNcinISdeTs>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ippm>
List-Help: <mailto:ippm-request@ietf.org?subject=help>
List-Owner: <mailto:ippm-owner@ietf.org>
List-Post: <mailto:ippm@ietf.org>
List-Subscribe: <mailto:ippm-join@ietf.org>
List-Unsubscribe: <mailto:ippm-leave@ietf.org>

Hi Chris,

Reading your email, it seems you are running into a problem we ran into
when designing QoO <https://datatracker.ietf.org/wg/ippm/about/>, which
later became an important pillar of the design.

When mapping network metrics to app performance, there will be multiple
relevant requirements. As you mention, the mean/median and max latency will
play a role. Going down the rabbit hole, you see that some apps can drop
some packets without meaningful app degradation, hence the 99th percentile
latency may be more important than the max, but not always.

Some apps have jitter buffers that are tuned by the 5% worst latencies,
which means the 95th percentile may also be important, other apps implement
their jitter buffers in different ways (e.g., like you mention with a fixed
jitter buffer). It seems to us that you are bound to end up with a set of
network requirements rather than one, for example:

Network Requirement 1: Median latency < 100 ms
Network Requirement 2:  95th percentile latency  < 150 ms
...
Network Requirement N-1: 99th percentile latency < 250ms
Network Requirement N: Max (100th) percentile latency < 300 ms

Where the app performance is bound by the worst network requirement.

Not sure how this is directly relatable to RPM calculations, but I figured
it was useful input anway. Both  @Bjørn Ivar Teigen <bjorn@domos.no> and
I would be happy to walk you through our approach, and the underlying maths
<https://wiki.broadband-forum.org/display/RESOURCES/Broadband+Forum+Published+Resources#tf-filters=%7B%22selectfilters%22%3A%5B%5D%2C%22userfilters%22%3A%5B%22Number%22%5D%2C%22numberfilters%22%3A%5B%5D%2C%22datefilters%22%3A%5B%5D%2C%22globalfilter%22%3Atrue%2C%22columnhider%22%3Afalse%2C%22iconfilters%22%3A%5B%5D%2C%22defaults%22%3A%5B%22TR-452.2%22%2C%22%22%5D%2C%22width%22%3A%5B%22150%22%2C%22150%22%5D%2C%22inverse%22%3A%5Bfalse%2Cfalse%5D%2C%22order%22%3A%5B0%2C1%5D%2C%22ddSeparator%22%3A%5B%5D%2C%22ddOperator%22%3A%5B%5D%2C%22sorts%22%3A%5B%22Date%20%E2%87%A9%22%5D%7D>
and
justifications.


Best Regards,
Magnus Olden | CTO | Domos | Bridging the gap between applications and
networks

The content of this email is intended for the person or entity to which it
is addressed only. This email may contain confidential information. If you
are not the person to whom this message is addressed, be aware that any
use, reproduction, or distribution of this message is strictly prohibited.
If you received this in error, please contact the sender and immediately
delete this email and any attachments.


On Sat, 9 Nov 2024 at 10:25, Chris Box <chris.box.ietf@gmail.com> wrote:

> Hi everyone
>
> I watched Stuart's presentation on Monday and I'm personally convinced
> that getting this responsiveness test right is one of the most important
> tasks of internet engineering right now. Responsiveness is the next
> frontier of quality improvement, and the current experience of most is
> plentiful bandwidth but frequently poor responsiveness. We can do better,
> and this test is the key enabler for that.
>
> My personal aims sound very similar to Stuart's:
>
> Primary goals (must have): Relevance and Repeatability
> Secondary goal (desirable): Convenience
>
>
> When I heard Jonathan describe the once-per-minute wifi freeze, I
> concluded this is a case where we ought to sacrifice convenience in favour
> of relevance and repeatability. If end users are being impacted by that
> 500ms gap, then the test ought to measure that. It's much less helpful if
> it ignores that degradation. Of course periodic (or even sporadic)
> disruptions can occur on longer timescales and we have to draw a line
> somewhere. But perhaps we can consider a "full test" and a "quick test"
> mode.
>
> The other major open question is how to convert from a large set of
> samples to a single RPM value. I agree with Abhishek that the message from
> Bjorn's QoO distribution is that the outliers have the most significant
> effect on user experience. Those little black circles are the ones we
> notice the most. So I suggest we do not discard any values. They all count.
> My view is that the packet with the maximum delay should form a significant
> component of the RPM. It's not everything of course, and repeatability
> probably implies we need some way to compute a "worst delay" figure from
> multiple packets, e.g. the worst 10 or 20.
>
> ITU Recommendation Y.1514 defines network performance requirements. It
> says for class 0 audio over a measurement period of one minute, the mean IP
> Packet Transfer Delay should not exceed 100ms, and IP Packet Delay
> Variation (RFC3393) should not exceed 50ms. To have good enough
> responsiveness for a video conference, this means mean delay in each
> direction <100ms, and no audio-bearing packets should arrive later than
> 150ms. With responsiveness we're measuring the sum of both directions, so
> it will depend on the assymetry of delay, for example in a mobile network
> uplink packets need to wait until granted permission. What we can say is
> that for all networks, http_l of <100 mean and <150 max is sufficient to
> meet the class 0 requirement. For a symmetrical network, http_l of <200
> mean and <300 max is sufficient.
>
> Also ITU G.114 discusses delay due to the use of an IP delay variation
> buffer. It says that for planning purposes, it is recommended to assume
> that a de-jitter buffer adds one half of its peak delay to the mean network
> delay.  A jitter buffer designed to compensate for 50 ms packet delay
> variation range will introduce 25 ms additional delay, on average.
>
> So what should we compute the RPM from? I suggest it needs to be some
> combination of the worst delay and the typical delay, that in testing
> proves itself to be both Relevant and Repeatable.
>
> Chris
>
>
> _______________________________________________
> ippm mailing list -- ippm@ietf.org
> To unsubscribe send an email to ippm-leave@ietf.org
>