Re: [tsvwg] new tests of L4S RTT fairness and intra-flow latency: defaults ready for testing

Jonathan Morton <chromatix99@gmail.com> Wed, 18 November 2020 00:28 UTC

Return-Path: <chromatix99@gmail.com>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 54B993A10FE for <tsvwg@ietfa.amsl.com>; Tue, 17 Nov 2020 16:28:15 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.847
X-Spam-Level:
X-Spam-Status: No, score=-1.847 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id inh5eJXlNArn for <tsvwg@ietfa.amsl.com>; Tue, 17 Nov 2020 16:28:12 -0800 (PST)
Received: from mail-lf1-x130.google.com (mail-lf1-x130.google.com [IPv6:2a00:1450:4864:20::130]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id B5AB03A110F for <tsvwg@ietf.org>; Tue, 17 Nov 2020 16:28:11 -0800 (PST)
Received: by mail-lf1-x130.google.com with SMTP id e139so466791lfd.1 for <tsvwg@ietf.org>; Tue, 17 Nov 2020 16:28:11 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:message-id:mime-version:subject:date:in-reply-to:cc:to :references; bh=wrdzuOXTiUFaYY27GflnrWDHXo7Zsnq2jfUWaDZxTgY=; b=mkixkPzJYFmnyIoHNc8TjC6pkS+97bEjJ4x8nMyI2G+gQJPoua5J90RSc1rNqmpuwk DDjSBXwHJM9spsbuqfkyyJvMAsW74Nwu2xLQUf3Jao2MTBTD2R+cmWh2kUpXB3bbB7Ia /i1wBHruFum0AqxoFNO7FYg6+gje966JoZXD543jRctSqi7N1VzDupXA3GJsu+VQYyNH WY9bZawuGh1wSVG4XQh8YSpuF9wQZAWOaQL4BZvVX4AcGGxxvXTGJWd1MLChBYPEwq0M hZZBJHFALyvJy+830Pbb8bbx5drNwfpM+DRF2ArOSrhppOxU14nZNln/a6B1Mt44sMZS oS/Q==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:message-id:mime-version:subject:date :in-reply-to:cc:to:references; bh=wrdzuOXTiUFaYY27GflnrWDHXo7Zsnq2jfUWaDZxTgY=; b=V2qvYQRkRB7aX7Uqj77fdTMwCYyuB3EmoSbr9aF6/QmmkY1uDaT+qNQNbFm6xeFr91 JmstJNu3Ot4zu7xfKale+qIigaJctiRCfumj/rKKQFEi1eYfG6Sz0p84yTA0RPqrRFhl kkQtNYmgm45oR49x70swN1UC/MMKBD9xnH5gJm6SxxAlaBwtyYqEr6x4Q6iOBX8mQlj0 6mE0/xkWHS1LgE8EkiAQWcjkwe5vIoUi4MZAAveB4X3bWPOhjmevaqDD8Mtxtxfw/2Un i0TpvRe5XE0jkNAv2qlA5JFZlGKU3fBZsTypH4l+ZYHachiyaUY74cB71v/tVIEsKT63 z4hw==
X-Gm-Message-State: AOAM5306C7+O9r8PQJR+K5dZqRKZymTUKmKKotzi2hS/jWw+w0fUd+/g lrGu1cdNH0RlGG5LmjT+yw8=
X-Google-Smtp-Source: ABdhPJw+Lkk+4H4208ofFqHbXPj2ZgRXDi1Dj1qjFb38jf5SBTY1Wr3+8bvCND+104pnrVjttlBvpA==
X-Received: by 2002:a19:207:: with SMTP id 7mr2865957lfc.251.1605659289783; Tue, 17 Nov 2020 16:28:09 -0800 (PST)
Received: from jonathartonsmbp.lan (178-55-159-67.bb.dnainternet.fi. [178.55.159.67]) by smtp.gmail.com with ESMTPSA id z19sm3240356lfd.128.2020.11.17.16.28.06 (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Tue, 17 Nov 2020 16:28:08 -0800 (PST)
From: Jonathan Morton <chromatix99@gmail.com>
Message-Id: <811A76DD-3D48-43D3-A962-3F15AE9E858B@gmail.com>
Content-Type: multipart/alternative; boundary="Apple-Mail=_9D32B758-E145-4409-A616-16AD4647015C"
Mime-Version: 1.0 (Mac OS X Mail 11.5 \(3445.9.7\))
Date: Wed, 18 Nov 2020 02:28:05 +0200
In-Reply-To: <AM8PR07MB7476081896E0A1C4897FFBA3B9E20@AM8PR07MB7476.eurprd07.prod.outlook.com>
Cc: Lars Eggert <lars@eggert.org>, Ingemar Johansson S <ingemar.s.johansson@ericsson.com>, tsvwg IETF list <tsvwg@ietf.org>
To: "De Schepper, Koen (Nokia - BE/Antwerp)" <koen.de_schepper@nokia-bell-labs.com>
References: <AM8PR07MB7476081896E0A1C4897FFBA3B9E20@AM8PR07MB7476.eurprd07.prod.outlook.com>
X-Mailer: Apple Mail (2.3445.9.7)
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/hAvwq9HiojXHc7HvcPp1OPl5Lbc>
Subject: Re: [tsvwg] new tests of L4S RTT fairness and intra-flow latency: defaults ready for testing
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg/>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 18 Nov 2020 00:28:15 -0000

> On 17 Nov, 2020, at 3:32 pm, De Schepper, Koen (Nokia - BE/Antwerp) <koen.de_schepper@nokia-bell-labs.com> wrote:
> 
> The RTT-independence was implemented, available and demonstrated several meetings ago already and as presented working very well according to our tests. The following parameters are now set as default, so can be tested out of the box:
> 
> All Prague flows with an RTT below 25ms will now converge to the same rate, independent of their real base RTT. This means that flows with a bigger RTT than 25 ms will never have to compete against smaller than 25ms RTT flows. 
> 
> Now the defaults are set, I'm looking forward to independent evaluations.

Since our tests are quite well automated, we were able to run a subset of them (all at 50Mbps) against the new defaults this evening.

I'll give you credit: there is some improvement in some of the tests.  However, we could still draw most of the same conclusions from the new data as we did from last week's data; the big-picture problems are still present and in some cases have actually deteriorated.

I'll focus on two major concerns in particular:

1: Prague outcompetes CUBIC in DualPI2, at a common baseline RTT.  This only stops being true when the BDP is large enough for Prague to have difficulty growing to steady state in a reasonable amount of time.

With the new code, the Jain's index improves from .823 to .987 at 10ms (the advantage in both cases being to Prague), but actually worsens from .880 to .838 at 20ms, and from .936 to .890 at 80ms.  All of these are sampled after allowing two minutes for the flows to converge to steady-state.

2: Prague vs Prague competition on differing RTTs.

Here is Figure 3 from the test report we recently posted, followed by an equivalent chart generated from the new data this evening.  Let's play spot the difference:




I can say that the throughput ratio for Prague vs Prague via DualPI2 is, in fact, slightly improved in the new data, but it is still significantly worse even than the 16:1 ratio expected from the baseline RTTs at identical average cwnd.  In a similar test with 80ms versus 20ms RTTs, the two Prague flows also have more than the expected 4:1 throughput ratio.  I don't have an immediate explanation for that.

Notice that with both the old and new code, CodelAF gets very close to parity in throughput with the same traffic load, and that even through DualPI2, a pair of CUBIC flows is closer to parity than a pair of Prague flows.  That is not, overall, an improvement in RTT independence from switching to TCP Prague and/or DualPI2.

However, we did find an improvement in fairness, compared to the older code, when comparing 20ms vs 10ms Prague flows.  That's what you were going for, wasn't it?  A shame that, in achieving that singular success, so many other things are left unresolved.

I'm sure we will have the opportunity to run more tests on your future efforts.  For the moment, with limited time on our hands, this will have to do.

 - Jonathan Morton