Re: [tcpPrague] TCP Prague parameters?

"De Schepper, Koen (Nokia - BE/Antwerp)" <koen.de_schepper@nokia-bell-labs.com> Wed, 19 May 2021 13:58 UTC

Return-Path: <koen.de_schepper@nokia-bell-labs.com>
X-Original-To: tcpprague@ietfa.amsl.com
Delivered-To: tcpprague@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 63F733A10AE for <tcpprague@ietfa.amsl.com>; Wed, 19 May 2021 06:58:46 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.599
X-Spam-Level:
X-Spam-Status: No, score=-2.599 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.698, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=nokia.onmicrosoft.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 5KMesfZ1j-bA for <tcpprague@ietfa.amsl.com>; Wed, 19 May 2021 06:58:41 -0700 (PDT)
Received: from EUR02-AM5-obe.outbound.protection.outlook.com (mail-eopbgr00105.outbound.protection.outlook.com [40.107.0.105]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 8773E3A10BA for <tcpprague@ietf.org>; Wed, 19 May 2021 06:58:41 -0700 (PDT)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=Ic8XT/HlEy7SOEIBzXTusnOiMGRSGdXDEE60O2psR1UDvHyVqFjoQ3Bg4KHDrrGcatn1EXEYmgltPWh680fo2wxGUat5xTuJIRbj3vZO5EjiSUxmxhYU9WLaIP2xeeJTSsbvtXH2NTt0t0fl4d4DJZdOFrgJY+ssVybiKgOwamHvBs05gnAIz2j7L25bUS5qTVHkjyZYvvW5b6UmqX/KoKGijFtV5zgnUzpQRQa0BPf6zfLvt5m4SwiY/grWSf4ilJXBmrD2VbTJqcgWyYQ+TO3XFHzlOgmQqOwqP4MS06g5N1JzUi3SRnFeAd6OemmvJwIFst+5mY4V+h12QGAAfg==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=jRcjZVZKaKO4liLEtgbojHHTrK+XwrMRu9V9ELwbFfc=; b=gKPx7P2l1nUXuYBY35h76/lSWyJT45vl7UUNDchBK4tFL1ZrvWGUH/Fsue2g8q7Gss0O7xTont1+VG1eO9h1KzgJGS6gN4tRoR7LdqW8ZM0B5f7vrSqz6QEuaOAzF5ils4yVVJ4OCH0uOV+1pY+Pph4p1lstzPBHsiiKmHMRdmCUKbREDAr5ITqIDsZC7mi2vYDj/d4gBBZucEf10s4j1c+nayzVpXYk/xx1fCM8av0v0vQ9hMKzAiraP3m1rTthnK6zpvhuXLAW9BvP73s6uclQXUBBO1Ig3fGLaPUGwDODQ3sqpprk9VFQ0JPVJuc31grQWGyfR/lfaQjTqzuhDg==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=nokia-bell-labs.com; dmarc=pass action=none header.from=nokia-bell-labs.com; dkim=pass header.d=nokia-bell-labs.com; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=nokia.onmicrosoft.com; s=selector1-nokia-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=jRcjZVZKaKO4liLEtgbojHHTrK+XwrMRu9V9ELwbFfc=; b=NkbuxyC6KDG3bwLTSLK3mhzvtgo0iGtGpoUek+QhCVx/AzkIrRpRe2FG4+FufhEPCZgxONeCOvVXuiBX6arAzm9bux3ReZ739zRBqTVHy4VEgp9tlhi1jeVL5xtAVPmqnPwrituaG5zwu5Bio7W0S5XXDBKwQCNGu8smTFWpRRo=
Received: from AM8PR07MB7476.eurprd07.prod.outlook.com (2603:10a6:20b:24e::12) by AM0PR07MB4785.eurprd07.prod.outlook.com (2603:10a6:208:76::20) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4150.12; Wed, 19 May 2021 13:58:38 +0000
Received: from AM8PR07MB7476.eurprd07.prod.outlook.com ([fe80::cc5e:1d65:1335:28d3]) by AM8PR07MB7476.eurprd07.prod.outlook.com ([fe80::cc5e:1d65:1335:28d3%5]) with mapi id 15.20.4150.019; Wed, 19 May 2021 13:58:38 +0000
From: "De Schepper, Koen (Nokia - BE/Antwerp)" <koen.de_schepper@nokia-bell-labs.com>
To: Szilveszter Nadas <Szilveszter.Nadas@ericsson.com>, "tcpprague@ietf.org" <tcpprague@ietf.org>
CC: Ferenc Fejes <fejes@inf.elte.hu>, Gombos Gergo <ggombos@inf.elte.hu>, LAKI Sandor <lakis@inf.elte.hu>, "Tilmans, Olivier (Nokia - BE/Antwerp)" <olivier.tilmans@nokia-bell-labs.com>
Thread-Topic: TCP Prague parameters?
Thread-Index: AddL7KyBBGuGdjnfRcGjKpnVxiQgKwArgBew
Date: Wed, 19 May 2021 13:58:38 +0000
Message-ID: <AM8PR07MB7476F66B214CF08A06319FF2B92B9@AM8PR07MB7476.eurprd07.prod.outlook.com>
References: <AM0PR07MB3953246D1737E65E34C939F68B2C9@AM0PR07MB3953.eurprd07.prod.outlook.com>
In-Reply-To: <AM0PR07MB3953246D1737E65E34C939F68B2C9@AM0PR07MB3953.eurprd07.prod.outlook.com>
Accept-Language: nl-BE, en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
authentication-results: ericsson.com; dkim=none (message not signed) header.d=none;ericsson.com; dmarc=none action=none header.from=nokia-bell-labs.com;
x-originating-ip: [2a02:1810:1e00:cb00:c9cf:e747:fdd2:107]
x-ms-publictraffictype: Email
x-ms-office365-filtering-correlation-id: 67b65c5b-46b9-4bd0-50a7-08d91ace33e9
x-ms-traffictypediagnostic: AM0PR07MB4785:
x-ms-exchange-transport-forked: True
x-microsoft-antispam-prvs: <AM0PR07MB478528C7B3168870AE2F517DB92B9@AM0PR07MB4785.eurprd07.prod.outlook.com>
x-ms-oob-tlc-oobclassifiers: OLM:10000;
x-ms-exchange-senderadcheck: 1
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: 1U58PDsCbJM+V51kJVq04VAyOKumD4GetGi31Nqei3zS9PniFZAm7oXX0sJRB90Tl02RwnAqL8nGgkeoom1ysejBpRJz+HUSVfIbeDqUop6R48DxQZNsLIudT98tzbneDRapYf46iyE9hWt2zWxZqqrAR4PpVvt4054Tkpkm+FF6H/7xZju4odXolavDd9MoC8WyBQg/gTKJqtJRCTii3RnyDIkR1t9wFAgSoRECYWnvNHuCzpsQIUY3Fx0Ooowbvw7RReL6nVQlx+f8r6aEcMBkVBseM9S+A68Z9R1wRDANFl/s++PgOHX97+I/13TzjcIVWTIrIDWBqoe9ZlE1ohSfJ/QSqwmodKu0eBzVasvHFYkSBObRffvG7E4ty4JGcVgmOMrG42XxyWvWElzRJpbog8tnQ6XmX6ZJP5MbTwVSO2dUniqfK7n+JoXoHvOYNu2s5k0jkCnqxVuyhgTBUYeb5B1jebrgeOjy4Mjn+woorXUtj9rFUaIVcs1RP8PHy5k+H0Sodi2wmrZKNCincfJ7GjcBIZTLtXpHSRXOT7mhwlUJRiUq0rs5jQ3MJBsHxoeiiDA3c+MvIePuclap3JkcV6FOKMFdT/RyUn1IEjBXum+li8fMmjecUXPSFWZJ5SOOj0OyqVkKU0u+TLtsgZ7Mogwn3UG0VKY4Rro4alLjxiy5RVa2iRPh7vsIZimv
x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:AM8PR07MB7476.eurprd07.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(366004)(396003)(39860400002)(136003)(346002)(376002)(86362001)(966005)(7696005)(186003)(83380400001)(66574015)(71200400001)(55016002)(8676002)(3480700007)(66556008)(66446008)(4326008)(76116006)(33656002)(316002)(478600001)(66476007)(107886003)(64756008)(66946007)(52536014)(38100700002)(122000001)(7116003)(53546011)(2906002)(110136005)(5660300002)(54906003)(6506007)(8936002)(9686003); DIR:OUT; SFP:1102;
x-ms-exchange-antispam-messagedata: me/pNimp1neoc+/VN1zFEYEA/7l2T5lMqpt3aOWeIY5cwPscr+tb9reIQ6SHMouzF18u1lIDk3dXZc3Q1vC+TPzzk9LTDE2KhOgb9hzoZ9V5xyNaoUBaA8FLRo/F22QlkXASVW1LbD6Ayea+zoWJMNQQGz1cI5Fgp20am7jEjgvRleNNblfQ3uvpfiU06LZLTNID/5rrnVliCLAAR4ygErHITOo3KE5jR5Fd2BfBU2H2V6k+4n17eTuxgT7Tzs3aO2njMJOoCCQzB0ZbBCOpwKIhTL4nXQ5kRojU1BmDouaiFeOs31DDWblcW2kMVW+UMf9bCHTV8W33rim4R1dr6nqNaZcyltDPu8u2gMnltFPjHwlQPrX9n7WrG0PU5j87ZCdBeCwyeJSvvmhp+QVn0O1mdk0CrXSrXe9jT6jFa7u7X3ZCCMGA8ZWS9azB2iHMuPl3lyOqJ/B5zDdXiPDWmYASn1iZZpL8DeysBJuI7TFfwU14KVOyCKItnbl3jJB0AGCVptenypYmJ7PaBK1v9nF3IuPJfOG5+WFSp5EkCUscSw10jeVmkPMkzQvVppgVeSQU1fIv0fidgajVDf+f6oQFT75bSxNOKytE9+wS8F1bqHDqFlqTzRToExEMUs6D3wQIRzwCDWviG7pIeQi04IVezu0tETQvNaFsm2/Ep8UIUnfEXmepzIWdSAubs2lI3pxbxiRSsYZQ4SFb5UmZv7w+CgeAtynGxS8XfRcC7n4klczFN4eiB+30Wq6YbXOt+g1V3w2yMQ+hFlKK50N4r0E1n/cj7YkdHxyBEmA1hsPjZq0WBQm57TNbK32i6oXOzYN1Yk5y9kNZ4FzCetI4ZpJD6XhdyWpdFpTLcYbiPMLjGkMtzpgrv+PTeWcYEy69NCV4SOgakHAcI5LrR2rx5yrg5zNGxtJj2+t08IIdy5tEU4e/lnGh0/8GotujQNDo91EvIcNZLNl5H3K9qu9Snt8rZC1p3yq4Ms6FqoKh0myBTOW0mInCCbM9Sqjt2W9d4NLGZPq/OM2IJ6Jg/BrGdbHVJsCmdx/FOlSkFMAh+pM6dZlKJd6dWfw2uYW3SRZlmZIwOSgXy5E0Mdp0gWmhwb4gQuBguXcqXy0qRAh+YJyL9uETc4eq9KjFzmQ6ebcW4VYRxiG1bS/aNvoy1rnF8YcHSijDObXuVShHRbGq9G/dCclpezytOPnf5BwoOZcQvuUjF6Ki0m3+IGjNqbQJoYV6oSRSrpibwHXxBNTdQRCPWqewlsHAS55I1Kur1URuctU+DUojVN3Y6jkjwIujUyXXdMaOluFZLLvK5nMDGBrPCdndyP4p7gi5LKUbpRrlBFEFVwm3p7eWJ/hrJ/U+4JLlNFARZQrIJv6mODAytfAENP0UO5QmWneGVfxwBZAf
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
MIME-Version: 1.0
X-OriginatorOrg: nokia-bell-labs.com
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-AuthSource: AM8PR07MB7476.eurprd07.prod.outlook.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 67b65c5b-46b9-4bd0-50a7-08d91ace33e9
X-MS-Exchange-CrossTenant-originalarrivaltime: 19 May 2021 13:58:38.4616 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 5d471751-9675-428d-917b-70f44f9630b0
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: jh9P7zMPd9P6TJrd6r+DrC6t6njoVpoR1EsUT5vQAWKtzJQtfR3gB+GkeuINniyWLNQzzuzE7TtC8WpTEmfsYlqn0nGikNXCJYOR4wIrJkHfZxy/L1VtqqiKefCICXvX
X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM0PR07MB4785
Archived-At: <https://mailarchive.ietf.org/arch/msg/tcpprague/6e31GjakjWbGVHSK4Gm9yUppTD0>
Subject: Re: [tcpPrague] TCP Prague parameters?
X-BeenThere: tcpprague@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "To coordinate implementation and standardisation of TCP Prague across platforms. TCP Prague will be an evolution of DCTCP designed to live alongside other TCP variants and derivatives." <tcpprague.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tcpprague>, <mailto:tcpprague-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tcpprague/>
List-Post: <mailto:tcpprague@ietf.org>
List-Help: <mailto:tcpprague-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tcpprague>, <mailto:tcpprague-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 19 May 2021 13:58:56 -0000

Hi Szilveszter,

Indeed currently Prague is not fully optimized, and would need the "performance improvements" to be implemented. I also didn't have the resources up to now, but we are encouraging others to contribute to the open source project. I know some are/have merged the kernel version to the BBRv2 version, so we can combine L4S and BBR techniques easier (in both directions). Clearly the Prague RTT independent code could use an algorithm to fall back to RTT dependent again or even slow start when for "some time" no marks have been seen. Seems BBR has a useful BW detection scheme that might inspire 😉. Similarly BBRv2's too aggressiveness as you mentioned can be tuned to targeting the converged RTT-independent marking rate of 25ms RTT flows. The merged code will also allow to use BBRv2 with accurate ECN. As another improvement, I believe Prague can take the latency signal into account on top of marking signal to better respond. I also saw very nice results at some point also taking the ACK rate into account (which also BBR does). So lots of interesting opportunities for CC experts/researchers that can spend more time on it. Let me check when this combined build would become available.

But this doesn't help you now, right? So maybe useful to put things in current perspective: How many flows are you targeting on the 10G link? It would be good to test as well under "normal internet use" conditions where every flow would get around 5 to 50Mbps, typical minimum and probably extreme max for today's interactive streaming applications. On a 10G link this would boil down on having 200 to 2000 TCP flows active. Is this something you can achieve in your tests? I guess for downloads 500Mbps to 1Gbps per flow would be a reasonable limit per user/flow, and there 10 to 20 flows are minimally needed. In those conditions I would expect to see the results you showed with getting up to speed degradation for default prague settings (fair rate for flows with RTT below 25ms).

So it would be good if you can show better results too (I would expect, let me know if not) for Prague with the 25ms rate fairness enabled (so just the default TCP-Prague settings) with enough flows and for base (unloaded) RTTs that are common on the internet 5ms - 25ms base latencies (for most nearby DC access on both mobile and fixed networks). Of course, also add the bad cases, which show where more work needs to be done on the CCs.

Thanks,
Koen.


-----Original Message-----
From: Szilveszter Nadas <Szilveszter.Nadas@ericsson.com> 
Sent: Tuesday, May 18, 2021 4:33 PM
To: De Schepper, Koen (Nokia - BE/Antwerp) <koen.de_schepper@nokia-bell-labs.com>; tcpprague@ietf.org
Cc: Ferenc Fejes <fejes@inf.elte.hu>; Gombos Gergo <ggombos@inf.elte.hu>; LAKI Sandor <lakis@inf.elte.hu>; Tilmans, Olivier (Nokia - BE/Antwerp) <olivier.tilmans@nokia-bell-labs.com>
Subject: TCP Prague parameters?

Hi Koen,

Thank you for your guidance below.  We did not have resources in the end to investigate this further. 

However, now we are running an experiment where we plan to use TCP Prague as an example for scalable CC. We plan to use Prague, because DCTCP is considered to be flawed in the internet RTT regime, and our experience is that BBRv2 scalable mode is too aggressive.

We use the following code:
https://github.com/L4STeam/linux.git
2021 jan 11
66331636f0dd4930916ee142ad5f505ac8e53e25

We experienced very slow convergence with  prague_rtt_scaling = RTT_CONTROL_RATE so we set it to 0 (RTT_CONTROL_NONE). Other than that we plan to use the default parameters of Prague. Is that OK? Do you recommend to change anything else?

Best Regards,
Szilveszter

FYI - some preliminary results of our experiments are attached. We use a scheduler similar to the one in this article: http://ppv.elte.hu/cc-independent-l4s/ . The attached plots are analogous to Fig 3a, but it is Prague instead of dctcp, and 10 Gbps instead of 1 Gbps



---
From: De Schepper, Koen (Nokia - BE/Antwerp) <koen.de_schepper@nokia-bell-labs.com> 
Sent: December 2, 2020 22:22
To: Szilveszter Nadas <Szilveszter.Nadas@ericsson.com>; tcpprague@ietf.org
Cc: Ferenc Fejes <fejes@inf.elte.hu>; Gombos Gergo <ggombos@inf.elte.hu>; LAKI Sandor <lakis@inf.elte.hu>; Tilmans, Olivier (Nokia - BE/Antwerp) <olivier.tilmans@nokia-bell-labs.com>
Subject: RE: [tcpPrague] A Congestion Control Independent L4S Scheduler - TCP Prague preliminary results

Hi Szilveszter,

First thing is to check that you can reproduce our results. With the previous (non-RTT-independent) Prague, can you confirm you get the following results?

Normally with the right coupling (p_C = p'^2 and P_L = 2*p') and if their total RTT is the same, Cubic (in Reno mode) and Prague get about the same throughput (Prague might get up to twice the throughput max). Total RTT means base RTT + all Queuing delay for each flow's round trip path. Can you confirm this, or do you see a bigger difference?

One way to check this on an equal base RTT setup is to use a singleQ PI2 which is doing the  correct marking or dropping according to the above equations, because the single Q  gives both classes the same latency (single Q = same Q delay). Another way is to use a DualQ and compensate the 15ms Classic latency with a 15ms bigger base RTT for Prague. In both cases you would see them getting around the same throughput. If the BDP (rate and/or RTT) becomes bigger, Cubic will start to get a higher throughput than Prague.

Note that if you install the latest Prague version from our L4STeam git, the default is now to be RTT independent below 25ms. So any Prague with a lower than 25 ms throughput will behave as a 25ms RTT flow (effective after 500 RTTs), and will get the same throughput as a 25ms RTT flow. You could test this with Prague on a DualPi2, when competing with a Cubic flow with a base RTT of 10ms. All RTTs (you can try even a mix of different ones in parallel) below 25ms should now get the same throughput as the Cubic one (because it has also a total RTT of 25ms).

Any other effective RTT combinations different than the equal ones should follow the "throughput ratio equals the inverse RTT ratio" rule. One exception is when non-Rtt-independent Prague flows with different RTTs ares competing on a STEP AQM. In that case the difference is bigger than that ratio, because the lowest RTTs are getting on-off marking, and behave as 1/p², instead of 1/p as they will get a more stable marking probability over their (much bigger) RTTs.

Let me know if you can reproduce the above behavior. If not, maybe try using the Linux DualPI2, and if still not as expected, then we need to dive deeper to find the reason.

Thanks,
Koen.