Re: [tsvwg] Explaining my ect(1) as input to network position

Sebastian Moeller <moeller0@gmx.de> Wed, 13 May 2020 07:32 UTC

Return-Path: <moeller0@gmx.de>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id D2C663A0DE4 for <tsvwg@ietfa.amsl.com>; Wed, 13 May 2020 00:32:05 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.649
X-Spam-Level:
X-Spam-Status: No, score=-1.649 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, FREEMAIL_FROM=0.001, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=no autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=gmx.net
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id lw5JwYhnB-il for <tsvwg@ietfa.amsl.com>; Wed, 13 May 2020 00:32:04 -0700 (PDT)
Received: from mout.gmx.net (mout.gmx.net [212.227.17.22]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 1BDBB3A0DBD for <tsvwg@ietf.org>; Wed, 13 May 2020 00:32:03 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.net; s=badeba3b8450; t=1589355121; bh=A3o6TtI//2j6p2M7qUWO20D3Vil3eznwG6thWgD2nIQ=; h=X-UI-Sender-Class:Subject:From:In-Reply-To:Date:Cc:References:To; b=gwBwNjvF1Jk+9q5s6y/s62D3L7LLrW8X/+R/lsskxejDJHvYtpZIzMN0xRDy/0IH5 I6VPvzdW8luRq27+1jaz1SvWXvJoyAsg3aolQ/K5yinfWPCIgADicBIuKcoxV2hoMW zGwP8u7XaEVEgUB7rnXBfRm6zktoAyKymCaEWa+I=
X-UI-Sender-Class: 01bb95c1-4bf8-414a-932a-4f6e2808ef9c
Received: from [10.11.12.16] ([134.76.241.253]) by mail.gmx.com (mrgmx104 [212.227.17.168]) with ESMTPSA (Nemesis) id 1MirjY-1iw9rQ4Ay4-00exS2; Wed, 13 May 2020 09:32:01 +0200
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (Mac OS X Mail 12.4 \(3445.104.14\))
From: Sebastian Moeller <moeller0@gmx.de>
In-Reply-To: <AM4PR07MB3490F956F0DDD670EDED93F6B9BE0@AM4PR07MB3490.eurprd07.prod.outlook.com>
Date: Wed, 13 May 2020 09:32:00 +0200
Cc: "tsvwg@ietf.org" <tsvwg@ietf.org>
Content-Transfer-Encoding: quoted-printable
Message-Id: <648C1541-EB29-4EB4-977A-39755FBA7045@gmx.de>
References: <AM4PR07MB3490F956F0DDD670EDED93F6B9BE0@AM4PR07MB3490.eurprd07.prod.outlook.com>
To: "De Schepper, Koen (Nokia - BE/Antwerp)" <koen.de_schepper@nokia-bell-labs.com>
X-Mailer: Apple Mail (2.3445.104.14)
X-Provags-ID: V03:K1:DakbqGFs3aivCMh+ftnplJo+BEVqYlz39QMtsT6unDrcDRdZXs0 esk6875i90+NOsBY0inLi5yyK3ZodTCQ78oil3ld+TLf58ZtWBt+DSubAihVeJRLr3d1v6u vroiaFqNf9tJwWmfJplyS03Ac+3cXD8bMCYItF8Yk5rb6rMC9OSKYGv8jrYosF6tpQOTnrD W7npm7xPrGN5szoW5DCtg==
X-UI-Out-Filterresults: notjunk:1;V03:K0:t7woCQJA5fI=:BQNAkRns0Z60pkUGhw73N/ KbzsLTKP2ivDg1Iwt6VC3fYi6QT5JQKtG7LwEJWjh/Hqi+0nybQ5ckQCGpuPYL1rboBVzJDb2 +z/ifPnVYqs6ombfhFqPT9V/NzVChYud1BMTwij9ZOvRoFDotT6LdbxZD8fuzZlslPvW+aXTr Cz4Scyx8eWGQrKOEvV5uTNGZPy17+0oErl8YLRk8f0DdgySpRSbQ+tfxiwutWkB5nVwZuNTIb AY63nvcG00v1BYHTMb0iwwh8iAIhowCR4z9tTdd7mjkbNIbsF/oeaMe19d14jlWeyAlAoS7Cv DBhQLWu7YI5eoVsXkZHduh29a0dOojRSQ+7RoEtRAXEfbOVxTrxwnkTCwdFJ1jlcQzZa55QFI VI3V4rLdMoSTmEwTtNDrW2KzfLy1J7unlqT6EWn+VCgDnTMCclybfRmeqe5T/853du+3uDV3s FnKjZJ4FqsR4yJibjgZ591/Et+j6ujXYtCcsCuivSrX5O4FGohNqjsWhwzEEK81G9j5VALbmh 4V2dMaIhf+5MSm2SfYOBzvDSgfVlr6MkrhdpJQS0bb5U1nGaVyOYOId5C6cQTGkgS/dPwZAju SlHpV2ySUwOFzO7PdVSwvs6ZwI9WtyfVv7LNeKp2XlHAYOy0WWfHSgWWXuADBOOuVKZ38tJlp iyZTNIwhv/pMaSh0jfo5nKDTuYLZ7xtzbwqNF5Syqwjk3c88sFiNLEWDIWMem5ItqYrE8m40x 75AaLNuUJ5Nixhz5MDk1saBH/j9qktlZi/p9Akuomg2lEglT1QOCjZOJWCbFrkyvhJi7hYu1b XG+Cq3A5JNTN6PRIYJLhf3YTM0umsB0QiAuB9UfLdjMtshmnmPIUk0/byjm/FahqCqm4bJt/E u6svZf+eUp3QZ2zatTu9glnqUQne0hPaNIAuBv+N5BeaGAf5Pl2r22GUOHR/Geh+cmALrRxQQ FJhN4QAJQOYm/93FnK9YaWAnPJ4AXM9T9A9r1Rb3Z1mpknyrvvYL7AzBj1kerEpaNeO8xSWLm ooOVlAhVOXv/ho/xrGBK6B44s5KdVpdhNLulSt9mWG+9GQDrcAimFMOYaXmjBZOIrGNpfa21f eO13nqEBTDwLJU8njcG6iDwONnku/1iuLFu6B75BE7Yntwd95vNdsOp0SVijiCoeO7BpIUWOV TW/Fkbp6lSpsw3n3PAGDV03kVOTeAxkspse3DZF8LDwLFZo3Kr6bHZlOuc+SZBmXw2mEQ5hlm 2g1RP9gPeN3UgVZ/+
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/hT3BYVeoo8IwZbGAC-DjUeeiky4>
Subject: Re: [tsvwg] Explaining my ect(1) as input to network position
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg/>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 13 May 2020 07:32:06 -0000

Hi Koen,

I thought I would let this slide, but on second thought this raises more questions than it answers, so more below in-line.

> On May 12, 2020, at 13:18, De Schepper, Koen (Nokia - BE/Antwerp) <koen.de_schepper@nokia-bell-labs.com> wrote:
> [...]
>  
> I noticed that there are quite some “safety due to non-responsive TCP”-concerns raised again which were solved already. We have shown in the past that the Coupled DualQ responds to non-responsive traffic as a normal Single Q Classic bottleneck (with or without ECN). No fast-lane for L4S non-responsive traffic, only lower latency. Try sending 3 non-responsive UDP/CBR flows using each one of the non-ECT, ECT(0) and ECT(1) with each a sending rate of 100% link capacity into a DualQ. They will each get 33% of the link capacity, while pushing away all TCP traffic (no matter which ECT they are). Note that cumulative, L4S gets even only 33%, while Classic gets 66% capacity in this case! It shows again that the coupled AQM disables the scheduling and fully overrules its claimed “priority”-behavior. This is a very unintuitive concept of DualQ, that seems to be easily overlooked. Depending on the overload strategy, the Classic flows can experience 15ms extra latency compared to the ECT(1), but there won’t be throughput benefit. Do the same experiment on a correctly designed Classic ECN bottleneck and the behavior will be exactly the same (except for the latency benefit for ECT(1)). This sensitivity to non-responsiveness has been always the case, even now on the Internet you all are using. It is typically constrained within a single subscriber/user by per user scheduling.
> [...]

	[SM] I find this argument to be questionable on two accounts. Let me break down my issues:

1) Is this actually relevant? You are basically demonstrating fair sharing between multiple DOS streams. I severely doubt that anybody is considering that a priority goal for achieving fairness at. Just consider that all of your streams loose 66% of their packets, so it is quite doubtful that if those are truly data streams the user is going to be happy with the result.
	But what I really would like to see, is what happens if you add an additional ECT((0)-CUBIC and EXT(1)-TCP Prague stream to the mix. If these streams survive passage though your DOSed gateway, you might have more than a purely academic point. But as you concede TCPs will be severely impacted. 
	As an end-user this is exactly the situation where I would wish my upstream AQM to be flow-fair as that has a much higher probability of still delivering reasonable throughput for responsive flows. And yes, this can be fooled by spreading the DOS traffic over multiple 5-tuples, but that same spreading will also ruin the normal dualQ AQM and will also out-smart your proposed queue-protection design so L4S does just as bad as FQ in that situation and worse in the non-spoofed-sprayed context.
	Worse an enduser, on ingress, I will not be able to fix this problem post-hoc, as the non-responsive flows make L4S already evict the TCP streams and no amount of FQing on my side of the bottleneck can bring these already dropped TCP packets back. 

2) You seem to be relaying on the overload protection here, and the fact that your streams are both unresponsive and CBR (aka paced), if I understand the dual queue coupled AQM draft correctly your AQM basically regresses to being a computationally expensive head-drop system under those circumstances, and the fairness you describe above is a direct consequence if your three DOS sources using the same fixed data rates with an equal probability dropping AQM.

Best Regards
	Sebastian