Re: [iccrg] comments about draft-balasubramanian-iccrg-ledbatplusplus-00

Praveen Balasubramanian <pravb@microsoft.com> Mon, 04 November 2019 23:47 UTC

Return-Path: <pravb@microsoft.com>
X-Original-To: iccrg@ietfa.amsl.com
Delivered-To: iccrg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id B94C5120114 for <iccrg@ietfa.amsl.com>; Mon, 4 Nov 2019 15:47:44 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.001
X-Spam-Level:
X-Spam-Status: No, score=-2.001 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=microsoft.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id mv1hTZleXd4W for <iccrg@ietfa.amsl.com>; Mon, 4 Nov 2019 15:47:40 -0800 (PST)
Received: from NAM05-DM3-obe.outbound.protection.outlook.com (mail-eopbgr730096.outbound.protection.outlook.com [40.107.73.96]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id D8E6A12099C for <iccrg@irtf.org>; Mon, 4 Nov 2019 15:47:39 -0800 (PST)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=b/XD+FR0BhtxigsS5MlWnBpIPC0imCmE7yC8PCy1wfWQIGXY2Qj2wGQi/kVbkwuAmuPTwM5Cj0Qr/BYj9Mf1k6VYjKWwtWfhRGdsYjkSaRmmmYwSLLXKiEclWxc/+qvDaY6TIJOPseasrxPMTHKayvOZVZEFetJKTcFz2VNdrYXW+DAr+ojZvpd6H44CY6eaBkLh9i71HUNOA5pSWl+bOR6ysYsbn38XxpquGZapyDVHGKWPo+pp2FuhaFYxwrUakMKuE32hYvbLIiBuFaTHhePD+L/edX9CDFXf86G6G25pYPxOC3WJoEllA19696+e+VZxlXT8NV8AGIzlPbVNEw==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=ZLWSElno7NHpN6n9OLaRF1khvGjodNZS9/7ikYo1iKY=; b=nenkqc1xmACux02QwDesA9QmBlwuqecj2CYG+25W5IiqkqpsLk8/wSD/3XheMkfDcbUMRbBLKN6MWxqPGNFbBVHjY8twl2VT4kKLvhYUxgXZcjQdtlHIxj/MyVPCuOO3iWOKEkJrO4SR/USlkWgH4oV0ASbTKbDb0BMmj6Pjw0IpUsRglBXpwKXg/qfeLRQtIDwEfT+kFrB+ZtwryOr7yFRX77yj06weRjS0PNnArE+rJKpSnv+UP18pHP12JgSvWhgHS0N6z+vpPXAxGLpPfyVBgcRUxQGt11ykTDx5UPFH8L/uZm/vgG47TsZKzBn/cYx1R5Sc2HYXx4v4x33BSQ==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=microsoft.com; dmarc=pass action=none header.from=microsoft.com; dkim=pass header.d=microsoft.com; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=selector2; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=ZLWSElno7NHpN6n9OLaRF1khvGjodNZS9/7ikYo1iKY=; b=dzvq82rtzP+Iz1Y7cjL7gcFzpmao1vjMNOS9QLxNBvBKh3hzxicZiSaIuYXs0dh2gVbIffAvAcs8FqVwRV9ojJs5x1YUHurlDpXVaIhz4BNqmFT0c+PmskXQGN1s4P5/ZN+r9JdcFRINzHQqQ7HqkMrEBdQuKk4Qcpnb4/UHD9Q=
Received: from MW2PR2101MB1049.namprd21.prod.outlook.com (52.132.149.13) by MW2PR2101MB1035.namprd21.prod.outlook.com (52.132.149.11) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.2430.20; Mon, 4 Nov 2019 23:47:38 +0000
Received: from MW2PR2101MB1049.namprd21.prod.outlook.com ([fe80::d90d:509a:e6c8:9c19]) by MW2PR2101MB1049.namprd21.prod.outlook.com ([fe80::d90d:509a:e6c8:9c19%7]) with mapi id 15.20.2430.014; Mon, 4 Nov 2019 23:47:38 +0000
From: Praveen Balasubramanian <pravb@microsoft.com>
To: marcelo bagnulo braun <marcelo@it.uc3m.es>, iccrg IRTF list <iccrg@irtf.org>
Thread-Topic: comments about draft-balasubramanian-iccrg-ledbatplusplus-00
Thread-Index: AQHVYj5yLCcCeV2nVkC9rO+w/ZQkyqd7w6fg
Date: Mon, 04 Nov 2019 23:47:38 +0000
Message-ID: <MW2PR2101MB1049340A6A785B8F85E546AFB67F0@MW2PR2101MB1049.namprd21.prod.outlook.com>
References: <85c3b2e6-ea10-ebc2-bd00-df9ede7d20bd@it.uc3m.es>
In-Reply-To: <85c3b2e6-ea10-ebc2-bd00-df9ede7d20bd@it.uc3m.es>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
msip_labels: MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_Enabled=True; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_SiteId=72f988bf-86f1-41af-91ab-2d7cd011db47; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_Owner=huanyi@microsoft.com; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_SetDate=2019-11-04T18:30:27.2179901Z; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_Name=General; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_Application=Microsoft Azure Information Protection; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_ActionId=23c254fa-1f54-494e-a5fd-ef50a14c8a1c; MSIP_Label_f42aa342-8706-4288-bd11-ebb85995028c_Extended_MSFT_Method=Automatic
authentication-results: spf=none (sender IP is ) smtp.mailfrom=pravb@microsoft.com;
x-originating-ip: [2001:4898:80e8:0:6c7e:cda3:12a0:f9f8]
x-ms-publictraffictype: Email
x-ms-office365-filtering-ht: Tenant
x-ms-office365-filtering-correlation-id: 3f0a18b8-ba0e-46b2-cec6-08d761815fdc
x-ms-traffictypediagnostic: MW2PR2101MB1035:
x-microsoft-antispam-prvs: <MW2PR2101MB103590F5013307C97C9B3CEEB67F0@MW2PR2101MB1035.namprd21.prod.outlook.com>
x-ms-oob-tlc-oobclassifiers: OLM:8882;
x-forefront-prvs: 0211965D06
x-forefront-antispam-report: SFV:NSPM; SFS:(10019020)(366004)(376002)(136003)(346002)(396003)(39860400002)(13464003)(189003)(51914003)(199004)(110136005)(8990500004)(66446008)(66556008)(66476007)(46003)(52536014)(71200400001)(10090500001)(86362001)(71190400001)(64756008)(6246003)(9686003)(76116006)(66946007)(6436002)(10290500003)(478600001)(14444005)(256004)(55016002)(2906002)(25786009)(6116002)(74316002)(186003)(7696005)(305945005)(76176011)(22452003)(316002)(14454004)(81156014)(81166006)(99286004)(33656002)(486006)(8936002)(8676002)(476003)(11346002)(446003)(7736002)(229853002)(5660300002)(102836004)(53546011)(6506007); DIR:OUT; SFP:1102; SCL:1; SRVR:MW2PR2101MB1035; H:MW2PR2101MB1049.namprd21.prod.outlook.com; FPR:; SPF:None; LANG:en; PTR:InfoNoRecords; MX:1; A:1;
received-spf: None (protection.outlook.com: microsoft.com does not designate permitted sender hosts)
x-ms-exchange-senderadcheck: 1
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: gisPPiagjTDofJn/NAvknd+nkrwGEdeLzfAcg0hrDW9YnSgvwcX77GMNhDY/6dEDCHssrlJNTd5WpZi9H0kZV6ydlksJ7JP6Y7BUIXS7YMYqCeeHiLL9xrR8fUe+LFg4jdNI66OufBeHJTD87ORkMrpYfERDZbJL2xSUYaa9SUI/8nOW6Qxawrf/QxRZt/4PJdC0tNAUWOpJH/n8OrDH4rZiOX3/i90QwHIJmxSEX3VGOlMxpMdxYQSv34+27mwrkThUhtmXT12qNFWipHqxlaoLCT/oyfAdjbfhr54+rCVdhihgyTVUeCM4q9htbDDDaVF5FjLvMAWV7/MZCVyPtnBKOTOWlHQo01B3Qsiz5XxtHsqcm9Ld1MMWiLtxisM5AOnrmHTrGphw/0E+qJwN6XYpLSLLDpOwKJHjHBxfDdrldEdYD3MgLjhZIN5gmk9P
x-ms-exchange-transport-forked: True
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
MIME-Version: 1.0
X-OriginatorOrg: microsoft.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 3f0a18b8-ba0e-46b2-cec6-08d761815fdc
X-MS-Exchange-CrossTenant-originalarrivaltime: 04 Nov 2019 23:47:38.2037 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 72f988bf-86f1-41af-91ab-2d7cd011db47
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: 1S+gaM5HYuJpqpETu2dryug/XWfBF8V+XPfigh/e6QpJ4mz+Fvr7wfQdYnHeVShFUWlQBvFyeh10950jBA6XMNa2ixX9Ng4S5GIuSGVPoCs=
X-MS-Exchange-Transport-CrossTenantHeadersStamped: MW2PR2101MB1035
Archived-At: <https://mailarchive.ietf.org/arch/msg/iccrg/Dfcl7Fjou3h-1-0Z-TbG1DWMQO0>
Subject: Re: [iccrg] comments about draft-balasubramanian-iccrg-ledbatplusplus-00
X-BeenThere: iccrg@irtf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "Discussions of Internet Congestion Control Research Group \(ICCRG\)" <iccrg.irtf.org>
List-Unsubscribe: <https://www.irtf.org/mailman/options/iccrg>, <mailto:iccrg-request@irtf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/iccrg/>
List-Post: <mailto:iccrg@irtf.org>
List-Help: <mailto:iccrg-request@irtf.org?subject=help>
List-Subscribe: <https://www.irtf.org/mailman/listinfo/iccrg>, <mailto:iccrg-request@irtf.org?subject=subscribe>
X-List-Received-Date: Mon, 04 Nov 2019 23:47:48 -0000

Thanks for the feedback Marcelo. Inline prefixed by [praveen]. Submitted draft version 01 to address this feedback. I'll try to submit 02 with pseudo code soon. 

-----Original Message-----
From: marcelo bagnulo braun <marcelo@it.uc3m.es> 
Sent: Tuesday, September 3, 2019 3:01 AM
To: Praveen Balasubramanian <pravb@microsoft.com>; iccrg IRTF list <iccrg@irtf.org>
Subject: comments about draft-balasubramanian-iccrg-ledbatplusplus-00

Hi Praveen,

Thanks for writing this up. I have some comments about the document, please find them below.

On section 3.1 I am uncertain if, as currently phrased, this is a problem that LEDBAT has or that LEDBAT has made a design choice to solve a different problem. This is related to the answer that I wrote to Neal's comment in the other email. I will repeat myself a bit.

RFC6817 described the goals the goals of LEDBAT as:

" LEDBAT congestion control seeks to achieve the following goals:

    1.  to utilize end-to-end available bandwidth and to maintain low
        queueing delay when no other traffic is present,

    2.  to add limited queuing delay to that induced by concurrent flows,
        and

    3.  to yield quickly to standard TCP flows that share the same
        bottleneck link."

When there is other traffic in the bottleneck, the goal is defined in terms of the added delay, not in term of the absolute delay.

The text in this section, describes the problem in terms of the difficulty of LEDBAT to actually measure the "real" minimum/base delay. 
I would argue that this is not what LEDBAT has been designed for and we should probably stay away from that. I would suggest removing section
3.1 altogether while keeping sections 3.2 and 3.3, that deal with the effects of the wrong estimation of the base delay limited to the errors caused by the LEDBAT traffic itself. If this is done, probably there should be some rewording in section 3.2 and 3.3 to accommodate the change. I am happy to suggest text, if needed.

[praveen] Correct, LEDBAT aims to keep the queuing delays added by itself below the target. So while measuring base delay is useful, its not critical. For a sufficient number of LEDBAT++ flows the initial and periodic slowdown mechanism works well in practice. I do however think that measurement comparison with a BBR like slowdown mechanism is important in future.

About section 3.4.

I am uncertain I understand the problem this section is describing.

The way i see this, the problem is when there is a mismatch between the size of the buffer of the bottleneck and the amount of packets that result in the LEDBAT target.

If the amount of packets that result in the LEDBAT target is larger than the size of the buffer, then LEDBAT never reacts to queuing delay signal and only reacts to losses. If LEDBAT uses the same parameters than TCP New Reno, it will be competing equally with it defeating LEDBAT purpose.

The problem that i have in the way the section is currently written is that it focusses on the bandwidth of the network, and leaves out the size of the buffer its relation with the target T. In particular, it fails to explain why the target will never be reached (it doesnt mentions that the network starts dropping packets before it reaches the target, which would be important for me to understand this). I mean, i agree with the problem (and the proposed solution) just that I would phrase it differently.


[praveen] You are correct. There is incorrect focus on bandwidth. I fixed the text to refer to bottleneck buffer size while also mentioning that at high speeds the problem becomes worse.

About section 4.1

I would suggest to eliminate the F parameter and just keep Gain. This is more consistent with the original LEDBAT spec and also with section 4.2 that refers to Gain and not to F.

[praveen] Agree this makes the reading easier. Switched to just using dynamic GAIN instead of F.

About section 4.2.

First, the algorithm in section 4.1 is expressed as reaction to packet loss and acknowledgement while section 4.2 is expressed as reaction per RTT. I think it would be better if both sections are described in the same terms either per RTT or per packet. (or both if you prefer).

[praveen]  Rewrote these sections to make things more clear. I will add pseudo code to make things more clear in another version soon. 

Second, the draft currently states:

"This suffices if all competing connections have measured the same base delay.  However, this change by itself does not suffice if the connections have different estimates of the base delay.  In such conditions, picking the constant value of 1 and capping the multiplicative decrease coefficient to be at least 0.5 is required."

But this is exactly the latecomer problem we are trying to solve, isnt it?

So, my suggestion is actually include a description of the final algorithm being proposed here.

I also unclear what do you mean by the constant value and the multiplicative decrease coefficient, as the formula in the draft has two parameters, Gain and Constant, which i dont think are the ones you are referring to?

[praveen] Agreed this was poorly worded. The Constant value is set to 1. And the total decrease is capped to at most 0.5 of the cwnd to ensure reaction to delay signal is akin to loss-based congestion signal (where cwnd is halved). Fixed this text.

Regards, marcelo