Re: [iccrg] LEDBAT++, rLEDBAT, and slowdowns

"Bless, Roland (TM)" <roland.bless@kit.edu> Mon, 09 September 2019 13:36 UTC

Return-Path: <roland.bless@kit.edu>
X-Original-To: iccrg@ietfa.amsl.com
Delivered-To: iccrg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 83556120047 for <iccrg@ietfa.amsl.com>; Mon, 9 Sep 2019 06:36:07 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.899
X-Spam-Level:
X-Spam-Status: No, score=-1.899 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id GWjhbhAxlnqS for <iccrg@ietfa.amsl.com>; Mon, 9 Sep 2019 06:36:04 -0700 (PDT)
Received: from iramx2.ira.uni-karlsruhe.de (iramx2.ira.uni-karlsruhe.de [IPv6:2a00:1398:2::10:81]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id AF0DA120274 for <iccrg@irtf.org>; Mon, 9 Sep 2019 06:36:04 -0700 (PDT)
Received: from [2a00:1398:2:4006:184d:c9e5:860a:7e6a] (helo=i72vorta.tm.kit.edu) by iramx2.ira.uni-karlsruhe.de with esmtpsa port 25 iface 2a00:1398:2::10:8 id 1i7Jpq-0005Dq-Jx; Mon, 09 Sep 2019 15:36:02 +0200
Received: from [IPv6:::1] (ip6-localhost [IPv6:::1]) by i72vorta.tm.kit.edu (Postfix) with ESMTPS id 864B74202D8; Mon, 9 Sep 2019 15:36:01 +0200 (CEST)
To: marcelo bagnulo braun <marcelo@it.uc3m.es>, Neal Cardwell <ncardwell@google.com>
Cc: Praveen Balasubramanian <pravb@microsoft.com>, iccrg IRTF list <iccrg@irtf.org>, Yuchung Cheng <ycheng@google.com>
References: <CADVnQynJY8xkqkghhCWhPpbF+4Ev_3c7OZf_tDEb_J5xr0FV9A@mail.gmail.com> <c4b76af5-abbe-3184-24ce-03a2c0b9544b@it.uc3m.es> <CADVnQy=3FqEjqipX6thgjcjN8YPTOqiduYKU2GccXHwS+a3wVA@mail.gmail.com> <be98e323-506f-bdc8-a128-72c9f4aa5ead@it.uc3m.es> <3862e86e-e588-0cca-38e8-4ea23ef2b4c7@it.uc3m.es> <CADVnQymqHHg4cF94fu81opaBibmShVrsZ3cLyVPcHir0p=Kv3Q@mail.gmail.com> <30f3abb2-a78f-6da6-e67a-1532d342ce9e@it.uc3m.es>
From: "Bless, Roland (TM)" <roland.bless@kit.edu>
Openpgp: preference=signencrypt
Autocrypt: addr=roland.bless@kit.edu; prefer-encrypt=mutual; keydata= xsFNBFi0OxABEACy2VohJ7VhSu/xPCt4/6qCrw4Pw2nSklWPfAYEk1QgrbiwgvLAP9WEhAIU w45cojBaDxytIGg8eaYeIKSmsXjHGbV/ZTfo8r11LX8yPYR0WHiMWZpl0SHUd/CZIkv2pChO 88vF/2FKN95HDcp24pwONF4VhxJoSFk6c0mDNf8Em/Glt9BcWX2AAvizTmpQDshaPje18WH3 4++KwPZDd/sJ/hHSXiPg1Gdhs/OG/C0CJguOAlqbgSVAe3qKOr1M4K5M+wVpsk373pXRfxd7 ZAmZ05iBTn+LfgVcz+AfaKKcsWri5CdTT+7JDL6QNQpox+b5FXZFSHnEIST+/qzfG7G2LqqY mml6TYY8XbaNyXZP0QKncfSpRx8uTRWReHUa1YbSuOxXYh6bXpcugD25mlC/Lu0g7tz4ijiK iIwq9+P2H1KfAAfYyYZh6nOoE6ET0TjOjUSa+mA8cqjPWX99kEEgf1Xo+P9fx9QLCLWIY7zc mSM+vjQKgdUFpMSCKcYEKOuwlPuOz8bVECafxaEtJJHjCOK8zowe2eC9OM+G+bmtAO3qYcYZ hQ/PV3sztt/PjgdtnFAYPFLc9189rHRxKsWSOb4xPkRw/YQAI9l15OlUEpsyOehxmAmTsesn tSViCz++PCdeXrQc1BCgl8nDytrxW+n5w1aaE8aL3hn8M0tonQARAQABzShSb2xhbmQgQmxl c3MgKFRNKSA8cm9sYW5kLmJsZXNzQGtpdC5lZHU+wsGABBMBCAAqAhsDBQkSzAMABQsJCAcC BhUICQoLAgQWAgMBAh4BAheABQJYtYdHAhkBAAoJEKON2tlkOJXuzWkP+wfjUnDNzRm4r34a AMWepcQziTgqf4I1crcL6VD44767HhyFsjcKH31E5G5gTDxbpsM4pmkghKeLrpPo30YK3qb7 E9ifIkpJTvMu0StSUmcXq0zPyHZ+HxHeMWkosljG3g/4YekCqgWwrB62T7NMYq0ATQe1MGCZ TAPwSPGCUZT3ioq50800FMI8okkGTXS3h2U922em7k8rv7E349uydv19YEcS7tI78pggMdap ASoP3QWB03tzPKwjqQqSevy64uKDEa0UgvAM3PRbJxOYZlX1c3q/CdWwpwgUiAhMtPWvavWW Tcw6Kkk6e0gw4oFlDQ+hZooLv5rlYR3egdV4DPZ1ugL51u0wQCQG9qKIMXslAdmKbRDkEcWG Oi2bWAdYyIHhhQF5LSuaaxC2P2vOYRHnE5yv5KTV3V7piFgPFjKDW+giCRd7VGfod6DY2b2y zwidCMve1Qsm8+NErH6U+hMpMLeCJDMu1OOvXYbFnTkqjeg5sKipUoSdgXsIo4kl+oArZlpK qComSTPhij7rMyeu/1iOwbNCjtiqgb55ZE7Ekd84mr9sbq4Jm/4QGnVI30q4U2vdGSeNbVjo d1nqjf3UNzP2ZC+H9xjsCFuKYbCX6Yy4SSuEcubtdmdBqm13pxua4ZqPSI0DQST2CHC7nxL1 AaRGRYYh5zo2vRg3ipkEzsFNBFi0OxABEAC2CJNp0/Ivkv4KOiXxitsMXZeK9fI0NU2JU1rW 04dMLF63JF8AFiJ6qeSL2mPHoMiL+fG5jlxy050xMdpMKxnhDVdMxwPtMiGxbByfvrXu18/M B7h+E1DHYVRdFFPaL2jiw+Bvn6wTT31MiuG9Wh0WAhoW8jY8IXxKQrUn7QUOKsWhzNlvVpOo SjMiW4WXksUA0EQVbmlskS/MnFOgCr8q/FqwC81KPy+VLHPB9K/B65uQdpaw78fjAgQVQqpx H7gUF1EYpdZWyojN+V8HtLJx+9yWAZjSFO593OF3/r0nDHEycuOjhefCrqr0DDgTYUNthOdU KO2CzT7MtweRtAf0n27zbwoYvkTviIbR+1lV1vNkxaUtZ6e1rtOxvonRM1O3ddFIzRp/Qufu HfPe0YqhEsrBIGW1aE/pZW8khNQlB6qt20snL9cFDrnB6+8kDG3e//OjK1ICQj9Y/yyrJVaX KfPbdHhLpsgh8TMDPoH+XXQlDJljMD0++/o7ckO3Sfa8Zsyh1WabyKQDYXDmDgi9lCoaQ7Lf uLUpoMvJV+EWo0jE4RW/wBGQbLJp5usy5i0fhBKuDwsKdLG3qOCf4depIcNuja6ZmZHRT+3R FFjvZ/dAhrCWpRTxZANlWlLZz6htToJulAZQJD6lcpVr7EVgDX/y4cNwKF79egWXPDPOvQAR AQABwsFlBBgBCAAPBQJYtDsQAhsMBQkSzAMAAAoJEKON2tlkOJXukMoP/jNeiglj8fenH2We 7SJuyBp8+5L3n8eNwfwY5C5G+etD0E6/lkt/Jj9UddTazxeB154rVFXRzmcN3+hGCOZgGAyV 1N7d8xM6dBqRtHmRMPu5fUxfSqrM9pmqAw2gmzAe0eztVvaM+x5x5xID2WZOiOq8dx9KOKrp Zorekjs3GEA3V1wlZ7Nksx/o8KZ04hLeKcR1r06zEDLN/yA+Fz8IPa0KqpuhrL010bQDgAhe 9o5TA0/cMJpxpLqHhX2As+5cQAhKDDsWJu3oBzZRkN7Hh/HTpWurmTQRRniLGSeiL0zdtilX fowyxGXH6QWi3MZYmpOq+etr7o4EGGbm2inxpVbM+NYmaJs+MAi/z5bsO/rABwdM5ysm8hwb CGt+1oEMORyMcUk/uRjclgTZM1NhGoXm1Un67+Rehu04i7DA6b8dd1H8AFgZSO2H4IKi+5yA Ldmo+ftCJS83Nf6Wi6hJnKG9aWQjKL+qmZqBEct/D2uRJGWAERU5+D0RwNV/i9lQFCYNjG9X Tew0BPYYnBtHFlz9rJTqGhDu4ubulSkbxAK3TIk8XzKdMvef3tV/7mJCmcaVbJ2YoNUtkdKJ goOigJTMBXMRu4Ibyq1Ei+d90lxhojKKlf9yguzpxk5KYFGUizp0dtvdNuXRBtYrwzykS6vB zTlLqHZ0pvGjNfTSvuuN
Organization: Institute of Telematics, Karlsruhe Institute of Technology
Message-ID: <cd9db905-436f-4cd0-bbdb-eda67033e5ae@kit.edu>
Date: Mon, 09 Sep 2019 15:36:01 +0200
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.8.0
MIME-Version: 1.0
In-Reply-To: <30f3abb2-a78f-6da6-e67a-1532d342ce9e@it.uc3m.es>
Content-Type: text/plain; charset="utf-8"
Content-Language: en-GB
Content-Transfer-Encoding: 8bit
X-ATIS-AV: ClamAV (iramx2.ira.uni-karlsruhe.de)
X-ATIS-Timestamp: iramx2.ira.uni-karlsruhe.de esmtpsa 1568036162.677028956
Archived-At: <https://mailarchive.ietf.org/arch/msg/iccrg/iK7rCCJTwS9MeXICzxNE5QHETKU>
Subject: Re: [iccrg] LEDBAT++, rLEDBAT, and slowdowns
X-BeenThere: iccrg@irtf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "Discussions of Internet Congestion Control Research Group \(ICCRG\)" <iccrg.irtf.org>
List-Unsubscribe: <https://www.irtf.org/mailman/options/iccrg>, <mailto:iccrg-request@irtf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/iccrg/>
List-Post: <mailto:iccrg@irtf.org>
List-Help: <mailto:iccrg-request@irtf.org?subject=help>
List-Subscribe: <https://www.irtf.org/mailman/listinfo/iccrg>, <mailto:iccrg-request@irtf.org?subject=subscribe>
X-List-Received-Date: Mon, 09 Sep 2019 13:36:08 -0000

Hi Marcelo,

Just some quick remarks. My colleague Mario Hock did some LEDBAT
research, too (yet unpublished), that eventually evolved into TCP LoLa
[*].
There are often dependencies between performance (e.g., keeping
utilization high and overall delay bounded) and fairness goals
and their respective mechanisms.
However, the results for using LEDBAT with AIMD are somewhat sensitive
to the \beta reduction factor and also show delay jitter.

In contrast to LEDBAT and Vegas, TCP LoLa tries to bound the
overall queuing delay (not the per-flow queueing delay).
TCP LoLa has a special "Fair Flow Balancing" mechanism that
uses Queueing Delay measurements for synchronization: if the flows
measure a queueing delay above Q_target, they will multiplicatively
decrease their CWnd after some fixed waiting period (this allows all
LoLa flows to quite reliably detect the exceedance of Q_target).
This "Tailored Decrease" reduction action uses a CWnd value that will
likely result in a light underutilization and thus allows all flows to
measure the current RTT_min.

[*]  Mario Hock, Felix Neumeister, Martina Zitterbart, Roland Bless: TCP
LoLa: Congestion Control for Low Latencies and High Throughput,
IEEE LCN 2017, http://ieeexplore.ieee.org/document/8109356/

Am 09.09.19 um 10:35 schrieb marcelo bagnulo braun:
> Thanks for the explanation. I can see now that uncoordinated slow downs
> is not enough to provide a bounded delay.
> I have an additional question related to this, that maybe you can
> provide some insight.
> Some papers, like [1], have proposed an AIMD variation of LEDBAT to
> achieve interledbat fairness. Since laedbat late comer advantage is
> essentially the problem of a late comer misestimating the base delay,
> and bloating the queueing delay, seems essentially the same problem. So
> the question is wouldnt AIMD solve the problem? I mean, wouldnt AIMD
> provide fairness and keep the queueing delay added by the ledbat flows
> bounded as well?
> Thanks, marcelo
> 
> 
> El 06/09/19 a las 17:25, Neal Cardwell escribió:
>> However, if we apply the slowdown described in LEDBAT++
>>> (unsynchronized), then f1 would reduce its window and because f1 is the
>>> one expelling the other flows out, this would allow f1 to have a more
>>> accurate measure of the base delay, enabling the AIMD fairness to kick
>>> in and preventing the increase of the queueing delay.
>> As noted above, if the fair share of inflight data is less than the
>> standing queue, when f1 reduces its window it will not drain the queue
>> fully, so its base delay estimate will ratchet up to a value that
>> includes the queueing delay from all the other flows queued at
>> the bottleneck, even if all the others are LEDBAT++ flows.
>>
>>
>>> In summary, because errors in estimating the queuing delay causes one
>>> flow to grow more than the others, when this particular flow slows down,
>>> will allow a more accurate measure of the base delay, enabling AIMD
>>> fairness mechanisms to reconverge to a fair split.
>>>
>>> So, I am not convinced that it is not enough with the slow down proposed
>>> in LEDBAT++.
>> When you say a slow-down "will allow a more accurate measure of the
>> base delay," I think a key question is: more accurate than what, exactly?
>> An uncoordinated slow-down will allow a more accurate measure
>> of the base delay than the most recent RTT samples, but it will not
>> allow a measure that is more accurate than (or even matching)
>> the current base delay estimate.That's because uncoordinated
>> slow-downs can cause feedback loops, where flows measure
>> ever-increasing RTTs from the queues created by other flows
>> that are also unable to "anchor" their base delay estimates, so
>> are also allowing ever-increasing amounts of data in flight.
>>
>>> I am not sure this is true for all combinations of parameter values, i
>>> need to look into this.
>> Yes, I think further testing of LEDBAT++/rLEDBAT in multi-flow scenarios
>> would be worthwhile.
>>
>> A simple scenario that should illustrate this issue:
>>   o RTT of 100ms
>>   o bottleneck with 10*BDP of buffer space (allowing 1 sec of queuing)
>>   o 3 bulk LEDBAT++/rLEDBAT flows
>>
>> Per the dynamics discussed above, my theory is that in such a scenario
>> the queuing delay will grow until the buffer is oscillating near full and
>> there is ~1sec of queuing in the steady state. That is, rather than
>> "Low Extra Delay" there will be 1sec of extra delay. :-)
>>
>> best,
>> neal
>>