Re: [iccrg] LEDBAT++, rLEDBAT, and slowdowns

marcelo bagnulo braun <marcelo@it.uc3m.es> Mon, 09 September 2019 08:57 UTC

Return-Path: <marcelo@it.uc3m.es>
X-Original-To: iccrg@ietfa.amsl.com
Delivered-To: iccrg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id C4F67120144 for <iccrg@ietfa.amsl.com>; Mon, 9 Sep 2019 01:57:20 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2
X-Spam-Level:
X-Spam-Status: No, score=-2 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=it.uc3m.es
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Wtp-aSkgiMYw for <iccrg@ietfa.amsl.com>; Mon, 9 Sep 2019 01:57:17 -0700 (PDT)
Received: from mail-wm1-x330.google.com (mail-wm1-x330.google.com [IPv6:2a00:1450:4864:20::330]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 4B357120142 for <iccrg@irtf.org>; Mon, 9 Sep 2019 01:57:17 -0700 (PDT)
Received: by mail-wm1-x330.google.com with SMTP id n10so13665810wmj.0 for <iccrg@irtf.org>; Mon, 09 Sep 2019 01:57:17 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=it.uc3m.es; s=google; h=subject:to:cc:references:from:message-id:date:user-agent :mime-version:in-reply-to:content-transfer-encoding:content-language; bh=Ic/cz+b/641ioU1zqGCQ+6JSftbcBvkdJndLaMswo5Q=; b=LLKRGshQK5UeVkdVQ+PvYVV0POVVzrbkkHpHI1iwLpktgNSmcTPsyPptg95942jQ1u x+1cyjVdGr2XfefcZV7mO4vA4YX02sDNl/aKqxSeKoC7e3W4RwXN0YJaizZjPbTbvuvJ 7yBbozFcyW4WNySeJeQ+Fh5vDA1TvMeNblPGVMasxTXzfDB1aYsHjDnRuIa8P77JB5Dr /Nyj5dEbxTyQzRn8dsLLTYhaMY/4KfGTK2O25cCbHF+GJw58qBIDPE6IrpVHeGsdtLkD /qYisB9AMGTPj13YnbU5nCPQgI6wTEuYnQ0tdIVNn0q0xxpN4h4ek8uD0Ej/NtEvtppq Gv2Q==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-transfer-encoding :content-language; bh=Ic/cz+b/641ioU1zqGCQ+6JSftbcBvkdJndLaMswo5Q=; b=ri5dZ1KYjyaGLGvQAgWOeAeTuCNB7lD0rYprHoQ1OlJlVv8NheaPiHtDvA+x+rJCLP SS9QriHlmSLOXisFBxylUGdY6xc0TehDlOmlEjavKDKSIRPACpvhqfXtl4q+y0d3OjAK peGDOqhPQ3DsSVtz97UkXmCF1URMZJLnGZRnemqfCFNOBb0el/fIN3USNleLYbslV5ai 2+s35sKWGLORnNPTV5Fe1bSyLwrmJbNGnKG/2A0XbOgTpuNby2JOasyPohJNdZU8aG0Y jw4DiEuJid+vtYEh4tI+rnVgB9C/gq2wl95khP8ztAtat+tfUZ2DAczjoM+EqeFfq8Zm m+VA==
X-Gm-Message-State: APjAAAXaX9/URXFpLVSBdGsoExYUrGIj1pnZpf+jl0+ZUgEodfXoGNvu zT4QJbP55GSbV/V+7LX3/GoCbA==
X-Google-Smtp-Source: APXvYqwm7Iurabi/e1Vb3VrTJW5Z/kItnOFZicyz4Dp1SfkketftP1To8KfzyHV71Zt+FGVqRx8x8A==
X-Received: by 2002:a7b:cc82:: with SMTP id p2mr17374743wma.165.1568019435671; Mon, 09 Sep 2019 01:57:15 -0700 (PDT)
Received: from Macintosh-6.local ([163.117.139.228]) by smtp.gmail.com with ESMTPSA id a190sm16844704wme.8.2019.09.09.01.57.14 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Mon, 09 Sep 2019 01:57:15 -0700 (PDT)
To: Neal Cardwell <ncardwell@google.com>
Cc: iccrg IRTF list <iccrg@irtf.org>, Praveen Balasubramanian <pravb@microsoft.com>, Yuchung Cheng <ycheng@google.com>
References: <CADVnQynJY8xkqkghhCWhPpbF+4Ev_3c7OZf_tDEb_J5xr0FV9A@mail.gmail.com> <c4b76af5-abbe-3184-24ce-03a2c0b9544b@it.uc3m.es> <CADVnQy=3FqEjqipX6thgjcjN8YPTOqiduYKU2GccXHwS+a3wVA@mail.gmail.com> <be98e323-506f-bdc8-a128-72c9f4aa5ead@it.uc3m.es> <3862e86e-e588-0cca-38e8-4ea23ef2b4c7@it.uc3m.es> <CADVnQymqHHg4cF94fu81opaBibmShVrsZ3cLyVPcHir0p=Kv3Q@mail.gmail.com> <30f3abb2-a78f-6da6-e67a-1532d342ce9e@it.uc3m.es>
From: marcelo bagnulo braun <marcelo@it.uc3m.es>
Message-ID: <f4646c07-9872-8980-44d4-8b231a24d355@it.uc3m.es>
Date: Mon, 09 Sep 2019 10:57:15 +0200
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.9; rv:60.0) Gecko/20100101 Thunderbird/60.8.0
MIME-Version: 1.0
In-Reply-To: <30f3abb2-a78f-6da6-e67a-1532d342ce9e@it.uc3m.es>
Content-Type: text/plain; charset="utf-8"; format="flowed"
Content-Transfer-Encoding: 8bit
Content-Language: en-GB
Archived-At: <https://mailarchive.ietf.org/arch/msg/iccrg/jtYBt4gBp3Z9qOkpVZSaHUC91_s>
Subject: Re: [iccrg] LEDBAT++, rLEDBAT, and slowdowns
X-BeenThere: iccrg@irtf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "Discussions of Internet Congestion Control Research Group \(ICCRG\)" <iccrg.irtf.org>
List-Unsubscribe: <https://www.irtf.org/mailman/options/iccrg>, <mailto:iccrg-request@irtf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/iccrg/>
List-Post: <mailto:iccrg@irtf.org>
List-Help: <mailto:iccrg-request@irtf.org?subject=help>
List-Subscribe: <https://www.irtf.org/mailman/listinfo/iccrg>, <mailto:iccrg-request@irtf.org?subject=subscribe>
X-List-Received-Date: Mon, 09 Sep 2019 08:57:21 -0000

Forgot the reference:
[1] Rethinking the Low Extra Delay Background Transport (LEDBAT) Protocol
Carofiglio, Giovanna; Muscariello, Luca; Rossi, Dario; Testa, Claudio; 
Valenti, Silvio
ISSN: 1389-1286 , 1872-7069; DOI: 10.1016/j.comnet.2013.02.020
Computer networks. , 2013, Vol.57(8), p.1838-1852

El 09/09/19 a las 10:35, marcelo bagnulo braun escribió:
> Hi Neal,
>
> Thanks for the explanation. I can see now that uncoordinated slow 
> downs is not enough to provide a bounded delay.
> I have an additional question related to this, that maybe you can 
> provide some insight.
> Some papers, like [1], have proposed an AIMD variation of LEDBAT to 
> achieve interledbat fairness. Since laedbat late comer advantage is 
> essentially the problem of a late comer misestimating the base delay, 
> and bloating the queueing delay, seems essentially the same problem. 
> So the question is wouldnt AIMD solve the problem? I mean, wouldnt 
> AIMD provide fairness and keep the queueing delay added by the ledbat 
> flows bounded as well?
> Thanks, marcelo
>
>
> El 06/09/19 a las 17:25, Neal Cardwell escribió:
>> However, if we apply the slowdown described in LEDBAT++
>>> (unsynchronized), then f1 would reduce its window and because f1 is the
>>> one expelling the other flows out, this would allow f1 to have a more
>>> accurate measure of the base delay, enabling the AIMD fairness to kick
>>> in and preventing the increase of the queueing delay.
>> As noted above, if the fair share of inflight data is less than the
>> standing queue, when f1 reduces its window it will not drain the queue
>> fully, so its base delay estimate will ratchet up to a value that
>> includes the queueing delay from all the other flows queued at
>> the bottleneck, even if all the others are LEDBAT++ flows.
>>
>>
>>> In summary, because errors in estimating the queuing delay causes one
>>> flow to grow more than the others, when this particular flow slows 
>>> down,
>>> will allow a more accurate measure of the base delay, enabling AIMD
>>> fairness mechanisms to reconverge to a fair split.
>>>
>>> So, I am not convinced that it is not enough with the slow down 
>>> proposed
>>> in LEDBAT++.
>> When you say a slow-down "will allow a more accurate measure of the
>> base delay," I think a key question is: more accurate than what, 
>> exactly?
>> An uncoordinated slow-down will allow a more accurate measure
>> of the base delay than the most recent RTT samples, but it will not
>> allow a measure that is more accurate than (or even matching)
>> the current base delay estimate.That's because uncoordinated
>> slow-downs can cause feedback loops, where flows measure
>> ever-increasing RTTs from the queues created by other flows
>> that are also unable to "anchor" their base delay estimates, so
>> are also allowing ever-increasing amounts of data in flight.
>>
>>> I am not sure this is true for all combinations of parameter values, i
>>> need to look into this.
>> Yes, I think further testing of LEDBAT++/rLEDBAT in multi-flow scenarios
>> would be worthwhile.
>>
>> A simple scenario that should illustrate this issue:
>>   o RTT of 100ms
>>   o bottleneck with 10*BDP of buffer space (allowing 1 sec of queuing)
>>   o 3 bulk LEDBAT++/rLEDBAT flows
>>
>> Per the dynamics discussed above, my theory is that in such a scenario
>> the queuing delay will grow until the buffer is oscillating near full 
>> and
>> there is ~1sec of queuing in the steady state. That is, rather than
>> "Low Extra Delay" there will be 1sec of extra delay. :-)
>>
>> best,
>> neal
>>
>