Re: [iccrg] LEDBAT++, rLEDBAT, and slowdowns

Neal Cardwell <ncardwell@google.com> Mon, 09 September 2019 14:10 UTC

Return-Path: <ncardwell@google.com>
X-Original-To: iccrg@ietfa.amsl.com
Delivered-To: iccrg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 073021202DD for <iccrg@ietfa.amsl.com>; Mon, 9 Sep 2019 07:10:36 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -17.499
X-Spam-Level:
X-Spam-Status: No, score=-17.499 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_MED=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, ENV_AND_HDR_SPF_MATCH=-0.5, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, USER_IN_DEF_DKIM_WL=-7.5, USER_IN_DEF_SPF_WL=-7.5] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=google.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id RYeiB3uYivfS for <iccrg@ietfa.amsl.com>; Mon, 9 Sep 2019 07:10:32 -0700 (PDT)
Received: from mail-ot1-x331.google.com (mail-ot1-x331.google.com [IPv6:2607:f8b0:4864:20::331]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id D0732120047 for <iccrg@irtf.org>; Mon, 9 Sep 2019 07:10:32 -0700 (PDT)
Received: by mail-ot1-x331.google.com with SMTP id 67so12547015oto.3 for <iccrg@irtf.org>; Mon, 09 Sep 2019 07:10:32 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=lDGOg0DN9pCTPb5WNfKRpIEs2WUydQ/N/OiWg4vy2t4=; b=cjvlf+O/P1oy8MNJzn1l0f9GDzMhkHflsfCygn/mQoV0DdiFd046YMiByGxM1HhBF+ xHtRW1O9A7dGa9oEehOdR2lUkYlYyWoHN1G4z68RnMIvQ5GU13DNTl33Ujr/cABilPx8 csw4m1yhkGuZhidGO8AD5RNDb3+YzAyasYaG04Smm/EGbvfQtFDxCSCvgJuRUubVi2us 2fzZvtF3BTDiJvZCNog66BFxC4J2PJzAKkAKIArkp/3iv0vm0RnjlyV2NQ6wMhppJkeo Nae1ZT2qwKWMT/Lq41L5D4hKKNQqobLrq9+MovuukLy+KG/gXkHs/CqxD5wswKKjVbvV GTxw==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=lDGOg0DN9pCTPb5WNfKRpIEs2WUydQ/N/OiWg4vy2t4=; b=OP32LiNaDbQip9tJSg8ZwNwC9DFIGSahlVZvcCpu8rUQBdtK92wGRS1tDg4OjS9V3W fFdYD5OxnC14GlQlM8DrxjjqdYI/MMtoCKjHL39/O5440FM14jct4bHZxdnDCAkrWTBi QH/pvndbyev0Rwp3XAfM50k5RBPGk0Lo34fW1bD/GUM8KnGUSk55Trp+D1F2etS2P8oz RvmlQdB0KFgeLupp0qxHMAiB5whaXykA+VJaraXD2MEoZdsVRt1Zq4ftYfYbmNzRIttw ZeSCUotJ73rV/QPI6ev7H8THX0iVyMlZ4+XIHpzG3L2ac9sTI8h/Xm7bTJLVLpRxyDyI tCpQ==
X-Gm-Message-State: APjAAAXE74BlIKgAXCBoFopGpcJQ+H+4bE2SDcFkOuwermdSYien1oSu YamRpQXI0dZ8IVFGS6LX4swWZ0pK3LZ20iaBh/elIw==
X-Google-Smtp-Source: APXvYqzKGvk7ng2PHaHT/NPl9mHygn+diI80KX0uAe2p17SHQ+OEgeKXajHvB9XHuKUnUQQJONkFPUEDnryBDvJfPb0=
X-Received: by 2002:a05:6830:1f12:: with SMTP id u18mr20896660otg.255.1568038231712; Mon, 09 Sep 2019 07:10:31 -0700 (PDT)
MIME-Version: 1.0
References: <CADVnQynJY8xkqkghhCWhPpbF+4Ev_3c7OZf_tDEb_J5xr0FV9A@mail.gmail.com> <c4b76af5-abbe-3184-24ce-03a2c0b9544b@it.uc3m.es> <CADVnQy=3FqEjqipX6thgjcjN8YPTOqiduYKU2GccXHwS+a3wVA@mail.gmail.com> <be98e323-506f-bdc8-a128-72c9f4aa5ead@it.uc3m.es> <3862e86e-e588-0cca-38e8-4ea23ef2b4c7@it.uc3m.es> <CADVnQymqHHg4cF94fu81opaBibmShVrsZ3cLyVPcHir0p=Kv3Q@mail.gmail.com> <30f3abb2-a78f-6da6-e67a-1532d342ce9e@it.uc3m.es>
In-Reply-To: <30f3abb2-a78f-6da6-e67a-1532d342ce9e@it.uc3m.es>
From: Neal Cardwell <ncardwell@google.com>
Date: Mon, 09 Sep 2019 10:10:15 -0400
Message-ID: <CADVnQynkVTc+vD7bi1NhRTYQ85m_MV4PMEbfO9Gz=atzg7PX0w@mail.gmail.com>
To: marcelo bagnulo braun <marcelo@it.uc3m.es>
Cc: iccrg IRTF list <iccrg@irtf.org>, Praveen Balasubramanian <pravb@microsoft.com>, Yuchung Cheng <ycheng@google.com>
Content-Type: multipart/alternative; boundary="000000000000cc169505921f59d5"
Archived-At: <https://mailarchive.ietf.org/arch/msg/iccrg/t1qNlUEytcXLl6Vo_9-qvCjbsyI>
Subject: Re: [iccrg] LEDBAT++, rLEDBAT, and slowdowns
X-BeenThere: iccrg@irtf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "Discussions of Internet Congestion Control Research Group \(ICCRG\)" <iccrg.irtf.org>
List-Unsubscribe: <https://www.irtf.org/mailman/options/iccrg>, <mailto:iccrg-request@irtf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/iccrg/>
List-Post: <mailto:iccrg@irtf.org>
List-Help: <mailto:iccrg-request@irtf.org?subject=help>
List-Subscribe: <https://www.irtf.org/mailman/listinfo/iccrg>, <mailto:iccrg-request@irtf.org?subject=subscribe>
X-List-Received-Date: Mon, 09 Sep 2019 14:10:36 -0000

On Mon, Sep 9, 2019 at 4:35 AM marcelo bagnulo braun <marcelo@it.uc3m.es>
wrote:

> Hi Neal,
>
> Thanks for the explanation. I can see now that uncoordinated slow downs
> is not enough to provide a bounded delay.
> I have an additional question related to this, that maybe you can
> provide some insight.
> Some papers, like [1], have proposed an AIMD variation of LEDBAT to
> achieve interledbat fairness. Since laedbat late comer advantage is
> essentially the problem of a late comer misestimating the base delay,
> and bloating the queueing delay, seems essentially the same problem. So
> the question is wouldnt AIMD solve the problem? I mean, wouldnt AIMD
> provide fairness and keep the queueing delay added by the ledbat flows
> bounded as well?
>

I'm pretty sure AIMD by itself would not solve the problem of unbounded
queuing delay for LEDBAT.

AIMD can increase fairness. But with enough flows the multiplicative
decrease aspect of AIMD, if it is uncoordinated, will fail to drain the
bottleneck queue, and thus will allow the queuing delays and base delay
estimates to keep ratcheting upward.

neal



> Thanks, marcelo
>
>
> El 06/09/19 a las 17:25, Neal Cardwell escribió:
> > However, if we apply the slowdown described in LEDBAT++
> >> (unsynchronized), then f1 would reduce its window and because f1 is the
> >> one expelling the other flows out, this would allow f1 to have a more
> >> accurate measure of the base delay, enabling the AIMD fairness to kick
> >> in and preventing the increase of the queueing delay.
> > As noted above, if the fair share of inflight data is less than the
> > standing queue, when f1 reduces its window it will not drain the queue
> > fully, so its base delay estimate will ratchet up to a value that
> > includes the queueing delay from all the other flows queued at
> > the bottleneck, even if all the others are LEDBAT++ flows.
> >
> >
> >> In summary, because errors in estimating the queuing delay causes one
> >> flow to grow more than the others, when this particular flow slows down,
> >> will allow a more accurate measure of the base delay, enabling AIMD
> >> fairness mechanisms to reconverge to a fair split.
> >>
> >> So, I am not convinced that it is not enough with the slow down proposed
> >> in LEDBAT++.
> > When you say a slow-down "will allow a more accurate measure of the
> > base delay," I think a key question is: more accurate than what, exactly?
> > An uncoordinated slow-down will allow a more accurate measure
> > of the base delay than the most recent RTT samples, but it will not
> > allow a measure that is more accurate than (or even matching)
> > the current base delay estimate.That's because uncoordinated
> > slow-downs can cause feedback loops, where flows measure
> > ever-increasing RTTs from the queues created by other flows
> > that are also unable to "anchor" their base delay estimates, so
> > are also allowing ever-increasing amounts of data in flight.
> >
> >> I am not sure this is true for all combinations of parameter values, i
> >> need to look into this.
> > Yes, I think further testing of LEDBAT++/rLEDBAT in multi-flow scenarios
> > would be worthwhile.
> >
> > A simple scenario that should illustrate this issue:
> >   o RTT of 100ms
> >   o bottleneck with 10*BDP of buffer space (allowing 1 sec of queuing)
> >   o 3 bulk LEDBAT++/rLEDBAT flows
> >
> > Per the dynamics discussed above, my theory is that in such a scenario
> > the queuing delay will grow until the buffer is oscillating near full and
> > there is ~1sec of queuing in the steady state. That is, rather than
> > "Low Extra Delay" there will be 1sec of extra delay. :-)
> >
> > best,
> > neal
> >
>
>