Re: [aqm] FQ-PIE kernel module implementation

"Francini, Andrea (Andrea)" <andrea.francini@alcatel-lucent.com> Tue, 07 July 2015 19:04 UTC

From: "Francini, Andrea (Andrea)" <andrea.francini@alcatel-lucent.com>
To: "Agarwal, Anil" <Anil.Agarwal@viasat.com>, Polina Goltsman <polina.goltsman@student.kit.edu>, "Bless, Roland (TM)" <roland.bless@kit.edu>, "Fred Baker (fred)" <fred@cisco.com>, Toke Høiland-Jørgensen <toke@toke.dk>
Thread-Topic: [aqm] FQ-PIE kernel module implementation
Thread-Index: AQHQuIvh7zcVUT1kXUKS4PSVeb85XZ3P+wwAgACMaYD//9BJEA==
Date: Tue, 07 Jul 2015 19:04:48 +0000
Message-ID: <1BFAC0A1D7955144A2444E902CB628F865B043CD@US70TWXCHMBA12.zam.alcatel-lucent.com>
References: <D1961A16.1087%hokano@cisco.com> <5577FBD3.5000804@student.kit.edu> <97EDD2D8-CC0A-4AFA-9A74-3F2C282CF5C2@cisco.com> <87mvzem9i9.fsf@alrua-karlstad.karlstad.toke.dk> <7E6C797B-EE6F-4390-BC8F-606FDD8D5195@cisco.com> <559659A8.9030104@student.kit.edu> <87fv55mtpz.fsf@alrua-karlstad.karlstad.toke.dk> <559674B7.5050004@kit.edu> <7A2801D5E40DD64A85E38DF22117852C70AD0859@wdc1exchmbxp05.hq.corp.viasat.com> <559B889B.4060409@kit.edu> <559B9724.6090902@student.kit.edu> <7A2801D5E40DD64A85E38DF22117852C70AD366D@wdc1exchmbxp05.hq.corp.viasat.com>
In-Reply-To: <7A2801D5E40DD64A85E38DF22117852C70AD366D@wdc1exchmbxp05.hq.corp.viasat.com>
Accept-Language: en-US
Content-Language: en-US
Content-Type: multipart/related; boundary="_004_1BFAC0A1D7955144A2444E902CB628F865B043CDUS70TWXCHMBA12z_"; type="multipart/alternative"
MIME-Version: 1.0
Archived-At: <http://mailarchive.ietf.org/arch/msg/aqm/Ayewf_jzTIF0nNkeWSzw2VKf9_I>
Cc: "draft-ietf-aqm-pie@tools.ietf.org" <draft-ietf-aqm-pie@tools.ietf.org>, "Hironori Okano -X (hokano - AAP3 INC at Cisco)" <hokano@cisco.com>, AQM IETF list <aqm@ietf.org>
Subject: Re: [aqm] FQ-PIE kernel module implementation
Precedence: list

Hi Anil,

One comment about the first point of your summary:

While FQ-PIE does drop TCP throughput compared to the fair share, a single-queue AQM will do even worse in the same scenario where the input rate of the UDP flow exceeds the output rate of the queue (no TCP throughput at all). I also suspect that, if the FQ-PIE experiment is repeated with a smaller RTT, closer to the PIE delay target, we may see some improvement for TCP (and more so with CUBIC vs. Reno).

FQ-AQM with per-queue state (including the case of a fixed tail-drop threshold per queue) does succeed in enforcing the fair share, but if the drop threshold is oversized compared to the flow RTT the price to pay is a large self-inflicted queuing delay.

It is true that any scheme that uses aggregate state (typically the overall buffer occupancy or queuing delay) to make drop decisions will lose flow isolation/protection to some extent. However, there are important quantitative differences that may emerge depending on the way the FQ-AQM uses the aggregate state.

Regards,

Andrea

From: aqm [mailto:aqm-bounces@ietf.org] On Behalf Of Agarwal, Anil
Sent: Tuesday, July 07, 2015 1:31 PM
To: Polina Goltsman; Bless, Roland (TM); Fred Baker (fred); Toke Høiland-Jørgensen
Cc: draft-ietf-aqm-pie@tools.ietf.org; Hironori Okano -X (hokano - AAP3 INC at Cisco); AQM IETF list
Subject: Re: [aqm] FQ-PIE kernel module implementation

Polina, Roland,

This is good info.
So, here is a short summary of our analysis -
For FQ-PIE with aggregate-queue AQM -


1.       In the presence of unresponsive flows, FQ-PIE has similar properties as single-queue AQMs - the responsive flows are squeezed down to use leftover bandwidth, if any. FQ-AQM with per-queue AQM performs better.

2.       In the presence of flows that do not use their fairshare (temporarily or permanently), FQ-PIE has similar properties as single-queue AQMs - the flows, that do not use their fairshare, experience non-zero packet drops. FQ-AQM with per-queue AQM performs better.

3.       In the presence of flows that do not use their fairshare (temporarily or permanently), the queue size and queuing delay of flows that use their fairshare can grow above the desired target value.

#2 and #3 are probably not major issues - especially in a network bottleneck with a large number of diverse flows.
But it is worth pointing out and documenting these properties (somewhere).

Regards,
Anil



From: Polina Goltsman [mailto:polina.goltsman@student.kit.edu]
Sent: Tuesday, July 07, 2015 5:09 AM
To: Bless, Roland (TM); Agarwal, Anil; Fred Baker (fred); Toke Høiland-Jørgensen
Cc: draft-ietf-aqm-pie@tools.ietf.org<mailto:draft-ietf-aqm-pie@tools.ietf.org>; Hironori Okano -X (hokano - AAP3 INC at Cisco); AQM IETF list
Subject: Re: [aqm] FQ-PIE kernel module implementation

Hello all,

Here are my thoughts about interaction of AQM and fair-queueing system.

I think I will start with a figure. I have started a tcp flow with netperf, and 15 seconds later unresponsive UDP flow with iperf with a send rate a little bit above bottleneck link capacity. Both flows run together for 50 seconds.
This figure plots the throughput of UDP flow that was reported by iperf server. (Apparently netperf doesn't produce any output if throughput is below some value, so I can't plot TCP flow.).  The bottleneck is 100Mb/s and RTT is 100ms. All AQMs were configured with their default values and noecn flag.
[cid:image001.png@01D0B8C3.77996720]

Here is my example in theory. A link with capacity is C is shared between two flows - a non-application-limited TCP flow and unresponsive UDP flow with send rate 105%C. Both flows send max-sized packets, so round robin can be used instead of fair-queueing scheduler.

Per definition of max-min fair share both flows are supposed to get 50% of link capacity.

(1) Taildrop queues:
UDP packets will be dropped when its queue is full, TCP packets will be dropped when its queue is full. As long as there are packets in TCP flow queue, TCP should receive its fair share. ( As far as I understand, this depends on the size of the queue)

(2) AQM with state per queue:
Drop probability of UDP flow will always be non-zero and should stabilize around approximately 0.5.
Drop probability of TCP flow will be non-zero only when it starts sending above 50%C. Thus, while TCP recovers from packet drops, it should not receive another drop.

(3) AQM with state per aggregate:
UDP flow always creates a standing queue, so drop probability of aggregate is always non-zero. Let's call it p_aqm.
The share of TCP packets in the aggregate p_tcp = TCP send rate / (TCP send rate + UDP send rate) and the probability of dropping a TCP packet is p_aqm * p_tcp. This probability is non-zero unless TCP doesn't send at all.

In (3) drop probability is at least different. I assume that it is larger than in (2), which will cause more packet drops for TCP flow, and as result the flow will reduce its sending rate below its fair share.

Regards,
Polina
On 07/07/2015 10:06 AM, Bless, Roland (TM) wrote:

Hi,



thanks for your analysis. Indeed, Polina came up with

a similar analysis for an unresponsive UDP flow and

a TCP flow. Flow queueing can achieve link share fairness

despite the presence of unresponsive flows, but is ineffective

if the AQM is applied to the aggregate and not to the individual

flow queue. Polina used the FQ-PIE implementation

to verify this behavior (post will follow).



Regards,

 Roland





Am 04.07.2015 um 22:12 schrieb Agarwal, Anil:

Roland, Fred,



Here is a simple example to illustrate the differences between FQ-AQM with AQM per queue vs AQM per aggregate queue.



Let's take 2 flows, each mapped to separate queues in a FQ-AQM system.

   Link rate = 100 Mbps

   Flow 1 rate = 50 Mbps, source rate does not go over 50 Mbps

   Flow 2 rate >= 50 Mbps, adapts based on AQM.



FQ-Codel, AQM per queue:

   Flow 1 delay is minimal

   Flow 1 packet drops = 0

   Flow 2 delay is close to target value



FQ-Codel, AQM for aggregate queue:

   Does not work at all

   Packets are dequeued alternatively from queue 1 and queue 2

   Packets from queue 1 experience very small queuing delay

   Hence, CoDel does not enter dropping state, queue 2 is not controlled :(



FQ-PIE, AQM per queue:

   Flow 1 delay is minimal

   Flow 1 packet drops = 0

   Flow 2 delay is close to target value



FQ-PIE, AQM for aggregate queue:

   Flow 1 delay and queue 1 length are close to zero.

   Flow 2 delay is close to 2 * target_del :(

           qlen2 = target_del * aggregate_depart_rate

   Flow 1 experiences almost the same number of drops or ECNs as flow 2 :(

           Same drop probability and almost same packet rate for both flows

   (If flow 1 drops its rate because of packet drops or ECNs, the analysis gets slightly more complicated).



See if this makes sense.



If the analysis is correct, then it illustrates that flow behaviors are quite different

between AQM per queue and AQM per aggregate queue schemes.

In FQ-PIE for aggregate queue,

   - The total number of queued bytes will slosh between

     queues depending on the nature and data rates of the flows.

   - Flows with data rates within their fair share value will experience

     non-zero packet drops (or ECN marks).

   - Flows that experience no queuing delay will increase queuing delay of other flows.

   - In general, the queuing delay for any given flow will not be close to target_delay and can be

     much higher

Attachment: image001.png

[aqm] FQ-PIE kernel module implementation Hironori Okano -X (hokano - AAP3 INC at Cisco)
Re: [aqm] FQ-PIE kernel module implementation Dave Taht
Re: [aqm] [Bloat] FQ-PIE kernel module implementa… Simon Barber
Re: [aqm] [Bloat] FQ-PIE kernel module implementa… Bill Ver Steeg (versteb)
Re: [aqm] [Bloat] FQ-PIE kernel module implementa… Bill Ver Steeg (versteb)
Re: [aqm] FQ-PIE kernel module implementation Polina Goltsman
Re: [aqm] [Bloat] FQ-PIE kernel module implementa… Fred Baker (fred)
Re: [aqm] FQ-PIE kernel module implementation Fred Baker (fred)
Re: [aqm] FQ-PIE kernel module implementation Toke Høiland-Jørgensen
Re: [aqm] FQ-PIE kernel module implementation Fred Baker (fred)
Re: [aqm] FQ-PIE kernel module implementation Fred Baker
Re: [aqm] FQ-PIE kernel module implementation Toke Høiland-Jørgensen
Re: [aqm] FQ-PIE kernel module implementation Bless, Roland (TM)
Re: [aqm] FQ-PIE kernel module implementation Polina Goltsman
Re: [aqm] FQ-PIE kernel module implementation Simon Barber
Re: [aqm] FQ-PIE kernel module implementation Dave Taht
Re: [aqm] FQ-PIE kernel module implementation Michael Welzl
Re: [aqm] FQ-PIE kernel module implementation Fred Baker (fred)
Re: [aqm] FQ-PIE kernel module implementation Dave Taht
Re: [aqm] FQ-PIE kernel module implementation Agarwal, Anil
Re: [aqm] FQ-PIE kernel module implementation Fred Baker (fred)
Re: [aqm] FQ-PIE kernel module implementation Bless, Roland (TM)
Re: [aqm] FQ-PIE kernel module implementation Polina Goltsman
Re: [aqm] FQ-PIE kernel module implementation Dave Taht
Re: [aqm] FQ-PIE kernel module implementation Agarwal, Anil
Re: [aqm] FQ-PIE kernel module implementation Francini, Andrea (Andrea)
Re: [aqm] FQ-PIE kernel module implementation Rong Pan (ropan)
Re: [aqm] FQ-PIE kernel module implementation Francini, Andrea (Andrea)
Re: [aqm] FQ-PIE kernel module implementation Rong Pan (ropan)
Re: [aqm] FQ-PIE kernel module implementation Agarwal, Anil
Re: [aqm] FQ-PIE kernel module implementation Agarwal, Anil
Re: [aqm] FQ-PIE kernel module implementation Polina Goltsman
Re: [aqm] FQ-PIE kernel module implementation Francini, Andrea (Andrea)
Re: [aqm] FQ-PIE kernel module implementation Rong Pan (ropan)
Re: [aqm] FQ-PIE kernel module implementation Agarwal, Anil
Re: [aqm] FQ-PIE kernel module implementation Rong Pan (ropan)
Re: [aqm] FQ-PIE kernel module implementation Agarwal, Anil
Re: [aqm] FQ-PIE kernel module implementation Francini, Andrea (Andrea)
Re: [aqm] FQ-PIE kernel module implementation Rong Pan (ropan)
Re: [aqm] FQ-PIE kernel module implementation Agarwal, Anil
Re: [aqm] FQ-PIE kernel module implementation Francini, Andrea (Andrea)

Re: [aqm] FQ-PIE kernel module implementation

Attachment: image001.png