Re: [AVTCORE] [rtcweb] [tsvwg] WG Last Call on changes: draft-ietf-avtcore-rtp-circuit-breakers-16

Colin Perkins <csp@csperkins.org> Mon, 27 June 2016 22:02 UTC

Return-Path: <csp@csperkins.org>
X-Original-To: avt@ietfa.amsl.com
Delivered-To: avt@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 1499512D8B1; Mon, 27 Jun 2016 15:02:33 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.2
X-Spam-Level:
X-Spam-Status: No, score=-4.2 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_MED=-2.3] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id c90lEi2M0KD5; Mon, 27 Jun 2016 15:02:30 -0700 (PDT)
Received: from balrog.mythic-beasts.com (balrog.mythic-beasts.com [46.235.227.24]) (using TLSv1.2 with cipher DHE-RSA-AES128-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id E2FE812D9B5; Mon, 27 Jun 2016 15:02:29 -0700 (PDT)
Received: from [81.187.2.149] (port=39596 helo=[192.168.0.91]) by balrog.mythic-beasts.com with esmtpsa (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:256) (Exim 4.80) (envelope-from <csp@csperkins.org>) id 1bHebq-0004LA-92; Mon, 27 Jun 2016 23:02:26 +0100
Content-Type: text/plain; charset=utf-8
Mime-Version: 1.0 (Mac OS X Mail 9.3 \(3124\))
From: Colin Perkins <csp@csperkins.org>
In-Reply-To: <6E35FB6C-CA98-413C-B7AE-75402A968017@ifi.uio.no>
Date: Mon, 27 Jun 2016 23:02:17 +0100
Content-Transfer-Encoding: quoted-printable
Message-Id: <3FD27BBF-8E2D-4A42-86A0-C4C0692FF8C9@csperkins.org>
References: <ccf9f2d7-2694-4336-0ec9-ccfebfeb0120@ericsson.com> <CE03DB3D7B45C245BCA0D243277949362F585D3E@MX307CL04.corp.emc.com> <d97e30a7-70f5-26d0-c3a4-0497c669f5f6@ericsson.com> <CE03DB3D7B45C245BCA0D243277949362F586054@MX307CL04.corp.emc.com> <D19E595F-7C66-4AE9-92B4-D550A93F634D@csperkins.org> <CE03DB3D7B45C245BCA0D243277949362F589335@MX307CL04.corp.emc.com> <20160616222548.GB77166@verdi> <0643E158-BF26-4692-8167-B7A959CB20CE@csperkins.org> <CE03DB3D7B45C245BCA0D243277949362F596DBC@MX307CL04.corp.emc.com> <E16BEA87-1D0F-48F1-A9AC-2729079D581D@tik.ee.ethz.ch> <8C16F1C6-B4A7-4BB4-B215-D7E7EAF308F8@erg.abdn.ac.uk> <CE03DB3D7B45C245BCA0D243277949362F59C41D@MX307CL04.corp.emc.com> <3E053A65-2698-4749-8E3D-E0451DF84011@ifi.uio.no> <BF6B00CC65FD2D45A326E74492B2C19FB76A6433@FR711WXCHMBA05.zeu.alcatel-lucent.com> <32a23d69d22062669f78df806a4eb6b8.squirrel@erg.abdn.ac.uk> <BF6B00CC65FD2D45A326E74492B2C19FB76A659B@FR711WXCHMBA05.zeu.alcatel-lucent.com> <CE03DB3D7B45C245BCA0D243277949362F5 AEE02@MX307CL04.corp.emc.com> <6E35FB6C-CA98-413C-B7AE-75402A968017@ifi.uio.no>
To: Michael Welzl <michawe@ifi.uio.no>
X-Mailer: Apple Mail (2.3124)
X-BlackCat-Spam-Score: -28
X-Mythic-Debug: Threshold = On =
Archived-At: <https://mailarchive.ietf.org/arch/msg/avt/M7sxof5JtGQzUokfNOrZpBFmHp4>
Cc: "Black, David" <david.black@emc.com>, "De Schepper, Koen \(Nokia - BE\)" <koen.de_schepper@nokia-bell-labs.com>, "rtcweb@ietf.org" <rtcweb@ietf.org>, tsvwg <tsvwg@ietf.org>, IETF AVTCore WG <avt@ietf.org>
Subject: Re: [AVTCORE] [rtcweb] [tsvwg] WG Last Call on changes: draft-ietf-avtcore-rtp-circuit-breakers-16
X-BeenThere: avt@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: Audio/Video Transport Core Maintenance <avt.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/avt>, <mailto:avt-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/avt/>
List-Post: <mailto:avt@ietf.org>
List-Help: <mailto:avt-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/avt>, <mailto:avt-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 27 Jun 2016 22:02:33 -0000

> On 27 Jun 2016, at 21:52, Michael Welzl <michawe@ifi.uio.no> wrote:
> 
> David,
> 
> 
>> On 27. jun. 2016, at 22.09, Black, David <david.black@emc.com> wrote:
>> 
>>> As long as an AQM is marking at the same rate as dropping
>> 
>> That's an interesting assumption - it should be true for AQMs vetted
>> here in the past, but there are easy ways for it not to hold (e.g., if dropping
>> or marking is based on queue occupancy, it is possible that dropping
>> reduces queue occupancy in a fashion that marking does not).
>> 
>> For ECN "classic" (i.e., see RFC 3168) where ECN-CE markings are treated
>> as drop-equivalent, that is for congestion control purposes, which is similar
>> to, (but not the same as) the throughput estimation usage for the RTP circuit
>> breaker.    I'll note that ECN "classic" was designed congestion control
>> algorithms for react to ECN-CE marks once per RTT, independent of how
>> many ECN-CE marks are observed in an RTT.
>> 
>> Gorry wrote:
>> 
>>>> in this context we should use ECN to drive a CC algorithm and we should be
>>>> cautious to avoid requiring its use within a Circuit Breaker - optional
>>>> use, if you understand how to interpret a reaction to many CE-marks as
>>>> excessive congestion, are permitted.
>> 
>> Something like that may be workable, starting with a clear distinction between
>> the use of ECN by CC (routine, active at all times) and ECN by a circuit
>> breaker (monitors for evidence that things have gotten bad, only activated
>> when things get bad).   This would baseline the RTP circuit breaker on drops
>> and allow use of ECN as additional evidence of problems, in contrast to
>> congestion control where ECN-CE is effectively treated as drop-equivalent.
>> 
>> I'm not quite sure how to specify "use of ECN as additional evidence" of
>> "excessive congestion" as drop-equivalence is about the best we have
>> for current guidance.
> 
> I fail to parse that sentence, so maybe I’m getting you wrong, but anyway I wonder: what’s even the point of this?
> Why even bother considering CE-marks as information for a circuit breaker?

Because the alternative is that we only break the circuit once the queue has been driven into overflow, and packets have been lost. We want to avoid that, since it causes latency, and too much latency is very bad for the user experience. 

> CE-marks may *not* indicate *excessive* congestion - and since you say “additional evidence”: I don’t think that a combination of loss and CE-marks makes this any better? CE-marks may be produced by a shallow queue, which can be rather “mild” congestion, at least in the light of what a circuit breaker should consider…

Surely this is just arguing for a different threshold for a circuit breaker triggered by ECN-CE marks (using a modern, small queue, AQM) than for one triggered by loss (or ECN marks considered equivalent to loss)? 

If I understand the L4S proposal correctly, that would be treat ECN-CE marks on ECT(0) marked flows as equivalent to loss, but treat ECN-CE marks on ECT(1) marked flows with a (much) higher threshold. 

Assuming, in all cases, that there’s a parallel congestion control algorithm running (and RMCAT has figured out the right congestion response for that; the proposals now treat ECN-CE and loss very similarly).

-- 
Colin Perkins
https://csperkins.org/