Re: [AVTCORE] Mirja Kühlewind's Discuss on draft-ietf-avtcore-rtp-circuit-breakers-15: (with DISCUSS and COMMENT)

Colin Perkins <csp@csperkins.org> Tue, 10 May 2016 22:49 UTC

Return-Path: <csp@csperkins.org>
X-Original-To: avt@ietfa.amsl.com
Delivered-To: avt@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 60C2B12D102; Tue, 10 May 2016 15:49:25 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.6
X-Spam-Level:
X-Spam-Status: No, score=-2.6 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_LOW=-0.7] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id lWokCpuvcdQ0; Tue, 10 May 2016 15:49:23 -0700 (PDT)
Received: from haggis.mythic-beasts.com (haggis.mythic-beasts.com [IPv6:2a00:1098:0:86:1000:0:2:1]) (using TLSv1.2 with cipher DHE-RSA-AES128-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id AA35012B00A; Tue, 10 May 2016 15:49:23 -0700 (PDT)
Received: from [81.187.2.149] (port=48808 helo=[192.168.0.91]) by haggis.mythic-beasts.com with esmtpsa (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:256) (Exim 4.80) (envelope-from <csp@csperkins.org>) id 1b0GSu-0006a5-0s; Tue, 10 May 2016 23:49:20 +0100
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (Mac OS X Mail 9.3 \(3124\))
From: Colin Perkins <csp@csperkins.org>
In-Reply-To: <1B501087-267A-4DBF-A08C-40245EC1200D@kuehlewind.net>
Date: Tue, 10 May 2016 23:49:17 +0100
Content-Transfer-Encoding: quoted-printable
Message-Id: <96261ECE-C99A-442F-BA22-7EF1CF200164@csperkins.org>
References: <20160503120321.7534.26562.idtracker@ietfa.amsl.com> <32C2F93E-9CB9-4645-9C42-320AA8B24ED5@csperkins.org> <2EC09917-2E05-407D-AA7F-38C73B179C4B@kuehlewind.net> <219AC00D-9E2E-4AF7-85F0-924EAE5F6164@kuehlewind.net> <BD5D6E34-3C35-4221-86A1-AB3150EBFFE1@csperkins.org> <8D786065-7DCD-43A2-BB69-5D7B1D44546B@kuehlewind.net> <953AB9F5-6220-4C9E-80B6-FE72BF80B648@csperkins.org> <762DC4EA-F5E8-49EB-BFEC-5DFA569B9444@kuehlewind.net> <12182E39-A26B-431F-9CCA-7E602EBFAB7E@csperkins.org> <30E6734A-0595-404E-8823-F27603B6CFC1@kuehlewind.net> <F3648EEF-5BBD-4F2F-BE9E-552D27EE3C2F@csperkins.org> <8CF47513-8E62-4EA0-B938-E9829EC4B774@kuehlewind.net> <9824AE79-0C56-4087-8C3C-E36812909B0E@csperkins.org> <1B501087-267A-4DBF-A08C-40245EC1200D@kuehlewind.net>
To: "Mirja Kuehlewind (IETF)" <ietf@kuehlewind.net>
X-Mailer: Apple Mail (2.3124)
X-BlackCat-Spam-Score: -28
X-Mythic-Debug: Threshold = On =
Archived-At: <http://mailarchive.ietf.org/arch/msg/avt/dSvJ6DF-Vvsn1VddT7NSYtMwFvg>
Cc: avtcore-chairs@ietf.org, Magnus Westerlund <magnus.westerlund@ericsson.com>, draft-ietf-avtcore-rtp-circuit-breakers@ietf.org, The IESG <iesg@ietf.org>, avt@ietf.org
Subject: Re: [AVTCORE] Mirja Kühlewind's Discuss on draft-ietf-avtcore-rtp-circuit-breakers-15: (with DISCUSS and COMMENT)
X-BeenThere: avt@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: Audio/Video Transport Core Maintenance <avt.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/avt>, <mailto:avt-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/avt/>
List-Post: <mailto:avt@ietf.org>
List-Help: <mailto:avt-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/avt>, <mailto:avt-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 10 May 2016 22:49:25 -0000

> On 10 May 2016, at 23:26, Mirja Kuehlewind (IETF) <ietf@kuehlewind.net> wrote:
> 
> Hi Colin,
> 
>> Am 10.05.2016 um 18:08 schrieb Colin Perkins <csp@csperkins.org>:
>> 
>> Hi,
>> 
>>> On 10 May 2016, at 22:52, Mirja Kuehlewind (IETF) <ietf@kuehlewind.net> wrote:
>>> 
>>> I thought that rfc6679 was already written or at least reviewed with the notion that CE marks and loss are not equivalent (even though threaten like this most often). A quick look at the doc  also gives me the impression that there is no update of rfc6679 needed. However, this should probably be discussed on the tsvwg list. Feel free to raise the issue there.
>> 
>> Section 7.3.3 of RFC 6679 says:
>> 
>>  The reception of RTP packets with ECN-CE marks in the IP header is a
>>  notification that congestion is being experienced.  The default
>>  reaction on the reception of these ECN-CE-marked packets MUST be to
>>  provide the congestion control algorithm with a congestion
>>  notification that triggers the algorithm to react as if packet loss
>>  had occurred.  There should be no difference in congestion response
>>  if ECN-CE marks or packet drops are detected.
>> 
>>  Other reactions to ECN-CE may be specified in the future, following
>>  IETF Review.  Detailed designs of such alternative reactions MUST be
>>  specified in a Standards Track RFC and be reviewed to ensure they are
>>  safe for deployment under any restrictions specified.  A potential
>>  example for an alternative reaction could be emergency communications
>>  (such as that generated by first responders, as opposed to the
>>  general public) in networks where the user has been authorised.  A
>>  more detailed description of these other reactions, as well as the
>>  types of congestion control algorithms used by end-nodes, is outside
>>  the scope of this document.
>> 
>> I read that as allowing other reactions, but such need to be documented and updated RFC 6679.
> 
> An update might be needed or not (also depends on what we finally end up with). However, this needs to go on the tsvwg list.
> 
>> 
>>> I still think, actually independent of what we do with rfc3168, that "SHOULD BE treated as lost“ is the wrong thing to say. Not because congestion control might change in future but because the way AQMs signal CE is even less well defined. Further as I said, there is a different between circuit breakers and low time-scale congestion control and for circuit breaker and therefore there must be a difference between loss and CE marks (as reflected correctly in ietf-tsvwg-circuit-breaker).
>> 
>> This forces queue overflows and high latency before the circuit breaker triggers, rather than allowing it to trigger on persistent congestion signals from the AQM that can avoid inducing latency for other flows. I’m not sure that’s the right trade-off, especially if the AQM behaviour is not well defined. 
> 
> The term we ended up using in ietf-tsvwg-circuit-breaker is "persistent excessive congestion“ and further "This is a safety measure to prevent starvation of network resources
>   denying other flows from access to the Internet.  [...]  Avoiding persistent excessive
>   congestion is important to reduce the potential for "Congestion
>   Collapse” [RFC2914].“

I well understand this.

> So persistent congestion in not the problem but excessive congestion (that might even have an increasing trend).

This is clearly stated in the circuit breaker draft.

> And further for ECN the persistent congestion level can be quite high, even without causing the queue to fill up but maintaining a low queuing delay level. Therefore ECN needs clearly to be treated differently than loss.

If you require the circuit breaker to only trigger on loss, not on ECN marks, this forces the queue to overflow before the problematic flow is stopped. That disrupts other traffic sharing the queue. The point of using ECN is to be able to stop flows causing persistent excessive congestion without the latency penalty of the queue overflow.

Colin