Re: [Dime] [dime] #46: Bad normative advice on not letting overload reports expire

Steve Donovan <srdonovan@usdonovans.com> Wed, 26 March 2014 01:57 UTC

Return-Path: <srdonovan@usdonovans.com>
X-Original-To: dime@ietfa.amsl.com
Delivered-To: dime@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 30D321A0069 for <dime@ietfa.amsl.com>; Tue, 25 Mar 2014 18:57:21 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: 0.279
X-Spam-Level:
X-Spam-Status: No, score=0.279 tagged_above=-999 required=5 tests=[BAYES_05=-0.5, SPF_NEUTRAL=0.779] autolearn=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id MaLOc17L093V for <dime@ietfa.amsl.com>; Tue, 25 Mar 2014 18:57:19 -0700 (PDT)
Received: from biz131.inmotionhosting.com (biz131.inmotionhosting.com [23.235.209.16]) by ietfa.amsl.com (Postfix) with ESMTP id 96D361A005F for <dime@ietf.org>; Tue, 25 Mar 2014 18:57:19 -0700 (PDT)
Received: from cpe-76-187-100-94.tx.res.rr.com ([76.187.100.94]:50144 helo=Steves-MacBook-Air-2.local) by biz131.inmotionhosting.com with esmtpsa (TLSv1:DHE-RSA-AES128-SHA:128) (Exim 4.82) (envelope-from <srdonovan@usdonovans.com>) id 1WSd5Z-0005Nf-UG; Tue, 25 Mar 2014 18:57:17 -0700
Message-ID: <533233F6.9080406@usdonovans.com>
Date: Tue, 25 Mar 2014 20:57:10 -0500
From: Steve Donovan <srdonovan@usdonovans.com>
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.9; rv:24.0) Gecko/20100101 Thunderbird/24.4.0
MIME-Version: 1.0
To: Ben Campbell <ben@nostrum.com>
References: <057.8b248d3cb5db23879c2730b80d4657d7@trac.tools.ietf.org> <B08CCDA3-4E2B-444A-AE27-9DE2D9C0B458@gmail.com> <4B803326-40A9-4E98-AC12-7DDF46BD101B@nostrum.com> <A9CA33BB78081F478946E4F34BF9AAA014D6979E@xmb-rcd-x10.cisco.com> <087A34937E64E74E848732CFF8354B9209772E9C@ESESSMB101.ericsson.se> <58574389-BAEB-49DA-A07E-B6648905C291@gmail.com> <533097D1.3090803@usdonovans.com> <D24C5BAB-C9CD-4AA1-8F1D-AB21D25EDB01@nostrum.com>
In-Reply-To: <D24C5BAB-C9CD-4AA1-8F1D-AB21D25EDB01@nostrum.com>
Content-Type: text/plain; charset="ISO-8859-1"; format="flowed"
Content-Transfer-Encoding: 7bit
X-OutGoing-Spam-Status: No, score=-2.9
X-AntiAbuse: This header was added to track abuse, please include it with any abuse report
X-AntiAbuse: Primary Hostname - biz131.inmotionhosting.com
X-AntiAbuse: Original Domain - ietf.org
X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12]
X-AntiAbuse: Sender Address Domain - usdonovans.com
X-Get-Message-Sender-Via: biz131.inmotionhosting.com: authenticated_id: srd+usdonovans.com/only user confirmed/virtual account not confirmed
Archived-At: http://mailarchive.ietf.org/arch/msg/dime/cYHxrFqc4QpCFyStqgV2WYBz62w
Cc: dime@ietf.org
Subject: Re: [Dime] [dime] #46: Bad normative advice on not letting overload reports expire
X-BeenThere: dime@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: Diameter Maintanence and Extentions Working Group <dime.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/dime>, <mailto:dime-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/dime/>
List-Post: <mailto:dime@ietf.org>
List-Help: <mailto:dime-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/dime>, <mailto:dime-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 26 Mar 2014 01:57:21 -0000

I'm ok with Ben's suggested wording.

Steve

On 3/25/14, 6:16 PM, Ben Campbell wrote:
> I do not agree. While this fixes a related problem of using a zero validity-duration to signal the end of an overload condition, it still implies that one SHOULD NOT let a report "just expire". As I've argued before, I believe there are time when it is just as good, if not better, to let an overload condition expire naturally.
>
> Here's a quote of my argument to that effect from further down the thread:
>
>> I think it's reasonable to say that a reporting node should terminate an overload condition in a timely manner. But if it's about to expire anyway, then expiration might be just as timely as an explicit report.
>>
>> And of course, the definition of "timely" is somewhat a matter of policy. For example, I can imagine an deployment that had a large number of clients using fairly short validity durations, and _never_ explicitly signaling an end to an overload condition. This adds a bit of a "slow-start" to the recovery, since different clients will expire the overload condition at different times, and the load will ramp up gradually. I don't see anything wrong with that. Of course, it wouldn't work if one chose long validity durations, or if the signaling of overload to different clients happened in close synchronization.
> So, here's a different proposal for your first paragraph:
>
>     "When a reporting node has recovered from overload, it SHOULD invalidate any existing overload reports in a timely matter. This can be achieved by sending an updated overload report (meaning the OLR contains a new sequence number) with the OC-Validity-Duration AVP value set to zero ("0"). If the overload report is about to expire naturally, the reporting node MAY choose to simply let it do so."
>
>
> On Mar 24, 2014, at 3:38 PM, Steve Donovan <srdonovan@usdonovans.com> wrote:
>
>> Here's some proposed wording that will hopefully let us close this issue:
>>
>> Regards,
>>
>> Steve
>>
>> -----
>>
>> Section 4.5., paragraph 3 -
>>
>> Current -02 wording:
>>
>> As a general guidance for implementations it is RECOMMENDED never to
>>     let any overload report to timeout.  Following to this rule, an
>>     overload endpoint should explicitly signal the end of overload
>>     condition and not rely on the expiration of the validity time of the
>>     overload report in the reacting node.  This is achieved by sending an
>>     updated overload report (meaning it must contain a new sequence
>>     number) with the OC-Validity-Duration AVP value set to zero ("0").
>>
>> Proposed wording:
>>
>>     A reporting node SHOULD explicitly signal the end of overload
>>     condition in a timely manner.  This is achieved by sending an
>>     updated overload report (meaning the OLR contains a new sequence
>>     number) with the OC-Validity-Duration AVP value set to zero ("0").
>>
>>    A reacting node MUST invalidate and remove an overload report that
>>    expires without an explicit overload report containing an OC-Validity-Duration
>>    value set to zero ("0").
>>
>>
>> On 2/11/14 4:31 PM, Jouni Korhonen wrote:
>>> Fine with me.
>>>
>>> - Jouni
>>>
>>> On Feb 11, 2014, at 12:24 PM, Maria Cruz Bartolome
>>> <maria.cruz.bartolome@ericsson.com>
>>>   wrote:
>>>
>>>
>>>> Ben, Nirav,
>>>>
>>>> I follow same argumentation.
>>>> Regards
>>>> /MCruz
>>>>
>>>> -----Original Message-----
>>>> From: DiME [
>>>> mailto:dime-bounces@ietf.org
>>>> ] On Behalf Of Nirav Salot (nsalot)
>>>> Sent: martes, 11 de febrero de 2014 11:23
>>>> To: Ben Campbell; Jouni Korhonen
>>>> Cc:
>>>> dime@ietf.org list; draft-docdt-dime-ovli@tools.ietf.org
>>>>
>>>> Subject: Re: [Dime] [dime] #46: Bad normative advice on not letting overload reports expire
>>>>
>>>> Ben,
>>>>
>>>> I resonate with your thinking below.
>>>>
>>>> Regards,
>>>> Nirav.
>>>>
>>>> -----Original Message-----
>>>> From: DiME [
>>>> mailto:dime-bounces@ietf.org
>>>> ] On Behalf Of Ben Campbell
>>>> Sent: Monday, February 10, 2014 9:54 PM
>>>> To: Jouni Korhonen
>>>> Cc:
>>>> dime@ietf.org list; draft-docdt-dime-ovli@tools.ietf.org
>>>>
>>>> Subject: Re: [Dime] [dime] #46: Bad normative advice on not letting overload reports expire
>>>>
>>>>
>>>> On Feb 10, 2014, at 5:16 AM, Jouni Korhonen
>>>> <jouni.nospam@gmail.com>
>>>>   wrote:
>>>>
>>>>
>>>>> My reasoning for explicit termination was that knowing the
>>>>> implementation folks they will let overload conditions expire unless advised otherwise.
>>>>> And having unnecessary stuff hanging around waiting for a cleanup is
>>>>> not a good thing in general. But I am open here for other options..
>>>>>
>>>>>
>>>> I think it's reasonable to say that a reporting node should terminate an overload condition in a timely manner. But if it's about to expire anyway, then expiration might be just as timely as an explicit report.
>>>>
>>>> And of course, the definition of "timely" is somewhat a matter of policy. For example, I can imagine an deployment that had a large number of clients using fairly short validity durations, and _never_ explicitly signaling an end to an overload condition. This adds a bit of a "slow-start" to the recovery, since different clients will expire the overload condition at different times, and the load will ramp up gradually. I don't see anything wrong with that. Of course, it wouldn't work if one chose long validity durations, or if the signaling of overload to different clients happened in close synchronization.
>>>>
>>>> _______________________________________________
>>>> DiME mailing list
>>>>
>>>> DiME@ietf.org
>>>> https://www.ietf.org/mailman/listinfo/dime
>>>>
>>>>
>>>> _______________________________________________
>>>> DiME mailing list
>>>>
>>>> DiME@ietf.org
>>>> https://www.ietf.org/mailman/listinfo/dime
>>>>
>>>>
>>>> _______________________________________________
>>>> DiME mailing list
>>>>
>>>> DiME@ietf.org
>>>> https://www.ietf.org/mailman/listinfo/dime
>>> _______________________________________________
>>> DiME mailing list
>>>
>>> DiME@ietf.org
>>> https://www.ietf.org/mailman/listinfo/dime
>>>
>>>
>>>
>> _______________________________________________
>> DiME mailing list
>> DiME@ietf.org
>> https://www.ietf.org/mailman/listinfo/dime
>