Re: [Dime] [dime] #46: Bad normative advice on not letting overload reports expire

"DOLLY, MARTIN C" <md3135@att.com> Tue, 25 March 2014 23:52 UTC

Return-Path: <md3135@att.com>
X-Original-To: dime@ietfa.amsl.com
Delivered-To: dime@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 1EE7B1A0259 for <dime@ietfa.amsl.com>; Tue, 25 Mar 2014 16:52:02 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.21
X-Spam-Level:
X-Spam-Status: No, score=-4.21 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_MED=-2.3, T_RP_MATCHES_RCVD=-0.01] autolearn=ham
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Y5Q1n91QmxxA for <dime@ietfa.amsl.com>; Tue, 25 Mar 2014 16:51:59 -0700 (PDT)
Received: from nbfkord-smmo06.seg.att.com (nbfkord-smmo06.seg.att.com [209.65.160.94]) by ietfa.amsl.com (Postfix) with ESMTP id 5A6561A021E for <dime@ietf.org>; Tue, 25 Mar 2014 16:51:58 -0700 (PDT)
Received: from unknown [144.160.229.24] (EHLO alpi155.enaf.aldc.att.com) by nbfkord-smmo06.seg.att.com(mxl_mta-7.2.1-0) over TLS secured channel with ESMTP id c9612335.0.1920279.00-2365.5377232.nbfkord-smmo06.seg.att.com (envelope-from <md3135@att.com>); Tue, 25 Mar 2014 23:51:57 +0000 (UTC)
X-MXL-Hash: 5332169d2cba091d-f21debf50cafd76fe5b0f94230d994faf0c12dd7
Received: from enaf.aldc.att.com (localhost [127.0.0.1]) by alpi155.enaf.aldc.att.com (8.14.5/8.14.5) with ESMTP id s2PNpu8A013896; Tue, 25 Mar 2014 19:51:56 -0400
Received: from mlpi407.sfdc.sbc.com (mlpi407.sfdc.sbc.com [130.9.128.239]) by alpi155.enaf.aldc.att.com (8.14.5/8.14.5) with ESMTP id s2PNpnTa013848 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Tue, 25 Mar 2014 19:51:52 -0400
Received: from MISOUT7MSGHUB9B.ITServices.sbc.com (MISOUT7MSGHUB9B.itservices.sbc.com [144.151.223.72]) by mlpi407.sfdc.sbc.com (RSA Interceptor); Tue, 25 Mar 2014 23:51:41 GMT
Received: from MISOUT7MSGUSR9I.ITServices.sbc.com ([144.151.223.56]) by MISOUT7MSGHUB9B.ITServices.sbc.com ([144.151.223.72]) with mapi id 14.03.0174.001; Tue, 25 Mar 2014 19:51:41 -0400
From: "DOLLY, MARTIN C" <md3135@att.com>
To: Ben Campbell <ben@nostrum.com>
Thread-Topic: [Dime] [dime] #46: Bad normative advice on not letting overload reports expire
Thread-Index: AQHPSIBMt0fDkcbCrUWGV1ADOW2pqZryeVVY
Date: Tue, 25 Mar 2014 23:51:41 +0000
Message-ID: <4CAA0308-08B6-4F7D-9A51-7A4AAA833404@att.com>
References: <057.8b248d3cb5db23879c2730b80d4657d7@trac.tools.ietf.org> <B08CCDA3-4E2B-444A-AE27-9DE2D9C0B458@gmail.com> <4B803326-40A9-4E98-AC12-7DDF46BD101B@nostrum.com> <A9CA33BB78081F478946E4F34BF9AAA014D6979E@xmb-rcd-x10.cisco.com> <087A34937E64E74E848732CFF8354B9209772E9C@ESESSMB101.ericsson.se> <58574389-BAEB-49DA-A07E-B6648905C291@gmail.com> <533097D1.3090803@usdonovans.com>, <D24C5BAB-C9CD-4AA1-8F1D-AB21D25EDB01@nostrum.com>
In-Reply-To: <D24C5BAB-C9CD-4AA1-8F1D-AB21D25EDB01@nostrum.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
X-RSA-Inspected: yes
X-RSA-Classifications: public
X-AnalysisOut: [v=2.0 cv=eY6Kic4H c=1 sm=1 a=dhB6nF3YHL5t/Ixux6cINA==:17 a]
X-AnalysisOut: [=LrBFAd3DcSIA:10 a=ofMgfj31e3cA:10 a=BLceEmwcHowA:10 a=kj9]
X-AnalysisOut: [zAlcOel0A:10 a=zQP7CpKOAAAA:8 a=Z80JlwQ0AAAA:8 a=yakATiurA]
X-AnalysisOut: [AAA:8 a=0FD05c-RAAAA:8 a=48vgC7mUAAAA:8 a=pGLkceISAAAA:8 a]
X-AnalysisOut: [=iWO5QD6-H_qhiD_E6h0A:9 a=CjuIK1q_8ugA:10 a=qM39cor4HRgA:1]
X-AnalysisOut: [0 a=Hz7IrDYlS0cA:10 a=0MAqpqVwYqEA:10 a=BMZHtSBzE-QA:10 a=]
X-AnalysisOut: [f7GxY0FH8QIA:10 a=lZB815dzVvQA:10 a=MSl-tDqOz04A:10 a=tqwV]
X-AnalysisOut: [kdtsOKSkepFu:21 a=fabrLpjNNEN_gIKe:21]
X-Spam: [F=0.2000000000; CM=0.500; S=0.200(2010122901)]
X-MAIL-FROM: <md3135@att.com>
X-SOURCE-IP: [144.160.229.24]
Archived-At: http://mailarchive.ietf.org/arch/msg/dime/XXbCh0CPQmOgDX05QhnvZW5h6TU
Cc: "dime@ietf.org" <dime@ietf.org>
Subject: Re: [Dime] [dime] #46: Bad normative advice on not letting overload reports expire
X-BeenThere: dime@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: Diameter Maintanence and Extentions Working Group <dime.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/dime>, <mailto:dime-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/dime/>
List-Post: <mailto:dime@ietf.org>
List-Help: <mailto:dime-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/dime>, <mailto:dime-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 25 Mar 2014 23:52:02 -0000

Why, sorry but please repeat or I go with I believe consensus

Thanks

Martin Dolly
Lead Member of Technical Staff
Core Network & Gov't/Regulatory Standards  
AT&T Standards and Industry Alliances
+1-609-903-3360
md3135@att.com

> On Mar 25, 2014, at 7:17 PM, "Ben Campbell" <ben@nostrum.com> wrote:
> 
> I do not agree. While this fixes a related problem of using a zero validity-duration to signal the end of an overload condition, it still implies that one SHOULD NOT let a report "just expire". As I've argued before, I believe there are time when it is just as good, if not better, to let an overload condition expire naturally.
> 
> Here's a quote of my argument to that effect from further down the thread:
> 
>> I think it's reasonable to say that a reporting node should terminate an overload condition in a timely manner. But if it's about to expire anyway, then expiration might be just as timely as an explicit report. 
>> 
>> And of course, the definition of "timely" is somewhat a matter of policy. For example, I can imagine an deployment that had a large number of clients using fairly short validity durations, and _never_ explicitly signaling an end to an overload condition. This adds a bit of a "slow-start" to the recovery, since different clients will expire the overload condition at different times, and the load will ramp up gradually. I don't see anything wrong with that. Of course, it wouldn't work if one chose long validity durations, or if the signaling of overload to different clients happened in close synchronization.
> 
> So, here's a different proposal for your first paragraph:
> 
>   "When a reporting node has recovered from overload, it SHOULD invalidate any existing overload reports in a timely matter. This can be achieved by sending an updated overload report (meaning the OLR contains a new sequence number) with the OC-Validity-Duration AVP value set to zero ("0"). If the overload report is about to expire naturally, the reporting node MAY choose to simply let it do so."
> 
> 
>> On Mar 24, 2014, at 3:38 PM, Steve Donovan <srdonovan@usdonovans.com> wrote:
>> 
>> Here's some proposed wording that will hopefully let us close this issue:
>> 
>> Regards,
>> 
>> Steve
>> 
>> -----
>> 
>> Section 4.5., paragraph 3 - 
>> 
>> Current -02 wording:
>> 
>> As a general guidance for implementations it is RECOMMENDED never to
>>   let any overload report to timeout.  Following to this rule, an
>>   overload endpoint should explicitly signal the end of overload
>>   condition and not rely on the expiration of the validity time of the
>>   overload report in the reacting node.  This is achieved by sending an
>>   updated overload report (meaning it must contain a new sequence
>>   number) with the OC-Validity-Duration AVP value set to zero ("0").
>> 
>> Proposed wording:
>> 
>>   A reporting node SHOULD explicitly signal the end of overload
>>   condition in a timely manner.  This is achieved by sending an
>>   updated overload report (meaning the OLR contains a new sequence
>>   number) with the OC-Validity-Duration AVP value set to zero ("0").
>> 
>>  A reacting node MUST invalidate and remove an overload report that
>>  expires without an explicit overload report containing an OC-Validity-Duration
>>  value set to zero ("0").
>> 
>> 
>>> On 2/11/14 4:31 PM, Jouni Korhonen wrote:
>>> Fine with me.
>>> 
>>> - Jouni
>>> 
>>> On Feb 11, 2014, at 12:24 PM, Maria Cruz Bartolome 
>>> <maria.cruz.bartolome@ericsson.com>
>>> wrote:
>>> 
>>> 
>>>> Ben, Nirav,
>>>> 
>>>> I follow same argumentation.
>>>> Regards
>>>> /MCruz
>>>> 
>>>> -----Original Message-----
>>>> From: DiME [
>>>> mailto:dime-bounces@ietf.org
>>>> ] On Behalf Of Nirav Salot (nsalot)
>>>> Sent: martes, 11 de febrero de 2014 11:23
>>>> To: Ben Campbell; Jouni Korhonen
>>>> Cc: 
>>>> dime@ietf.org list; draft-docdt-dime-ovli@tools.ietf.org
>>>> 
>>>> Subject: Re: [Dime] [dime] #46: Bad normative advice on not letting overload reports expire
>>>> 
>>>> Ben,
>>>> 
>>>> I resonate with your thinking below.
>>>> 
>>>> Regards,
>>>> Nirav.
>>>> 
>>>> -----Original Message-----
>>>> From: DiME [
>>>> mailto:dime-bounces@ietf.org
>>>> ] On Behalf Of Ben Campbell
>>>> Sent: Monday, February 10, 2014 9:54 PM
>>>> To: Jouni Korhonen
>>>> Cc: 
>>>> dime@ietf.org list; draft-docdt-dime-ovli@tools.ietf.org
>>>> 
>>>> Subject: Re: [Dime] [dime] #46: Bad normative advice on not letting overload reports expire
>>>> 
>>>> 
>>>> On Feb 10, 2014, at 5:16 AM, Jouni Korhonen 
>>>> <jouni.nospam@gmail.com>
>>>> wrote:
>>>> 
>>>> 
>>>>> My reasoning for explicit termination was that knowing the 
>>>>> implementation folks they will let overload conditions expire unless advised otherwise.
>>>>> And having unnecessary stuff hanging around waiting for a cleanup is 
>>>>> not a good thing in general. But I am open here for other options..
>>>> I think it's reasonable to say that a reporting node should terminate an overload condition in a timely manner. But if it's about to expire anyway, then expiration might be just as timely as an explicit report. 
>>>> 
>>>> And of course, the definition of "timely" is somewhat a matter of policy. For example, I can imagine an deployment that had a large number of clients using fairly short validity durations, and _never_ explicitly signaling an end to an overload condition. This adds a bit of a "slow-start" to the recovery, since different clients will expire the overload condition at different times, and the load will ramp up gradually. I don't see anything wrong with that. Of course, it wouldn't work if one chose long validity durations, or if the signaling of overload to different clients happened in close synchronization.
>>>> 
>>>> _______________________________________________
>>>> DiME mailing list
>>>> 
>>>> DiME@ietf.org
>>>> https://www.ietf.org/mailman/listinfo/dime
>>>> 
>>>> 
>>>> _______________________________________________
>>>> DiME mailing list
>>>> 
>>>> DiME@ietf.org
>>>> https://www.ietf.org/mailman/listinfo/dime
>>>> 
>>>> 
>>>> _______________________________________________
>>>> DiME mailing list
>>>> 
>>>> DiME@ietf.org
>>>> https://www.ietf.org/mailman/listinfo/dime
>>> _______________________________________________
>>> DiME mailing list
>>> 
>>> DiME@ietf.org
>>> https://www.ietf.org/mailman/listinfo/dime
>> 
>> _______________________________________________
>> DiME mailing list
>> DiME@ietf.org
>> https://www.ietf.org/mailman/listinfo/dime
> 
> _______________________________________________
> DiME mailing list
> DiME@ietf.org
> https://www.ietf.org/mailman/listinfo/dime