Re: [Idr] draft-spaghetti-idr-bgp-sendholdtimer - Feedback requested

Enke Chen <enchen@paloaltonetworks.com> Fri, 23 April 2021 20:25 UTC

Return-Path: <enchen@paloaltonetworks.com>
X-Original-To: idr@ietfa.amsl.com
Delivered-To: idr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id CDF4D3A0913 for <idr@ietfa.amsl.com>; Fri, 23 Apr 2021 13:25:21 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.017
X-Spam-Level:
X-Spam-Status: No, score=-2.017 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.001, HTTPS_HTTP_MISMATCH=0.1, RCVD_IN_MSPIKE_H4=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_HELO_NONE=0.001, SPF_NONE=0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=paloaltonetworks.com header.b=PoXXl80D; dkim=pass (2048-bit key) header.d=paloaltonetworks-com.20150623.gappssmtp.com header.b=VX8K323x
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 3HgcGs6V_BQY for <idr@ietfa.amsl.com>; Fri, 23 Apr 2021 13:25:17 -0700 (PDT)
Received: from mx0b-00169c01.pphosted.com (mx0a-00169c01.pphosted.com [67.231.148.124]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 6AF163A090D for <idr@ietf.org>; Fri, 23 Apr 2021 13:25:17 -0700 (PDT)
Received: from pps.filterd (m0048493.ppops.net [127.0.0.1]) by mx0a-00169c01.pphosted.com (8.16.0.43/8.16.0.43) with SMTP id 13NIp3dA007146 for <idr@ietf.org>; Fri, 23 Apr 2021 11:54:35 -0700
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=paloaltonetworks.com; h=mime-version : references : in-reply-to : from : date : message-id : subject : to : cc : content-type; s=PPS12012017; bh=IpXKnUtWZ0Ci8jUz6SKaH8wgUdKPrWKUhK7inzoqwwE=; b=PoXXl80DF9tVCbANCGJp4KnKfKbKBlOwWBELd3FGRq6mR0AzhXndTBkXG14DmPm7gwaN jxIms6liXQ4ZF22KDsID9h9Z7LWsH20uwhajZ53BT8m0XO/ImeBZJVEpZwuTYyKtmFnc bKpQwdyGi4hTQNFlJnR9/XC+plFMd/qoMarwx9jU1OiYfFxJM+LpE6dXgrqAJZ6WduVH YcapipqPNh2/Q00gS8BJhst1wHHDw4sMDIJ1qbHiNgsnslZ6H31CxkHwF42WMzxV+LOy riG9ejC0sN2EyPAHVyfdH33La+KRuVqO2qAbpjKsvwSWL40br8+yArp/BxsJqyuo37QW oQ==
Received: from mail-lj1-f200.google.com (mail-lj1-f200.google.com [209.85.208.200]) by mx0a-00169c01.pphosted.com with ESMTP id 383nbtdsxu-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128 verify=NOT) for <idr@ietf.org>; Fri, 23 Apr 2021 11:54:34 -0700
Received: by mail-lj1-f200.google.com with SMTP id u24-20020a2e14180000b02900bf2bea127dso4846818ljd.16 for <idr@ietf.org>; Fri, 23 Apr 2021 11:54:34 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=paloaltonetworks-com.20150623.gappssmtp.com; s=20150623; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=IpXKnUtWZ0Ci8jUz6SKaH8wgUdKPrWKUhK7inzoqwwE=; b=VX8K323xFRFQGSspqJSC2r03L9i0op54zF2ciyYHY/rQu1nykhYQK1Nst5ZTupLeB5 Qvx8Rifoyx5NdrGTQsNbt9GFdsjzPmhkig1tbC5VQzB3TWCtE/RgzJx9x15SFbeidIxp Knd5BwBBHBWJc8a9lVFIA0rSHB5tlmj6RDCyGJUQlKZsQN2Xk/c0RSkAEPsVkTsoRj/w CUf7N5fFRdPlUKYGO3yESpMdri1huxcVz4T79sB36bXfjBwDSC7RwPKc7q+wx1uinGzq myT3yZPD3oqFB9TWE5/N68afTqm/9vUBKa4O5oSZRd4QkG7FQAkem6eSBtn8SYr8bEnL 4rYg==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=IpXKnUtWZ0Ci8jUz6SKaH8wgUdKPrWKUhK7inzoqwwE=; b=azQPFrCytpykFwI0ve9O5XDdf6tDriiHWkG0Ihp+K+EgPKoPQcqcEQl7/Cyobbtrja aQZJ02eVdqallS575Szdki7+9ic4qxMkJUeRI/KxxX+yczTRL0Zy1RzrYHqRODXRbuBZ M3LfGKuS5foIkHFWzlBPjYu2a4upGtwyfdv9V7F0t1OrcHav2KwnUNsNW4FttP3BDRJR /Gz8a2mxIUIRpc8MXfsdILPbU+CCeL622mILhBFBdpKqsAifwcjjIPPV5XA8aAXCK7qu lsrsrp1bXQusmf/SXnk72rLAnaltypkw6wnUNk2Pu1h9Ly6oUkdT0m9ekGmcUAFVJIrk SR7Q==
X-Gm-Message-State: AOAM53079eFH5dOlgznjBFPfyoUZH9lQxLY3E+NcmS+hKoYM8HZrGjJf ho3+cOuPDAJhJ2H4M4FhqIauyo08vxGHXO8jIXGt/Rp69Aj81J5w6AtxBeNOElbJudwKqwD2AQ0 0PiOC8ExSQZJLb7aUkow=
X-Received: by 2002:a2e:86d9:: with SMTP id n25mr3753619ljj.470.1619204072536; Fri, 23 Apr 2021 11:54:32 -0700 (PDT)
X-Google-Smtp-Source: ABdhPJwyrYqSqfnoSs40ch+Df9O0sBshGAtMsMTKW2JJoe3r7dyzIgQUhE1o8lJe6HBo+YoBB5GMj0zSZtBPNzNcbkE=
X-Received: by 2002:a2e:86d9:: with SMTP id n25mr3753609ljj.470.1619204072292; Fri, 23 Apr 2021 11:54:32 -0700 (PDT)
MIME-Version: 1.0
References: <CAL=9YSVy+mvxvAv+maxkUSzPbe0bfnUy-XJJTtcVhi3S3bm=WQ@mail.gmail.com> <BYAPR11MB32074D8142CCE2C2B50B2C32C0459@BYAPR11MB3207.namprd11.prod.outlook.com> <YILA1sLuzMd47LYR@snel> <CANJ8pZ8PTudSx9ie-eEHbPKEHDeTNJ1-C_u116NbWXh5d7kyMg@mail.gmail.com> <CAH1iCioXZXmYmwg-4nvDqgnKtSPeNFufXxgZ7cmbMVijc_QBgA@mail.gmail.com>
In-Reply-To: <CAH1iCioXZXmYmwg-4nvDqgnKtSPeNFufXxgZ7cmbMVijc_QBgA@mail.gmail.com>
From: Enke Chen <enchen@paloaltonetworks.com>
Date: Fri, 23 Apr 2021 11:54:21 -0700
Message-ID: <CANJ8pZ_VK5FWT-stvNdBCyVKKnyKpKKyiWk4Bw5gZeSipsxiVA@mail.gmail.com>
To: Brian Dickson <brian.peter.dickson@gmail.com>
Cc: Job Snijders <job=40fastly.com@dmarc.ietf.org>, "Jakob Heitz (jheitz)" <jheitz=40cisco.com@dmarc.ietf.org>, Ben Cox <ben=40benjojo.co.uk@dmarc.ietf.org>, "idr@ietf.org" <idr@ietf.org>
Content-Type: multipart/alternative; boundary="0000000000008c5c7405c0a853ea"
X-Proofpoint-GUID: NwEvlSndXSdHId2A0H78ehAxdyhqVJ7L
X-Proofpoint-ORIG-GUID: NwEvlSndXSdHId2A0H78ehAxdyhqVJ7L
X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:6.0.391, 18.0.761 definitions=2021-04-23_07:2021-04-23, 2021-04-23 signatures=0
X-Proofpoint-Spam-Details: rule=outbound_spam_notspam policy=outbound_spam score=0 phishscore=0 suspectscore=0 lowpriorityscore=0 spamscore=0 bulkscore=0 mlxlogscore=999 clxscore=1015 priorityscore=1501 impostorscore=0 mlxscore=0 adultscore=0 malwarescore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2104060000 definitions=main-2104230122
Archived-At: <https://mailarchive.ietf.org/arch/msg/idr/7sB8D7nKWzbZ7Y1hP6QaToXpUbw>
Subject: Re: [Idr] draft-spaghetti-idr-bgp-sendholdtimer - Feedback requested
X-BeenThere: idr@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Inter-Domain Routing <idr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/idr>, <mailto:idr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/idr/>
List-Post: <mailto:idr@ietf.org>
List-Help: <mailto:idr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/idr>, <mailto:idr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 23 Apr 2021 20:25:22 -0000

Hi, Brian:

Isn't the holdtimer already per-neighbor based?

Regards,  -- Enke

On Fri, Apr 23, 2021 at 11:51 AM Brian Dickson <
brian.peter.dickson@gmail.com> wrote:

>
>
> On Fri, Apr 23, 2021 at 10:31 AM Enke Chen <enchen@paloaltonetworks.com>
> wrote:
>
>> Hi, Folks:
>>
>> The holdtimer already conveys whether the session should be timed out
>> sooner or later. It makes sense to tie the new timer with it, perhaps with
>> a configurable multiplier. The default could be 1, or 2.
>>
>>
> This is a bad idea (having the only "knob" be a multiplier).
>
> It might be okay for the default to be derived from some multiplier over
> the default for holdtimer, but it is not okay for the knob to be ONLY a
> multiplier.
>
> Here's why:
>
> A router with a large number of BGP sessions, where those sessions have
> different characteristics, such as RTT, loss, bandwidth, etc., are very
> likely to behave quite differently.
> (There are any number of operational reasons a router might have
> connections of this sort, it's really not worth detailing all the reasons,
> but some examples would include a router with both local LAN peers
> (customers) and satellite peers; peering extension providers over
> multi-city links; underprovisioned peer routers; software router peers,
> etc.)
>
> The operator should (MUST) have exposed controls to adjust timers on a
> per-neighbor basis, and/or per-router basis (when managing large numbers of
> routers).
>
> The purpose of this proposal is not met if that level of granularity is
> unavailable.
>
> The operator has the ability to understand why individual peering sessions
> may be behaving this way, and will need to adjust the parameters on a
> per-peer basis.
> Fixing a problem for one peer, should not introduce problems for another
> peer.
>
> This isn't 1995.
>
> Respectfully,
> Brian
>
>
>> Thanks.   -- Enke
>>
>> On Fri, Apr 23, 2021 at 5:43 AM Job Snijders <job=
>> 40fastly.com@dmarc.ietf.org> wrote:
>>
>>> Hi Jakob, group,
>>>
>>> Thanks for the feedback, a -02 has been submitted.
>>>
>>> On Fri, Apr 23, 2021 at 12:56:40AM +0000, Jakob Heitz (jheitz) wrote:
>>> > I propose to add this text:
>>> >
>>> > It is possible that a BGP speaker presents a TCP zero receive window
>>> > after receiving a large batch of updates. Resetting the session to
>>> > such a speaker may just repeat the large batch, whereas waiting a
>>> little
>>> > longer may have given it a chance to recover. The possibility of this
>>> > occurrence makes it difficult to determine a good value for the
>>> > timer. The default value of the timer is therefore recommended
>>> > to be 15 minutes.
>>>
>>> In this email John Scudder appears to suggest using a separate new
>>> timer, and populate the timer with the default HoldTimer value (and the
>>> -02 draft attempts to describe exactly that).
>>>
>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__mailarchive.ietf.org_arch_msg_idr_E5pqxIObPS7e9A2IqIApTHVSfig_&d=DwICAg&c=V9IgWpI5PvzTw83UyHGVSoW3Uc1MFWe5J8PTfkrzVSo&r=OPLTTSu-451-QhDoSINhI2xYdwiMmfF5A2l8luvN11E&m=bQFpOg20Jvvi5Fdn0FbsPn3umJ_yRN4nNcXfCsfl5ek&s=ic9GqvTL_AVNp--whyEU4xnDPEpswdHPBNt_XoBG99g&e=
>>>
>>> you propose 15 minutes; so it seems there is some convergent thinking in
>>> the group the proposed timers at hand is in the order of 'multiple
>>> minutes'.
>>>
>>> How long should a BGP speaker be willing to buffer on behalf of another
>>> BGP speaker's inability to progress message processing?
>>>
>>> If the oldest message in the queue is a KEEPALIVE message, and its been
>>> stuck there for more than HoldTimer, wouldn't we have expected the
>>> remote side to have killed the BGP session anyway?
>>>
>>> Would you mind sharing your thinking on why a default of 15 minutes is
>>> better than for example 180 seconds? What do others think?
>>>
>>> Kind regards,
>>>
>>> Job
>>>
>>> _______________________________________________
>>> Idr mailing list
>>> Idr@ietf.org
>>>
>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__www.ietf.org_mailman_listinfo_idr&d=DwICAg&c=V9IgWpI5PvzTw83UyHGVSoW3Uc1MFWe5J8PTfkrzVSo&r=OPLTTSu-451-QhDoSINhI2xYdwiMmfF5A2l8luvN11E&m=bQFpOg20Jvvi5Fdn0FbsPn3umJ_yRN4nNcXfCsfl5ek&s=PHKZKmQoCjqpWGGBhDIpOnAaSlDjeRsLdEPlOwChzyw&e=
>>>
>> _______________________________________________
>> Idr mailing list
>> Idr@ietf.org
>> https://www.ietf.org/mailman/listinfo/idr
>> <https://urldefense.proofpoint.com/v2/url?u=https-3A__www.ietf.org_mailman_listinfo_idr&d=DwMFaQ&c=V9IgWpI5PvzTw83UyHGVSoW3Uc1MFWe5J8PTfkrzVSo&r=OPLTTSu-451-QhDoSINhI2xYdwiMmfF5A2l8luvN11E&m=ecQzdLStIaoAFzguVP-QtyJpUCwi8K4lXMCt2ey-mZE&s=GquAlygyileCKhzvo6AEjoKhUu6R2Z3sSmft31rxH9s&e=>
>>
>