Re: [Idr] TCP & BGP: Some don't send terminate BGP when holdtimer expired, because TCP recv window is 0

Gyan Mishra <hayabusagsm@gmail.com> Sat, 19 December 2020 23:20 UTC

Return-Path: <hayabusagsm@gmail.com>
X-Original-To: idr@ietfa.amsl.com
Delivered-To: idr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 898DF3A0AB4 for <idr@ietfa.amsl.com>; Sat, 19 Dec 2020 15:20:34 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.087
X-Spam-Level:
X-Spam-Status: No, score=-2.087 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_REMOTE_IMAGE=0.01, URIBL_BLOCKED=0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id t_U7CPpa0sYt for <idr@ietfa.amsl.com>; Sat, 19 Dec 2020 15:20:32 -0800 (PST)
Received: from mail-pl1-x632.google.com (mail-pl1-x632.google.com [IPv6:2607:f8b0:4864:20::632]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 3DE663A0C38 for <idr@ietf.org>; Sat, 19 Dec 2020 15:20:32 -0800 (PST)
Received: by mail-pl1-x632.google.com with SMTP id r4so3445588pls.11 for <idr@ietf.org>; Sat, 19 Dec 2020 15:20:32 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=wqaZfgsnRc2BFMD5n2peYmw1fRSDvaWDMNtS89CUsT8=; b=IqKa1caGmAsAt6+qphvDxF35zqRlmbp0J/49/CGQROD9LvHnK593jVQJ8lTs9uJ+e4 4SxkKA7Z2wDWNlat8pHcI1Q4wIGe8ovN3HWF3xu+jZMgKCaTIaQWRp3cTkMzFg26sMAj GXkzXEVPyfYnU4uz9G8V5EUtetSTJc+w2oiupJOy8ZEWROE9UY0PyxwJEfdpiRS4AwAY nrJfcoSOiWaL/oygUzlSC/eFSQkn17/VDJk+TiD3n00OXPL5c9NkcvMwmQ8GpJN9maDg 7TcXS+pPBBHjuI16H7LKnrZiY5DkkHgBcvP1VX5/vDPOKq5qwd6d7XfOscmIr/63B7fG yFjg==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=wqaZfgsnRc2BFMD5n2peYmw1fRSDvaWDMNtS89CUsT8=; b=MMQBn9NGzwUo20icJi93zDHPkaMf2Kawy3HBkPq5jDMOS0t1iCkMGiMgodwM+RlmHb nxDxXGzxNpqlVoVZKBI9n35OYxuVs7685u78lpFZOCJyAzdvDR1VAfAICTdxQnXflQGi BqI1zVRKvyJ6xiYF7c0+AnvrSsUL3XazIICE9/ZKqace3Q4hOhiIE9cdFn57aHdPCDrV 0izlF1LYyWAUMnF5lhhI7WgranegSkJ1r0kSonDnQ/ovQ29lsHaZt0eiYvt8fmFvJjaZ 7JKIm6FSViZztl/squYtqkzkt6h41IIZzWfAy16skzFgdwo8bnBI773JIoOxcBCtryyl 5uQw==
X-Gm-Message-State: AOAM533uz5b+OyIPtfp6lxHYBZxdzOZIwoEJ3fYNgLEUFIM7FP//3GZv PEn8xIR7n/H/kO2pG/vp9o5q2KphNupVqRe5ORg=
X-Google-Smtp-Source: ABdhPJzryOxPDHvTTOak0GZXqSS+z0COIUE8fIIpXQ6WtqTtPTGKkjvoCN0llx3O0hdGhTF4njhX8v9IkIFIZfIw7O4=
X-Received: by 2002:a17:902:b717:b029:d9:e816:fd0b with SMTP id d23-20020a170902b717b02900d9e816fd0bmr10620867pls.50.1608420031562; Sat, 19 Dec 2020 15:20:31 -0800 (PST)
MIME-Version: 1.0
References: <CANJ8pZ-WMDotkQvhN-NuP7ivZkPRR-9S2KJSar=6463U0VKkow@mail.gmail.com> <EFC56A31-1276-4DAB-9526-9C2F24814D2C@pfrc.org> <CANJ8pZ_LnDna_jtipcLJq9rrS3MM32rLdxRW8ntC2aEi9VvzMg@mail.gmail.com> <722A787A-5B83-4802-A9F4-AB2957BB3305@juniper.net> <CA+eZshBse4g6jUBMxs4bJiE+uvWScwv7ggLNOMJbUiL1YsaisQ@mail.gmail.com> <CABNhwV1ikHAknsfNDw6GJ8BngHDNjNdCxmgipJvJ7G3rxmnZVA@mail.gmail.com> <CAOj+MMHM0bHHL9UfVZC2QWy6=W5F7QtEq9v-rndcUG0u7CLi1Q@mail.gmail.com>
In-Reply-To: <CAOj+MMHM0bHHL9UfVZC2QWy6=W5F7QtEq9v-rndcUG0u7CLi1Q@mail.gmail.com>
From: Gyan Mishra <hayabusagsm@gmail.com>
Date: Sat, 19 Dec 2020 18:20:21 -0500
Message-ID: <CABNhwV3CaWn5gsFGr4HNi_qoE4V1N1CA44KN+fFFvVCYr1YMgw@mail.gmail.com>
To: Robert Raszuk <robert@raszuk.net>
Cc: Enke Chen <enchen@paloaltonetworks.com>, John Scudder <jgs=40juniper.net@dmarc.ietf.org>, William McCall <william.mccall@gmail.com>, "idr@ietf. org" <idr@ietf.org>
Content-Type: multipart/alternative; boundary="000000000000a187c505b6d978fa"
Archived-At: <https://mailarchive.ietf.org/arch/msg/idr/CZOriMdPmHMk6Eadh3d-Wt_FxZY>
Subject: Re: [Idr] TCP & BGP: Some don't send terminate BGP when holdtimer expired, because TCP recv window is 0
X-BeenThere: idr@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Inter-Domain Routing <idr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/idr>, <mailto:idr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/idr/>
List-Post: <mailto:idr@ietf.org>
List-Help: <mailto:idr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/idr>, <mailto:idr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sat, 19 Dec 2020 23:20:35 -0000

Hi Robert

On the other thread it was not quite clear.

So if this scenario is completely devoid of link congestion and purely a
management plane TCP control plane processing BGP socket processing issue
then I agree BFD won’t help at all.

I agree with the poor RP design of management plane that either lead to RP
being overwhelmed high cpu and memory and or possibly memory leak or bug.

Do we know which vendor?

Something simple otter then messing with TCP parameters, if instead of
using the default 90 second BGP dead timer,  if that was reduced down a bit
to like 10 / 30, that could limit the time traffic is black hole and not
rerouted to alternate path until the hold timer expires.


Kind Regards

Gyan

On Sat, Dec 19, 2020 at 5:18 PM Robert Raszuk <robert@raszuk.net> wrote:

> Hi Gyan,
>
> > Going down this path of does seem a lot more complicated and risker then
> using BFD.
>
> But BFD is not going to help at all to the problem at hand.
>
> BFD is in the vast majority of cases distributed (and that is feature not
> a bug) and responses are handled by line cards.
>
> Here we are dealing with RE/RP based subsystems bugs regardless if those
> are in TCP or BGP layer.
>
> Thx,
> R.
>
>
>
>
>
>
> On Sat, Dec 19, 2020 at 10:36 PM Gyan Mishra <hayabusagsm@gmail.com>
> wrote:
>
>>
>> Here is the RFC 5482 TCP User timeout options from TCPM WG.
>>
>> https://tools.ietf.org/html/rfc5482
>>
>> TCPM has a bis draft update to 793 that has more info then the original.
>>
>> https://datatracker.ietf.org/wg/tcpm/documents/
>>
>> https://tools.ietf.org/html/draft-ietf-tcpm-rfc793bis-19#page-42
>>
>>
>> From quick read there are caveats with devices supporting or not
>> supporting the option.
>>
>> Also I guess setting the value is tricky as well not too low or too high
>> that either could make matters worse with instability.
>>
>> Going down this path of does seem a lot more complicated and risker then
>> using BFD.
>>
>>
>> Kind Regards
>>
>> Gyan
>>
>>
>> On Sat, Dec 19, 2020 at 5:38 AM William McCall <william.mccall@gmail.com>
>> wrote:
>>
>>> On Fri, Dec 18, 2020 at 10:33 PM John Scudder
>>> <jgs=40juniper.net@dmarc.ietf.org> wrote:
>>> >
>>> > On Dec 18, 2020, at 1:09 PM, Enke Chen <enchen@paloaltonetworks.com>
>>> wrote:
>>> > >
>>> > > No, I am not assuming that packets are getting somewhere. The
>>> TCP_USER_TIMEOUT would work as long as there is "pending data" (either
>>> unacked, or locally queued). The data can be from the local BGP Keepalives
>>> or the TCP_KEEPALIVE.
>>> >
>>> > Apart from the other objections to relying on TCP_USER_TIMEOUT, which
>>> I think are sufficient, it’s not clear to me that implementations will
>>> provide the desired semantics. RFC 793 seems like it specifies the right
>>> semantics (“get this data to the peer within N seconds or close”):
>>> >
>>> >         The timeout, if present, permits the caller to set up a timeout
>>> >         for all data submitted to TCP.  If data is not successfully
>>> >         delivered to the destination within the timeout period, the TCP
>>> >         will abort the connection.  The present global default is five
>>> >         minutes.
>>> >
>>> > However the Linux man page documents different semantics:
>>> >
>>> >        TCP_USER_TIMEOUT (since Linux 2.6.37)
>>> >               This option takes an unsigned int as an argument.  When
>>> the
>>> >               value is greater than 0, it specifies the maximum amount
>>> of
>>> >               time in milliseconds that transmitted data may remain
>>> >               unacknowledged before TCP will forcibly close the
>>> >               corresponding connection and return ETIMEDOUT to the
>>> >               application.  If the option value is specified as 0, TCP
>>> will
>>> >               use the system default.
>>> >
>>> > The important difference being that whereas 793 implies data written
>>> to the socket, the Linux man page says “transmitted” data, which seems like
>>> it must mean data TCP has written to the network. These are two very
>>> different things! If Linux (or another stack) implements what the man page
>>> seems to say, it’s not useful for our purposes.
>>> >
>>> > —John
>>> > _______________________________________________
>>> > Idr mailing list
>>> > Idr@ietf.org
>>> > https://www.ietf.org/mailman/listinfo/idr
>>>
>>> I was curious too. I read the manpage, relevant linux kernel code, the
>>> RFC, and hacked up a test case (unicast me if you want the code).
>>> Also, Cloudflare published a relevant blog entry[0]. For this specific
>>> scenario, see under the sub-heading "Zero window ESTAB is...
>>> forever?".
>>>
>>> TCP_USER_TIMEOUT doesn't appear to kick in until there is unACKed
>>> data, meaning that it has already been transmitted from TCP's
>>> perspective. Stuff hanging around in the buffers due to persist state
>>> doesn't seem to count, per the test results and the docs. Confirms
>>> your thoughts from the reading I think.
>>>
>>> [0] https://blog.cloudflare.com/when-tcp-sockets-refuse-to-die/
>>>
>>> --
>>> William McCall
>>>
>>> _______________________________________________
>>> Idr mailing list
>>> Idr@ietf.org
>>> https://www.ietf.org/mailman/listinfo/idr
>>>
>> --
>>
>> <http://www.verizon.com/>
>>
>> *Gyan Mishra*
>>
>> *Network Solutions A**rchitect *
>>
>>
>>
>> *M 301 502-134713101 Columbia Pike
>> <https://www.google.com/maps/search/13101+Columbia+Pike%C2%A0+Silver+Spring,+MD?entry=gmail&source=g>*Silver
>> Spring, MD
>> <https://www.google.com/maps/search/13101+Columbia+Pike%C2%A0+Silver+Spring,+MD?entry=gmail&source=g>
>>
>> _______________________________________________
>> Idr mailing list
>> Idr@ietf.org
>> https://www.ietf.org/mailman/listinfo/idr
>>
> --

<http://www.verizon.com/>

*Gyan Mishra*

*Network Solutions A**rchitect *



*M 301 502-134713101 Columbia Pike *Silver Spring, MD