Re: [Idr] TCP & BGP: Some don't send terminate BGP when holdtimer expired, because TCP recv window is 0

Tony Li <tony.li@tony.li> Fri, 11 December 2020 22:31 UTC

Sender: Tony Li <tony1athome@gmail.com>
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (Mac OS X Mail 13.4 \(3608.120.23.2.4\))
From: Tony Li <tony.li@tony.li>
In-Reply-To: <CAOj+MMEGRLw9cRXJR4VgOYtoj+tRyeY4WhWsdkMuYktGh6THag@mail.gmail.com>
Date: Fri, 11 Dec 2020 14:31:04 -0800
Cc: Job Snijders <job@sobornost.net>, "idr@ietf.org" <idr@ietf.org>
Content-Transfer-Encoding: quoted-printable
Message-Id: <0F61A27E-935C-4B95-9761-0D454D0F66A8@tony.li>
References: <X9PHRuGndvsFzQrG@bench.sobornost.net> <FCB1ADB7-AD8C-447E-82FE-2EC15B8C3FB9@juniper.net> <CAOj+MMEGRLw9cRXJR4VgOYtoj+tRyeY4WhWsdkMuYktGh6THag@mail.gmail.com>
To: Robert Raszuk <robert@raszuk.net>
Archived-At: <https://mailarchive.ietf.org/arch/msg/idr/HTFPMSp_S1ILerFDMYh0bf5PGFA>
Subject: Re: [Idr] TCP & BGP: Some don't send terminate BGP when holdtimer expired, because TCP recv window is 0
Precedence: list

Hi Robert,


> * Is the "unable to send" only possible under Window = 0 ? What if there is a local NIC buffer full and we keep dropping it locally ? If we are going there perhaps we could say "unable to successfully send" meaning send and get an ACK for it ? 


There are many, many reasons why we might not be able to exchange bits. The specifics aren’t particularly relevant. The point is that we’re not able to make progress, so the session is clearly broken.


> * The proposal is about reusing the HOLD TIME value to bring BGP down when you are still receiving keepalives however peer sent ACK for the last segment indicating zero window - is this right ? 


More generally, the proposal is that we apply the HOLD TIME on the transmit side as well as the receive side. If we are not able to transmit for that period of time, the receiver should give up and so should the transmitter. The session is broken, updates cannot flow, and we no longer have (eventual) consistency.


> * What happens if our side is stuck and not able to process subsequent ACKs which potentially increase the window on the peer ? 


Then our side has some kind of bug. But the result is the same: the session is stuck and is not helpful.  Tearing down the session and starting over is probably the best that we can do.


> * Most deployments use BFD to make sure peer is reachable. As BFD is often offloaded from CPU this is not an indication of health of TCP or BGP path. But what if some deployments can not use BFD (say not supported by the peer) ? Then those typically reduce BGP HOLD TIME. Would it not be too fragile in some cases to bring a TCP down in such cases too fast ? Aren't we overloading HOLD TIME value here a bit ? 


The HOLD TIME is already enforced by the receiver.  The only clarification that this makes is that the transmitter should also enforce it.  


> * Assume we bring TCP down ... when does it attempt to go up again ? Are we ok to bring it up only after manual/script action from such a state ? 


That is (and has always been) at the discretion of the implementation. Automatic periodic recovery would be preferable as a means of minimizing managerial overhead.


> * As proposed it seems that the change will affect all AFI/SAFIs using given single TCP session. Even if perhaps all are perfectly healthy and running on separate cores. If each SAFI would run on a separate TCP session this would not have such an impact. 


Creating more TCP sessions is not likely to improve the behavior of a TCP receiver.


> * The change seems applicable to both iBGP and eBGP right ? 


Yes.  It’s fundamental.


> In summary the attempt here is to fix application issues by cutting the transport. Sure half broken transport for whatever reason may not be a good thing to keep in UP state. Especially when redundancy is in place. But my main concerns are that we are only trying to focus on a single low level trigger to detect it. 


The point here is simply a clarification for robustness. If a (half) session is not making progress, then the receiver should terminate the session. With this clarification, we make it explicit that the transmitter may do so as well. The likely scenarios where this would come into play are serious software bugs where the transmitter or receiver is not able to make progress. This could be due to transport issues or infrastructure issues. As you note, trying to continue to work with the session is unlikely to be beneficial.


> Wouldn't per AFI/SAFI heartbeat be a better option to detect if a peer's BGP stack is still up and running fine for all applications ? 


That would add considerable complexity and still not address the stuck transmitter.

Regards,
Tony

[Idr] TCP & BGP: Some don't send terminate BGP wh… Job Snijders
Re: [Idr] TCP & BGP: Some don't send terminate BG… Tony Li
Re: [Idr] TCP & BGP: Some don't send terminate BG… John Scudder
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jeff Tantsura
Re: [Idr] TCP & BGP: Some don't send terminate BG… Robert Raszuk
Re: [Idr] TCP & BGP: Some don't send terminate BG… Robert Raszuk
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jakob Heitz (jheitz)
Re: [Idr] TCP & BGP: Some don't send terminate BG… Job Snijders
Re: [Idr] TCP & BGP: Some don't send terminate BG… Tony Li
Re: [Idr] TCP & BGP: Some don't send terminate BG… Keyur Patel
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jeff Tantsura
Re: [Idr] TCP & BGP: Some don't send terminate BG… Robert Raszuk
Re: [Idr] TCP & BGP: Some don't send terminate BG… Keyur Patel
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jakob Heitz (jheitz)
Re: [Idr] TCP & BGP: Some don't send terminate BG… Enke Chen
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jakob Heitz (jheitz)
Re: [Idr] TCP & BGP: Some don't send terminate BG… Robert Raszuk
Re: [Idr] TCP & BGP: Some don't send terminate BG… Christoph Loibl
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jakob Heitz (jheitz)
Re: [Idr] TCP & BGP: Some don't send terminate BG… Christoph Loibl
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jared Mauch
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jared Mauch
Re: [Idr] TCP & BGP: Some don't send terminate BG… William McCall
Re: [Idr] TCP & BGP: Some don't send terminate BG… Job Snijders
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jared Mauch
Re: [Idr] TCP & BGP: Some don't send terminate BG… Enke Chen
Re: [Idr] TCP & BGP: Some don't send terminate BG… Randy Bush
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jared Mauch
Re: [Idr] TCP & BGP: Some don't send terminate BG… John Scudder
Re: [Idr] TCP & BGP: Some don't send terminate BG… Christoph Loibl
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jakob Heitz (jheitz)
Re: [Idr] TCP & BGP: Some don't send terminate BG… John Scudder
Re: [Idr] TCP & BGP: Some don't send terminate BG… Job Snijders
Re: [Idr] TCP & BGP: Some don't send terminate BG… John Scudder
Re: [Idr] TCP & BGP: Some don't send terminate BG… Robert Raszuk
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jakob Heitz (jheitz)
Re: [Idr] TCP & BGP: Some don't send terminate BG… john heasley
Re: [Idr] TCP & BGP: Some don't send terminate BG… Tony Li
Re: [Idr] TCP & BGP: Some don't send terminate BG… Keyur Patel
Re: [Idr] TCP & BGP: Some don't send terminate BG… Keyur Patel
Re: [Idr] TCP & BGP: Some don't send terminate BG… Brian Dickson
Re: [Idr] TCP & BGP: Some don't send terminate BG… Claudio Jeker
Re: [Idr] TCP & BGP: Some don't send terminate BG… Claudio Jeker
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jakob Heitz (jheitz)
Re: [Idr] TCP & BGP: Some don't send terminate BG… Job Snijders
Re: [Idr] TCP & BGP: Some don't send terminate BG… John Heasley
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jakob Heitz (jheitz)
Re: [Idr] TCP & BGP: Some don't send terminate BG… Claudio Jeker
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jakob Heitz (jheitz)
Re: [Idr] TCP & BGP: Some don't send terminate BG… Brian Dickson
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jakob Heitz (jheitz)
Re: [Idr] TCP & BGP: Some don't send terminate BG… Job Snijders
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jakob Heitz (jheitz)
Re: [Idr] TCP & BGP: Some don't send terminate BG… Robert Raszuk
Re: [Idr] TCP & BGP: Some don't send terminate BG… Job Snijders
Re: [Idr] TCP & BGP: Some don't send terminate BG… Job Snijders
Re: [Idr] TCP & BGP: Some don't send terminate BG… Brian Dickson
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jeffrey Haas
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jeffrey Haas
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jeffrey Haas
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jeffrey Haas
Re: [Idr] TCP & BGP: Some don't send terminate BG… Brian Dickson
Re: [Idr] TCP & BGP: Some don't send terminate BG… Enke Chen
Re: [Idr] TCP & BGP: Some don't send terminate BG… Robert Raszuk
Re: [Idr] TCP & BGP: Some don't send terminate BG… Gert Doering
Re: [Idr] TCP & BGP: Some don't send terminate BG… Claudio Jeker
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jeffrey Haas
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jeffrey Haas
Re: [Idr] TCP & BGP: Some don't send terminate BG… Enke Chen
Re: [Idr] TCP & BGP: Some don't send terminate BG… Robert Raszuk
Re: [Idr] TCP & BGP: Some don't send terminate BG… Brian Dickson
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jakob Heitz (jheitz)
Re: [Idr] TCP & BGP: Some don't send terminate BG… Enke Chen
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jeffrey Haas
Re: [Idr] TCP & BGP: Some don't send terminate BG… Enke Chen
Re: [Idr] TCP & BGP: Some don't send terminate BG… Brian Dickson
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jeffrey Haas
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jakob Heitz (jheitz)
Re: [Idr] TCP & BGP: Some don't send terminate BG… Gyan Mishra
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jeffrey Haas
Re: [Idr] TCP & BGP: Some don't send terminate BG… John Scudder
Re: [Idr] TCP & BGP: Some don't send terminate BG… Gyan Mishra
Re: [Idr] TCP & BGP: Some don't send terminate BG… William McCall
Re: [Idr] TCP & BGP: Some don't send terminate BG… Gyan Mishra
Re: [Idr] TCP & BGP: Some don't send terminate BG… Robert Raszuk
Re: [Idr] TCP & BGP: Some don't send terminate BG… Gyan Mishra
Re: [Idr] TCP & BGP: Some don't send terminate BG… Gyan Mishra
Re: [Idr] TCP & BGP: Some don't send terminate BG… Robert Raszuk
Re: [Idr] TCP & BGP: Some don't send terminate BG… Jeffrey Haas
Re: [Idr] TCP & BGP: Some don't send terminate BG… Gyan Mishra
Re: [Idr] TCP & BGP: Some don't send terminate BG… Robert Raszuk
Re: [Idr] TCP & BGP: Some don't send terminate BG… Gyan Mishra
Re: [Idr] TCP & BGP: Some don't send terminate BG… Gyan Mishra
Re: [Idr] TCP & BGP: Some don't send terminate BG… Enke Chen
Re: [Idr] TCP & BGP: Some don't send terminate BG… Enke Chen
Re: [Idr] TCP & BGP: Some don't send terminate BG… Enke Chen
Re: [Idr] TCP & BGP: Some don't send terminate BG… Enke Chen
Re: [Idr] TCP & BGP: Some don't send terminate BG… Enke Chen
Re: [Idr] TCP & BGP: Some don't send terminate BG… Job Snijders
Re: [Idr] TCP & BGP: Some don't send terminate BG… Enke Chen