Re: [Rift] Device restart problem

Tony Przygienda <tonysietf@gmail.com> Thu, 24 October 2019 15:59 UTC

MIME-Version: 1.0
References: <201910241652249011192@zte.com.cn>
In-Reply-To: <201910241652249011192@zte.com.cn>
From: Tony Przygienda <tonysietf@gmail.com>
Date: Thu, 24 Oct 2019 08:58:35 -0700
Message-ID: <CA+wi2hNN9JrRft2_n0eHmWq4+p2KHdBH3dwQ6pat8Ri02FTrHQ@mail.gmail.com>
To: xu.benchong@zte.com.cn
Cc: rift@ietf.org
Content-Type: multipart/alternative; boundary="000000000000a29f740595aa1d16"
Archived-At: <https://mailarchive.ietf.org/arch/msg/rift/w-JTBT4KJH8rJDzRXLQRb_Np074>
Subject: Re: [Rift] Device restart problem
Precedence: list

hey xu, I see deeper and deepr into implementation, you just found first
layer of the onion here BTW ;-)  The seqnr# handling is since times
immemorial one of the trickier parts of IGP implementation (but not only,
same problems exists in  other places but there, the information is not
persistent so problem is not as pressing).

Multiple mechanisms kick in here

a) the seqnr# is circular which is a very important piece of the puzzle.
you cannot generate a "biggest" number no'one can override. math explained
in appendix in lots detail. BTW, not my invention, smarter people than me
worked stuff out long time ago but there was never a full, detailed, easy
to implement writedown AFAIK.
b) yes, the fact that we flood only northbound prevents via normal "flat
flooding"  Leaf111 "getting" its old TIE with a higher sequence number.
Using flat flooding south would of course kill largely the scalability of
the protocol and make it equivalent to OSPF or ISIS  or any other "normal"
link-state approach in terms of flooding complexity (well, flood reduction
would still work ;-)
c) However, observe that Table 3 holds the key to the solution. TIDE/South
tells you what you need to do to describe your database to the neighbor
south. The description from Spine111 includes the description of N-TIEs of
Leaf111 and with that Leaf111 can realize that there is a stale N-TIE it
originated before reboot and re-issue with a higher sequence number (that's
where a] comes into play)

When you keep on implementing and testing you'll find another very
interesting, far more complex case that we solved but I will keep the
suspension going ;-)

The observation on the one week is also correct. Done very purposefully.
Let's say RIFT runs on 0.5M devices (scale we aim at given
multi-homed/overlay originating servers can run it as well). If you assume
5 TIEs per device that's 2.5M TIEs @ the top of the fabric (large but not a
scary number compared to what we do with BGP and add/path on daily basis in
world's most scalable implementations ;-). If we'd have something like 1hr
reorigination we talk  2.5M/24 = 100K re-originations per hour. That gives
you a flooding rate into ToFs of 30 TIEs/sec (assuming perfect flood
reduction). All that disregarding things like server rebooting or container
architectures which will possibly inject lots prefixes on moves/boots and
so on. So refresh often is churn that is unnecessary. With 1 week lifetime
we're talking 15K TIEs per hour refresh which is a manageable number given
we're talking 0.5M devices.

Observe however that you can issue with any lifetime you choose as a device
and RIFT will work (and when emptying TIEs you are supposed to originate
with 300secs only). So the 1 week is basically a protocol constant that can
be knobbed.

Let us know when you got first pieces inter'oped with Bruno's open source
BTW. Things always become much more clear when implementations are bashed
against each other ;-)

--- tony

On Thu, Oct 24, 2019 at 1:52 AM <xu.benchong@zte.com.cn> wrote:

> Hi, Tony
>
> There is a device restart problem
>
> In draft-ietf-rift-rift-08 Figure 2, N-TIE of Leaf111 flooded to ToF21 via
> Spine111. Seq NR may be larger.
>
> Leaf111 restarts and regenerates N-TIE. The random seq NR may be small, so
> that when Spine111 receives it, it will compare the seq NR and discard the
> new message.
>
> According to the behavior of Appendix c.3.4 b.3, it is hoped that Spine111
> sends DBTIE to Leaf111 to update seq NR.
>
> However, according to the flooding range of N-TIE, this message cannot be
> sent out.
>
> In this way, there will be a large number of invalid N-TIEs in the network
> for a long time (the default expire time of the protocol is 1 week)
>
> Is this understanding correct? How does rift solve this problem?
>
>
> Thank you!
>
> Benchong
>
>
>
>
>

[Rift] Device restart problem xu.benchong
Re: [Rift] Device restart problem Tony Przygienda
Re: [Rift] Device restart problem xu.benchong
Re: [Rift] Device restart problem Tony Przygienda