Re: [Idr] WG Adoption call for draft-wang-idr-rd-orf-05.txt (2/4/2021 to 2/18/2021)

Robert Raszuk <> Sat, 13 February 2021 11:42 UTC

Return-Path: <>
Received: from localhost (localhost []) by (Postfix) with ESMTP id 3E0203A0FED for <>; Sat, 13 Feb 2021 03:42:16 -0800 (PST)
X-Virus-Scanned: amavisd-new at
X-Spam-Flag: NO
X-Spam-Score: -2.098
X-Spam-Status: No, score=-2.098 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: (amavisd-new); dkim=pass (2048-bit key)
Received: from ([]) by localhost ( []) (amavisd-new, port 10024) with ESMTP id 9j7sxiR0KDwr for <>; Sat, 13 Feb 2021 03:42:13 -0800 (PST)
Received: from ( [IPv6:2a00:1450:4864:20::132]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by (Postfix) with ESMTPS id 886033A0FEC for <>; Sat, 13 Feb 2021 03:42:12 -0800 (PST)
Received: by with SMTP id u25so3271121lfc.2 for <>; Sat, 13 Feb 2021 03:42:12 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=TONgxPYMYbW3jVx2ur6CnXY/BuMssXkEfnifYCUMxIw=; b=SY/F/p4qKUJybHWpw8FNrMrhBuC/KnKGwcDfk41AlHuNjXgQUjABHHtgti+RJTGS+R Fp2WdVB+SuCJv8zNnrNPmHKWG1BTgFOwzS0tZGzmeKDfxmw73UvgeJV3pxZHuiydMWC0 7mOKEoISIu1Zk5TEAJYB+3NeAi72Gv4NbnjdasJYImFER8RGEfVqub1mImm8K/CmnjDg EG7jiSLR/j1R5Eks0BNDmO7lBII8zfW4NxVPaHlh7Wwu200QSRlAV9fXikQa9zv9x5XY y//GVcNnG0utQKzZFgIWjQi5awbnX3SKn+tiadDpIwcPfpegFsWOCS0PsVSD0rIWC2s7 fddg==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=TONgxPYMYbW3jVx2ur6CnXY/BuMssXkEfnifYCUMxIw=; b=Jx3UhXib++/OtThoGyn0bRQnrgJIuUyjiZYINuf6DCCoGqCDXB+prV73X82HGey9ti NbTT4Ti/Ze1dYJD8QfuYxy1sFcieBiIQSt2p3WvvU7SRmsLlFZVWPMoWrc2xrzwRdvjR PpDKSywx6d6EUKV6ToMV3hwvTQIcKOpPMbZOppE+FZ1Ckm+/s58c5wK4NiWNNE6PNPpU lQWnj25+B01UCw7derkL+1B5apl4i48ttAn3J0ZR2tcyS9AZ0cv76BUhOYiK6aOw2ru+ EitHh0W0Uay04ztT945cjdEacfX+BFsI2B+tTts+N/27QXmNjbV9fkj2iRipmvANgOhm D7aw==
X-Gm-Message-State: AOAM532eZuyDukG8w+wBhY7emuV/08rUkvT+Cz0vctHK9lTD2R2Hn2Yf 8JbZnDS0c+KgtzmOkT3kuV4VU8XmiUHoTRug0XGJlg==
X-Google-Smtp-Source: ABdhPJymJtLpK0CNOpUKqpJ3+Uu1z8n4XOeiPdTPDPKCRCsDIMTQN7Ybt0OQtgPv1vU5w0CbxiUTwLoBC5O7VAfJBlQ=
X-Received: by 2002:a19:3817:: with SMTP id f23mr3757436lfa.591.1613216530503; Sat, 13 Feb 2021 03:42:10 -0800 (PST)
MIME-Version: 1.0
References: <> <> <>
In-Reply-To: <>
From: Robert Raszuk <>
Date: Sat, 13 Feb 2021 12:42:01 +0100
Message-ID: <>
To: Gyan Mishra <>
Cc: Aijun Wang <>, "Jakob Heitz (jheitz)" <>, Susan Hares <>, "idr@ietf. org" <>
Content-Type: multipart/alternative; boundary="0000000000003f2fd805bb363e54"
Archived-At: <>
Subject: Re: [Idr] WG Adoption call for draft-wang-idr-rd-orf-05.txt (2/4/2021 to 2/18/2021)
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Inter-Domain Routing <>
List-Unsubscribe: <>, <>
List-Archive: <>
List-Post: <>
List-Help: <>
List-Subscribe: <>, <>
X-List-Received-Date: Sat, 13 Feb 2021 11:42:16 -0000


>   The problem we are trying to solve is a scenario where you have an
offending PE that is flooding routes and a weak PE that is overwhelmed by a
flood of routes.

The problem is a valid problem. But the proposed solution to the problem is
not. And moreover solutions to this very problem are widely known and used
for years.

*The problem as a matter of fact has nothing to do with VPNs of any sort.*

If you are ISP offering Internet transport unless you apply proper
protection you would be badly exposed. Your clients or peers or upstreams
injecting millions of routes and melting your network and perhaps even
global Internet (if they would use a registered block).

And yes BGP ingress policy here is used to filter out junk before it enters
any network. The same policy must be used in VPN cases too.

Indeed years have passed and I think I have only seen in a very few cases
that operators offering L3VPNs are doing prefix ingress filtering ... Max
prefix on ingress is used much more often. Those are the right tools here
to work on. I am not saying we should not invent more ... we should.


* Customise secure BGP to work in VPN cases as example.

* Augment RRs to be a bit more intelligent with pitch of ML - if number of
routes with given RD for time Tx is R moment you receive 100*R you suspend
those and raise NOC alarm before spraying everywhere

* Do not put VPN customer routes into your data plane ... just handle next
hops. Redefine RFC4364 all together and use IP transport for it.

etc ...

Focus on not allowing the meltdown is the proper solution space ... But
here instead we do nothing to prevent the fire to start and instead focus
on tools to extinguish it. Wrong approach. And specifically this tool
(RD-ORF) does way too much damage when used.


On Sat, Feb 13, 2021 at 4:16 AM Gyan Mishra <> wrote:

> All
> From Susan Hares summary of where we are at with the adoption call let’s
> start with the problem this draft is trying to solve and gaining
> consensus.  Once we gain consensus we can get back to RD-ORF solution.  See
> w
> a) the problem this draft is drafting to solve relating to BGP routes,
> The problem we are trying to solve is a scenario where you have an
> offending PE that is flooding routes and a weak PE that is overwhelmed by a
> flood of routes.  This is not a normal situation and is an outage situation
> where the weak PE being overwhelmed by a flood of routes.  Do we all agree
> to the problem statement?
> Why and why not?
> b) the need for additional mechanisms to solve the problem,
> Do other methods exist that can solve the problem and if not do we need a
> new mechanism to solve this?
> RTC, Peer maximum prefix, VPN maximum prefix
> c) a clear description of the technology to solve the problem.
> Do we all agree that in a normal situation we would never filter on RD as
> that would partition the VPN which is unwanted and what Robert mentioned.
> As this is not a normal situation but a unique situation where a weak PE is
> overwhelmed by a flood of routes.  How best can this be solved?
> On Fri, Feb 12, 2021 at 10:32 AM Aijun Wang <>
> wrote:
>> Hi, Robert:
>> Yes, the behavior of the device should be determined. There maybe several
>> factors to be considered for this local behavior, we should describe it
>> more clearly in this section later.
>> We have discussed the differences between RTC and RD-ORF a lot. As Haibo
>> mentioned, they are not exclusive to each other, and can be used together
>> in some situations. But they are different and can’t replace each other.
>> Aijun Wang
>> China Telecom
>> On Feb 12, 2021, at 23:04, Robert Raszuk <> wrote:
>> Sorry Aijun,
>> What you say is just handwaving. There is no room for it in any spec.
>> When code is written PE must deterministically behave so the RR or any
>> other network element.
>> Statements "decisions of PE2 to judge" are not acceptable in protocol
>> design.
>> Just imagine that each PE does what it feels like in a distributed
>> network .... Same for BGP same for IGP etc ....
>> And all of this is not needed if on ingress between PE1 and HQ1 you apply
>> max prefix of 2 or even 100. It is also not needed if you enable  RTC to
>> send RT:TO_HUB from PE2 to RR.
>> But I understand - no matter what we say or how much we spend time to
>> explain why this idea is a bad idea you are still going to push this fwd.
>> Oh well ...   If I were you I would spend this time to redefine L3VPN such
>> that customer routes are never needed to be sent to SP core routers.
>> Thx,
>> R.
>> On Fri, Feb 12, 2021 at 3:47 PM Aijun Wang <>
>> wrote:
>>> Hi, Robert:
>>> has
>>> described such situations, which will require the additional local
>>> decisions of PE2 to judge whether to send the RD-ORF message out.
>>> In your example, if only the HUB VRF exceed but the resources of PE2 is
>>> not exhausted, then the PE2 will not send the RD-ORF message. It may just
>>> discard the excessive 100000/32 routes.
>>> If the resources of PE2 is nearly exhausted, it must send the RD-ORF
>>> message out. Or else not only the Spoke VRF, but also other VPNs on this
>>> device can’t be used.
>>> Regarding to RR, it is the same principle: if RR can cope with such
>>> flooding, it need not send out RD-ORF to PE1. If RR can’t cope with, it
>>> must send out the RD-ORF message, or else not only the VPN that import RD
>>> X1 routes can’t work, but also other VPNs that don’t import RD x1 routes.
>>> RD-ORF mechanism just keep the influences as small as possible.
>>> Wish the above explanation can refresh your review of this draft.
>>> We are also hopeful to invite you join us to make RD-ORF mechanism more
>>> robust and meet the critical challenges.
>>> Aijun Wang
>>> China Telecom
>>> On Feb 12, 2021, at 19:30, Robert Raszuk <> wrote:
>>> Aijun & Gyan,
>>> Let me try one more (hopefully last time) to explain to both of you -
>>> and for that matter to anyone how supported this adoption.
>>> Let's consider very typical Hub and Spoke scenario as illustrated below:
>>> <image.png>
>>> HQ1 is advertising two routes:
>>> - one default with RDX1 with RT TO_SPOKE
>>> - one or more specifics with RDX1 to the other HUBs
>>> Now imagine HQ1 bought a new BGP "Optimizer" and suddenly is starting to
>>> advertise 100000 /32 routes just to the other HUB with RT: TO_HUB.
>>> <image.png>
>>> So PE2 detects this as VRF with RDX2 on it got overwhelmed during import
>>> with RT TO_HUB and starts pushing RDX1 (original RD) to RR to stop getting
>>> those routes.
>>> Well all great except now you are throwing baby with the water as all
>>> spokes attached to PE2 which just import default route to HUB HQ1 also can
>>> no longer reach their hub site as their default route will be removed.
>>> Therefor they will have nothing to import with RT:TO_SPOKE
>>> Further if RR "independently" decided ... oh let's push this ORF to PE1
>>> then all of the spokes attached to perhaps even much more powerful PE3 can
>>> also no longer reach their headquarters.
>>> - - -
>>> *Summary: *
>>> The above clearly illustrates why the proposed solution to use RD for
>>> filtering is in fact harmful.
>>> See when you design new protocol extensions the difficulty is to not
>>> break any existing protocols and deployments.
>>> Hope this puts this long thread to rest now.
>>> Thx,
>>> Robert
>>> --
> <>
> *Gyan Mishra*
> *Network Solutions A**rchitect *
> *M 301 502-134713101 Columbia Pike *Silver Spring, MD