Re: [Idr] Review Request for draft BGP Overload

amit bhagat <scet.amit@gmail.com> Thu, 07 July 2016 17:33 UTC

Return-Path: <scet.amit@gmail.com>
X-Original-To: idr@ietfa.amsl.com
Delivered-To: idr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id D4E8512D1A7 for <idr@ietfa.amsl.com>; Thu, 7 Jul 2016 10:33:35 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.699
X-Spam-Level:
X-Spam-Status: No, score=-2.699 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id evKzRPfJzQdy for <idr@ietfa.amsl.com>; Thu, 7 Jul 2016 10:33:29 -0700 (PDT)
Received: from mail-qt0-x235.google.com (mail-qt0-x235.google.com [IPv6:2607:f8b0:400d:c0d::235]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 4158212D572 for <idr@ietf.org>; Thu, 7 Jul 2016 10:33:29 -0700 (PDT)
Received: by mail-qt0-x235.google.com with SMTP id m2so12008609qtd.1 for <idr@ietf.org>; Thu, 07 Jul 2016 10:33:29 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=swkEyJaX/0JPwy+Up/9VXQgaBV6Of8McrjS29zRIdzk=; b=p06SxB5V71Y26MOsW0fgsKO7Ep3bxeX/prSDwhVW4Y7A/GzHLPjW8DKjTdNgyQpah4 S0JXzGoxsWxqohBw/EKkD1ZvGLr7Cs5rZhzeXx49q5K7Umhpfs5mkzyLiqummGwzIVhS R4XBVQgvwIBCAwnIT/9uS5fWlcalqsyE3wk4CToHnQ5JEW1aFrzVjninWOeiJufhH4zt 26owKDNMaTuFSBGIRf0FIxONnCnxnH67QMjcQEcziGEOoEkC446ATwRRUnSde1DdSYR4 rOVdRSVyPNkhiKMnrUkvfuphrM81neHVng5WDqUFvEPOW3qCCgoheELw6qMiCVx3m1n5 F1/g==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=swkEyJaX/0JPwy+Up/9VXQgaBV6Of8McrjS29zRIdzk=; b=Db8egqA/WoPQSWMhK+rDQxtvAXXazJO0ZEe7F0Fsa3kRcc3AeslhT2w9j95+WJksyl 1bq2inQhh+iM2t9EQu5ef5z2YoLc10qO03jILW9uJZ7wii4g0SK0ghN7M/ibmMfYR2Yl uyv1as1ciuHv+Yi1LYWc0yUvYWjrf0MfqG48aO4DgKv2vjNhqLaYNhGPGQA8MPJOt/A7 /5Zb+lvvfW1kLydUa4c3zQRx7TrknEFarEVxNXFNPdvGTtoyMwAUtYXJzZJ5wHr5d47B xqA9todd/wEGeiJG0z2g4MuEQW3OUhgRlZxJB3/Us/2XMERTbetqf15sDPD5BguBvjja jG7A==
X-Gm-Message-State: ALyK8tJlXOvsBlvjW98elKYzVJXCr9FK8F1KZPTF+xzULcBeOxwEH5UlcQbN0A7d7spMgdRc1h9AwjCgdbssoQ==
X-Received: by 10.200.37.150 with SMTP id e22mr2003543qte.37.1467912808399; Thu, 07 Jul 2016 10:33:28 -0700 (PDT)
MIME-Version: 1.0
Received: by 10.55.43.232 with HTTP; Thu, 7 Jul 2016 10:33:27 -0700 (PDT)
In-Reply-To: <CA+b+ERnZkCiNOFBgy3boZuURWGkPHrXN_2cYsSWdFSHNctFxRQ@mail.gmail.com>
References: <CADJmATHTYEW3_S+8=wLmjYL35oPF24J+Tu9O+m7w_7uLUMghjQ@mail.gmail.com> <CA+b+ERnOgiq3g+oCp-3vx1Ncq8QSrf-32Go_RWDeO+P7VHUNRA@mail.gmail.com> <CADJmATGndK0s-8Pao3cp7DfubsBFc1FG+LgMh-5NUJ32HAsvFg@mail.gmail.com> <CA+b+ER=-_omoqvM_=nqhzm_D1HB8-Wp+0ixadwaRa-7URKbHWQ@mail.gmail.com> <CADJmATFqusgXiMAzQrCqMczkcGkNaLiKcOMu3k7-N_pMjoOMGg@mail.gmail.com> <CA+b+ERnyNk1ZSA_mu+VnRTthpCibzGGN8SCeEaSoPZq1wHNzYA@mail.gmail.com> <CADJmATH0jD0eiNewvFsGhpHGJ=LRfi=hAN-9f17TXBoLVCB1nw@mail.gmail.com> <CA+b+ERnZkCiNOFBgy3boZuURWGkPHrXN_2cYsSWdFSHNctFxRQ@mail.gmail.com>
From: amit bhagat <scet.amit@gmail.com>
Date: Thu, 07 Jul 2016 10:33:27 -0700
Message-ID: <CADJmATGtWNdLzUxDzBxC_ZyXMymkZxSLDxfYBgQ9Td8-uhoL1Q@mail.gmail.com>
To: Robert Raszuk <robert@raszuk.net>
Content-Type: multipart/alternative; boundary="001a11c04ca881896205370f154f"
Archived-At: <https://mailarchive.ietf.org/arch/msg/idr/vnPX4Qnqiw02rFr-VGpocU4oZpw>
Cc: idr wg <idr@ietf.org>
Subject: Re: [Idr] Review Request for draft BGP Overload
X-BeenThere: idr@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: Inter-Domain Routing <idr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/idr>, <mailto:idr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/idr/>
List-Post: <mailto:idr@ietf.org>
List-Help: <mailto:idr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/idr>, <mailto:idr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 07 Jul 2016 17:33:36 -0000

Hi Robert,

I uploaded v01 of the draft to address some of your concerns. Sorry, I
misunderstood what you meant by leaking. Agreed, even though
MP_(UN)REACH_NLRI are non-transitive, they will be advertised to
neighboring AS. To stop this flooding, I have added that the UPDATE message
must be sent with NO_ADVERTISE community. Not sure if this is "hacky".

It also includes the usecase I am referring to - neighbor-count threshold.

I appreciate your feedback.

Thanks.
Amit

On Wed, Jul 6, 2016 at 10:41 AM, Robert Raszuk <robert@raszuk.net> wrote:

> Hi Amit,
>
> BGP Capability is completely orthogonal. It only indicates who support
> this new SAFI. I am talking about the case where  as this new SAFI which is
> needed only between *directly connected* peers will use generic BGP
> mechanism and will propage NLRIs across entire DC. It would be completely
> unnecessary yet your draft does not discuss this point. This is what I call
> "leaking".
>
> As far as signalling number of active peers to a peer I do not see how you
> encode this in the currently posted proposal nor it is clear what action
> such router should automatically do with that information. Honestly I would
> rather see the operator action here.
>
> Thx,
> R.
>
>
>
> On Wed, Jul 6, 2016 at 7:34 PM, amit bhagat <scet.amit@gmail.com> wrote:
>
>> Hi Robert,
>>
>> There is no risk of leaking as the Capability will be exchanged with
>> peers upon session establishment.
>>
>> What this feature also gives is monitoring Established neighbor-count.
>> Consider a router connected to 10 peers and all 10 peers advertise exactly
>> same prefixes in a CLOS fabric. Now, if 5 peer sessions go down, the
>> capacity is effectively reduced to 50% while this router is still in the
>> transit. This can cause congestion. Instead, the router can declare itself
>> overloaded and the downstream routers then rely on other paths. The update
>> advertisement delay can be an implementation detail.
>>
>> Thanks.
>> Amit
>>
>> On Wed, Jul 6, 2016 at 12:34 AM, Robert Raszuk <robert@raszuk.net> wrote:
>>
>>> Hi Amit,
>>>
>>> I agree that recently BGP is being stretched towards DC-fabric side to
>>>> encompass various use cases.
>>>>
>>>> However, the way I see this draft, it can be useful for both, Internet
>>>> and DC-fabrics. The main reason is to keep traffic drain procedure BGP
>>>> topology agnostic, exactly same as ISIS Overload-bit. Agree, BGP has a
>>>> bigger blast radius in Internet compared to ISIS but appropriate
>>>> implementation of the feature can provide good benefits.
>>>>
>>>
>>>
>>> ​I don't think ​this is about "blast radius". ISIS or OSPF are link
>>> state protocols and each node in the given flooding scope computes its
>>> independent SPF hence flooding such information is a necessity for
>>> consistent forwarding.
>>>
>>> Contrary to the above BGP is path/distance-vector where each BGP speaker
>>> recomputes its bRIB and re-advertises it. Therefor for the use case you
>>> have in mind all what is required is to signal in some way to a bgp peer(s)
>>> that you may not want or want again to be in forwarding for a given SAFI.
>>>
>>> You defined a new SAFI as well as new NLRI format to use for point to
>>> point signalling .. I am not convinced this is the right level of
>>> signalling choice for this purpose. How do you stop propagation of such
>>> NLRIs around ? It would be pretty harmful if one router in Clos fabric will
>>> leak it and it breaks entire fabric - wouldn't you agree ? You at very min
>>> MUST enforce the NO-EXPORT/NO-ADVERTISE semantics for such SAFI which
>>> currently your draft seems to be missing.
>>>
>>> There are couple of alternatives to accomplish the same though:
>>>
>>> - using flag in dynamic capabilities
>>> - local poisoning of next hops
>>> - use of local pref/med (same as OSPF max metric:)
>>> - use of G-shut community
>>>
>>> or simply shutting the SAFIs. When you shut SAFI in one shot all paths
>>> learned are removed and best path (or multipath) recomputed. The potential
>>> "hit" would be on re-enabling it such that you need readvertise your
>>> underlay again.
>>>
>>> It is gracefull (no packet drops) as local forwarding can continue till
>>> everyone around stops sending you packets in a given table_id
>>> (corresponding to SAFI which has been shut down).
>>>
>>> Many thx,
>>> R.
>>>
>>>
>>
>