Re: [Idr] IETF LC for IDR-ish document <draft-ietf-grow-bgp-reject-05.txt> (Default EBGP Route Propagation Behavior Without Policies) to Proposed Standard

Enke Chen <enkechen@cisco.com> Thu, 20 April 2017 21:03 UTC

Return-Path: <enkechen@cisco.com>
X-Original-To: idr@ietfa.amsl.com
Delivered-To: idr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id E01521316B0 for <idr@ietfa.amsl.com>; Thu, 20 Apr 2017 14:03:26 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -14.523
X-Spam-Level:
X-Spam-Status: No, score=-14.523 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_HI=-5, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RP_MATCHES_RCVD=-0.001, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001, USER_IN_DEF_DKIM_WL=-7.5] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=cisco.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id b24cjn6hqtJm for <idr@ietfa.amsl.com>; Thu, 20 Apr 2017 14:03:25 -0700 (PDT)
Received: from alln-iport-2.cisco.com (alln-iport-2.cisco.com [173.37.142.89]) (using TLSv1.2 with cipher DHE-RSA-SEED-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 21A361316B5 for <idr@ietf.org>; Thu, 20 Apr 2017 14:03:25 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=cisco.com; i=@cisco.com; l=4373; q=dns/txt; s=iport; t=1492722205; x=1493931805; h=subject:to:references:cc:from:message-id:date: mime-version:in-reply-to:content-transfer-encoding; bh=5o+hApi6jnFob/VvkCUdEeNnb9mdqLNWNZdAG7q9Y/o=; b=HLeT3msJyxFK5iceCT9vrD/ByZ/fJwIYEalmB2LK3oc2+/FRntnD3c1I thVgn9kBHrN3JXBtgVCD/FNrWIXXiEgJ9GOA9oF6i4AktCniblKlG+AcX Y5xElZo61sZ18X8RbZyq0FNvPdz1UwDpwYJ2BE2HYqY+cqiozpj17S+4C g=;
X-IronPort-AV: E=Sophos;i="5.37,227,1488844800"; d="scan'208";a="410306070"
Received: from rcdn-core-3.cisco.com ([173.37.93.154]) by alln-iport-2.cisco.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 20 Apr 2017 21:03:24 +0000
Received: from [10.41.56.234] ([10.41.56.234]) by rcdn-core-3.cisco.com (8.14.5/8.14.5) with ESMTP id v3KL3NPA004707; Thu, 20 Apr 2017 21:03:24 GMT
To: "John G. Scudder" <jgs@juniper.net>
References: <D4E812E8-AA7B-4EA2-A0AC-034AA8922306@juniper.net> <abe393d3-d1e4-7841-4620-38dab751765b@cisco.com> <68B29403-9AD9-4F06-9FE4-3F077E793D9F@puck.nether.net> <275cf744-1f64-bcbc-dabe-a47479921230@cisco.com> <20170420154142.lacvtplusepy3qcf@hanna.meerval.net> <b57162ec-f806-6e86-7713-58608f72c468@cisco.com> <32C0B4EE-6241-49F9-97F2-7107AC68678D@juniper.net>
Cc: Job Snijders <job@ntt.net>, idr@ietf.org, Hares Susan <shares@ndzh.com>, Enke Chen <enkechen@cisco.com>
From: Enke Chen <enkechen@cisco.com>
Message-ID: <e513849d-f895-0499-7bf4-5ecb24cadab7@cisco.com>
Date: Thu, 20 Apr 2017 14:03:23 -0700
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.10; rv:45.0) Gecko/20100101 Thunderbird/45.8.0
MIME-Version: 1.0
In-Reply-To: <32C0B4EE-6241-49F9-97F2-7107AC68678D@juniper.net>
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: 8bit
Archived-At: <https://mailarchive.ietf.org/arch/msg/idr/3hP9jTkYvMJSqf0ZnhrELzG5HpM>
Subject: Re: [Idr] IETF LC for IDR-ish document <draft-ietf-grow-bgp-reject-05.txt> (Default EBGP Route Propagation Behavior Without Policies) to Proposed Standard
X-BeenThere: idr@ietf.org
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: Inter-Domain Routing <idr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/idr>, <mailto:idr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/idr/>
List-Post: <mailto:idr@ietf.org>
List-Help: <mailto:idr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/idr>, <mailto:idr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 20 Apr 2017 21:03:27 -0000

Hi, John:

It's not like the issues with software update are new :-)

As we all know, there is a complete lack of uniformity (on a global scale) with
the procedures for software update. As a result, a proposal that assumes certain
procedures is bound to be incomplete, and will not work for some.

As Eric summarized,

---
Phasing in a change of behavior over several releases is not a practical solution, because:

a) Customers will still be surprised when the default behavior finally changes, and
b) Many customers won't deploy all the releases anyway.
---

Thanks.  -- Enke

On 4/20/17 11:40 AM, John G. Scudder wrote:
> Enke,
> 
> On Apr 20, 2017, at 11:57 AM, Enke Chen <enkechen@cisco.com> wrote:
> 
>> It depends on the customer base and also how long the software has been deployed.
>> Just think about the scenario that a large number of customers would lose network
>> connectivity unexpectedly due to a default behavior change in the code. Such outages
>> could keep happening to different customers for years to come.
>>
>> Perhaps, changing "impossible" to "impractical" :-)
> 
> Various people have provided worked examples of how they think this change could be effected without causing the disruptions you warn of. I've pasted one example (from Jared) below. I haven't seen any response or acknowledgement from you, to those suggestions, just the repetition of your initial concern. I for one don't understand why you think a scheme such as Jared describes would be either "impossible" or "impractical".
> 
> If you are unconvinced about the practicality of such a scheme, it would be great if you could describe why. 
> 
> Thanks,
> 
> --John
> 
> Jared's message:
>> On Apr 20, 2017, at 9:40 AM, Jared Mauch <jared@puck.nether.net> wrote:
>>
>>
>>> On Apr 19, 2017, at 6:26 PM, Robert Raszuk <robert@raszuk.net> wrote:
>>>
>>> Keyur,
>>>
>>> You can not set "insecure mode" before you reload the OS as current OS does not have such knob. Unless you delay the deployment across N releases and enforce sequenced upgrade.
>>
>> Infact, this is the recommendation that I’ve provided to vendors that have expressed concerns.  There are many defaults that have not always been displayed, but things like IOS have “show run all” so you can see these.  
>>
>> Something like the ‘bgp unsafe-ebgp-policy’ could be generated on their respective implementations.  I didn’t think that GROW/IDR needed to tell implementors this level of how to manage their release, so this does seem somewhat out of scope, but a concern I can see needs to be thought about.
>>
>>> The only way to prevent massive reachability failure upon reload due to complete silent bgp prefix drop is to configure inbound policy for all EBGP sessions before the reload and run with new image. 
>>
>> Since we’re talking about how to operate a network:
>> - People who are taking advantage of an undefined behavior will always be surprised
>> - Vendors can take the N+1.x and N+2.x release strategy, where in N and N+1 they generate their equivalent of IOS-XR and the "bgp unsafe-ebgp-policy” policy to prevent their customers from breaking
>> - In a release N+2(or more) that would become the “default”.
>>
>>> Of course this is all assuming that someone will read carefully the release notes :) 
>>
>> Most people don’t, and I’ve always suggested to vendors they implement some sort of incremental approach to resolving this.
>>
>> What worries me is that there is a major incumbent provider who doesn’t see this as the serious (and well-documented) security issue that it is for those operating large networks.
>>
>>> If they do not the troubleshooting of this will be really painful ! CE will see EBGP session as UP, will get all the routes and will send his routes. CE will have no clue if PE dropped or accepted his routes. Likewise on the other end .. Only imagine a network which has 10s of thousands of VPN CEs as Bruno mentioned and their provider not following all releases CEs are running. 
>>
>>
>> If they don’t know how to troubleshoot BGP, that isn’t the vendors fault.
>>
>>> At least doing it as part of OPEN msg will be immediately indicated to both ends. 
>>
>> This is you promoting a different draft, I recommend another thread for that draft.
>>
>> - Jared
> ...
>