Re: Neighbor Unreachability Detection is too impatient

Mikael Abrahamsson <swmike@swm.pp.se> Wed, 25 May 2011 09:20 UTC

Return-Path: <swmike@swm.pp.se>
X-Original-To: ipv6@ietfa.amsl.com
Delivered-To: ipv6@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id B4B34E0679 for <ipv6@ietfa.amsl.com>; Wed, 25 May 2011 02:20:53 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.599
X-Spam-Level:
X-Spam-Status: No, score=-2.599 tagged_above=-999 required=5 tests=[BAYES_00=-2.599]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id KZ-diLb0ddpF for <ipv6@ietfa.amsl.com>; Wed, 25 May 2011 02:20:53 -0700 (PDT)
Received: from uplift.swm.pp.se (ipv6.swm.pp.se [IPv6:2a00:801::f]) by ietfa.amsl.com (Postfix) with ESMTP id C585EE06DF for <ipv6@ietf.org>; Wed, 25 May 2011 02:20:52 -0700 (PDT)
Received: by uplift.swm.pp.se (Postfix, from userid 501) id BE2B19C; Wed, 25 May 2011 11:20:49 +0200 (CEST)
Received: from localhost (localhost [127.0.0.1]) by uplift.swm.pp.se (Postfix) with ESMTP id BBDE59A for <ipv6@ietf.org>; Wed, 25 May 2011 11:20:49 +0200 (CEST)
Date: Wed, 25 May 2011 11:20:49 +0200
From: Mikael Abrahamsson <swmike@swm.pp.se>
To: ipv6@ietf.org
Subject: Re: Neighbor Unreachability Detection is too impatient
In-Reply-To: <m1QPA6W-0001hIC@stereo.hq.phicoh.net>
Message-ID: <alpine.DEB.2.00.1105251114100.13754@uplift.swm.pp.se>
References: <m1QOaVe-0001pxC@stereo.hq.phicoh.net> <4DDBDE7F.5070706@acm.org> <m1QOv9u-0001hgC@stereo.hq.phicoh.net> <4DDBE941.8030007@acm.org> <m1QPA6W-0001hIC@stereo.hq.phicoh.net>
User-Agent: Alpine 2.00 (DEB 1167 2008-08-23)
Organization: People's Front Against WWW
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset="US-ASCII"; format="flowed"
X-BeenThere: ipv6@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "IPv6 Maintenance Working Group \(6man\)" <ipv6.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ipv6>, <mailto:ipv6-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ipv6>
List-Post: <mailto:ipv6@ietf.org>
List-Help: <mailto:ipv6-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ipv6>, <mailto:ipv6-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 25 May 2011 09:20:53 -0000

On Wed, 25 May 2011, Philip Homburg wrote:

> So you are saying (with a bit exaggeration from my side) that some links 
> in data centers have so many hosts that a network disruption of a few 
> seconds will cause a multicast storm big enough to meltdown the network.
>
> With modern CPUs and a Gbit/s network that seems very odd to me.

I have an example from reality:

At AMSIX, there are a few hundred routers on the same L2 network. 
Connected to this we have a router from a major well known vendor. This 
vendor does not filter IPv6 multicast messages on the linecard, but 
instead punts all of it to RP, and AMSIX L2 infrastructure treats all 
multicast as broadcast, so all multicast (mainly ND traffic) is sent out 
on all ports.

This traffic to the RP was enough to cause IPv6 BGP sessions to go down 
because it was continous ND messages coming in all the time (few hundred 
PPS).

ND is chatty, and it does cause problems on large networks. One might call 
this multiple faults on multiple levels (bad linecard code, bad L2 
platform, bad low-performing CPU on the RP), but it does cause problems in 
real life regardless.

-- 
Mikael Abrahamsson    email: swmike@swm.pp.se