Re: [rrg] an example of convergence
Zied BEN HOUIDI <zied.benhouidi@gmail.com> Tue, 16 March 2010 13:33 UTC
Return-Path: <zied.benhouidi@gmail.com>
X-Original-To: rrg@core3.amsl.com
Delivered-To: rrg@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 59DB73A6816 for <rrg@core3.amsl.com>; Tue, 16 Mar 2010 06:33:29 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: 0
X-Spam-Level:
X-Spam-Status: No, score=x tagged_above=-999 required=5 tests=[]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 6RYNbL0a98+W for <rrg@core3.amsl.com>; Tue, 16 Mar 2010 06:33:29 -0700 (PDT)
Received: from mail-ew0-f220.google.com (mail-ew0-f220.google.com [209.85.219.220]) by core3.amsl.com (Postfix) with ESMTP id CDEC13A68E9 for <rrg@irtf.org>; Tue, 16 Mar 2010 06:33:22 -0700 (PDT)
Received: by ewy20 with SMTP id 20so1699325ewy.21 for <rrg@irtf.org>; Tue, 16 Mar 2010 06:33:28 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:cc:content-type; bh=v0HeS4uqxX+BYi1apmMbF+4r+7m19WaMw/inIOMm1IQ=; b=SErKpX2R7lZGQ0HcEveVuYiOlCbMfnYMuJrlO6x/HK6rthD8p533yiwtpCXOF2F6E9 U9nwVmmHUtuzigI1rcI7uvB38VxUNyABEBTqWYNggVaPrQV6mOyctSt1TsL/0pxBLm0n VOXZ3+c3K8ZiYNT72TOmrIRvYtRTMT815sb1A=
DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; b=GmnHd9xnDvgeUpNM/UUNiFoaSLb7Ok7fkL+ZKqu17bnLhtfzGRYd9ZVbsbzvb18Dg2 31H+DigMBs5IOquVo62WKPRz7xPJ9EJ0LyvkMz6vDiuGy6akVEUDlDzMKeO1Y6RdCkVn xqnVOGP3jclrSUkHlCEUxvd1LhcsDjItFeq1M=
MIME-Version: 1.0
Received: by 10.216.157.149 with SMTP id o21mr58334wek.4.1268746407647; Tue, 16 Mar 2010 06:33:27 -0700 (PDT)
In-Reply-To: <alpine.LRH.2.00.1003161007360.26291@netcore.fi>
References: <C7C3EEFA.604E%tony.li@tony.li> <alpine.LRH.2.00.1003161007360.26291@netcore.fi>
Date: Tue, 16 Mar 2010 14:33:27 +0100
Message-ID: <8a5aa23c1003160633p248cbdf7hc0685bc01f8f5247@mail.gmail.com>
From: Zied BEN HOUIDI <zied.benhouidi@gmail.com>
To: Pekka Savola <pekkas@netcore.fi>
Content-Type: multipart/mixed; boundary="0016e649c820f01ffe0481eb0a79"
Cc: RRG <rrg@irtf.org>, BEN-HOUIDI Zied RD-CORE <zied.benhouidi@orange-ftgroup.com>
Subject: Re: [rrg] an example of convergence
X-BeenThere: rrg@irtf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: IRTF Routing Research Group <rrg.irtf.org>
List-Unsubscribe: <http://www.irtf.org/mailman/listinfo/rrg>, <mailto:rrg-request@irtf.org?subject=unsubscribe>
List-Archive: <http://www.irtf.org/mail-archive/web/rrg>
List-Post: <mailto:rrg@irtf.org>
List-Help: <mailto:rrg-request@irtf.org?subject=help>
List-Subscribe: <http://www.irtf.org/mailman/listinfo/rrg>, <mailto:rrg-request@irtf.org?subject=subscribe>
X-List-Received-Date: Tue, 16 Mar 2010 13:33:29 -0000
Hi, I see two distinct issues in your post. The time to receive/send routes and the time to install them back to FIB. The first post is about the FIB installing time (krt queue blocking). In the older posts (three years ago), it was probably more about route sending rate. http://puck.nether.net/pipermail/juniper-nsp/2007-February/007655.html http://puck.nether.net/pipermail/juniper-nsp/2007-February/007575.html "7 minutes seems rather long as the links between peers are between 2.5G to 10G" This looks like an issue with the slow BGP route sending rate that we discovered on routers from three different vendors. In fact, the sending of routes is one order of magnitude slower than it should be (despite the high link capacity, the table transfer was slow and both routers, the sender and the receiver, were almost idle during the table transfer). We found that it is due to the timer driven implementation of the interaction between BGP and TCP. We showed, using a software BGP speaker (sbgp) that an event driven implementation can reduce transfer times from 4 minutes to 7 seconds (680,000 routes). We also opened a case with one of the router vendors. The vendor confirmed that it was due to the timer driven implementation and modified its BGP implementation to be event driven. Convergence time in the vendor's set up went from 12minutes to 47 seconds with the new implementation/fix. Here is an experimental study that we've made on this issue and on BGP routing table transfers in general (it appeared in the last Internet Measurement Conference IMC2009). http://delivery.acm.org/10.1145/1650000/1644935/p350-benhouidi.pdf?key1=1644935&key2=8304478621&coll=GUIDE&dl=GUIDE&CFID=82157635&CFTOKEN=81293083 You may find more details about different factors that impact BGP routing table transfers (route sending, table size, route installing, number of BGP sessions, router CPU). Kind regards, --- Zied Ben Houidi Phd Candidate at Orange Labs 2010/3/16 Pekka Savola <pekkas@netcore.fi> > On Mon, 15 Mar 2010, Tony Li wrote: > >> <employer hat off> >> Any operator who would like to stand up and embarrass their favorite >> router >> vendor by showing a graph of router boot convergence times is welcome to >> do >> so. ;-) >> </employer hat off> >> > > Ok, I'll jump on my soap box. In worst case, Juniper high-end routers have > been demonstrated to take up to 10 minutes to install RIB to FIB. During > this time, packets usually get blackholed (aka "krt queue blocking"). See > thread in j-nsp in Dec 2009: > > http://www.mail-archive.com/juniper-nsp@puck.nether.net/msg07663.html > > A bit of continuation here: > > http://www.mail-archive.com/juniper-nsp@puck.nether.net/msg07833.html > > This is not a new issue; It has started appearing over 3 years ago: > > http://puck.nether.net/pipermail/juniper-nsp/2007-February/007655.html > http://puck.nether.net/pipermail/juniper-nsp/2007-February/007575.html > > I've personally pursued a case with Juniper TAC on this for a long time but > eventually gave up. One known architectural bottleneck is the slowness of > RE-PFE IPC (unix socket), but I don't really belive in this being the root > cause. I know others have also tried to work with the vendor to get this > fixed. > > Semi-relevant may also be a graph of BGP update sizes during a BGP DFZ flap > event that I sent to idr list three years back, showing that of 87K BGP > updates due to a single event, almost all of them are below 100 bytes in > size. > > http://www.ietf.org/mail-archive/web/idr/current/msg02398.html > > -- > Pekka Savola "You each name yourselves king, yet the > Netcore Oy kingdom bleeds." > Systems. Networks. Security. -- George R.R. Martin: A Clash of Kings > _______________________________________________ > rrg mailing list > rrg@irtf.org > http://www.irtf.org/mailman/listinfo/rrg >
- [rrg] draft-narten-radir-problem-statement-05.txt Thomas Narten
- Re: [rrg] draft-narten-radir-problem-statement-05… Joel M. Halpern
- Re: [rrg] draft-narten-radir-problem-statement-05… Danny McPherson
- Re: [rrg] draft-narten-radir-problem-statement-05… Robin Whittle
- Re: [rrg] draft-narten-radir-problem-statement-05… Thomas Narten
- Re: [rrg] draft-narten-radir-problem-statement-05… Thomas Narten
- Re: [rrg] draft-narten-radir-problem-statement-05… HeinerHummel
- Re: [rrg] draft-narten-radir-problem-statement-05… Joel M. Halpern
- Re: [rrg] draft-narten-radir-problem-statement-05… Thomas Narten
- Re: [rrg] draft-narten-radir-problem-statement-05… Brian E Carpenter
- Re: [rrg] draft-narten-radir-problem-statement-05… Joel M. Halpern
- Re: [rrg] draft-narten-radir-problem-statement-05… Noel Chiappa
- Re: [rrg] draft-narten-radir-problem-statement-05… Robin Whittle
- Re: [rrg] draft-narten-radir-problem-statement-05… Geoff Huston
- Re: [rrg] draft-narten-radir-problem-statement-05… HeinerHummel
- Re: [rrg] draft-narten-radir-problem-statement-05… Brian E Carpenter
- Re: [rrg] draft-narten-radir-problem-statement-05… HeinerHummel
- Re: [rrg] draft-narten-radir-problem-statement-05… Flinck, Hannu (NSN - FI/Espoo)
- Re: [rrg] draft-narten-radir-problem-statement-05… HeinerHummel
- [rrg] Geoff Huston's BGP/DFZ research Robin Whittle
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Robin Whittle
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Christopher Morrow
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Robin Whittle
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Paul Jakma
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Amund Kvalbein
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Robin Whittle
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Marshall Eubanks
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Paul Jakma
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Robin Whittle
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Paul Jakma
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Tony Li
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Lixia Zhang
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Lixia Zhang
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Tony Li
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Tom Vest
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Paul Jakma
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Paul Jakma
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Paul Jakma
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Tony Li
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Tony Li
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Christopher Morrow
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Lixia Zhang
- [rrg] an example of convergence Pekka Savola
- Re: [rrg] an example of convergence Zied BEN HOUIDI
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Constantine Dovrolis
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Danny McPherson
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Geoff Huston
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Tony Li
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Geoff Huston
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Tony Li
- Re: [rrg] Geoff Huston's BGP/DFZ research - 300k … Amund Kvalbein