Re: [Idr] I-D Action: draft-ietf-idr-error-handling-03.txt

John Leslie <john@jlc.net> Thu, 17 January 2013 16:48 UTC

Return-Path: <john@jlc.net>
X-Original-To: idr@ietfa.amsl.com
Delivered-To: idr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 546D521F8512 for <idr@ietfa.amsl.com>; Thu, 17 Jan 2013 08:48:48 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -106.161
X-Spam-Level:
X-Spam-Status: No, score=-106.161 tagged_above=-999 required=5 tests=[AWL=0.438, BAYES_00=-2.599, RCVD_IN_DNSWL_MED=-4, USER_IN_WHITELIST=-100]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 6iANz-LFiPYH for <idr@ietfa.amsl.com>; Thu, 17 Jan 2013 08:48:47 -0800 (PST)
Received: from mailhost.jlc.net (mailhost.jlc.net [199.201.159.4]) by ietfa.amsl.com (Postfix) with ESMTP id 9430E21F84EB for <idr@ietf.org>; Thu, 17 Jan 2013 08:48:45 -0800 (PST)
Received: by mailhost.jlc.net (Postfix, from userid 104) id 9B7FD33D5A; Thu, 17 Jan 2013 11:48:44 -0500 (EST)
Date: Thu, 17 Jan 2013 11:48:44 -0500
From: John Leslie <john@jlc.net>
To: Brian Dickson <brian.peter.dickson@gmail.com>
Message-ID: <20130117164844.GF41685@verdi>
References: <CAEGVVtBy-zdLz8hVajLnuAqgzfgQHrseK4r-N9=pOZGtqV7LbA@mail.gmail.com> <074d01cdd536$173f5830$45be0890$@highwayman.com> <9474D8DC-30FF-4C52-9504-15CBCC47E7D8@ericsson.com> <07df01cdd661$f28ef7c0$d7ace740$@highwayman.com> <36E98AE5-3EF8-4738-9982-42B9CA0BAAF5@rob.sh> <005001cdd6da$099f1e90$1cdd5bb0$@highwayman.com> <828AAFF5-0260-4AA6-BBDC-6C1F69919837@ericsson.com> <009001cdd6ff$1c982530$55c86f90$@highwayman.com> <CAPWAtb+JZaJejebL0qd8LzC+3zhGYcagnz9m7gqq=AMC9T=mhw@mail.gmail.com> <CAH1iCirraR6BcRvupMN-+t=HXv9s2giTsOKzsDa=GhyCNj_THQ@mail.gmail.com>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Disposition: inline
In-Reply-To: <CAH1iCirraR6BcRvupMN-+t=HXv9s2giTsOKzsDa=GhyCNj_THQ@mail.gmail.com>
User-Agent: Mutt/1.4.1i
Cc: idr@ietf.org
Subject: Re: [Idr] I-D Action: draft-ietf-idr-error-handling-03.txt
X-BeenThere: idr@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: Inter-Domain Routing <idr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/idr>, <mailto:idr-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/idr>
List-Post: <mailto:idr@ietf.org>
List-Help: <mailto:idr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/idr>, <mailto:idr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 17 Jan 2013 16:48:48 -0000

Brian Dickson <brian.peter.dickson@gmail.com> wrote:
> 
> It is unfortunate that UPDATE messages do not have any sort of identifier.

   Indeed! One hopes BGP5 won't suffer from this...

> It would be nice to be able to have the receiver send a NOTIFICATION that
> says, in effect, UPDATE #foo was garbled, please send JUST the afi/safi &
> NLRI so I know what to ignore.

   The downside, of course, is that the sender needs to agree to do so.

   But IMHO, we're not going to get fully "safe" error recovery without
involving the sender. (There is a sign in my office, which everyone
who has worked here has had to "write 100 times," saying, "Error
correction is the most error-prone operation in computing.")

> Maybe this could be handled via another UPDATE Attribute - the
> identification for an UPDATE itself?
> Then refer to this in the appropriate NOTIFICATION, type 3, new subtype,
> with parameter identifying the problem UPDATE?

   There are various ways to do this. I, being a bear of small brain,
chose to do it with a new message type, thus entirely removing any
question of pulling it out of an UPDATE message. (In BGP5, it should
be a required field early in the UPDATE!)

> Then there is the question of the original UPDATE (generally on-the-fly
> data). If the error happens soon enough, maybe the sender still has the
> UPDATE in his/her outgoing TCP buffer?

   Actually, this wouldn't be hard, provided the receiver gives a prompt
enough signal that an UPDATE has been "successfully" received.

> Otherwise, this would mean having to hold onto UPDATEs for possibly longer
> than desired.
> (LRU for buffer management is so very convenient when it is available, of
> course. Pack rat coding rules.)

   But all the sender really needs to save is the NLRI(s) deserving to
be withdrawn. That's a smaller amount of data...

> And of course, this pushes the "ACK" further up the stack, from TCP to the
> parent application (BGP). Again, maybe not that bad an idea, but definitely
> non-trivial.

   IMHO, the ACK needs to be further up the stack. Sooner or later we'll
run into a case where the error is only apparent in "context", but the
proper correction is to back out the UPDATE and ask the sender to
generate a "safer" version of it.

> (I think this is more reliable, but again requires changes to both ends.
> The net benefit can be substantial, in terms of not having to "just guess"
> when the UPDATE is already known to be busted.)

   Exactly!

--
John Leslie <john@jlc.net>