Re: [lisp] Questions about draft-saucez-lisp-itr-graceful-03

Damien Saucez <damien.saucez@inria.fr> Wed, 19 February 2014 20:29 UTC

Return-Path: <damien.saucez@inria.fr>
X-Original-To: lisp@ietfa.amsl.com
Delivered-To: lisp@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 5583F1A057F for <lisp@ietfa.amsl.com>; Wed, 19 Feb 2014 12:29:15 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -7.097
X-Spam-Level:
X-Spam-Status: No, score=-7.097 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HELO_EQ_FR=0.35, MIME_QP_LONG_LINE=0.001, RCVD_IN_DNSWL_HI=-5, RP_MATCHES_RCVD=-0.548] autolearn=ham
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id hLxvT-1xkwmh for <lisp@ietfa.amsl.com>; Wed, 19 Feb 2014 12:29:12 -0800 (PST)
Received: from mail2-relais-roc.national.inria.fr (mail2-relais-roc.national.inria.fr [192.134.164.83]) by ietfa.amsl.com (Postfix) with ESMTP id 7DAB71A04F4 for <lisp@ietf.org>; Wed, 19 Feb 2014 12:29:12 -0800 (PST)
X-IronPort-AV: E=Sophos;i="4.97,507,1389740400"; d="scan'208";a="59303855"
Received: from lvelizy-156-46-22-251.w80-11.abo.wanadoo.fr (HELO [192.168.1.237]) ([80.11.231.251]) by mail2-relais-roc.national.inria.fr with ESMTP/TLS/AES128-SHA; 19 Feb 2014 21:29:08 +0100
References: <20140218144825842648.087ffc67@sniff.de> <7DFCF6EA-9F05-468D-B51F-7AB7DEC149C8@inria.fr> <20140219111747519985.d46b87a8@sniff.de>
Mime-Version: 1.0 (1.0)
In-Reply-To: <20140219111747519985.d46b87a8@sniff.de>
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
Message-Id: <A9214C0E-E6AB-4DCF-A493-77C71F190AA0@inria.fr>
X-Mailer: iPod touch Mail (11B554a)
From: Damien Saucez <damien.saucez@inria.fr>
Date: Wed, 19 Feb 2014 21:29:05 +0100
To: Marc Binderberger <marc@sniff.de>
Archived-At: http://mailarchive.ietf.org/arch/msg/lisp/LCbJ6IpWggUVEqlKG5g9jxe9jPQ
Cc: Clarence Filsfils <cf@cisco.com>, Luigi Iannone <luigi.iannone@telecom-paristech.fr>, LISP mailing list list <lisp@ietf.org>
Subject: Re: [lisp] Questions about draft-saucez-lisp-itr-graceful-03
X-BeenThere: lisp@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: List for the discussion of the Locator/ID Separation Protocol <lisp.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/lisp>, <mailto:lisp-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/lisp/>
List-Post: <mailto:lisp@ietf.org>
List-Help: <mailto:lisp-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/lisp>, <mailto:lisp-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 19 Feb 2014 20:29:15 -0000

> On 19 Feb 2014, at 20:17, Marc Binderberger <marc@sniff.de> wrote:
> 
> Hello Damien,
> 
> thanks for the reply!
> 
>> If you have a solution to continuously synchronise ITRs caches, we
>> would be very happy to look at them and integrate them in the proposed
>> solution.
> 
> And I was curious to see a light-weight protocol extension from you :-)
> Seriously, was wondering if you see an elegant, light way to implement 
> this in the LISP protocol (?). 

Well directly using LISP maybe we could imagine something with map-notify and multicast to keep caches synchronized but I have to think more about that.

Damien Saucez
> 
>> the purpose of the document is to deal with planned restart of routers
>> meaning that we know exactly when the routeur will get down then up
>> (it is controlled by the operator).
> 
> but then the "Traffic deflection to other ITRs (or a PxTR)" could be 
> used to fill the cache of the 2nd ITR (the one that is not reloaded). 
> You turn it on on ITR2 (off on ITR1), change your IGP to send all LISP 
> data to remote sites to ITR2, "wait a bit", then ITR2 should be ready, 
> you turn off deflection on ITR2 and reload ITR1. Then turning on 
> deflection on ITR1 and bring the IGP routing back to active-active (or 
> whatever the setup was before).
> 
> 
> Regards, Marc
> 
> 
> 
> 
> 
> 
>> On Wed, 19 Feb 2014 09:38:54 +0100, Damien Saucez wrote:
>> Hello Marc,
>> 
>>> On 18 Feb 2014, at 23:48, Marc Binderberger <marc@sniff.de> wrote:
>>> 
>>> Hello Damien/Olivier/Luigi/Clarence & LISP experts,
>>> 
>>> had a look at draft-saucez-lisp-itr-graceful-03. And wonder if there is 
>>> more to come?
>> 
>> Thank you for the interest.  We are indeed thinking on ways to extend
>> the document and provide more details on the ways the solutions could
>> be implemented.
>> 
>> 
>>> Somehow section 4 feels a bit "short".
>>> 
>>> What I mean: if you try to solve the problem of the _two_ cache-miss 
>>> storms - first on the 2nd ITR (ITR2) when your restarting ITR (ITR1) 
>>> goes down, then on the restarting ITR1 when it picks up traffic again - 
>>> then section 4 would probably need to talk about a permanent cache 
>>> synchronization (?). Unless you want to solve a planned restart only. 
>>> But for a failure of the ITR1 I don't see how the solution you describe 
>>> would work
>>> 
>>>  o  ITR cache synchronization: upon startup, the ITR synchronizes its
>>>     cache with the other ITRs in its synchronization set.  The ITR is
>>>     marked as available only after the cache is synchronized.
>>> 
>>> as ITR2 would trigger the cache-miss storm for the traffic after ITR1 
>>> failure.
>>> 
>>> Or if you want to solve only the cache-miss storm when ITR1 comes back 
>>> into the traffic stream then the ITR deflection has the advantage to 
>>> not require any cache-synchronization protocol, IMHO. The rate of 
>>> Map-Requests could be throttled to turn the storm into a breeze. The 
>>> method how to transport traffic to ITR2 could be one of many - a direct 
>>> LAN, GRE, Lisp.
>>> 
>>> 
>>> So my question in short: are you planning to add some words about a 
>>> permanent cache synchronization?
>> 
>> For now we don't have acceptable techniques to keep caches
>> synchronised in a permanent way but I don't think it is a big issue as
>> the purpose of the document is to deal with planned restart of routers
>> meaning that we know exactly when the routeur will get down then up
>> (it is controlled by the operator).
>> 
>> If you have a solution to continuously synchronise ITRs caches, we
>> would be very happy to look at them and integrate them in the proposed
>> solution.
>> 
>> Thank you,
>> 
>> Damien Saucez
>> 
>>> 
>>> Thanks & Regards,
>>> Marc
>>