Re: [i2rs] Provisions for Client Redundancy/Fault Tolerance in draft-atlas-i2rs-architecture-01

Alia Atlas <akatlas@gmail.com> Tue, 13 August 2013 21:00 UTC

Return-Path: <akatlas@gmail.com>
X-Original-To: i2rs@ietfa.amsl.com
Delivered-To: i2rs@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id C1EC221E812A for <i2rs@ietfa.amsl.com>; Tue, 13 Aug 2013 14:00:49 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.562
X-Spam-Level:
X-Spam-Status: No, score=-2.562 tagged_above=-999 required=5 tests=[AWL=0.038, BAYES_00=-2.599, HTML_MESSAGE=0.001, NO_RELAYS=-0.001]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id J2g0B2VxuTTg for <i2rs@ietfa.amsl.com>; Tue, 13 Aug 2013 14:00:42 -0700 (PDT)
Received: from mail-ob0-x22b.google.com (mail-ob0-x22b.google.com [IPv6:2607:f8b0:4003:c01::22b]) by ietfa.amsl.com (Postfix) with ESMTP id 94CAE21E80C3 for <i2rs@ietf.org>; Tue, 13 Aug 2013 14:00:39 -0700 (PDT)
Received: by mail-ob0-f171.google.com with SMTP id tb18so11042673obb.16 for <i2rs@ietf.org>; Tue, 13 Aug 2013 14:00:38 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=DZpg0oPgz3KQdhutPi/M9BHenHjPw/g4e6B14BfxXgQ=; b=l6NfouDUQrAIoEWMk0M/69f+5I7gnPS6YzuXBwq2KEU2aBzo9ayISytD3nciX6XTtj Edve2SSdF54Rd1ixN6gp1uKd8Vsafz6ZhEew7yuBb/cuY5IZC7ccmhnkNe2LN7vxyy1b DJEYFUlbmjjrfvdzlqmfimgIcGDGEfcEIX5BcMWU9tO4OnLczPeEacffJxNmT6ysNZHI MT/8UoE7v5ONqNy3Vqq45oM5qcOmhGOV39B8g2CKlZp4LdH9Hw0Wb+kgmPOMuW5/tjvI +BAh3HhzDjUWhLdTwqI1VtybBXWxN5f9DK4Vr5N+8qY6UVa0xrPT52Y+cJckAnp9ZtCE FqzA==
MIME-Version: 1.0
X-Received: by 10.182.61.44 with SMTP id m12mr8365520obr.52.1376427638593; Tue, 13 Aug 2013 14:00:38 -0700 (PDT)
Received: by 10.182.221.98 with HTTP; Tue, 13 Aug 2013 14:00:38 -0700 (PDT)
In-Reply-To: <031001ce8f76$a1d07830$e5716890$@riw.us>
References: <51FA492D.3070402@labn.net> <CAG4d1rd33Xu-zOQWKeK2ZT7qE3r6kddKk_=VqSaGp5E31usjqQ@mail.gmail.com> <51FA4FF0.4060902@labn.net> <031001ce8f76$a1d07830$e5716890$@riw.us>
Date: Tue, 13 Aug 2013 17:00:38 -0400
Message-ID: <CAG4d1rfWccm9UoNks0_XVhXAw2_Dk5soVFzKNaVXOH3RvWA9rw@mail.gmail.com>
From: Alia Atlas <akatlas@gmail.com>
To: Russ White <russw@riw.us>
Content-Type: multipart/alternative; boundary="e89a8fb1f218751f5904e3da8817"
Cc: "i2rs@ietf.org" <i2rs@ietf.org>, Joel Halpern <Joel.Halpern@ericsson.com>, Lou Berger <lberger@labn.net>
Subject: Re: [i2rs] Provisions for Client Redundancy/Fault Tolerance in draft-atlas-i2rs-architecture-01
X-BeenThere: i2rs@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "Interface to The Internet Routing System \(IRS\)" <i2rs.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/i2rs>, <mailto:i2rs-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/i2rs>
List-Post: <mailto:i2rs@ietf.org>
List-Help: <mailto:i2rs-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/i2rs>, <mailto:i2rs-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 13 Aug 2013 21:00:50 -0000

Hi Russ,

On Fri, Aug 2, 2013 at 7:51 AM, Russ White <russw@riw.us> wrote:

>
> > A bit.  The discussion of treating multiple writers for the same state as
> errors
> > got me worried about this case - from a redundancy perspective, and to a
> > lesser degree, accountability standpoint.
> >
> > Given the importance of reliable operation even in the case of client
> failures,
> > I think the topic should be covered in the architecture document. Of
> course
> > this topic/comment can be addressed after the document is adopted.
>
> Wouldn't much of this problem be alleviated if we went to a RESTful style
> atomic interface? It seems, to me, that multipart "commit/rollback"
> procedures always end up adding a lot of complexity, etc. I know there are
> situations where it's simpler to have the client hold partial state while
> telling the controller, "this part will succeed, please send me the next
> part," but then we've also fallen into dealing with latency of operations,
> locking forwarding plane tables across a remote connection (which routing
> events are still occurring!), and this entire error management mess in
> terms
> of resiliency.
>

[Alia] I don't think that we are suggesting supporting multipart
commit/rollback procedures.  What operations come in a single message can
be handled but so far we are avoiding transaction semantics and certainly
locking any tables across messages!!


> Routing protocols produce atomic updates today --we should stick with the
> pattern set there as much as possible.
>

[Alia] Yup - my understanding of their concern is the traceability of
whether the backup has taken over or not.  I'm sure I'm missing nuances
that I will be shortly enlightened about.  :-)

Regards,
Alia


>
> :-)
>
> Russ
>
>