Re: [Ila] [5gangip] Scaling mapping systems (was Re: BOF Description)

Tom Herbert <tom@quantonium.net> Thu, 08 February 2018 04:20 UTC

Return-Path: <tom@quantonium.net>
X-Original-To: ila@ietfa.amsl.com
Delivered-To: ila@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 79F8C12D848 for <ila@ietfa.amsl.com>; Wed, 7 Feb 2018 20:20:00 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.899
X-Spam-Level:
X-Spam-Status: No, score=-1.899 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=quantonium-net.20150623.gappssmtp.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id o7G_Y_QvlPMM for <ila@ietfa.amsl.com>; Wed, 7 Feb 2018 20:19:58 -0800 (PST)
Received: from mail-wr0-x230.google.com (mail-wr0-x230.google.com [IPv6:2a00:1450:400c:c0c::230]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 005D51241F8 for <ila@ietf.org>; Wed, 7 Feb 2018 20:19:57 -0800 (PST)
Received: by mail-wr0-x230.google.com with SMTP id w50so3315001wrc.2 for <ila@ietf.org>; Wed, 07 Feb 2018 20:19:57 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=quantonium-net.20150623.gappssmtp.com; s=20150623; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=y0yMpnCK3fSydfTjvWVxd1IQN3pE79msVoMGNJJiJGA=; b=aqo/2x+I+BLWmIP4AWsFvUZtVmQb5Od6aHf/AFg0opv83P/FY6P9vbTyQ1qlfVHqV0 afPJahIWaYb54AjasZxDPGx2efbZV7zRw5FTJGWZqpYNrNrsYhxCQcZvPG4eLQ/W/ykC OorclY/hZjquT2wCTNpAy8cQvA09DB8r+9zjfCPvwRvB8V5R6rWq420An5wjl1D8PG18 toP1SulVCbkpYC+sRn05W8c09S78omIuXdz6KZcMfz1yvvpvHXTjyesG1aVmBdAQryyl a5iyAtKvWB1YD6W2uDJQD4DKiLxx+xy/P353k6sxnWcnNLxGR3OMuMhK4ypd+oVt8RF0 yKug==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=y0yMpnCK3fSydfTjvWVxd1IQN3pE79msVoMGNJJiJGA=; b=i2jsFj54zGpAI1/dUeNZEPBXNb2D8mUBsAruRpjWr5cqNQmOuQPVEX/hwJCOtMwj4P kIGwCnOVGrQReNmDBZTGaZ/W7vteK07iGRuL1XxtaBNN5ntO1PygaldQt401+7OhoGeg tQQ2E2Qa/rHIZot2kHj8hk+OVBAUqcOl56lfnkJj8Gz5seLvy08LUI3WUjmmG0xQn++8 pQYRl1P8u68YIjnKkXQLsRvsAZ7Fih3YuPx8XFEOBglR4wbhf0vrr0yuE02xGg+uIbMr 1v92O6OK6NWDLkYAdTPu5Tr5M+nsrv75s1i8mDwMxiOoKwBdZjOClsu9x+XurBfGxDEt WRaw==
X-Gm-Message-State: APf1xPC/o53z4pf/aXK7ZU4+hFBJJGSCN0TZ5AwVcHoAywiQa9WEEVLq 3k8sEt273tZ5m/AAFTQccpqvX++lboKszSeJzeXWCg==
X-Google-Smtp-Source: AH8x225v/0X9UJG3zhDM/MdMDovDDMoA9i93HL87KyUA4OcqTaKCX4CgJteke2pYlUw9rawbMy9Q2UUz93r8SqW/EGU=
X-Received: by 10.223.134.163 with SMTP id 32mr7781465wrx.250.1518063596382; Wed, 07 Feb 2018 20:19:56 -0800 (PST)
MIME-Version: 1.0
Received: by 10.223.173.66 with HTTP; Wed, 7 Feb 2018 20:19:55 -0800 (PST)
In-Reply-To: <D518818A-B2DF-40EE-8880-1D1B8B67FADC@gmail.com>
References: <CALx6S37+1PK3ET7g+XsCHt6-CJrABLko0YbWgE12xFX=vL5OPA@mail.gmail.com> <5067FA81-B416-4A19-9F11-A08B35BB8B6D@gmail.com> <CAPDqMeqNkiOWHOVU0AsUzFfPH60pTdS2x9CePhvZDhVLGJoGmw@mail.gmail.com> <9C425F56-738C-4600-9DFF-9D30FC3872DC@gmail.com> <CAPDqMeoLPSGbFg_H-_yOPBguXhOmx8fXjzYd_ax1Qds56KibZQ@mail.gmail.com> <EF6D1220-510C-4A4A-A15E-CA7029B300F7@gmail.com> <CAPDqMeq9yWoEY7WvtX7v-p0UN01-BjERVUF0HNFTEqwD=P1X=g@mail.gmail.com> <D518818A-B2DF-40EE-8880-1D1B8B67FADC@gmail.com>
From: Tom Herbert <tom@quantonium.net>
Date: Wed, 07 Feb 2018 20:19:55 -0800
Message-ID: <CAPDqMeoHxRCB2mr5Oc4XVG+RjKF+3mBx7APVj6y8uCzpOfn0BA@mail.gmail.com>
To: Dino Farinacci <farinacci@gmail.com>
Cc: Tom Herbert <tom@herbertland.com>, 5GANGIP <5gangip@ietf.org>, Behcet Sarikaya <sarikaya@ieee.org>, ila@ietf.org, Alexandre Petrescu <alexandre.petrescu@gmail.com>, Lorenzo Colitti <lorenzo@google.com>
Content-Type: multipart/alternative; boundary="001a1146a22068bcdf0564abb9ee"
Archived-At: <https://mailarchive.ietf.org/arch/msg/ila/qNSA9AmCA8d68hjzbl1QuM0aX0U>
Subject: Re: [Ila] [5gangip] Scaling mapping systems (was Re: BOF Description)
X-BeenThere: ila@ietf.org
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: Identifier Locator Addressing <ila.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ila>, <mailto:ila-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ila/>
List-Post: <mailto:ila@ietf.org>
List-Help: <mailto:ila-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ila>, <mailto:ila-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 08 Feb 2018 04:20:00 -0000

On Wed, Feb 7, 2018 at 6:48 PM, Dino Farinacci <farinacci@gmail.com> wrote:

> > Yes. OOO packets are possible during a transition period. In order
> delivery is not a requirement of IP and the window for OOO packets is
> relatively small anyway.
>
> Right but frequent out-of-order delivery does not make the system as a
> whole work well. And the “transition period” is for *every new destination”
> that is not cached. So this can be a steady state situation.
>
> Hi Dino,

I'm not sure I understand. OOO packets most often happen when packets for
the same flow take two different paths through the network at the same time
so that packets sent later beat earlier ones to the destination. W.r.t. to
the cache, the window for OOO delivery is when cache is entry is
instantiated, but there are already packets inflight that presumably are
taking non-cached scenic route. Once the non-cached packets have been
received, all the packets for the flow go through the cache taking the same
path so there is no more OOO. It's a small window. Also, if multiple
connections are simultaneously suffer OOO from this I doubt this would have
a material impact-- congestion window collapse is unlikely without packet
loss. Of course, a stateful firewall in the path might not be so tolerant
of OOO packets...

> > trianglular routing. Redirects must be secure so that they cannot be
> spoofed, so for that reason (and some others) the protocol is over TCP. The
> mapping cache is only an optimization and packets are never dropped or
> queued for pending cache resolution. If the cache weren't present,
> communications would
> >
> > But the ILA router in the network needs to have the mapping. And if it
> does not, what happens then?
> >
> > The ILA router is considered authoritative for its shard. If it doesn't
> have a mapping then the packet is silently dropped as having an unreachable
> destination.
>
> Does the ILA router for the shard only route to destinations inside of the
> shard?


Yes.


> And if so, it is kept small so you can push the mappings and make it scale
> at that level?
>

Small is relative. The ILA router first needs to being able to forward the
load, so provisioning would need to consider the worse case where no
entries hit the cache. Redirects would likely be rate limited (probably
capped by throughput). In a network, design I would probably target a cache
hit rate of at least 95%.


>
> > > still be viable but sub-optimal; that characteristic establishes a
> bound for the worst case DOS attack.
> > >
> > > Any thoughs on this approach?
> >
> > You’ll really need to put signatures in the Redirect messages if you
> want robust authentication.
> >
> > The redirects are sent over an established TCP connection to an ILA-N so
> that deters message spoofing. For stronger security, TCP authentication
> option or TLS can be used.
>
> Is it worth the state to hold millions of TCP connections only the
> occasion redirect message?
>

It might be millions of TCP connections spread across a large network, but
for ILA-Ns it's at most a couple of hundred connections and for a mapping
resolution ILA-R it's probably no more than 50K. Assuming 2K memory per TCP
connection ctx (and TLS if in use) that's a 100M on the ILA-Rs. In addition
to security, the other reasons to use TCP is it's reliable transport and
provides congestion control and avoidance. There's also the option of using
TFO if the amount of state were to become an issue.

Tom