Re: [Int-area] Where to aggregate, where to drop

Dino Farinacci <farinacci@gmail.com> Tue, 05 April 2022 23:40 UTC

Return-Path: <farinacci@gmail.com>
X-Original-To: int-area@ietfa.amsl.com
Delivered-To: int-area@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 9A4BE3A1553 for <int-area@ietfa.amsl.com>; Tue, 5 Apr 2022 16:40:22 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -7.109
X-Spam-Level:
X-Spam-Status: No, score=-7.109 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_HI=-5, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 32wJsH7IaR-T for <int-area@ietfa.amsl.com>; Tue, 5 Apr 2022 16:40:20 -0700 (PDT)
Received: from mail-pl1-x630.google.com (mail-pl1-x630.google.com [IPv6:2607:f8b0:4864:20::630]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 311C33A15FD for <int-area@ietf.org>; Tue, 5 Apr 2022 16:40:20 -0700 (PDT)
Received: by mail-pl1-x630.google.com with SMTP id n8so493370plh.1 for <int-area@ietf.org>; Tue, 05 Apr 2022 16:40:20 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; bh=WulA62SjJ3IpnrpITuJUld+e3qjEsOdJkxPvi+i578E=; b=h34+4IxsjHLxG+IhsG3VDthBXP09O3R/DRmc8+WGZytN/9YWIf3K1+5LsrFcBr+NQu qmGDR/jZm4D1aycbUtWDU4BZP4WoxjTHEZ168le4lVYfzPqpWI35TIqbu6A6QaOyYvYl 0yCIucvtReB49WXAYCr67P7vGSqxpidKSzyzEjBpXdKMUEsQ8TeQYrwTjO3XAO1+uAyq 1ctXhEP+SULR0EonIE1pkAsh4aL0KEX+8F4SJg4GdfOpS7j7R2Tr6a6wayf+Y7jfFCjX IiAaAt1ILR3VUZYdoUE7a/C/wDRl1Kz7241es5J6E0nVIeqW0xUUHzkL/4ADhtlgxdT8 RJeg==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; bh=WulA62SjJ3IpnrpITuJUld+e3qjEsOdJkxPvi+i578E=; b=vnYNhaaI04IFNK8v9bKqR9aMmcd7rGsJNLSxuIwAVAXR2j5w3G4sUjMiwCsWO5huby /3Ps5WqVmoABL+pr7I6yv1pWmAx+6TTGawCXpXCL7LB67n+2bT+S+D2br8CnHpRQ5JBm wI3F1RYtsakPbneHHF4OShYj1MZDmDzizTFjEDR2MBTzWtnkZ/PDo2mHqe2lFl8sK4rE Z5EDB3am2g9UCAOZJqeDJYyPcksdYIZhGYibFEF5cphkGV4r1uTxiCx4TuAYS+/iOZ8V 72qvlfuM45QKpKDfnoQjGHOT8SZVcVECUgg79wNDtza7zLIW30GtcrFZUDIQxaWB4/sL XhcA==
X-Gm-Message-State: AOAM531muVgihkZWo2UZ+A9O5i/lzzj+L7+ertgtQk5309bR+VPNVp6B wYolWrFg2D5HKJkON8CaW3hfkyEHT2E=
X-Google-Smtp-Source: ABdhPJyzZ7wbroLa0NF4OGdNq70dHd5uB3CBR7FpYWSDE8nP64Qkr33YEZ5GyrmsjeURaALExgjseg==
X-Received: by 2002:a17:90b:1d08:b0:1c7:5523:6a1f with SMTP id on8-20020a17090b1d0800b001c755236a1fmr6902020pjb.225.1649202019070; Tue, 05 Apr 2022 16:40:19 -0700 (PDT)
Received: from smtpclient.apple (c-98-234-33-188.hsd1.ca.comcast.net. [98.234.33.188]) by smtp.gmail.com with ESMTPSA id d16-20020a17090ad99000b001bcbc4247a0sm3395286pjv.57.2022.04.05.16.40.18 (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Tue, 05 Apr 2022 16:40:18 -0700 (PDT)
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (Mac OS X Mail 15.0 \(3693.60.0.1.1\))
From: Dino Farinacci <farinacci@gmail.com>
In-Reply-To: <4D42E193-5BD7-4159-9A72-54CD013FA7D3@comcast.net>
Date: Tue, 05 Apr 2022 16:40:17 -0700
Cc: int-area <int-area@ietf.org>
Content-Transfer-Encoding: quoted-printable
Message-Id: <7FCD5415-7516-4372-8A95-7B0E847C0489@gmail.com>
References: <D9CC0098-7D52-4E91-B154-41BCA59DC82A@gmail.com> <316508E1-2AC9-4974-8C47-1351088445D2@comcast.net> <A0A71F59-1AB0-4905-8203-7A50918207FA@gmail.com> <4D42E193-5BD7-4159-9A72-54CD013FA7D3@comcast.net>
To: Tony Li <li.tony@comcast.net>
X-Mailer: Apple Mail (2.3693.60.0.1.1)
Archived-At: <https://mailarchive.ietf.org/arch/msg/int-area/uZirGDTZc88glieXxcfc23_T-T0>
Subject: Re: [Int-area] Where to aggregate, where to drop
X-BeenThere: int-area@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: IETF Internet Area WG Mailing List <int-area.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/int-area>, <mailto:int-area-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/int-area/>
List-Post: <mailto:int-area@ietf.org>
List-Help: <mailto:int-area-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/int-area>, <mailto:int-area-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 05 Apr 2022 23:40:23 -0000

> Hi Dino,
> 
>>>> So here are the options:
>>>> 
>>>> (1) ISP-A advertises P to ISP-B (and may also advertise more specifics to other peers for policy reasons).
>>>> (2) ISP-A advertises P.1, P.2, and P.3 to ISP-B and ISP-B advertises P to its peers.
>>>> 
>>>> The questions is *where is the best place to aggregate*. Its a tradeoff on routing table savings and how far a packet can travel to either (1) get delivered to the destination or (2) get dropped by a router that doesn't have a more-specific for the destination (thereby wasting resources from the source to the drop point).
>>>> 
>>>> (1) If ISP-B aggregates P to the links you see above to stub peers, then the stubs can load-split on P and ISP-B can drop packets for say P.4 and deliver packets for P.3. The drop can happen at the PE router relatively soon.
>>> 
>>> 
>>> If they are stub peers, then sending them anything more than default is inefficient. Yes, ISP B will drop anything for P.4.  There’s no way to deliver that any way.
>> 
>> Well the stub AS could be multi-homed and want to load-spread traffic for different prefixes. The a set of advertised prefixes are exceptions for not following the default path.
> 
> 
> ??? If the AS is multi-homed, then it’s not a stub.

It's not transit if it's a multi-homed stub. Like most multi-national enterprise networks. That's what I mean.

> Yes, if this is a third AS (ISP-C) that has other connectivity, then it would see longer prefixes as well as the aggregate.  Traffic would be diverted to the longer prefixes and away from ISP-B, as noted.

That's what I meant by exceptions.

>> If the peers are not stubs and ISP B advertises P, then yes, it will attract P.4 traffic.  Presumably, the peers are not learning alternate paths for this traffic, so there’s also no way of delivering this traffic and ISP B would get to drop it.  
>>> 
>>> I’m not sure why we’re trying to optimize drop traffic. Hopefully, there isn’t a large volume of it. :)
>> 
>> If you can, you want to drop traffic early as possible so you optimize network resources.
> 
> 
> As always, it’s a trade-off.  In this case, do you want to optimize your routing resources or a small amount of bandwidth.  You can either carry more specifics or drop P.4 traffic.  IMHO, that’s an easy call.

Not at all clear that its a small amount of bandwidth.

> 
>>>> All things not considered and looking at this specific topology, I would vote for (1).
>>> 
>>> 
>>> Of course, but what we’ve found is that prefix owners are not willing to just aggregate. They want traffic engineering so they also advertise more specifics.  Thus, the question is (2) or (3) let more specifics propagate throughout the network.
>> 
>> Right, but you are trying to fix this, right? You are trying to do better.
> 
> 
> That is indeed the point.  If aggregation happens at the right place, you get traffic engineering close to the destination and you get abstraction farther away. IMHO, that’s ideal.

That is, if where traffic engineering is required or desired. 

> 
>>>> Another question is, if all more-specfics are not stored (due to major link failure), is the aggregate withdrawn from the routing system. That is, if you want less route flapping, you may just want to keep P advertised. That optimizes FIB add/delete entropy everywhere that wants to store P. I would rather have hardware routers drop packets fast, then to have route oscillation.
>>> 
>>> 
>>> Yes, if ISP B isn’t getting all of the more specifics and continues to aggregate, it could attract traffic that it can’t deliver. Presumably this is a transient until it can get the more specifics.
>> 
>> So what is your conclusion? Is your draft saying to optimize traffic engineering or saving routing table space? Or simply documenting the tradeoffs?
> 
> 
> I’m saying do both: aggregate at the right place and it’s a win-win.  But figuring out where to aggregate does require some thought.

Definitely agree.

Dino