Re: [Idr] Part 3 of CAR/CT Adoption call (7/14/2022 to 7/27/2022) - Operational Differences

Robert Raszuk <robert@raszuk.net> Mon, 25 July 2022 14:00 UTC

Return-Path: <robert@raszuk.net>
X-Original-To: idr@ietfa.amsl.com
Delivered-To: idr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 7976FC14F730 for <idr@ietfa.amsl.com>; Mon, 25 Jul 2022 07:00:50 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.005
X-Spam-Level:
X-Spam-Status: No, score=-2.005 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.001, HTTPS_HTTP_MISMATCH=0.1, RCVD_IN_DNSWL_BLOCKED=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=raszuk.net
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id IIwLvNi24xMy for <idr@ietfa.amsl.com>; Mon, 25 Jul 2022 07:00:46 -0700 (PDT)
Received: from mail-vs1-xe32.google.com (mail-vs1-xe32.google.com [IPv6:2607:f8b0:4864:20::e32]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 7AAA0C15A73B for <idr@ietf.org>; Mon, 25 Jul 2022 07:00:46 -0700 (PDT)
Received: by mail-vs1-xe32.google.com with SMTP id l68so4697261vsc.0 for <idr@ietf.org>; Mon, 25 Jul 2022 07:00:46 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=raszuk.net; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=o6vBLpqQzNXxWMR3XA7XHMnBOfN+aETtK5AQj3BxtyI=; b=Enzd1Us1xtCU9o/Rq5R+MzVAHrkXBrDctJ43p50vqO/8aQQSSCft3r2bWzuNAsIc2N gppDx8kOPFJOjYsd8IocWjMMSwIdAarOJrgcKOHvnQme1e/ntOYoPCJfva7DCUzNF4um EVqF2agJcQT/SDGAumeuqCvU/rUqGDGEldEbZfkQYU+rkIlYF5DbgvdesRjQ1nLFOv+e ifh3NWcIqcdvYfyfzPIyh+4tGoflYp8oo7DRfxWFcF7xn6k5KxfL3HmqctIy42LRw1xb CU1oC164wfqNKDXzfz9Iw6g8SfSTn+Ahmk/3ia/GNsl9turp56WpFmXbmd+rWGUjCy9M 7UXg==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=o6vBLpqQzNXxWMR3XA7XHMnBOfN+aETtK5AQj3BxtyI=; b=S/s4BJqk1zN6Hfg6lT8lmZ5sWlx3F+XJmEw9QN6ooDTcF8ObaML8z4zPQnlfzNtkcS Mlqp6FlBNu4EllSyzWNbciIvfxjxufPaqTb++ftYmAtiqYuREUjnvMxaiiXp8usd32Qn K2lTNeny91+xJZzPrI3t62UesudtGweNaACQEaE7m2exYatQp9oCGUyvGsNiehiBbRFH FHuhe/eMvlY4ubgJNjBTAKNXTXOCXCrAh8YNxrODr54zvQnLnHm4wVNuFJft+U4exCrb Oxnrur+PkKfFAdazQZgaO6ZVuQccuYtbK8XlS98f9kcIiRhUHHTRf+7vPWB+tK6heeP+ 1OrQ==
X-Gm-Message-State: AJIora/nBEvNwC8X0y47HGN1HV9hMCjB0hB0Jiv53kxiynFbFShxsgkJ i9IOjp8HCFmE2TamKirq9pQlObwzXdwCdP/HiTEE65LqmEfNXw==
X-Google-Smtp-Source: AGRyM1u1RQlwj/0VRwIhkhjY5A5EsaYNGzb9PUlC5iU9c3oCDI2XajT+d0SJvBm71nUCeeAXTcW2x1cLihOscfEJ3C4=
X-Received: by 2002:a67:c803:0:b0:358:5a65:fd27 with SMTP id u3-20020a67c803000000b003585a65fd27mr1517247vsk.15.1658757644837; Mon, 25 Jul 2022 07:00:44 -0700 (PDT)
MIME-Version: 1.0
References: <CAOj+MMGBG8GNTXp_LT+euqKFq4vx88bRbYbhFCLyC4AGrxs52w@mail.gmail.com> <E35FF3FE-C1AA-46EF-841A-C0658C4A93D5@pfrc.org> <CAOj+MMHY2L978mH=Sv2VWwjt_O0yzznfV4CQER192_6As9g8yg@mail.gmail.com> <CAOj+MMGsmatkjb=dkCOLANTK2ZN=gUehhpg-+YmErJYDhkvFCQ@mail.gmail.com> <PH0PR05MB7749438C321ECF62400137ACB9959@PH0PR05MB7749.namprd05.prod.outlook.com>
In-Reply-To: <PH0PR05MB7749438C321ECF62400137ACB9959@PH0PR05MB7749.namprd05.prod.outlook.com>
From: Robert Raszuk <robert@raszuk.net>
Date: Mon, 25 Jul 2022 16:00:32 +0200
Message-ID: <CAOj+MMGpva1jz4r6s8W5TfAn4P8dBaDiA=NgdcxXQrXHxF4WMg@mail.gmail.com>
To: Srihari Sangli <ssangli@juniper.net>
Cc: Jeffrey Haas <jhaas@pfrc.org>, Sue Hares <shares@ndzh.com>, "idr@ietf. org" <idr@ietf.org>
Content-Type: multipart/alternative; boundary="000000000000307c9805e4a19cb7"
Archived-At: <https://mailarchive.ietf.org/arch/msg/idr/S9vv_3uFc0wOn4F_y__xGhidcHQ>
Subject: Re: [Idr] Part 3 of CAR/CT Adoption call (7/14/2022 to 7/27/2022) - Operational Differences
X-BeenThere: idr@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Inter-Domain Routing <idr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/idr>, <mailto:idr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/idr/>
List-Post: <mailto:idr@ietf.org>
List-Help: <mailto:idr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/idr>, <mailto:idr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 25 Jul 2022 14:00:50 -0000

Hi Srihari,

> Also, what are we optimizing by doing this.

2.5 min of BGP propagation time for 1.5 million routes (as Jeff stated). Do
you think it is not worth to consider an aggregation in such a case ?

Imagine I have POD or Cluster with 1000 PEs (compute as PEs) and I see no
value in advertising all 1000s each with 5 colors especially considering
that such 5 colors will share identical links and paths to those compute.

Same if you replace compute by PEs.

> Trolling will result in your posts being put into the moderation queue.

Part 3 is about operational differences. Not every difference requires a
topology - some do some do not.

I quoted sections from both drafts ?

I do think this fear of moderation - when just pointing out operational
differences and asking clarification questions - simply inappropriate.

Thx,
R.





On Mon, Jul 25, 2022 at 3:07 PM Srihari Sangli <ssangli@juniper.net> wrote:

> Robert,
>
>
>
> Pls comments inline.
>
>
>
> Thanks & Regards,
>
>
>
> srihari…
>
>
>
>
>
> *From: *Idr <idr-bounces@ietf.org> on behalf of Robert Raszuk <
> robert@raszuk.net>
> *Date: *Monday, 25 July 2022 at 7:02 AM
> *To: *Jeffrey Haas <jhaas@pfrc.org>
> *Cc: *Sue Hares <shares@ndzh.com>, idr@ietf. org <idr@ietf.org>
> *Subject: *Re: [Idr] Part 3 of CAR/CT Adoption call (7/14/2022 to
> 7/27/2022) - Operational Differences
>
> *[External Email. Be cautious of content]*
>
>
>
> All,
>
>
>
> Relative to the discussion about scale and stability of CAR vs CT proposal
> I would like to bring a very important difference.
>
>
>
> As part of NLRI, CAR defines a prefix as a real IPv4 or IPv6 prefix with
> length.  See section 2.9.2 of CAR draft
>
>
>
> CT however in its NLRI defines prefix as address of 32 or 128 bits and
> there is no length. See section 7 of CT draft.
>
>
>
> That means that even if the operator wishes to aggregate all 300K PEs into
> one prefix (per ASN or per POP or per pod/cluster etc ...) for a given
> color before it sends it over BGP with CT it is not an option. While in CAR
> it is.
>
>
>
> Srihari> Could you please describe the use case why would a customer want
> to do this. Also, what are we optimizing by doing this. Please also
> consider it is not just the sender that needs to do this work. Are you
> trying to optimize the bits on the wire ? Given different attributes these
> prefixes may have, I would say this can be a theoretical exercise.
>
>
>
>
>
>
>
>
>
> Leave alone that adding RD to the CT NLRI makes it even more impossible to
> aggregate.
>
>
>
> This is a fundamental difference especially when /32s or /128s are not
> needed to be sprayed to other ASNs.
>
>
>
> Till that is fixed I recommend that CT draft goes back to the drawing
> board and would not be even accepted as experimental. Its experiment has
> just concluded as a failure.
>
>
>
> Cheers,
>
> Robert
>
>
>
>
>
> On Mon, Jul 25, 2022 at 9:26 AM Robert Raszuk <robert@raszuk.net> wrote:
>
> Hi,
>
>
>
> My point is that this is the first time we are facing the introduction of
> BGP invalidation (as you stated no resolution) by performance (or
> under-perfomance) of data plane metric.
>
>
>
> I think this has new consequences to the protocol which are nowhere near
> SAFI 4.
>
>
>
> Perhaps it could work just fine for reasonable scale. But the numbers
> being quoted of 1.5M color routes seems way too excessive and rather
> suggest different protocol encoding or one more layer of
> hierarchy/indirection needed.
>
>
>
> Many thx,
>
> R.
>
>
>
>
>
>
>
>
>
>
>
> On Mon, Jul 25, 2022 at 3:13 AM Jeffrey Haas <jhaas@pfrc.org> wrote:
>
> Robert,
>
>
>
> I make no comment on how it is intended to be deployed. Only that the
> consequences protocol-wise are the same.
>
> Jeff
>
>
>
> On Jul 24, 2022, at 7:22 PM, Robert Raszuk <robert@raszuk.net> wrote:
>
> 
>
> Hi Jeff,
>
>
>
> > The stability dynamics and impact of service route re-resolution are
> largely the same as BGP labeled unicast.
>
>
>
> I would quite not agree with the above.
>
>
>
> Reason being that labeled unicast is about reachability.
>
>
>
> Here we are talking about real promises of data plane "performance" hence
> we are dealing with completely different set of triggers for various data
> plane issues.
>
>
>
> Many thx,
>
> R.
>
>
>
>
>
>
>
>
>
>
>
> On Mon, Jul 25, 2022 at 1:10 AM Jeffrey Haas <jhaas@pfrc.org> wrote:
>
> Robert,
>
>
>
> A partial comment from my mobile device.
>
>
>
> Withdraw encoding will pack much denser. On a total withdraw you likely
> could pack 200 or more prefixes per update.
>
>
>
> Implicit withdraw via replacement is clearly same speed as initial
> advertisement.
>
>
>
> The stability dynamics and impact of service route re-resolution are
> largely the same as BGP labeled unicast. Thus, beware churning your
> transport routes.
>
>
>
>
>
>
>
> Jeff
>
>
>
> On Jul 24, 2022, at 12:28 PM, Robert Raszuk <robert@raszuk.net> wrote:
>
> 
>
> Hi Jeff,
>
>
>
> Sure 300k times 5 colors makes it 1.5M ...
>
>
>
> So I have a few different questions here.
>
>
>
> Assume in CAR/CT enabled domain one color has transport problems ... say
> low latency is becoming not so low due to interface queuing is transiently
> congesting for whatever reason between P1 and P2 nodes (not even running
> any BGP).
>
>
>
> Q1 - How (by what exact protocol) and how fast such issue with
> forwarding a given color via this domain will be visible at the CAR/CT
> layer ?
>
>
>
> Q2 - Assume Q1 is done - do we now need to withdraw 300K routes based on
> one color brownout ?
>
>
>
> Q3 - According to your math such CAR/CT reaction will take 30 sec. What if
> transport problem is transient and occurs for say 5-10 sec every 40 sec ?
>
>
>
> Q4 - Is there in any document an analysis on dynamics of CAR/CT signalling
> needed to make this at all practical in real deployments vs ppts ?
>
>
>
> We keep burning energy on encoding, but apologies if I missed it but I am
> not seeing the full picture here.
>
>
>
> Why not advertise just 5 colors between those domains in 5 NLRIs and
> define a new attribute to carry all the interdomain color mappings in it ?
>
>
>
> 5 being an example from the section 6.3.2 ... but realistically we could
> perhaps vastly simplify this if we define day one set of well-known colors
> instead of each domain inventing their own definition :)
>
>
>
> Maybe I am just too practical here - but your math inspired those
> questions :)
>
>
>
> Many thx,
>
> R.
>
>
>
>
>
> On Sun, Jul 24, 2022 at 4:47 PM Jeffrey Haas <jhaas@pfrc.org> wrote:
>
> On Sun, Jul 24, 2022 at 10:44:49AM +0530, Ketan Talaulikar wrote:
> > The scalability requirements are captured here:
> >
> https://datatracker.ietf.org/doc/html/draft-hr-spring-intentaware-routing-using-color-00#section-6.3.2
> <https://urldefense.com/v3/__https:/datatracker.ietf.org/doc/html/draft-hr-spring-intentaware-routing-using-color-00*section-6.3.2__;Iw!!NEt6yMaO-gk!CsQrDLldHnl4JxBSFvLAV5NjuafJxhdCiqWRL71OLt1IsarmAEPYSFOnUFJZgdAHr0jBtaZH1acgU8M$>
> >
> > This is the merged document that, I believe, captures the consensus that
> > both the CAR and CT solutions aim to address.
>
> Thanks, Ketan.
>
> Roughly 1.5 million routes.
>
> Presuming an example 10k update per second handling, roughly 2.5 minutes of
> convergence time without packing optimizations.
>
> -- Jeff
>
>
> Juniper Business Use Only
>