Re: [Idr] Part 3 of CAR/CT Adoption call (7/14/2022 to 7/27/2022) - Operational Differences

Robert Raszuk <robert@raszuk.net> Mon, 25 July 2022 11:02 UTC

Return-Path: <robert@raszuk.net>
X-Original-To: idr@ietfa.amsl.com
Delivered-To: idr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 9360AC06B8C1 for <idr@ietfa.amsl.com>; Mon, 25 Jul 2022 04:02:19 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.104
X-Spam-Level:
X-Spam-Status: No, score=-2.104 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_BLOCKED=0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=raszuk.net
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 2NlVSFVNJd12 for <idr@ietfa.amsl.com>; Mon, 25 Jul 2022 04:02:15 -0700 (PDT)
Received: from mail-vs1-xe31.google.com (mail-vs1-xe31.google.com [IPv6:2607:f8b0:4864:20::e31]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 21ACFC06B8F8 for <idr@ietf.org>; Mon, 25 Jul 2022 04:02:14 -0700 (PDT)
Received: by mail-vs1-xe31.google.com with SMTP id t28so4517817vsr.11 for <idr@ietf.org>; Mon, 25 Jul 2022 04:02:14 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=raszuk.net; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=mJXJ1h/lTU1Wiye4K2/bYuQB0H0pIGutdk3dFpXc6es=; b=L42BX0Zf9AhmvITJEKMZ1yLJ01kDgxF/KRui/FhFHCGglWwi21ZH7b8SJ6tWRqCUeg HibIOaLDbtpTHaf0gE4go0v75tgNl/hipCyxKW2hQvgsUtmAw2cK4H+Nd3/2lZaOO8tE M6hQdTePFL7a4N7s/2kT5hw5GIBJVaOXDJ4mRBmo4rNqYkP4Z+oPNagCcnbbOX3U5LTS tl5PkmXBdbD7LT79u5/qlpdmmQXEi8i+9gNO/oWUP+Hr+HXNDLI6r8iwE3+ITKxDNYhR ZdM9/IMGeo9/pT2yz3ijcX+CtwDBK4gVSP83nzK2t+X+jJot9wDQML54MaxmC6ebEdNq 0t4Q==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=mJXJ1h/lTU1Wiye4K2/bYuQB0H0pIGutdk3dFpXc6es=; b=r4UQfvseVgAmcK2DBu07Ty8Wo2RfPi2IQq98UGRokhgQvdh7E0w/iaSX/vWQQ6wq0W 6Chdmr0rw+p9E0gnw2TObbS34z3ZpiALZot7ePori8RB8IOa/TJ82wLwoz5zrfuqP+WW SHJwWc6uI9XSJjVDWe5XmQffRUGeAmSCc/6wAzBEiSr/cVdMysBxG8iMjTFPX+1QOYlR S6lKWNFrYW5CtpjkaSmverRLGYKBx2UMqHBbSUDK9e6MGIY7Sq3hVJtLwNcdm4Sm4sDj 98CkIJbQKO0fxhps4JnUIaMY/M+CMr+G77b81hWp/RQm4DOEvWKKhDSa+y0dyt7bqNAu F1Ew==
X-Gm-Message-State: AJIora95iZSsLp9ejbvlYQ6D3XZK2ljD5M01u6bJcaVEe4i2uEM3sVUB c4zOWNf3j9RBRIPPA3O+ZZeoYXqsYYXHyYzIZIrHkg==
X-Google-Smtp-Source: AGRyM1vWJiN90f7x8bL8KybwFeTX8eoU/NZ82VxwUPWvrzGsKwXX5GlSUG92Ew2HC8yfO2RHwSZBlzUaJG8S2x1myjA=
X-Received: by 2002:a67:c296:0:b0:358:3669:d33e with SMTP id k22-20020a67c296000000b003583669d33emr3050231vsj.27.1658746933604; Mon, 25 Jul 2022 04:02:13 -0700 (PDT)
MIME-Version: 1.0
References: <CAOj+MMGBG8GNTXp_LT+euqKFq4vx88bRbYbhFCLyC4AGrxs52w@mail.gmail.com> <E35FF3FE-C1AA-46EF-841A-C0658C4A93D5@pfrc.org> <CAOj+MMHY2L978mH=Sv2VWwjt_O0yzznfV4CQER192_6As9g8yg@mail.gmail.com>
In-Reply-To: <CAOj+MMHY2L978mH=Sv2VWwjt_O0yzznfV4CQER192_6As9g8yg@mail.gmail.com>
From: Robert Raszuk <robert@raszuk.net>
Date: Mon, 25 Jul 2022 13:02:02 +0200
Message-ID: <CAOj+MMGsmatkjb=dkCOLANTK2ZN=gUehhpg-+YmErJYDhkvFCQ@mail.gmail.com>
To: Jeffrey Haas <jhaas@pfrc.org>
Cc: Ketan Talaulikar <ketant.ietf@gmail.com>, Kaliraj Vairavakkalai <kaliraj@juniper.net>, Sue Hares <shares@ndzh.com>, "idr@ietf. org" <idr@ietf.org>
Content-Type: multipart/alternative; boundary="000000000000c008fd05e49f1dff"
Archived-At: <https://mailarchive.ietf.org/arch/msg/idr/v8kkDGmr3ViPIR4UEmOPJbJ8B44>
Subject: Re: [Idr] Part 3 of CAR/CT Adoption call (7/14/2022 to 7/27/2022) - Operational Differences
X-BeenThere: idr@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Inter-Domain Routing <idr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/idr>, <mailto:idr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/idr/>
List-Post: <mailto:idr@ietf.org>
List-Help: <mailto:idr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/idr>, <mailto:idr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 25 Jul 2022 11:02:19 -0000

All,

Relative to the discussion about scale and stability of CAR vs CT proposal
I would like to bring a very important difference.

As part of NLRI, CAR defines a prefix as a real IPv4 or IPv6 prefix with
length.  See section 2.9.2 of CAR draft

CT however in its NLRI defines prefix as address of 32 or 128 bits and
there is no length. See section 7 of CT draft.

That means that even if the operator wishes to aggregate all 300K PEs into
one prefix (per ASN or per POP or per pod/cluster etc ...) for a given
color before it sends it over BGP with CT it is not an option. While in CAR
it is.

Leave alone that adding RD to the CT NLRI makes it even more impossible to
aggregate.

This is a fundamental difference especially when /32s or /128s are not
needed to be sprayed to other ASNs.

Till that is fixed I recommend that CT draft goes back to the drawing board
and would not be even accepted as experimental. Its experiment has just
concluded as a failure.

Cheers,
Robert


On Mon, Jul 25, 2022 at 9:26 AM Robert Raszuk <robert@raszuk.net> wrote:

> Hi,
>
> My point is that this is the first time we are facing the introduction of
> BGP invalidation (as you stated no resolution) by performance (or
> under-perfomance) of data plane metric.
>
> I think this has new consequences to the protocol which are nowhere near
> SAFI 4.
>
> Perhaps it could work just fine for reasonable scale. But the numbers
> being quoted of 1.5M color routes seems way too excessive and rather
> suggest different protocol encoding or one more layer of
> hierarchy/indirection needed.
>
> Many thx,
> R.
>
>
>
>
>
> On Mon, Jul 25, 2022 at 3:13 AM Jeffrey Haas <jhaas@pfrc.org> wrote:
>
>> Robert,
>>
>> I make no comment on how it is intended to be deployed. Only that the
>> consequences protocol-wise are the same.
>>
>> Jeff
>>
>> On Jul 24, 2022, at 7:22 PM, Robert Raszuk <robert@raszuk.net> wrote:
>>
>> 
>> Hi Jeff,
>>
>> > The stability dynamics and impact of service route re-resolution are
>> largely the same as BGP labeled unicast.
>>
>> I would quite not agree with the above.
>>
>> Reason being that labeled unicast is about reachability.
>>
>> Here we are talking about real promises of data plane "performance" hence
>> we are dealing with completely different set of triggers for various data
>> plane issues.
>>
>> Many thx,
>> R.
>>
>>
>>
>>
>>
>> On Mon, Jul 25, 2022 at 1:10 AM Jeffrey Haas <jhaas@pfrc.org> wrote:
>>
>>> Robert,
>>>
>>> A partial comment from my mobile device.
>>>
>>> Withdraw encoding will pack much denser. On a total withdraw you likely
>>> could pack 200 or more prefixes per update.
>>>
>>> Implicit withdraw via replacement is clearly same speed as initial
>>> advertisement.
>>>
>>> The stability dynamics and impact of service route re-resolution are
>>> largely the same as BGP labeled unicast. Thus, beware churning your
>>> transport routes.
>>>
>>>
>>>
>>> Jeff
>>>
>>> On Jul 24, 2022, at 12:28 PM, Robert Raszuk <robert@raszuk.net> wrote:
>>>
>>> 
>>> Hi Jeff,
>>>
>>> Sure 300k times 5 colors makes it 1.5M ...
>>>
>>> So I have a few different questions here.
>>>
>>> Assume in CAR/CT enabled domain one color has transport problems ... say
>>> low latency is becoming not so low due to interface queuing is transiently
>>> congesting for whatever reason between P1 and P2 nodes (not even running
>>> any BGP).
>>>
>>> Q1 - How (by what exact protocol) and how fast such issue with
>>> forwarding a given color via this domain will be visible at the CAR/CT
>>> layer ?
>>>
>>> Q2 - Assume Q1 is done - do we now need to withdraw 300K routes based on
>>> one color brownout ?
>>>
>>> Q3 - According to your math such CAR/CT reaction will take 30 sec. What
>>> if transport problem is transient and occurs for say 5-10 sec every 40 sec
>>> ?
>>>
>>> Q4 - Is there in any document an analysis on dynamics of CAR/CT
>>> signalling needed to make this at all practical in real deployments vs ppts
>>> ?
>>>
>>> We keep burning energy on encoding, but apologies if I missed it but I
>>> am not seeing the full picture here.
>>>
>>> Why not advertise just 5 colors between those domains in 5 NLRIs and
>>> define a new attribute to carry all the interdomain color mappings in it ?
>>>
>>> 5 being an example from the section 6.3.2 ... but realistically we could
>>> perhaps vastly simplify this if we define day one set of well-known colors
>>> instead of each domain inventing their own definition :)
>>>
>>> Maybe I am just too practical here - but your math inspired those
>>> questions :)
>>>
>>> Many thx,
>>> R.
>>>
>>>
>>> On Sun, Jul 24, 2022 at 4:47 PM Jeffrey Haas <jhaas@pfrc.org> wrote:
>>>
>>>> On Sun, Jul 24, 2022 at 10:44:49AM +0530, Ketan Talaulikar wrote:
>>>> > The scalability requirements are captured here:
>>>> >
>>>> https://datatracker.ietf.org/doc/html/draft-hr-spring-intentaware-routing-using-color-00#section-6.3.2
>>>> >
>>>> > This is the merged document that, I believe, captures the consensus
>>>> that
>>>> > both the CAR and CT solutions aim to address.
>>>>
>>>> Thanks, Ketan.
>>>>
>>>> Roughly 1.5 million routes.
>>>>
>>>> Presuming an example 10k update per second handling, roughly 2.5
>>>> minutes of
>>>> convergence time without packing optimizations.
>>>>
>>>> -- Jeff
>>>>
>>>