Re: [Lsr] New Version Notification for draft-xu-lsr-flooding-reduction-in-clos-01.txt

Robert Raszuk <robert@raszuk.net> Mon, 27 November 2023 23:36 UTC

Return-Path: <robert@raszuk.net>
X-Original-To: lsr@ietfa.amsl.com
Delivered-To: lsr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 58B96C15108F for <lsr@ietfa.amsl.com>; Mon, 27 Nov 2023 15:36:31 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.105
X-Spam-Level:
X-Spam-Status: No, score=-2.105 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=raszuk.net
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 77cYnsAqY-54 for <lsr@ietfa.amsl.com>; Mon, 27 Nov 2023 15:36:26 -0800 (PST)
Received: from mail-ed1-x52f.google.com (mail-ed1-x52f.google.com [IPv6:2a00:1450:4864:20::52f]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id C62BAC15152C for <lsr@ietf.org>; Mon, 27 Nov 2023 15:36:26 -0800 (PST)
Received: by mail-ed1-x52f.google.com with SMTP id 4fb4d7f45d1cf-548ce39b101so6451723a12.2 for <lsr@ietf.org>; Mon, 27 Nov 2023 15:36:26 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=raszuk.net; s=google; t=1701128185; x=1701732985; darn=ietf.org; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=GWMAwiOkc3KwGBGl0kRC/JH0rahTRf/pmh11lPOAebc=; b=UZ0XZGhy/gIb/0TjZwfgZ4ZqgSmVymGD0aMoyEOMTthocZoVPAhtlG6XuihBdwT6IB yYqmfx12QIAb7kseoJSDtzaQdo2FcU7grGb6usdYAY7zwKdft/dGSnTUeJJJ9gtxoIpB rlJDZ2qys3z5yApoCjcPzd5wbX4nMwwoixOe/WV0eoky/80DjqfqAXHTJXriDU8l0Xkn Y4Bg3JcSQnc35+rXMDgxh2g+DmfdTpHSl5uopIoElr28d5Kjl8+rCs46qAvJ+LTXxIU2 0GKNLTjRUv1NLgw5hDC8rr5D/hy9Esd/SzVeFIbZjdYUaCiMNgOsmoA+o3TaHslFPOeC 57qA==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1701128185; x=1701732985; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=GWMAwiOkc3KwGBGl0kRC/JH0rahTRf/pmh11lPOAebc=; b=E7koLlALaYMy4N9r81wfddMsOfxxM7QK9aPjWtk6bAMyTKrN+fi7qhRjtDfMqPmsTA 6MAFVXCOs7IBGyZGjtlUZeqVUCVp+kFKCjgP3VBPAkzYiNZ7PdztTDeuJwWdnxXEaaOj GSiP30E2+VF9cK0/k1b/k1gkxKfw+m0LkyoZWLBzQDWgh77P+EiP/hzmIC4DNS306A8m mvrNmNOOWMIolzZb0KmT/z3UUILoDFOsf+3r/j6vjvBnW2sBFjqVl7tQiA3m42k1yp8Y yZgZ8H8pbYTQKbOY1qr6s49b0yXy+En0ssIs6P0G33jbwOKYt9et1kXKgClz30smcslz IQUQ==
X-Gm-Message-State: AOJu0YyfoUkfqwfFvtOyDyv4zjMojBGCVuENynfhjg220eURzIi6ul36 yfAKgeXyaeCv8lEm9Jy3sR/y5AOOONXee8YJIXTTMEDqklMKy5Ds
X-Google-Smtp-Source: AGHT+IEJFVrs3gJOReNLOPnY1aSSMUrBAUGUEM+MNs7X5O87Jmaqx5qco5mn6gvRqSzjHR0qqWIhTyqdY2hRhaXKZHI=
X-Received: by 2002:a05:6402:51d4:b0:54b:7f07:9862 with SMTP id r20-20020a05640251d400b0054b7f079862mr2932729edd.31.1701128184664; Mon, 27 Nov 2023 15:36:24 -0800 (PST)
MIME-Version: 1.0
References: <CAOj+MMH_7PSdNRsSzAhN7hbB_QyyrKO3i-4EYt-0e2EKtvT3Tg@mail.gmail.com> <1F10AE52-C87C-4245-A034-81D8110623C6@gmail.com>
In-Reply-To: <1F10AE52-C87C-4245-A034-81D8110623C6@gmail.com>
From: Robert Raszuk <robert@raszuk.net>
Date: Tue, 28 Nov 2023 00:36:13 +0100
Message-ID: <CAOj+MMHaUBYjhCF4C-Zohte2TcruBGCc5OLnDvBJ11gqhMLmHQ@mail.gmail.com>
To: Jeff Tantsura <jefftant.ietf@gmail.com>
Cc: Acee Lindem <acee.ietf@gmail.com>, "Les Ginsberg (ginsberg)" <ginsberg=40cisco.com@dmarc.ietf.org>, Tony Li <tony.li@tony.li>, xuxiaohu_ietf@hotmail.com, lsr@ietf.org
Content-Type: multipart/alternative; boundary="0000000000002a23e4060b2ac5fc"
Archived-At: <https://mailarchive.ietf.org/arch/msg/lsr/wZ4e38Qfa0o_e00O3kvYwMP6oN4>
Subject: Re: [Lsr] New Version Notification for draft-xu-lsr-flooding-reduction-in-clos-01.txt
X-BeenThere: lsr@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Link State Routing Working Group <lsr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/lsr>, <mailto:lsr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/lsr/>
List-Post: <mailto:lsr@ietf.org>
List-Help: <mailto:lsr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/lsr>, <mailto:lsr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 27 Nov 2023 23:36:31 -0000

Jeff,

End-points do not need to participate in link state topology. Those are
merely leaves even if all endpoints are L3.

And 8K or even 256K of routes I think would be supported today by cheapest
nodes nodes from any vendor :)

Cheers,
R.





On Tue, Nov 28, 2023 at 12:32 AM Jeff Tantsura <jefftant.ietf@gmail.com>
wrote:

> Robert,
>
> In context of LLM (10% of that for DLRM) training clusters, towards
> 2024/25 we would be looking to up to 8K end-points in a 3 stage leaf-spine
> fabric and up to 64-256K in 5 stages.
> Virtualization and how it is instantiated might significantly change
> amount/distribution of state in underlay/overlay.
>
> Obviously, these are hyperscale size deployments, many will be running
> 10-30 switches fabrics, where anything could work.
> BGP seems to work just fine, some data plane signaling could be used as a
> near real time augmentation to “slow but stable” control plane.
>
> Cheers,
> Jeff
>
> On Nov 26, 2023, at 14:30, Robert Raszuk <robert@raszuk.net> wrote:
>
> 
> Hey Jeff,
>
> Could you be so kind and defined term: "scaled-out leaf-spine fabrics" ?
>
> Specifically folks watching us here would highly appreciate if we state
> max target nodes with vanilla ISIS and max target nodes when your ISIS
> implementation supports draft-ietf-lsr-dynamic-flooding
> <https://datatracker.ietf.org/doc/html/draft-ietf-lsr-dynamic-flooding>
>
> While I am a BGP person I feel pretty strongly that BGP is not a best fit
> for the vast majority of DC fabrics in use today.
>
> Cheers,
> Robert
>
>
> On Sun, Nov 26, 2023 at 10:49 PM Jeff Tantsura <jefftant.ietf@gmail.com>
> wrote:
>
>> I agree with all aforementioned comments.
>>
>> Wrt AI/ML networking - if a controller is used, what is required is link
>> state exposure northbound and not link state protocol  in the fabric. (I
>> could argue for RIFT though ;-))
>> I’d urge you to take a look at Meta’s deployment  in their ML clusters
>> (publicly available) - they use BGP as the routing protocol to exchange
>> reachability (and build ECMP sets) and provide a backup if controller
>> computed next hop goes away/before new one has been computed.
>> Open R is used northbound to expose the topology (in exactly same way -
>> BGP-LS could be used).
>>
>> To summarize: an LS protocol brings no additional value in scaled-out
>> leaf-spine fabrics, without significant modifications -  it doesn’t work in
>> irregular topologies such as DF, etc.
>> Existing proposals - there are shipping implementations and experience in
>> operating it, have proven their relative value in suitable deployments.
>>
>> Cheers,
>> Jeff
>>
>> > On Nov 26, 2023, at 12:20, Acee Lindem <acee.ietf@gmail.com> wrote:
>> >
>> > Speaking as WG member:
>> >
>> > I agree. The whole Data Center IGP flooding discussion went on years
>> ago and the simplistic enhancement proposed in the subject draft is neither
>> relevant or useful now.
>> >
>> > Thanks,
>> > Acee
>> >
>> >> On Nov 24, 2023, at 11:55 PM, Les Ginsberg (ginsberg) <ginsberg=
>> 40cisco.com@dmarc.ietf.org> wrote:
>> >>
>> >> Xiaohu –
>> >> I also point out that there are at least two existing drafts which
>> specifically address IS-IS flooding reduction in CLOS networks and do so in
>> greater detail and with more robustness than what is in your draft:
>> >> https://datatracker.ietf.org/doc/draft-ietf-lsr-distoptflood/
>> >> https://datatracker.ietf.org/doc/draft-ietf-lsr-isis-spine-leaf-ext/
>> >> I do not see a need for yet another draft specifically aimed at CLOS
>> networks.
>> >> Note that work on draft-ietf-lsr-isis-spine-leaf-ext was suspended due
>> to lack of interest in deploying an IGP solution in CLOS networks.
>> >> You are suggesting in draft-xu-lsr-fare that AI is going to change
>> this. Well, maybe, but if so I think we should return to the solutions
>> already available and prioritize work on them.
>> >>    Les
>> >>  From: Lsr <lsr-bounces@ietf.org> On Behalf Of Tony Li
>> >> Sent: Thursday, November 23, 2023 8:39 AM
>> >> To: xuxiaohu_ietf@hotmail.com
>> >> Cc: lsr@ietf.org
>> >> Subject: Re: [Lsr] New Version Notification for
>> draft-xu-lsr-flooding-reduction-in-clos-01.txt
>> >> Hi,
>> >> What you’re proposing is already described in IS-IS Mesh Groups (
>> https://www.rfc-editor.org/rfc/rfc2973.html) and improved upon in
>> Dynamic Flooding (
>> https://datatracker.ietf.org/doc/html/draft-ietf-lsr-dynamic-flooding).
>> >> Regards,
>> >> Tony
>> >>
>> >>
>> >> On Nov 23, 2023, at 8:29 AM, xuxiaohu_ietf@hotmail.com wrote:
>> >> Hi all,
>> >> Any comments or suggestions are welcome.
>> >> Best regards,
>> >> Xiaohu
>> >> 发件人: internet-drafts@ietf.org <internet-drafts@ietf.org>
>> >> 日期: 星期三, 2023年11月22日 11:37
>> >> 收件人: Xiaohu Xu <xuxiaohu_ietf@hotmail.com>
>> >> 主题: New Version Notification for
>> draft-xu-lsr-flooding-reduction-in-clos-01.txt
>> >> A new version of Internet-Draft
>> draft-xu-lsr-flooding-reduction-in-clos-01.txt
>> >> has been successfully submitted by Xiaohu Xu and posted to the
>> >> IETF repository.
>> >>
>> >> Name:     draft-xu-lsr-flooding-reduction-in-clos
>> >> Revision: 01
>> >> Title:    Flooding Reduction in CLOS Networks
>> >> Date:     2023-11-22
>> >> Group:    Individual Submission
>> >> Pages:    6
>> >> URL:
>> https://www.ietf.org/archive/id/draft-xu-lsr-flooding-reduction-in-clos-01.txt
>> >> Status:
>> https://datatracker.ietf.org/doc/draft-xu-lsr-flooding-reduction-in-clos/
>> >> HTMLized:
>> https://datatracker.ietf.org/doc/html/draft-xu-lsr-flooding-reduction-in-clos
>> >> Diff:
>> https://author-tools.ietf.org/iddiff?url2=draft-xu-lsr-flooding-reduction-in-clos-01
>> >>
>> >> Abstract:
>> >>
>> >>   In a CLOS topology, an OSPF (or ISIS) router may receive identical
>> >>   copies of an LSA (or LSP) from multiple OSPF (or ISIS) neighbors.
>> >>   Moreover, two OSPF (or ISIS) neighbors may exchange the same LSA (or
>> >>   LSP) simultaneously.  This results in unnecessary flooding of link-
>> >>   state information, which wastes the precious resources of OSPF (or
>> >>   ISIS) routers.  Therefore, this document proposes extensions to OSPF
>> >>   (or ISIS) to reduce this flooding within CLOS networks.  The
>> >>   reduction of OSPF (or ISIS) flooding is highly beneficial for
>> >>   improving the scalability of CLOS networks.
>> >>
>> >>
>> >>
>> >> The IETF Secretariat
>> >>
>> >> _______________________________________________
>> >> Lsr mailing list
>> >> Lsr@ietf.org
>> >> https://www.ietf.org/mailman/listinfo/lsr
>> >> _______________________________________________
>> >> Lsr mailing list
>> >> Lsr@ietf.org
>> >> https://www.ietf.org/mailman/listinfo/lsr
>> >
>> >
>> > _______________________________________________
>> > Lsr mailing list
>> > Lsr@ietf.org
>> > https://www.ietf.org/mailman/listinfo/lsr
>>
>> _______________________________________________
>> Lsr mailing list
>> Lsr@ietf.org
>> https://www.ietf.org/mailman/listinfo/lsr
>>
>