Re: [Lsr] New Version Notification for draft-xu-lsr-flooding-reduction-in-clos-01.txt

Jeff Tantsura <jefftant.ietf@gmail.com> Mon, 27 November 2023 23:32 UTC

Return-Path: <jefftant.ietf@gmail.com>
X-Original-To: lsr@ietfa.amsl.com
Delivered-To: lsr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 97413C15152C for <lsr@ietfa.amsl.com>; Mon, 27 Nov 2023 15:32:44 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -0.212
X-Spam-Level:
X-Spam-Status: No, score=-0.212 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, FREEMAIL_REPLY=1, HTML_MESSAGE=0.001, MIME_HTML_ONLY=0.1, MIME_HTML_ONLY_MULTI=0.001, MIME_QP_LONG_LINE=0.001, MPART_ALT_DIFF=0.79, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=no autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id zH9QEzl3MZ-S for <lsr@ietfa.amsl.com>; Mon, 27 Nov 2023 15:32:41 -0800 (PST)
Received: from mail-pl1-x631.google.com (mail-pl1-x631.google.com [IPv6:2607:f8b0:4864:20::631]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 0753FC15108F for <lsr@ietf.org>; Mon, 27 Nov 2023 15:32:40 -0800 (PST)
Received: by mail-pl1-x631.google.com with SMTP id d9443c01a7336-1cf8c462766so33877705ad.1 for <lsr@ietf.org>; Mon, 27 Nov 2023 15:32:40 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1701127960; x=1701732760; darn=ietf.org; h=to:in-reply-to:cc:references:message-id:date:subject:mime-version :from:content-transfer-encoding:from:to:cc:subject:date:message-id :reply-to; bh=m6CmeJwK1kdM6qzlAx7W/akiklXKbjY1oEUaVR7/V44=; b=lInKbL91caMGT8r5BCY4BdgY2+uEuJSy9OoFLg2osixZcVo4R6Tso8Dg3CuyNS3HMP GAASZL2mwlAztfOsL7xekJsh7Vr/qztx4XCi5suhRuhr0wAxSAZ+fd7++H4opt9AU3c3 Kh/LPGGQS8GymaikVujnkUxSgmuP31CRm+Vy+Q5/baHHWVnVYD6VSqlSF8TIDOji1KQJ lq0Z1oaTNUb7BfUFpwdz9GUmdtYJK9gzMyRXUDqdCCNIBHzs+vAzvzw9HvtxcR8hgL+A J1/Vc77/HfsAjqjZ3QXDA0jSrQxE07YvS7iPx+MEfBLRwKjXrKn0bhDDK3xDq3HdKtNc +hqw==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1701127960; x=1701732760; h=to:in-reply-to:cc:references:message-id:date:subject:mime-version :from:content-transfer-encoding:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=m6CmeJwK1kdM6qzlAx7W/akiklXKbjY1oEUaVR7/V44=; b=NBbaeknEULKjr4ZadMQvCd6DCCvCC/hJI73Eg36MufUP9dQYq6gsjrKhCh3ZNu2EIE vQUrQBOC7Erf5KrvxRLDVNqnduDlvU6yB3VoMCCtCRHwftMd7TmbwWActh8Ab6+f2pu9 h1sgHPnw2h5Baxx3DkEqLpfMNwAmvDNRTj35OuOjdFciM9H0H+qwqqJfcOc8b1lnqvhk bdlg5BnlW45CpvLOHGPnWcvjWCvZdVK7ogSCG4eyGSG7ce2nWjyFScL/h5SuuC1VytMx Ln7eILLyDLM++OdMeKAgHs/RbQADL2Qtp8AnnJTN/aUB3LRgG56vC4JESNA+FwLRp85r GTvA==
X-Gm-Message-State: AOJu0YyXOMMqyTV0NxuJOe3gLKv4zdjwIS6klhc/hEZllh918n+iAw13 A8OZuvcJ0lJLF/dE+B+0oWE=
X-Google-Smtp-Source: AGHT+IFPPf5f5qTSI8fWGlopWRKsBr2qnQH4PCTv+UVA/NjWwYLtOm2nQE9hSOp99MRdJNvsY8ClMw==
X-Received: by 2002:a17:903:4282:b0:1c5:befa:d81d with SMTP id ju2-20020a170903428200b001c5befad81dmr11279623plb.10.1701127959861; Mon, 27 Nov 2023 15:32:39 -0800 (PST)
Received: from smtpclient.apple (c-67-164-29-73.hsd1.ca.comcast.net. [67.164.29.73]) by smtp.gmail.com with ESMTPSA id d8-20020a170902654800b001c61acd5bd2sm9072446pln.112.2023.11.27.15.32.38 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Mon, 27 Nov 2023 15:32:39 -0800 (PST)
Content-Type: multipart/alternative; boundary="Apple-Mail-628EAB32-C29A-47BE-8767-47A93BF16FC6"
Content-Transfer-Encoding: 7bit
From: Jeff Tantsura <jefftant.ietf@gmail.com>
Mime-Version: 1.0 (1.0)
Date: Mon, 27 Nov 2023 15:32:27 -0800
Message-Id: <1F10AE52-C87C-4245-A034-81D8110623C6@gmail.com>
References: <CAOj+MMH_7PSdNRsSzAhN7hbB_QyyrKO3i-4EYt-0e2EKtvT3Tg@mail.gmail.com>
Cc: Acee Lindem <acee.ietf@gmail.com>, "Les Ginsberg (ginsberg)" <ginsberg=40cisco.com@dmarc.ietf.org>, Tony Li <tony.li@tony.li>, xuxiaohu_ietf@hotmail.com, lsr@ietf.org
In-Reply-To: <CAOj+MMH_7PSdNRsSzAhN7hbB_QyyrKO3i-4EYt-0e2EKtvT3Tg@mail.gmail.com>
To: Robert Raszuk <robert@raszuk.net>
X-Mailer: iPad Mail (21B74)
Archived-At: <https://mailarchive.ietf.org/arch/msg/lsr/auc59DqswrpEPU5eaijpKwYap8g>
Subject: Re: [Lsr] New Version Notification for draft-xu-lsr-flooding-reduction-in-clos-01.txt
X-BeenThere: lsr@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Link State Routing Working Group <lsr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/lsr>, <mailto:lsr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/lsr/>
List-Post: <mailto:lsr@ietf.org>
List-Help: <mailto:lsr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/lsr>, <mailto:lsr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 27 Nov 2023 23:32:44 -0000

Robert,

In context of LLM (10% of that for DLRM) training clusters, towards 2024/25 we would be looking to up to 8K end-points in a 3 stage leaf-spine fabric and up to 64-256K in 5 stages.
Virtualization and how it is instantiated might significantly change amount/distribution of state in underlay/overlay.

Obviously, these are hyperscale size deployments, many will be running 10-30 switches fabrics, where anything could work. 
BGP seems to work just fine, some data plane signaling could be used as a near real time augmentation to “slow but stable” control plane.

Cheers,
Jeff

On Nov 26, 2023, at 14:30, Robert Raszuk <robert@raszuk.net> wrote:


Hey Jeff, 

Could you be so kind and defined term: "scaled-out leaf-spine fabrics" ?

Specifically folks watching us here would highly appreciate if we state max target nodes with vanilla ISIS and max target nodes when your ISIS implementation supports https://datatracker.ietf.org/doc/html/draft-ietf-lsr-dynamic-flooding" target="_blank" rel="nofollow">draft-ietf-lsr-dynamic-flooding

While I am a BGP person I feel pretty strongly that BGP is not a best fit for the vast majority of DC fabrics in use today. 

Cheers,
Robert


On Sun, Nov 26, 2023 at 10:49 PM Jeff Tantsura <jefftant.ietf@gmail.com> wrote:
I agree with all aforementioned comments.

Wrt AI/ML networking - if a controller is used, what is required is link state exposure northbound and not link state protocol  in the fabric. (I could argue for RIFT though ;-))
I’d urge you to take a look at Meta’s deployment  in their ML clusters (publicly available) - they use BGP as the routing protocol to exchange reachability (and build ECMP sets) and provide a backup if controller computed next hop goes away/before new one has been computed.
Open R is used northbound to expose the topology (in exactly same way - BGP-LS could be used).

To summarize: an LS protocol brings no additional value in scaled-out leaf-spine fabrics, without significant modifications -  it doesn’t work in irregular topologies such as DF, etc.
Existing proposals - there are shipping implementations and experience in operating it, have proven their relative value in suitable deployments.

Cheers,
Jeff

> On Nov 26, 2023, at 12:20, Acee Lindem <acee.ietf@gmail.com> wrote:
>
> Speaking as WG member:
>
> I agree. The whole Data Center IGP flooding discussion went on years ago and the simplistic enhancement proposed in the subject draft is neither relevant or useful now.
>
> Thanks,
> Acee
>
>> On Nov 24, 2023, at 11:55 PM, Les Ginsberg (ginsberg) <ginsberg=40cisco.com@dmarc.ietf.org> wrote:
>>
>> Xiaohu –
>> I also point out that there are at least two existing drafts which specifically address IS-IS flooding reduction in CLOS networks and do so in greater detail and with more robustness than what is in your draft:
>> https://datatracker.ietf.org/doc/draft-ietf-lsr-distoptflood/" rel="noreferrer nofollow" target="_blank">https://datatracker.ietf.org/doc/draft-ietf-lsr-distoptflood/
>> https://datatracker.ietf.org/doc/draft-ietf-lsr-isis-spine-leaf-ext/" rel="noreferrer nofollow" target="_blank">https://datatracker.ietf.org/doc/draft-ietf-lsr-isis-spine-leaf-ext/
>> I do not see a need for yet another draft specifically aimed at CLOS networks.
>> Note that work on draft-ietf-lsr-isis-spine-leaf-ext was suspended due to lack of interest in deploying an IGP solution in CLOS networks.
>> You are suggesting in draft-xu-lsr-fare that AI is going to change this. Well, maybe, but if so I think we should return to the solutions already available and prioritize work on them.
>>    Les
>>  From: Lsr <lsr-bounces@ietf.org> On Behalf Of Tony Li
>> Sent: Thursday, November 23, 2023 8:39 AM
>> To: xuxiaohu_ietf@hotmail.com
>> Cc: lsr@ietf.org
>> Subject: Re: [Lsr] New Version Notification for draft-xu-lsr-flooding-reduction-in-clos-01.txt
>> Hi,
>> What you’re proposing is already described in IS-IS Mesh Groups (https://www.rfc-editor.org/rfc/rfc2973.html" rel="noreferrer nofollow" target="_blank">https://www.rfc-editor.org/rfc/rfc2973.html) and improved upon in Dynamic Flooding (https://datatracker.ietf.org/doc/html/draft-ietf-lsr-dynamic-flooding" rel="noreferrer nofollow" target="_blank">https://datatracker.ietf.org/doc/html/draft-ietf-lsr-dynamic-flooding).
>> Regards,
>> Tony
>>
>>
>> On Nov 23, 2023, at 8:29 AM, xuxiaohu_ietf@hotmail.com wrote:
>> Hi all,
>> Any comments or suggestions are welcome.
>> Best regards,
>> Xiaohu
>> 发件人: internet-drafts@ietf.org <internet-drafts@ietf.org>
>> 日期: 星期三, 2023年11月22日 11:37
>> 收件人: Xiaohu Xu <xuxiaohu_ietf@hotmail.com>
>> 主题: New Version Notification for draft-xu-lsr-flooding-reduction-in-clos-01.txt
>> A new version of Internet-Draft draft-xu-lsr-flooding-reduction-in-clos-01.txt
>> has been successfully submitted by Xiaohu Xu and posted to the
>> IETF repository.
>>
>> Name:     draft-xu-lsr-flooding-reduction-in-clos
>> Revision: 01
>> Title:    Flooding Reduction in CLOS Networks
>> Date:     2023-11-22
>> Group:    Individual Submission
>> Pages:    6
>> URL:      https://www.ietf.org/archive/id/draft-xu-lsr-flooding-reduction-in-clos-01.txt" rel="noreferrer nofollow" target="_blank">https://www.ietf.org/archive/id/draft-xu-lsr-flooding-reduction-in-clos-01.txt
>> Status:   https://datatracker.ietf.org/doc/draft-xu-lsr-flooding-reduction-in-clos/" rel="noreferrer nofollow" target="_blank">https://datatracker.ietf.org/doc/draft-xu-lsr-flooding-reduction-in-clos/
>> HTMLized: https://datatracker.ietf.org/doc/html/draft-xu-lsr-flooding-reduction-in-clos" rel="noreferrer nofollow" target="_blank">https://datatracker.ietf.org/doc/html/draft-xu-lsr-flooding-reduction-in-clos
>> Diff:     https://author-tools.ietf.org/iddiff?url2=draft-xu-lsr-flooding-reduction-in-clos-01" rel="noreferrer nofollow" target="_blank">https://author-tools.ietf.org/iddiff?url2=draft-xu-lsr-flooding-reduction-in-clos-01
>>
>> Abstract:
>>
>>   In a CLOS topology, an OSPF (or ISIS) router may receive identical
>>   copies of an LSA (or LSP) from multiple OSPF (or ISIS) neighbors.
>>   Moreover, two OSPF (or ISIS) neighbors may exchange the same LSA (or
>>   LSP) simultaneously.  This results in unnecessary flooding of link-
>>   state information, which wastes the precious resources of OSPF (or
>>   ISIS) routers.  Therefore, this document proposes extensions to OSPF
>>   (or ISIS) to reduce this flooding within CLOS networks.  The
>>   reduction of OSPF (or ISIS) flooding is highly beneficial for
>>   improving the scalability of CLOS networks.
>>
>>
>>
>> The IETF Secretariat
>>
>> _______________________________________________
>> Lsr mailing list
>> Lsr@ietf.org
>> https://www.ietf.org/mailman/listinfo/lsr" rel="noreferrer nofollow" target="_blank">https://www.ietf.org/mailman/listinfo/lsr
>> _______________________________________________
>> Lsr mailing list
>> Lsr@ietf.org
>> https://www.ietf.org/mailman/listinfo/lsr" rel="noreferrer nofollow" target="_blank">https://www.ietf.org/mailman/listinfo/lsr
>
>
> _______________________________________________
> Lsr mailing list
> Lsr@ietf.org
> https://www.ietf.org/mailman/listinfo/lsr" rel="noreferrer nofollow" target="_blank">https://www.ietf.org/mailman/listinfo/lsr

_______________________________________________
Lsr mailing list
Lsr@ietf.org
https://www.ietf.org/mailman/listinfo/lsr" rel="noreferrer nofollow" target="_blank">https://www.ietf.org/mailman/listinfo/lsr