[Idr] Re: Fwd: I-D Action: draft-wang-idr-dpf-00.txt

Robert Raszuk <robert@raszuk.net> Tue, 02 December 2025 19:45 UTC

Return-Path: <robert@raszuk.net>
X-Original-To: idr@mail2.ietf.org
Delivered-To: idr@mail2.ietf.org
Received: from localhost (localhost [127.0.0.1]) by mail2.ietf.org (Postfix) with ESMTP id D9B64941FB2F for <idr@mail2.ietf.org>; Tue, 2 Dec 2025 11:45:45 -0800 (PST)
X-Virus-Scanned: amavisd-new at ietf.org
X-Spam-Flag: NO
X-Spam-Score: -1.999
X-Spam-Level:
X-Spam-Status: No, score=-1.999 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.001, HTTPS_HTTP_MISMATCH=0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: mail2.ietf.org (amavisd-new); dkim=pass (2048-bit key) header.d=raszuk.net
Received: from mail2.ietf.org ([166.84.6.31]) by localhost (mail2.ietf.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id kpM-nGkCW07I for <idr@mail2.ietf.org>; Tue, 2 Dec 2025 11:45:44 -0800 (PST)
Received: from mail-ed1-x534.google.com (mail-ed1-x534.google.com [IPv6:2a00:1450:4864:20::534]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature ECDSA (P-256) server-digest SHA256) (No client certificate requested) by mail2.ietf.org (Postfix) with ESMTPS id 4CCBC941F9F3 for <idr@ietf.org>; Tue, 2 Dec 2025 11:45:42 -0800 (PST)
Received: by mail-ed1-x534.google.com with SMTP id 4fb4d7f45d1cf-640c6577120so10446950a12.1 for <idr@ietf.org>; Tue, 02 Dec 2025 11:45:42 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=raszuk.net; s=google; t=1764704734; x=1765309534; darn=ietf.org; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=/xL2rWhpWQgcrqY1cg8D2x9Codclo1pVmZBO2NBuyxw=; b=JXskOzie6iP+MP/+jqkv1LxNvfCaBQYR1Bx2KNywF/18cUeCLSrRLtohSaCIFGiWbi EfE7EjfWWoipGp7M2csnYqhtJjDTCx/VyEqXF+ZhsAr56dl86ERzJZf4x0GKX+xghXNT uRBbu5bu3DXZUxLqo2a6E3B/ST09HHJa8OW0TEQLHZmu4vu1EFG8DacbuXvNBqZ5pnqc zl6zEGjKu51eiKypY4b/SQPB4Lsf12bHZ9qVklt4kHtuMS0YgIL/WIo+cP+r5pKfEL7M oNjv/lnKeCGfUs/Aba9luItktxKkeou3tMdYp7LI7MY1M7E+j5B7I0DZ/bXOSkPPoYoO BCBQ==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1764704734; x=1765309534; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-gg:x-gm-message-state:from:to:cc:subject:date :message-id:reply-to; bh=/xL2rWhpWQgcrqY1cg8D2x9Codclo1pVmZBO2NBuyxw=; b=JMdoMji4mVQFli92uMrmbmxkzEBJYm5n8L4uQeLoIh6rqAudJZBg5HUTZkIQ55nPDf RWdRHZE6FybXI0+O+qj8XVchpHk688O6Ns5zoXAmgkMuxnAx4hK8pqNNRhhvx79QHAxH DdufhD8uaajomie7Vo9qgJbEBuqIEkt51+t79bEjXkiuJof4dAnw8Wa6/xFLIV550X1O 2XgZu8OKS+eZ9JbNw+4oS+5yMxBtyxbf+RxDUx+QX4DSIglHmY5aUFTlceYkuFmLP6ef GFq9aPnolYcY5C6yIjyWyLfIJFaowvc3isBMNCBg+TG+4BMgPb3qI6SU7rBgWZNUS4Ax 57Vw==
X-Forwarded-Encrypted: i=1; AJvYcCVKNUE6IOePlKBnOU9/pEH6uv+h3ttNtLtaWGypZxvR5G4R6IMrD9itny5hen0N1xT4KlM=@ietf.org
X-Gm-Message-State: AOJu0Yw11FstHsvJNzb3iaJF/Q2cmGoJxSp71dr3v3RNAtFtIEQZhJGb PHuQAN4MzBtJsmczpqVp/1TpLN6VO8Cl+NNtPsRD5GOwKtzVeQf+h/jXE96o4CzF4l0EkyqoIbI gzT1fWXJz0Kgq3K4aweeDT3a7CAO/ceYZd61wMJTKJw==
X-Gm-Gg: ASbGncs5QkXv2o8VMbnT+3CKTJlXcX8bBggaG3af8L9xnt83FtZwSPX2CFPibVBRv89 Bd/E0FYbRbcAbW0d4kAoe2fxIjqJOFi/1zkw6oCD7MtrM4Ve/kwz5sjGQgSfS2tiG2W797E/rKH WBZH0Z2Eo3O/MWS8dqtxYAwNFsQjt+7fWEsv5gqeLrPkDuNqAroQVGzP3f2rFFiawte6s+NOJXD eYJY7TcXVauDpdf5tWErVYr+uvAvcQJIELW4I5+dibsdG/efQ4x7NZTjRJ2ZWTGlWgBGMI=
X-Google-Smtp-Source: AGHT+IHAPrQ+rg4RG1/dEaAj8KTj/Cn3jxGri1/EJIFqOmGcFM9PGw2LkLZi78vOtgItIdAgOvgkLu3CuoSvT36HMU4=
X-Received: by 2002:a05:6402:1d4d:b0:640:ceef:7e44 with SMTP id 4fb4d7f45d1cf-6455468d5b1mr44532606a12.28.1764704734338; Tue, 02 Dec 2025 11:45:34 -0800 (PST)
MIME-Version: 1.0
References: <176462578612.3650528.8915305565733099516@dt-datatracker-5bd94c585b-wk4l4> <CAOj+MMEw4HFJRmJ_=VhVSQCr1Sic6nrixXqFYpT3E47Mk_EaGw@mail.gmail.com> <CABNhwV2XzaTyiETsYr-STKypq9M49YnW8ekj5jTsG5==xmRX5Q@mail.gmail.com> <MW4PR84MB2092F37923BBE432BF32AECF86D8A@MW4PR84MB2092.NAMPRD84.PROD.OUTLOOK.COM>
In-Reply-To: <MW4PR84MB2092F37923BBE432BF32AECF86D8A@MW4PR84MB2092.NAMPRD84.PROD.OUTLOOK.COM>
From: Robert Raszuk <robert@raszuk.net>
Date: Tue, 02 Dec 2025 20:45:22 +0100
X-Gm-Features: AWmQ_bnzXFV57h9yDg9fZoPfMg_pslxaibautuKprDW4cJaWg9BHIlyJsVKab-4
Message-ID: <CAOj+MMHpDy6zfSjC4nuaVwst+Bj+vcDNjz6CXkO5x6N7-Rkx6w@mail.gmail.com>
To: "Wang, Kevin" <kevin.wang@hpe.com>
Content-Type: multipart/alternative; boundary="000000000000d2ef680644fd5699"
Message-ID-Hash: M4N4JNTKIOPSCORZHS6RCQJY56I2ANPP
X-Message-ID-Hash: M4N4JNTKIOPSCORZHS6RCQJY56I2ANPP
X-MailFrom: robert@raszuk.net
X-Mailman-Rule-Misses: dmarc-mitigation; no-senders; approved; emergency; loop; banned-address; member-moderation; header-match-idr.ietf.org-0; nonmember-moderation; administrivia; implicit-dest; max-recipients; max-size; news-moderation; no-subject; digests; suspicious-header
CC: "idr@ietf. org" <idr@ietf.org>, lsr <lsr@ietf.org>
X-Mailman-Version: 3.3.9rc6
Precedence: list
Subject: [Idr] Re: Fwd: I-D Action: draft-wang-idr-dpf-00.txt
List-Id: Inter-Domain Routing <idr.ietf.org>
Archived-At: <https://mailarchive.ietf.org/arch/msg/idr/SHvRzVD53h8pj3Xb2w2AF8MEQjI>
List-Archive: <https://mailarchive.ietf.org/arch/browse/idr>
List-Help: <mailto:idr-request@ietf.org?subject=help>
List-Owner: <mailto:idr-owner@ietf.org>
List-Post: <mailto:idr@ietf.org>
List-Subscribe: <mailto:idr-join@ietf.org>
List-Unsubscribe: <mailto:idr-leave@ietf.org>

Dear Kevin,

I know very well what RFC 7938 says. In fact I did review this document
well before it became an RFC :)

But what happened next is that while RFC7938 make a valid observation on
how one can build MSDCs lots of folks misinterpreted it as the only guide
on how to build even a few racks of DC fabrics.

So yes, using BGP to construct dynamic routing in the DC fabrics has its
use cases that are really applicable to only a handful of deployments. And
I am not aware that any of the MSDCs would be asking you for logical
transport planes within their fabrics.

All other DCs would be much better off using IGP for underlay and BGP for
overlay as a design pattern.

When I constructed 10 full racks of hardware using ISIS folks were shocked
- and pointed out that I am not using an IETF standard approach :). Then
when I demonstrated that connectivity restoration upon any node or link
failure is repaired in less then 50 ms the masks went off.

Maybe what is actually needed is an  informational RFC - just like RFC7938
- simply illustrating that one can construct DC using ISIS. It is obvious
to me, but I admit there is no RFC I am aware of to show operators that
"Large-Scale Data Centers" can be robustly build with IGPs.

Kind regards,
Robert


On Tue, Dec 2, 2025 at 7:24 PM Wang, Kevin <kevin.wang@hpe.com> wrote:

> Hi Robert and Gyan,
>
> Thanks for your feedback! Your observation is correct that IGP Flex Algo
> could achieve the same. BGP DPF can be though as a BGP counterpart of IGP
> Flex Algo to some extent (though not precisely).
>
> As explained in the “Introduction” section of this draft, BGP DPF is
> designed for the current IP fabric environment where EBGP is usually the
> only protocol used for routing. Section 5 of RFC 7938 explains why DC
> fabrics use EBGP as the sole routing protocol.
>
> Thanks,
> Kevin
>
> *From: *Gyan Mishra <hayabusagsm@gmail.com>
> *Date: *Tuesday, December 2, 2025 at 7:43 AM
> *To: *Robert Raszuk <robert@raszuk.net>
> *Cc: *idr@ietf. org <idr@ietf.org>, lsr <lsr@ietf.org>
> *Subject: *[Idr] Re: Fwd: I-D Action: draft-wang-idr-dpf-00.txt
>
> I agree with Robert that you could use RFC 9502 IGP Flex Algo in IP
> networks to build disjoint planes as desired.
>
> You could also use SRv6 with IGP Flex Algo with SR RFC 9350 which uses
> IPv6 data plane and build your disjoint planes.
>
> Thanks
>
> Gyan
>
> On Tue, Dec 2, 2025 at 6:32 AM Robert Raszuk <robert@raszuk.net> wrote:
>
> Hi,
>
> In respect to the subject draft ... why would you not use IGP Flexible
> Algorithm for it ?
>
> Are you going to port now years of work from IGP to BGP to achieve the
> same ?
>
> Besides, in a non-blocking fabric latency is really not a factor. So you
> want to logically partition it to make it blocking them worry about what
> travels on which such logical plane ? Is this a reasonable direction ?
>
> Thx,
> R.
>
> ---------- Forwarded message ---------
> From: <internet-drafts@ietf.org>
> Date: Mon, Dec 1, 2025 at 10:49 PM
> Subject: I-D Action: draft-wang-idr-dpf-00.txt
> To: <i-d-announce@ietf.org>
>
>
> Internet-Draft draft-wang-idr-dpf-00.txt is now available.
>
>    Title:   BGP Deterministic Path Forwarding (DPF)
>    Authors: Kevin Wang
>             Michal Styszynski
>             Wen Lin
>             Mahesh Subramaniam
>             Thomas Kampa
>             Diptanshu Singh
>    Name:    draft-wang-idr-dpf-00.txt
>    Pages:   18
>    Dates:   2025-12-01
>
> Abstract:
>
>    Modern data center (DC) fabrics typically employ Clos topologies with
>    External BGP (EBGP) for plain IPv4/IPv6 routing.  While hop-by-hop
>    EBGP routing is simple and scalable, it provides only a single best-
>    effort forwarding service for all types of traffic.  This single
>    best-effort service might be insufficient for increasingly diverse
>    traffic requirements in modern DC environments.  For example, loss
>    and latency sensitive AI/ML flows may demand stronger Service Level
>    Agreements (SLA) than general purpose traffic.  Duplication schemes
>    which are standardized through protocols such as Parallel Redundancy
>    Protocol (PRP) require disjoint forwarding paths to avoid single
>    points of failure.  Congestion avoidance may require more
>    deterministic forwarding behavior.
>
>    This document introduces BGP Deterministic Path Forwarding (DPF), a
>    mechanism that partitions the physical fabric into multiple logical
>    fabrics.  Flows can be mapped to different logical fabrics based on
>    their specific requirements, enabling deterministic forwarding
>    behavior within the data center.
>
> The IETF datatracker status page for this Internet-Draft is:
> https://datatracker.ietf.org/doc/draft-wang-idr-dpf/
> <https://urldefense.com/v3/__https://datatracker.ietf.org/doc/draft-wang-idr-dpf/__;!!NEt6yMaO-gk!EP_lEYmqbOUApQqqOz-ZuP9CsojS2gbvLvgQfxoYTXPXtS-0yjfv8ElqZwJBCRfOLFY6nymWoR5eJlshPeG9$>
>
> There is also an HTML version available at:
> https://www.ietf.org/archive/id/draft-wang-idr-dpf-00.html
> <https://urldefense.com/v3/__https://www.ietf.org/archive/id/draft-wang-idr-dpf-00.html__;!!NEt6yMaO-gk!EP_lEYmqbOUApQqqOz-ZuP9CsojS2gbvLvgQfxoYTXPXtS-0yjfv8ElqZwJBCRfOLFY6nymWoR5eJjgsy_TY$>
>
> Internet-Drafts are also available by rsync at:
> rsync.ietf.org::internet-drafts
>
>
> _______________________________________________
> I-D-Announce mailing list -- i-d-announce@ietf.org
> To unsubscribe send an email to i-d-announce-leave@ietf.org
> _______________________________________________
> Idr mailing list -- idr@ietf.org
> To unsubscribe send an email to idr-leave@ietf.org
>
>