Re: [Bier] Comments on draft-chen-bier-frr-02.

"Jeffrey (Zhaohui) Zhang" <zzhang@juniper.net> Wed, 17 March 2021 16:39 UTC

Return-Path: <zzhang@juniper.net>
X-Original-To: bier@ietfa.amsl.com
Delivered-To: bier@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 69E0A3A0925 for <bier@ietfa.amsl.com>; Wed, 17 Mar 2021 09:39:38 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.346
X-Spam-Level:
X-Spam-Status: No, score=-2.346 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.248, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=juniper.net header.b=xneTK11Q; dkim=pass (1024-bit key) header.d=juniper.net header.b=Ea6RVqC0
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id cKmVr4VN_3Tl for <bier@ietfa.amsl.com>; Wed, 17 Mar 2021 09:39:34 -0700 (PDT)
Received: from mx0a-00273201.pphosted.com (mx0a-00273201.pphosted.com [208.84.65.16]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id E943E3A0912 for <bier@ietf.org>; Wed, 17 Mar 2021 09:39:34 -0700 (PDT)
Received: from pps.filterd (m0108159.ppops.net [127.0.0.1]) by mx0a-00273201.pphosted.com (8.16.0.43/8.16.0.43) with SMTP id 12HGa5a5024127; Wed, 17 Mar 2021 09:39:32 -0700
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=juniper.net; h=from : to : subject : date : message-id : references : in-reply-to : content-type : mime-version; s=PPS1017; bh=8kyFiUSpad9wTfG80QNylds7p3atIfKWiKrNfHY32vY=; b=xneTK11QNPeFOGmgU2Oz4v/RzwH8VBCwB3E7wYw08+5jAK8JMu/FLXY43JHUQQoYEj6u yhTHKfveBN6zork7C2CDZRxDdnscnTM0cct06eCl3wZPqLmBYKK9B1Kn8UFaag6Eow82 JRwWY64g2AcHrr+IBQ7INYvpPnsrN51xS7T0kSbtvFBqCVYbo7kVnRWi9RM4ZVIKvNMU Hc7YoueSZwOJ+ckPoaMrV5XeamHGCCPQwf6yx+5bw4Hu9K1p7u/TIXiv5CQkfqzPtukv 5ltvWA0Bju4WT0wM1MjlojsW0yaBHYqcmvNHnCoTTIdYYUhLjpGAKzbiiboThw9+L8TQ GQ==
Received: from nam10-mw2-obe.outbound.protection.outlook.com (mail-mw2nam10lp2101.outbound.protection.outlook.com [104.47.55.101]) by mx0a-00273201.pphosted.com with ESMTP id 37b00xjdvk-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Wed, 17 Mar 2021 09:39:32 -0700
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=l4p/Yu4k6B8v1tazprIASKftNVLVesRQOA7cNoUbBuvA7LGh1nBCT/DtJhFffbDHkSxG034ESbnUJ+gSHYeQcfmORZm05aL945iPcEIWSxmmiKEW7GjQcWTN174cpJwBQ8VKX2uplA5yie6KPe8AYMwcp5PD7CI7X1IEdKbvYfFmCaWR1HakvO9L792ph6mKqmUwbvr2nAY6OqINMu2eLiBJJz5AInx/p4GofeCU7PYcGvig/nomKdpOWRXLOBAqAXbQrkQbtiQBlnR6EMzgS+JPjyJUe7Etht21f54X6aPlm6neUc0/HrIyz/fKp6fnVHpz5WkfQcaf1/Xk73/hBw==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=8kyFiUSpad9wTfG80QNylds7p3atIfKWiKrNfHY32vY=; b=V3o0nNoXXCYDdqpo75NwDGgS2oEAIvhlxs84HE6A6JNEOLXwL1OVMShWne/HnAsOWgvIE10pYhjCfOHRy0zehGCiteVPqa3ultx2Mdzs0PAqdIkCxvsyn+DLkzBJTZC+5bYHdp/0+tA+9SVzuy92P36QFVQkgdv07UH0dGurVgu3jrMDfLaQazGLyAhKFLtyYLGfO2Ktn1k8/xpd7OYq6CEt+/+A2aQsi0a0QDR0GnNXsDcarZjs8uGTWXqKbdtsgYJflmEzjZkBqVxTlRYoP6iNi5jpnaTr6eaGFDjyejdmJy5+2Xzrl8I3TIqATtcmSWbJb/CYLJFd65ebjsRQJw==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=juniper.net; dmarc=pass action=none header.from=juniper.net; dkim=pass header.d=juniper.net; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=juniper.net; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=8kyFiUSpad9wTfG80QNylds7p3atIfKWiKrNfHY32vY=; b=Ea6RVqC0DmWPPbnMf9xtjIM5MoLacIsDpiD74SlDOm+tPGwwJvWwXFF/eJ0j9NwRvUtvguS8NCVD+wy6bjgxSTTOIUrqdbp8UFXGRTjABFHyEKeDcXpBtHaGnOs8GRAklS2FS+Lwknb1Du7ecUxsIl4Z2H4o/H1ryD4Q/3/LOGM=
Received: from MN2PR05MB5981.namprd05.prod.outlook.com (2603:10b6:208:c3::15) by MN2PR05MB5981.namprd05.prod.outlook.com (2603:10b6:208:c3::15) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.3955.11; Wed, 17 Mar 2021 16:39:30 +0000
Received: from MN2PR05MB5981.namprd05.prod.outlook.com ([fe80::203e:7f1f:be91:161c]) by MN2PR05MB5981.namprd05.prod.outlook.com ([fe80::203e:7f1f:be91:161c%6]) with mapi id 15.20.3955.012; Wed, 17 Mar 2021 16:39:29 +0000
From: "Jeffrey (Zhaohui) Zhang" <zzhang@juniper.net>
To: Huaimo Chen <huaimo.chen@futurewei.com>, "bier@ietf.org" <bier@ietf.org>
Thread-Topic: Comments on draft-chen-bier-frr-02.
Thread-Index: AdcarQ4JLoZGL76CTimMbvlaYdVumQAi/1wSAAMlCeA=
Date: Wed, 17 Mar 2021 16:39:29 +0000
Message-ID: <MN2PR05MB5981FAFF9B51444DE1813240D46A9@MN2PR05MB5981.namprd05.prod.outlook.com>
References: <MN2PR05MB5981712D40FD05376D55784FD46B9@MN2PR05MB5981.namprd05.prod.outlook.com> <MN2PR13MB4087A2CA3EB20A35E5C2A72BF26A9@MN2PR13MB4087.namprd13.prod.outlook.com>
In-Reply-To: <MN2PR13MB4087A2CA3EB20A35E5C2A72BF26A9@MN2PR13MB4087.namprd13.prod.outlook.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
dlp-product: dlpe-windows
dlp-version: 11.6.0.76
dlp-reaction: no-action
msip_labels: MSIP_Label_0633b888-ae0d-4341-a75f-06e04137d755_ActionId=044035c6-cc93-47d8-b660-1edd3b3f5ee8; MSIP_Label_0633b888-ae0d-4341-a75f-06e04137d755_ContentBits=0; MSIP_Label_0633b888-ae0d-4341-a75f-06e04137d755_Enabled=true; MSIP_Label_0633b888-ae0d-4341-a75f-06e04137d755_Method=Standard; MSIP_Label_0633b888-ae0d-4341-a75f-06e04137d755_Name=0633b888-ae0d-4341-a75f-06e04137d755; MSIP_Label_0633b888-ae0d-4341-a75f-06e04137d755_SetDate=2021-03-16T21:40:09Z; MSIP_Label_0633b888-ae0d-4341-a75f-06e04137d755_SiteId=bea78b3c-4cdb-4130-854a-1d193232e5f4;
authentication-results: futurewei.com; dkim=none (message not signed) header.d=none;futurewei.com; dmarc=none action=none header.from=juniper.net;
x-originating-ip: [71.248.165.31]
x-ms-publictraffictype: Email
x-ms-office365-filtering-ht: Tenant
x-ms-office365-filtering-correlation-id: 3e90622b-5bee-4530-9043-08d8e9633c6a
x-ms-traffictypediagnostic: MN2PR05MB5981:
x-microsoft-antispam-prvs: <MN2PR05MB5981AA9D92DF4CC98B46AD5CD46A9@MN2PR05MB5981.namprd05.prod.outlook.com>
x-ms-oob-tlc-oobclassifiers: OLM:10000;
x-ms-exchange-senderadcheck: 1
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: 0+oAxhSz6hUjgnjo2xtIbGNI08WFFZsRoAoVTLSsTLVP0qSqo5zitZBGKorSB5YajezgI2Wzl2tby/toOlrKVp4HVu0/YUrG5alHujKAwdmB8CP2pGimxL+m73KBS9kWnMKWXP2gnDJNi0nKdQjShor0jY+yrWkEpKH2Z8uJ5QSJmRrTHB+IZRrsq3WoFMIwskvwnEHNxhnwLr5TjmJj1g6DxDersZQttgMzKcW10KiWYeiHGZXu7gYad7vOJK9o385F18T6jnjjOcClXoLbpqu4mSWKpMB0aMyaZMi8Qf4ufbP7tIP0vQAhGb7rTU5Nsr8AYNIlhdO9thrfCqrO5jVAtKHL08GnBo7KsfbPDgQMngNXQ5bNg5Ad5luTQ2fs6vPf7NOJQtW9tycqdn/ozJS+hczGlgBf6AL3K8GQKgimPVPeAy8K1DjH+QZtWNsalcLV2AAUagEezyF/6OB5y2jTK0MKDr2tffT6wvfeYTS7Ir95xzeDllRuMss7XwZOkh/BjClhaeKyWQpflJvmsPQUYYZjAvYVHfWduBAZ8xaO9Qiy8QhaH+K66fr9bBz+
x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:MN2PR05MB5981.namprd05.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(136003)(396003)(346002)(366004)(376002)(39860400002)(30864003)(110136005)(8936002)(83380400001)(66476007)(26005)(6506007)(9686003)(9326002)(5660300002)(53546011)(64756008)(316002)(86362001)(478600001)(66946007)(52536014)(8676002)(2906002)(76116006)(33656002)(55016002)(66446008)(186003)(7696005)(66556008)(71200400001); DIR:OUT; SFP:1102;
x-ms-exchange-antispam-messagedata: =?us-ascii?Q?ZrVYS0dQq7Su6JiPirNsMFHtUnLjsbv2PKl4ZIOBmy+HlsAo/40O944wHnjZ?= =?us-ascii?Q?Uqa9uO3L8jLJWZ+7V32pSJgSx+mL2H+l2Fl6SY0Y9K60ITC6X9Tnfk2zxh7Z?= =?us-ascii?Q?cUJ3kvrwk+PLh0+VpCqZGqX3YnkcMNxAm5iBGhVE8FplyUNEiy/3Lw+io1wZ?= =?us-ascii?Q?o9aV7344K8nC6cTklDby+U9EoKSTbNj0brdtx+YZYbXAG8+dW1mFzXqeKXMC?= =?us-ascii?Q?UGH3Y3hDQAriwrKKrAhn2sccxxXKnBQeWE6GR4Dhi1D9V8Xw6RYlAx2k9CJr?= =?us-ascii?Q?Pe5+6eYwYp+8wmTqtWaQun+Rfg8jFi3aUJfD7NJFQG7HiUxnMKUH7EfOYT5U?= =?us-ascii?Q?B2Hv6qjM3v0ppZU7JNNLj3CcyDRa3zgGzwoYkGhnVC69xWrvcMeY+VtU44/l?= =?us-ascii?Q?xUE9b2TeDDBnhMSAECgRJ169EnrlIJb9KxdStyPdvS1ruI8DDcHeSJHk4OiQ?= =?us-ascii?Q?cYybAYqk7JPjtSvxmc7T2am0LFSiOnY442uNBurBn2MAwA35lSnWpeeddW+X?= =?us-ascii?Q?/C5R1cp576pKSIkHA5jc8vvP/WZdKr7auvQ8UwWft6pxPX79lmgNG7LdT0cF?= =?us-ascii?Q?jZYu/LjhjU1TtsIGiXrHz1GDat1EdcFeSJAsWcPYbOJDc9hHVO0vIVsXo+c6?= =?us-ascii?Q?R3FIcGCqbZJxok/7ToDpMZU6AB2Ybp0vA76QKbZ2OxveT0h03LIiT+HH52y0?= =?us-ascii?Q?paQ5CCTX7TSj+ZCKSh3l2eFwt0t19RAT3ataEz76OX0ShgXKAgb5C2ZJt7wN?= =?us-ascii?Q?E8+kb9jv2aI5aoahe3xqNMh6bCG9p9VN4NYBICwPzjRFvZOQnp+zIZh+6v0s?= =?us-ascii?Q?zFw+YD8HxiA4dnBMscpJjI8or4C0m4wGhGqIqKsEVNWZSM843XKVVRAvHZYU?= =?us-ascii?Q?YMk+w6B3yJlbGIIm1VMzHMu105XjjKLji3Nn6H6LhoGT4ZWKp9h7ci9Gp069?= =?us-ascii?Q?mfzSnNL/UTuk4I1qWv/vix65d6gxrBVudrh+hohu92JnsAETylDGhwYzRLmH?= =?us-ascii?Q?B90E5H0lN7kKayDzmhXKxgH57SxhY97FSD8/KwLBXpv6q0eAN6UMzTdiFKmp?= =?us-ascii?Q?mnomwoEgzOGu8npeYPdUg0KcngZNejinm7z23qdVl356RsEI8/VjhFVdwRul?= =?us-ascii?Q?H1vb+o9Kmypl6FVIosuBf1eZ6uugZsJvtkiZ9IodWUWfVWOR9GFaAq14iviS?= =?us-ascii?Q?199qfgUobWNzfRl8Y699257UsnpSD7nhaNKV2rUJEH5pO+YNZPCvHzcrqSKS?= =?us-ascii?Q?5ahF5CzHivvje1gMUVtSIxKMBuDNw/WoZG7iCdODVFF88O/6OGodCc0fO8Z0?= =?us-ascii?Q?1Zua5MDh5m3+qhSq3jxDH3MG?=
x-ms-exchange-transport-forked: True
Content-Type: multipart/alternative; boundary="_000_MN2PR05MB5981FAFF9B51444DE1813240D46A9MN2PR05MB5981namp_"
MIME-Version: 1.0
X-OriginatorOrg: juniper.net
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-AuthSource: MN2PR05MB5981.namprd05.prod.outlook.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 3e90622b-5bee-4530-9043-08d8e9633c6a
X-MS-Exchange-CrossTenant-originalarrivaltime: 17 Mar 2021 16:39:29.5737 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: bea78b3c-4cdb-4130-854a-1d193232e5f4
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: R9kg+I44xjMOHZqFmxEy1hlgSEsufIugyZzOhhE1K1J+01Z67MCBYloJBRacLxKOqno4VG15r8gKOaGK7cGFaQ==
X-MS-Exchange-Transport-CrossTenantHeadersStamped: MN2PR05MB5981
X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:6.0.369, 18.0.761 definitions=2021-03-17_10:2021-03-17, 2021-03-17 signatures=0
X-Proofpoint-Spam-Details: rule=outbound_spam_notspam policy=outbound_spam score=0 mlxscore=0 phishscore=0 impostorscore=0 malwarescore=0 lowpriorityscore=0 suspectscore=0 priorityscore=1501 spamscore=0 mlxlogscore=999 adultscore=0 bulkscore=0 clxscore=1015 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2009150000 definitions=main-2103170115
Archived-At: <https://mailarchive.ietf.org/arch/msg/bier/_X-U_kbHv5eozoHcCS_1nf9WYHk>
Subject: Re: [Bier] Comments on draft-chen-bier-frr-02.
X-BeenThere: bier@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "\"Bit Indexed Explicit Replication discussion list\"" <bier.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/bier>, <mailto:bier-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/bier/>
List-Post: <mailto:bier@ietf.org>
List-Help: <mailto:bier-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/bier>, <mailto:bier-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 17 Mar 2021 16:39:39 -0000

Hi Huaimo,

My comments were in three categories:


  1.  Nits - we can skip them now
  2.  About your statement about draft-merling. I need to digest your response so I'll defer that to later and/or to Michael.
  3.  The flaw in multiple per-nbr FRR BIFTs

The first two are not really important to the adoption call and the direction of the BIER FRR document, but the 3rd one is.

As you said:

[HC]: Multiple failures handling is out of scope for now.

In my view, the following does handle multiple failures to maximum extent:

zzh> A better way is to simply have a single BIFT for both normal and FRR
forwarding. Each entry has some ECMP and/or primary/backup branches,
and you simply use another ECMP branch or a backup branch when the
normally used branch fails. A backup branch can go through completely
different nbr, or go through the same neighbor but via a tunnel.

Consider that and the cost associated with building/maintaining multiple BIFTs, I would argue that the multiple per-nbr FRR BIFT approach, while perhaps acceptable for a particular implementation, is not ready yet for WG adoption.

Please note that I am not arguing against LFA based FRR. I am just saying that the reference model of a single BIFT with ECMP/primary/backup branches for an entry is the best baseline.

Jeffrey

From: Huaimo Chen <huaimo.chen@futurewei.com>
Sent: Wednesday, March 17, 2021 11:31 AM
To: Jeffrey (Zhaohui) Zhang <zzhang@juniper.net>et>; bier@ietf.org
Subject: Re: Comments on draft-chen-bier-frr-02.

[External Email. Be cautious of content]

Hi Jeffrey,

    Thanks for your comments.
    My responses are inline below with prefix [HC].

Best Regards,
Huaimo
________________________________
From: Jeffrey (Zhaohui) Zhang <zzhang@juniper.net<mailto:zzhang@juniper.net>>
Sent: Tuesday, March 16, 2021 5:41 PM
To: Huaimo Chen <huaimo.chen@futurewei.com<mailto:huaimo.chen@futurewei.com>>; bier@ietf.org<mailto:bier@ietf.org> <bier@ietf.org<mailto:bier@ietf.org>>
Subject: Comments on draft-chen-bier-frr-02.

Hi Huaimo,

Please see my comments below.

Some are nits, but if I understand it correctly, the idea of using
multiple per-nbr FRR BIFTs is flawed.
[HC]: My statement below is right.

   [I-D.merling-bier-frr] proposes a tunnel-based fast re-route (FRR)
   ... Before the
   primary bit mask is recomputed and updated, some of BIER packets may
   be forwarded incorrectly.

Would like to see elaborations on why some of the packets may be
forwarded incorrectly - especially if you consider the approach
at the end of this mail.

[HC]: There are two forwarding entries for each BFER in the BIFT for FRR.
One is primary forwarding entry with primary bit mask and primary next hop.
The other is the backup forwarding entry with backup bit mask and backup next hop.
The backup next hop is the next hop of the neighbor.
All the primary forwarding entries in the BIFT have thier primary bit masks and
primary next hops that are computed before the failure of a neighbor.
When the failure of a neighbor happens, the BIER packets that
go to the neighbor as its next hop are forwarded to the next hop
of the neighbor using the backup forwarding entry.
All the other BIER packets that go to the other neighbors
as their next hops are forwarded to these next hops (i.e., primary next hops)
using the primary forwarding entries.
These primary forwarding entries with thier primary bit mask and primary next hop
are computed before the failure. After the failure, all the primary forwarding entries
will be recomputed and may be different from those before the failure.
>From the time at which the neighbor fails to the time at which
all the primary forwarding entries are recomputed and updated
based on the updated network topology after failure,
Some of the BIER packets that are forwarded using the old primary
forwarding entries may be lost.


   This document describes a mechanism for fast re-route (FRR)
   protection against the failure of a node or link in the core of a
   BIER domain, which resolves the above issue.  It is based on LFA,
   which is called LFA-based BIER-FRR.

Unicast FRR can be based on many mechanisms, and BIER FRR can simply
follow suit. It does not have to be limited to LFA or tunnel - simple
ECMP can also work if there are ECMP paths (if an ECMP branch is no
longer available you can simply take another one).

[HC]:  The draft about LFA-based BIER FRR does not mean other approaches
for BIER FRR cannot be used.

   In normal operations, the normal BIFT is used to forward BIER
   packets.  When a neighbor fails, the BFR as PLR uses the FRR BIFT for
   the neighbor to forward BIER packets.  For a BIER packet to traverse
   the BFR and the failed neighbor, the BFR reroutes the packet around
   the failed neighbor using the FRR BIFT for the neighbor.  For a BIER
   packet to traverse the BFR and any other neighbors, the BFR forwards
   the packet to its expected next hop neighbors using the forwarding
   entries with these BFR neighbors in the FRR BIFT.

I do not see why there is a need for per-neighbor FRR BIFT. It is also not
clear how the switch between normal BIFT and FRR BIFT is done - in fact
I think it is flawed - more on that later.

[HC]: In one implementation, we can have a BIFT pointer, which points
to the forwarding table that is used. In normal operations, the pointer
points the normal BIFT. When a neighbor fails, we can set the pointer to
the FRR BIFT for the neighbor. After the network convergence,
the normal BIFT is recomputed and updated based on the updated network
topology after the failure, we can set the pointer to the normal BIFT.

   From the BIRT on the BFR, a "Bit Index Forwarding Table" (BIFT) is
   derived.  In addition to having a route to a BFER in each row of the
   BIFT which is the same as the BIRT, it has a "Forwarding Bit Mask"
   (F-BM) in its each row.  For the rows in the BIRT that have the same
   SI and the same BFR-NBR, the F-BM for each of these rows in the BIFT
   is the logical OR of the BitStrings of these rows.

Need to be a bit more strict here. A row in BIFT is about a
particular BFER, which could be reached via two ECMP nbrs.
The F-BM is really for a nbr, not for a row. See Figure 6 in RFC8279.

[HC]: We will update accordingly.

   When a BFR receives a packet, for each BFER k (from the rightmost to
   the leftmost) represented in the SI and BitString of the packet, if
   BFER k is the BFR itself, the BFR copies the packet, sends the copy
   to the multicast flow overlay and clears bit k in the original
   packet; otherwise the BFR finds the row (i.e., forwarding entry) in
   the BIFT for the sub-domain using the SI and BitString as the key or
   say index, and then copies, updates and forwards the packet to the
   BFR-NBR (i.e., the next hop) indicated by the row (i.e., forwarding
   entry).

Strictly speaking, the BIFT is for <sub-domain, SI> (if we ignore
different bitstringLen), and the lookup key is 'k' not the entire bitstring.

[HC]: We may add some text to explain.

   The FRR-BIRT for BFR-NBR X of the BFR considers the failure of X and
   maps the BFR-id (in the sub-domain) of a BFER to the BFR-prefix of

A nit here - strictly speaking it's not that you map the bfr-id to bier
prefix in BIRT. BIRT is built based on how a BIER prefix is reached,
and the mapping is the other way around - you map the prefix to a BFR-id
when you derive the BIFTs from BIRT.

[HC]: It seems that this is based on some RFCs. I will double check.

   The BFR may build the FRR-BIRT for BFR-NBR X by copying its BIRT to
   the FRR-BIRT first, and then change the next hop with value BFR-NBR X
   in the FRR-BIRT to a backup next hop (BNH) to protect against the
   failure of X.  In other wards, for the BFR-id of a BFER in the FRR-
   BIRT for BFR-NBR X, if the next hop BFR-NBR on the path to the BFER
   is X, it is changed to a BNH when there is a BNH on a backup path to
   the BFER without going through X and the link from the BFR to X.

   If there is not any BNH to a BFER to protect against the failure of
   X, the next hop BFR-NBR X to the BFER in the FRR-BIRT for BFR-NBR X
   is changed to NULL.  For a multicast packet having the BFER as one of
   its destinations, if the next hop BFR-NBR to the BFER is NULL, the
   BFR does not send the packet to the next hop BFR-NBR NULL but drops
   it when X fails.

A nit - it's not that the packdet is dropped. Rather, the bit for that
BFER is simply cleared.

[HC]: We will update accordingly.

So for BFERs not affected by X's failure, they're still present in
X's FRR BIFT and are the same as in the normal BIFT. Keeping these
per-nbr FRR BIFTs with those unaffected entries
just does not make sense - you might as well have one BIFT in which
each row includes ECMP/backup branches. Of course, this is implementation
details/preferences, but:

- it's not good to specify this controversial implementation details
- switching to using FRR BIFT is actually flawed - see later comments.

[HC]: Switching between normal and FRR BIFT is described above.

   The forwarding procedure defined in [RFC8279] is updated/enhanced for
   a FRR-BIFT to consider the case where the next hop BFR-NBR to a BFER
   is NULL.  For a multicast packet with the BitString indicating a BFER
   as one of its destinations, the updated forwarding procedure checks
   whether the next hop BFR-NBR to the BFER in the FRR-BIFT is NULL.  If
   it is NULL, the procedure will not send the packet to this next hop
   BFR-NBR NULL but drop the packet.

A nit - not "drop the packet" but simply clear the bit.
Additionally, even when you do not support FRR you also have situations
whwere some BFERs are simply not reachable. In other words, clearing
the bit for those BFERs is the base requirement/assumption in RFC8279,
not a update/enhancement.

[HC]: We will update accordingly.

   The FRR-BIFTs will be pre-computed and installed ready for activation
   when a failure is detected.  Once the BFR detects the failure of its
   BFR-NBR X, it activates the FRR-BIFT for X to forward packets with
   BIER headers and de-activates its BIFT.  After activation of the FRR-
   BIFT, it remains in effect until it is no longer required.

Let's say this BFR has several BFR neighbors (including X and W).
X fails first and W fails instantly after that.
Which FRR BIFT do you use? X's FRR BIFT does not consider W's failure
and W's FRR BIFT does not consider X's failure, so using either one alone
has problems. Notice that this is different from the typical
"multiple failure" scenario - it's not that X and W may depend on each
other but it could be that some BFERs have a common backup neighbor Y, yet
if you use X's FRR BIFT then those BFERs normally reachable via W
are not protected now that W also fails. Similarly, if you use W's
FRR BIFT then those BFERs normally reachable via X are not protected
now that X also fails.

[HC]: We focus on single failure at one time. Multiple failures at the same
should be out of scope now. We may consider it in future.


That's why a single BIFT should be used for both normal and FRR forwarding,
just like in unicast case.

   The number of entries in a FRR BIFT is the number of BFERs.  Each FRR
   BIFT on a BFR can be compressed through combining all the entries
   with the same BFR-BNR and F-BM into one entry.  The number of entries
   in a compressed FRR BIFT is the number of neighbors of the BFR minus
   one.

   For example, the compressed FRR-BIFT for BFR C on BFR B is shown in
   Figure 7.  The number of entries in it is three, which equals the
   number (four) of neighbors of BFR B minus one.

                 +----------------+---------+------------+
                 |     BFR-id     |  F-BM   |  BFR-NBR   |
                 | (SI:BitString) |         | (Next Hop) |
                 +================+=========+============+
                 | 1, 4 (0:01001) |  01001  |     G      |
                 +----------------+---------+------------+
                 | 2, 3 (0:00110) |  00110  |     E      |
                 +----------------+---------+------------+
                 |  5 (0:10000)   |  10000  |     A      |
                 +----------------+---------+------------+

             Figure 7: Compressed FRR BIFT for BFR C on BFR B

   For a BIER packet with a BFR-ID as a destination, the entry
   containing the BFR-ID is used to forward the packet.

Not sure of the benefit of compression here. It makes the lookup more
complicated. If the memory consumption is a concern, just get rid of
per-nbr FRR-BIT and use a single one. Additionally, the F-BM and BFR-NBR
information can be considered forwarding information that can be pointed
at by multiple BIFT entries, so the above compression does not really
give you much savings.

[HC]: Compression is optional.

   Once BFR B detects the failure of its BFR-NBR X, after receiving the
   packet from BFR A, BFR B copies, updates and sends the packet using
   the FRR-BIFT for X on BFR B to avoid X and link from B to X according
   to the forwarding procedure defined in [RFC8279].

   For example, once BFR B detects the failure of its BFR-NBR C, after
   receiving the packet from BFR A, BFR B copies, updates and sends the
   packet to BFR G and BFR E using the FRR-BIFT for BFR C on BFR B to
   avoid C and link from B to C.

See earlier comments about failure of multiple neighbors (who do not
use each other for backup). Using a per-nbr FRR BIFT not only is
unnecessary, but also has problems.

[HC]: Multiple failures handling is out of scope for now.

A better way is to simply have a single BIFT for both normal and FRR
forwarding. Each entry has some ECMP and/or primary/backup branches,
and you simply use another ECMP branch or a backup branch when the
normally used branch fails. A backup branch can go through completely
different nbr, or go through the same neighbor but via a tunnel.

Jeffrey

Juniper Business Use Only


Juniper Business Use Only


Juniper Business Use Only