[Roll] Review of draft-ietf-roll-rnfd-02

"S.V.R.Anand" <anandsvr@iisc.ac.in> Thu, 16 November 2023 09:52 UTC

Return-Path: <anandsvr@iisc.ac.in>
X-Original-To: roll@ietfa.amsl.com
Delivered-To: roll@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 3C4CBC14CF1A for <roll@ietfa.amsl.com>; Thu, 16 Nov 2023 01:52:59 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.109
X-Spam-Level:
X-Spam-Status: No, score=-2.109 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=iisc.ac.in
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id hgiSskaNDeEg for <roll@ietfa.amsl.com>; Thu, 16 Nov 2023 01:52:53 -0800 (PST)
Received: from IND01-BMX-obe.outbound.protection.outlook.com (mail-bmxind01on2085.outbound.protection.outlook.com [40.107.239.85]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id A6D74C151075 for <roll@ietf.org>; Thu, 16 Nov 2023 01:52:52 -0800 (PST)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=MRyjFK3UnBVtjxsne3EC3EtlQFlvvoU5eC/cSjA5HRwBou5kKnhqeEE+iDTRtwfIVntOp+XJBGTa4bFKHEUcOOBpIFMLeC7Rv+brpvYff46TjKQlTO997/lUaccHbmNCuc9uex4TC4JxfYUTGSXjeA4/QG3bk49oRbLRc91hJJwdnNgKyCGMG4/WuMXCcjN2lVPecMARaia9ZaCNXvfMG+tw7/rWvkw4TMMoOrBl4rvnfLwM71AOES62fhae7XX3UB7vJqBDlvdIx4w3Zy7UbFchAyalGa0Wq9NgQHSmAloUUoPe49ufZCMgigvDIgwPbCnX4ptTp/+MpzNDud0zQA==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=MvwE1pefe9tZaePFQHV5+xDUcygsHvaD4LcfcVp7vDw=; b=DuHdQb1un0e/ysCep8HhStMEZDLCGnvsRnNCvOQSShjlPEdXBictlXh76Gb9i60giwHKYk8mEyoNJkAumH9Atq6zCXVuxrVBomBCVb2QdR81JelIuzZo5D6kT9XJDVg1ENU88vIk3FOG/U36uWy//BLVJ6fJ2LLR0YKbnBKmnAtpFwL2dUJYGquaPr0N9KrndrBK3UN7X9kSfzYypnR/BT5BQlDe5fhWQm0ZDJl20amedgGW4oJCpQkHEsm8y+eA02GN9UUYJQXouOTBIzdMwanh4qFynGOaqvOAYtvL5VTdMYaKa1HqiVI9kJ284tA8PS/1oQtz4O1jb0mTX8heeg==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=iisc.ac.in; dmarc=pass action=none header.from=iisc.ac.in; dkim=pass header.d=iisc.ac.in; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=iisc.ac.in; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=MvwE1pefe9tZaePFQHV5+xDUcygsHvaD4LcfcVp7vDw=; b=D9xn3tbyoNwrcJPkiNoI3xiewTxSSkMc8mWIKaUPVx3vF0Q8GQdi9TCOqBIKwXHfEf8aUx9lv5xRvKexXmGJa+YmzwaX0rGA/7xOh/RPg/YZHyg7AExLHFHrQTNwnmygwWrR2Uf71GTIL/2OqZut6VIMs7G6r9imxivtyrpgknwiZUacNTbkeCZ3unJjyxLbzbT6+7EETmIhbaTNBi8MXaEqgHlczChCK/ns9cYfMWh2CYq5GXRChsl3Vl8jVvERb5e2449+Eu6VvsbT80PbrwLaJk1nymMOrbv6NUB845GYFjkmgPatnZzIjmjn2a01hc2ZDRKth0Zl6TZvko/lPg==
Authentication-Results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=iisc.ac.in;
Received: from MA1PR01MB2218.INDPRD01.PROD.OUTLOOK.COM (2603:1096:a00:37::23) by PN0PR01MB9527.INDPRD01.PROD.OUTLOOK.COM (2603:1096:c01:111::14) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7002.18; Thu, 16 Nov 2023 09:52:48 +0000
Received: from MA1PR01MB2218.INDPRD01.PROD.OUTLOOK.COM ([fe80::4963:887b:afd8:f4a2]) by MA1PR01MB2218.INDPRD01.PROD.OUTLOOK.COM ([fe80::4963:887b:afd8:f4a2%6]) with mapi id 15.20.7002.019; Thu, 16 Nov 2023 09:52:48 +0000
Date: Thu, 16 Nov 2023 15:22:45 +0530
From: "S.V.R.Anand" <anandsvr@iisc.ac.in>
To: roll@ietf.org
Message-ID: <d6fxpyibwqdb5vt76ekvh6g3vqguvwp7ndfb4m2phyr7zxpmpf@fsxok66lfr27>
Content-Type: text/plain; charset="utf-8"
Content-Disposition: inline
Content-Transfer-Encoding: 8bit
X-ClientProxiedBy: PN2PR01CA0135.INDPRD01.PROD.OUTLOOK.COM (2603:1096:c01:6::20) To MA1PR01MB2218.INDPRD01.PROD.OUTLOOK.COM (2603:1096:a00:37::23)
MIME-Version: 1.0
X-MS-PublicTrafficType: Email
X-MS-TrafficTypeDiagnostic: MA1PR01MB2218:EE_|PN0PR01MB9527:EE_
X-MS-Office365-Filtering-Correlation-Id: cf0374b2-16af-4aca-a271-08dbe689ca1e
X-MS-Exchange-SenderADCheck: 1
X-MS-Exchange-AntiSpam-Relay: 0
X-Microsoft-Antispam: BCL:0;
X-Microsoft-Antispam-Message-Info: rJU3n+batmKf7XI/QWAIYzPZAoblvATW5cPdQPvmq2V0zQeaKD9LQ042LtIoi+PTjw/KCsS7hLywzBPjs0wQ33H24Mm//b9IXcS5KPViH/+Jg0ykpad2Wk50qg+stxGMG9/sxdQwgTRbqx2ZaP3QeQ+v7RbsrfmskWsESIFsAefXRycDemBK2YB4eqPn4vEmusQzEX/z7F/c/kiW7GNJYjc1vtX8TBns50ogn3HcZ9mg3n9DpmhnmI2IFrLoTBQ2VEpyMsGW3aXdGHRZ/35591p6ZqLPGC3frHKUYuw1C93ULQwLU4GOy9Nso/TBrM0kl/OcpB/dbroPaGhmmUPyLYdzBc7URG2Xvby0ztW9sof3fgL4DblvsVF2ly7cJQnIX1wvQ3hZlGQgP+C0EFLh6ML5UmO5uKrjUoWitxzE3Wa8TQLYzveVpJfH4aEaTDz+2loY5P5D9nepTUQpgsKxxATPH/+pa0utpV6Kvl46mMm6vr2DlAb0ZYld6HZwDPx8Z6cWiUN1mFy46ozPWj17x2Z+1K5m0cQ/cKQalNhBf6s=
X-Forefront-Antispam-Report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:MA1PR01MB2218.INDPRD01.PROD.OUTLOOK.COM; PTR:; CAT:NONE; SFS:(13230031)(7916004)(136003)(376002)(39860400002)(346002)(396003)(366004)(230922051799003)(1800799009)(64100799003)(186009)(451199024)(6512007)(9686003)(4743002)(66574015)(26005)(33716001)(6486002)(966005)(478600001)(8676002)(86362001)(41300700001)(2906002)(5660300002)(8936002)(6916009)(66556008)(66946007)(66476007)(316002)(786003)(55236004)(6666004)(6506007)(38100700002)(83380400001); DIR:OUT; SFP:1101;
X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1
X-MS-Exchange-AntiSpam-MessageData-0: BRWLmX05+aEs65Pt9OcadsEn8gXBEq+gM6y1jDi+TZukdrKUsGlOtJBqjeyBvkV33Zbna/JFQZ4ywp1FyeAZJyhnp1LftVi+QFq3TSGh7ZgkaMNDzrDlt2uCa7rqWmqaquedlm9fDVKf4uRLkCeH/Edk5d8JafOzzmwmnMkLtjPO74WHq3Rkdq088FkKAibQCeUpPUCPbgu2f6BElDmh5KlDXXdqRo1QqlC6Zgl/6n1daEgmwwkvl4J9/ns2xA7iP8IYJBtTULI9aIKNN/AhvRJSSxG7FCm+XPQRd7GxXr1ge5PU8j2VIDSxuNOl9IUBS2zzNL9DcUW9XPaLKyELScPY5Z/hLFqNXDmIIFzyT9MQjoe1H6vzHoqsJ12Jw+W9w7//G3ML5SFm3MMX6Wj5fpn3+SI/ITZtDIgfd/MaxsaC+JAWyxbdrpLd+rgDRBS3eCotAj+UlW+5sXa60+OFSTFuCrxfV08L3lMQ+vX1etOw572Gh+JThGk8iPMP0hN6KoPYsWWb1XTBFpaFwqWidRFud3PJJvr1TWNmnUAJYG4gBf0ohlzKw4xLv5PFHJIFZQgBSLWY8kWuDqbDvbf9rJZk18sxvOzefj7YF+TnIFkdktpsqLbwgWOxWfEHq2+PeuRn+5VW1V6Kzh/7qWMVnfdiMUGeCaBj+0EVZQpCbeGv0KvVhpt0AgXx11JmRAWdxbjFk5QPPfoWNP+4IrIKsuxSYJEPVZXsBbbq99EKiR0T/tq3ELF1VemIA43Leu2uJnQ8fHFSeDCj+Z8fGrwnVpEJYF6zyW40QMjbAVp5KJVD6YRX15VFfq7Xs2TU8Cqgifg6NJDkhfGbnkkMhuQ60MjxsvgWK+P5KQ0RyRQSC+8BpfSPI2YRrO5PWIbGulWq2+VOIZxSCUDqp9egqeElBhQSx7RZE4mLJMV2dxL0U+5bc+xd12QucXfoUi5Cr0lPc5E9SIgpGW54JXgSAhb0a0/OzBpqhXdgHSMdt/ZEkVdGzooPZBLMTa5OTbOuUM36j3ZfmxHdWgD2mcicOeyo94VK7E53aQokBLbP7+ZftZxKq5UF+4SQI0r8WQDO7Ek6yepth0bTY1BHEkvyDQSeE1carjmOJBpszxgSmpd7MTeNVnBsF954fvvaiWDgYHJ0FOd2TjLR/XuLpPf+FYHxEJGU9GFGidh1/KTIwJnaT6+FG76g469I6HI/1pTdelc1ee4s6+Mr4AKr1JaHfWpNxLnE9MmqDqPQI+9TUnU+v8pxNOgFfK3mXgbJRA1eNajAop+MIkAPotiiNrWqhlgzDss8c/TEkzzFQtOObgfa9baGL5oQEwpS9zJzhBtOUm6nxG9Ots7XJmTIR/pjM7UdMRta9h7dU5ot6KOqg9qKAUjuAqM8nt2BUxPGujeOQl6VCd8cz8JTeDl1o598CIHPt5ZHt47c3CjGJKZHIrGJGLTNVz9zTGXy3vT6UoIlffbeuHf4yzkgKTuKPDgEaD7gbQgaR+2RRNZBwoiqvXM7j8u7YGq88/BtrTsliMuFGlZtoD0YQqa4oz9GY/geBuBxUrh4KSIzzofPSH0wcOBqWHijydcrhkTvv2ewD4NXqOe1
X-OriginatorOrg: iisc.ac.in
X-MS-Exchange-CrossTenant-Network-Message-Id: cf0374b2-16af-4aca-a271-08dbe689ca1e
X-MS-Exchange-CrossTenant-AuthSource: MA1PR01MB2218.INDPRD01.PROD.OUTLOOK.COM
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-OriginalArrivalTime: 16 Nov 2023 09:52:48.1130 (UTC)
X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted
X-MS-Exchange-CrossTenant-Id: 6f15cd97-f6a7-41e3-b2c5-ad4193976476
X-MS-Exchange-CrossTenant-MailboxType: HOSTED
X-MS-Exchange-CrossTenant-UserPrincipalName: 8Q167fLT5JeBEBojwlbUYwWxRsC8auWwnP5w+vEDx93zBhtmXiNAcjGo5UMwTivRRCV297y6oSUwQWL7EBEYoA==
X-MS-Exchange-Transport-CrossTenantHeadersStamped: PN0PR01MB9527
Archived-At: <https://mailarchive.ietf.org/arch/msg/roll/o28Z1arAFa9FYhTUI0u2rGHN_8E>
Subject: [Roll] Review of draft-ietf-roll-rnfd-02
X-BeenThere: roll@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Routing Over Low power and Lossy networks <roll.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/roll>, <mailto:roll-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/roll/>
List-Post: <mailto:roll@ietf.org>
List-Help: <mailto:roll-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/roll>, <mailto:roll-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 16 Nov 2023 09:52:59 -0000

Dear Konrad,

Thank you for your nicely written draft that draws attention to the severity of
the LBR failure and proposes a solution for faster early detection of such a
failure. In this review, I will try to summarize my thoughts on the
same which will hopefully help in enhancing the document.

1) After the reading document, I am tempted to ask this question as to how
come such a serious problem which can disrupt the service for several minutes
got unnoticed till today after so many years of large scale RPL deployments
across the world and lot of RPL related research work that had gone in. Surely some of
those real-world deployments might have implemented their own proprietary
methods to minimize the service disruption without any changes to RPL protocol.
Having multiple LBRs for the purpose of combating LBR failures and load
balancing is not new. 

Since LBR is straddling between wired and wireless worlds, proper functioning
of the IoT network is essential for wireless sensor data producer and the wired
sensor data consumer/actuator. This means both these end points can potentially
detect LBR failures. While the RNFD document tackles the problem from the LLN
side, one can use plethora of schema for quickly detecting LBR failure from the
wired side deterministically at negligible cost. In the presence of multiple LBRs
and the use of virtual roots one can reroute the packets to the other LBRs.

I came across an interesting work that has been carried out in Inria research
"RPL cooperation experiments using FIT IoT-LAB testbed". In particular, the
paper titled "Sharing is caring: a cooperation scheme for RPL network resilience
and efficiency". Here coordination happens between LBRs over wired network to
detect LBR failures.  The other LBRs pass on the LBR failure information into
the LLN. (Source: https://inria.hal.science/hal-02095410/file/papier.pdf).

Giving a brief summary of strategies adopted by IoT deployments that bring
resilience to LBR failures may not be out of place. Also, RNFD protocol can
take benefit from other schemes that are available.

2) Assuming the time gap between border router failures due to power outages is
long, it is not apparent from the document the energy consumption and its
impact on the node/network lifetime due to the consensus process which is
always running in the background. I know, it depends :) But it is good to
include experimental results that indicate RNFD performance under different 
network scenarios. 

3) Refer to section 5.2.  Detecting and Verifying Problems with the DODAG Root

- Apart from NUD and Layer 2 triggers, I think "overhearing" technique as a means
  to infer the neighbors' state can help reducing the message exchanges and
  faster LBR failure detection. What are your thoughts on this ? 

- How does the RNFD work in the presence of RDC ? How is the causality 
  of information is maintained in such a situation ?

- What happens if certain conditions result in a state that keeps toggling ?  I
  think RNFD needs to have additional algorithms in place on top of link
  failure detection schemes such as NUD.

4) For minimizing false positives, and false negatives, we would expect the
   Sentinel nodes to be running some internal algorithm that decide on the state
   changes between “SUSPECTED DOWN” and “LOCALLY DOWN”. I suppose this algorithm
   is important for the proper functioning of RNFD protocol. In order to maintain
   consistency and robustness among different implementations, a reference
   algorithm will be useful to be given in a separate section or appendix.

5) Refer to 5.3.  Disseminating Observations and Reaching Agreement

- How does one set RNFD_CONSENSUS_THRESHOLD value ? Is it constant across the
  network or it depends on factors like network topology, traffic pattern and so on ? 
  Also, does this number remain fixed for the entire lifetime of the network or needs
  to be adapted over time ?

- A curious question. I would assume that even if just one node indicates that
  LBR is UP at any time through PositiveCFRC, then LBR can be considered up at
  that time. Isn't it ?  This precise time epoch is what should be considered in
  deciding the liveliness of the LBR. In the absence of the notion of time, the
  question of "latest" information does not arise. Is there a risk of a wrong 
  conclusion, especially if the number of good links are less ?

There is a small typo in 3.2. Counters and Communication. "acros" to be changed to "across"

----------------------------------------------------------------------------------------

Thank you for your time. I hope you found the points that I brought out are 
reasonably good and help in improving the draft.

Regards
Anand