Re: [Roll] draft-rahul-roll-rpl-observations-00 Section 2.1: Wear leveling

"Pascal Thubert (pthubert)" <pthubert@cisco.com> Sat, 12 May 2018 19:59 UTC

Return-Path: <pthubert@cisco.com>
X-Original-To: roll@ietfa.amsl.com
Delivered-To: roll@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 049BC127599 for <roll@ietfa.amsl.com>; Sat, 12 May 2018 12:59:11 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -14.51
X-Spam-Level:
X-Spam-Status: No, score=-14.51 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_HI=-5, SPF_PASS=-0.001, T_DKIMWL_WL_MED=-0.01, URIBL_BLOCKED=0.001, USER_IN_DEF_DKIM_WL=-7.5] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=cisco.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id CDOj7s3C63ka for <roll@ietfa.amsl.com>; Sat, 12 May 2018 12:59:08 -0700 (PDT)
Received: from alln-iport-7.cisco.com (alln-iport-7.cisco.com [173.37.142.94]) (using TLSv1.2 with cipher DHE-RSA-SEED-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id B1BDF1200F1 for <roll@ietf.org>; Sat, 12 May 2018 12:59:08 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=cisco.com; i=@cisco.com; l=4886; q=dns/txt; s=iport; t=1526155148; x=1527364748; h=from:to:subject:date:message-id:references:in-reply-to: content-transfer-encoding:mime-version; bh=BUw7PagYamjQVmkT1UhZguS+YEXNj5+EMXmNviHC5hk=; b=KXzX4a6TmoipoRW71lzEF/1Wh1P2G8zw7QQrUunO4WRgS5qyq4cXa5+q ZWwpXwYV6ao6KfNxR+AOYvzNPqLbPIlsxGn0DTgxgr95Q7gAZGq2oboYV jd/bM5+IGuBPfV61ebmv37pwou7sXhQqGJqamE5fJum7nbeuxfSgT0mov 8=;
X-IronPort-Anti-Spam-Filtered: true
X-IronPort-Anti-Spam-Result: A0C0AgA0Rvda/4sNJK1VBhkBAQEBAQEBAQEBAQEHAQEBAQGDFC9heyiDcpRygXmBD5NGgWQLGAuBVIJ1AhqCbyE3FQECAQEBAQEBAmwcDIUoAQEBAwEBASEROgYKCwIBCBgCAiYCAgIlCxUQAgQTgyMCgXcID6pughyIQIIngQmHHIFUP4EPI4JogxEBAYFJAwYOgwAwgiQChxeRHwkCjk+BNoNlh1SQPAIREwGBJAEyIoFScBU7KgGCGAmFH4VohT5vjlWCRQEB
X-IronPort-AV: E=Sophos;i="5.49,393,1520899200"; d="scan'208";a="113054263"
Received: from alln-core-6.cisco.com ([173.36.13.139]) by alln-iport-7.cisco.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 12 May 2018 19:59:07 +0000
Received: from XCH-ALN-004.cisco.com (xch-aln-004.cisco.com [173.36.7.14]) by alln-core-6.cisco.com (8.14.5/8.14.5) with ESMTP id w4CJx7e2019455 (version=TLSv1/SSLv3 cipher=AES256-SHA bits=256 verify=FAIL) for <roll@ietf.org>; Sat, 12 May 2018 19:59:07 GMT
Received: from xch-rcd-001.cisco.com (173.37.102.11) by XCH-ALN-004.cisco.com (173.36.7.14) with Microsoft SMTP Server (TLS) id 15.0.1320.4; Sat, 12 May 2018 14:59:06 -0500
Received: from xch-rcd-001.cisco.com ([173.37.102.11]) by XCH-RCD-001.cisco.com ([173.37.102.11]) with mapi id 15.00.1320.000; Sat, 12 May 2018 14:59:06 -0500
From: "Pascal Thubert (pthubert)" <pthubert@cisco.com>
To: Routing Over Low power and Lossy networks <roll@ietf.org>
Thread-Topic: [Roll] draft-rahul-roll-rpl-observations-00 Section 2.1: Wear leveling
Thread-Index: AQHTwciVQsGzK7ypcECrCDATupfKUqQN4y+QgA8MpwCAAB9OgIAAdwaAgABf3gCAAAH3AIAAPNoAgAW+F5CAASfcgIAAA82AgAVc0ZCAAqsXgP//vUuT
Date: Sat, 12 May 2018 19:59:06 +0000
Message-ID: <5BEB0BFF-8BA1-4D6D-83DA-AC75F8AD4AEC@cisco.com>
References: <8EC2893F-731B-439D-86FE-984505349D8D@tzi.org> <982B626E107E334DBE601D979F31785C5DBCD1B4@BLREML503-MBS.china.huawei.com> <22477.1525301358@localhost> <CAO0Djp1sCSFJSZSbVNL+RLgV2FrjwZQrJ6p-9ExndMbQqua-QA@mail.gmail.com> <0522ea86b4234490bdd56b1428db0762@XCH-RCD-001.cisco.com> <29982.1525354228@localhost> <685a615a79e8442f9f7b19983ef2a36d@XCH-RCD-001.cisco.com> <25461.1525367718@localhost> <982B626E107E334DBE601D979F31785C5DBE374F@BLREML503-MBX.china.huawei.com> <23345.1525709153@localhost> <CAO0Djp3a2AXMC3ORDhTGbyhwr09nw+8poDeQL03V7t2Bksp-bQ@mail.gmail.com> <b3265ca6f74545468c00007dadaa0fb7@XCH-RCD-001.cisco.com>, <17986.1526151471@localhost>
In-Reply-To: <17986.1526151471@localhost>
Accept-Language: fr-FR, en-US
Content-Language: fr-FR
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-ms-exchange-transport-fromentityheader: Hosted
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
MIME-Version: 1.0
Archived-At: <https://mailarchive.ietf.org/arch/msg/roll/WeE56XiDI9pu8j7RNDqoNmSj6NA>
Subject: Re: [Roll] draft-rahul-roll-rpl-observations-00 Section 2.1: Wear leveling
X-BeenThere: roll@ietf.org
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: Routing Over Low power and Lossy networks <roll.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/roll>, <mailto:roll-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/roll/>
List-Post: <mailto:roll@ietf.org>
List-Help: <mailto:roll-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/roll>, <mailto:roll-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sat, 12 May 2018 19:59:11 -0000

Hello Michael 

The default of 16 points in the straight line is there to cover multiple losses in a row; 16 may be too large for a reliable Mac like TSCH in which case a node may start at another value, e.g., 250. Starting at 255 looks optimistic though. In case of a loss numbers will move to the circular part and will be uncomparable.

It is not really that the node is stable that counts but that all other nodes got a chance to see that this node as rebooted so they can reset their reference value and keep numbers comparable.

I’m happy to pursue this discussion to see more clearly than I do now; at this point I still do not see what change we could make - and why - to the sequence counter behavior...

Take care,

Pascal

> Le 12 mai 2018 à 20:58, Michael Richardson <mcr+ietf@sandelman.ca> a écrit :
> 
> 
> Pascal Thubert (pthubert) <pthubert@cisco.com> wrote:
>    mcr> So I think that the errata against 6550 is that it should be a lollipop
>    mcr> counter, and I think that this solves the reboot problem, and eliminates
>    mcr> having to write it to flash.
> 
>    rj> [RJ] this still does not solve the reboot issue... The text in the
>    rj> draft actually considers dtsn to be a lollipop counter and explains
>    rj> the problem when rebooting within the sequence window of lollipop
>    rj> counter. Please refer to section 7.2 of the draft for example. Errata
>    rj> is nonetheless needed for dtsn to be a lollipop counter.
> 
>    pt> Well, yes, but that’s true for any of the sequence counters, not just
>    pt> DTSN, and was not perceived as an “issue”. 
>    pt> The lollipop by design detects one reboot during the straight
>    pt> part. More will cause a delay. There are ways to avoid that like
>    pt> persistent memory or alleviate it like making the straight part
>    pt> shorted on reliable links.  I’m not sure there’s much to add to that
>    pt> or to fix, is there?
> 
> Pascal, I feel that Rahul has some point that we've missed.
> 
>>  behalf of node A).  As per section 7.2. of [RFC6550], when node B
>>  receives this packet it will consider the information to be old
>>  (since 240 < 250).
> 
>>  Based on this figure, there is dead zone (240 to 0) in which if A
>>  operates after reboot then the seqcnt will always be considered
>>  smaller.  Thus node A needs to maintain the seqcnt in persistent
>>  storage and reuse this on reboot.
> 
> My instinct is there are one of two bugs (maybe both documentation and code):
>   a) node A SHOULD rapidly move from 240 to 0, that the numbers from 240 to
>      0 should be considered "setup", and node A was wrong to advance only
>      to 250, and not as soon as it felt "stable", move to 0.
> 
>   b) that node B, should recognize that during the 240 to 0 stage, that
>      the counters should not be compared.  Anything < 0 is "setup", and is
>      always "new", but that 251 > 250 require immediate action.
> 
> I found it unusual that the lollipop should require 16 values in the stem, I
> think that one ('0') would have worked, but maybe there was some reason to
> prefer 16.
> 
> -- 
> ]               Never tell me the odds!                 | ipv6 mesh networks [ 
> ]   Michael Richardson, Sandelman Software Works        | network architect  [ 
> ]     mcr@sandelman.ca  http://www.sandelman.ca/        |   ruby on rails    [ 
>    
> 
>    
> _______________________________________________
> Roll mailing list
> Roll@ietf.org
> https://www.ietf.org/mailman/listinfo/roll