Re: [Roll] Semantics of DAO ACK

"Pascal Thubert (pthubert)" <pthubert@cisco.com> Fri, 02 October 2015 13:05 UTC

Return-Path: <pthubert@cisco.com>
X-Original-To: roll@ietfa.amsl.com
Delivered-To: roll@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 1DFB31A1B43 for <roll@ietfa.amsl.com>; Fri, 2 Oct 2015 06:05:32 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -14.511
X-Spam-Level:
X-Spam-Status: No, score=-14.511 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_HI=-5, SPF_PASS=-0.001, T_RP_MATCHES_RCVD=-0.01, USER_IN_DEF_DKIM_WL=-7.5] autolearn=ham
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id HBGNJSRoE6Fb for <roll@ietfa.amsl.com>; Fri, 2 Oct 2015 06:05:29 -0700 (PDT)
Received: from rcdn-iport-6.cisco.com (rcdn-iport-6.cisco.com [173.37.86.77]) (using TLSv1 with cipher RC4-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 3FD0B1A1B49 for <roll@ietf.org>; Fri, 2 Oct 2015 06:05:22 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=cisco.com; i=@cisco.com; l=12338; q=dns/txt; s=iport; t=1443791122; x=1445000722; h=from:to:subject:date:message-id:references:in-reply-to: content-transfer-encoding:mime-version; bh=nX3LfjrZ6mT4fS+dCradyF2MLbqZ8Q9y/yKtB+6E/h8=; b=YdsRwmIIFQxl95YFDqnI/CN0EkIgBMGlTis6yhwy0AI7FgAtHRa4mhon yIV3otz3YjUr49hg6RiGg/5WpvAQDLnFlpbEVcFSmX+ngy5JGjj08S5ip RsO97vdJFMiZxd9Yq2ax0GJrRpki9X1DHwYZxQVYpgnSG63a+CxC34fhq k=;
X-IronPort-Anti-Spam-Filtered: true
X-IronPort-Anti-Spam-Result: A0ABAgB+gA5W/4ENJK1egydUbga9eQENgXEKhXkCHIEeOBQBAQEBAQEBgQqEJAEBAQQBAQELFRExBgMXBAIBCBEEAQEBAgIjAwICAiULFAEICAIEEwgTiBMNtn2URAEBAQEBAQEBAQEBAQEBAQEBAQEBAReBIoVRg3iBBoQ0CjQiBoJjgUMFhz+Gd4dGAYUWh3mBXYdbjkODbgEfAQFCghEdgVRxAYguQ4EGAQEB
X-IronPort-AV: E=Sophos;i="5.17,623,1437436800"; d="scan'208";a="34030169"
Received: from alln-core-9.cisco.com ([173.36.13.129]) by rcdn-iport-6.cisco.com with ESMTP; 02 Oct 2015 13:05:20 +0000
Received: from XCH-ALN-003.cisco.com (xch-aln-003.cisco.com [173.36.7.13]) by alln-core-9.cisco.com (8.14.5/8.14.5) with ESMTP id t92D5KI8008159 (version=TLSv1/SSLv3 cipher=AES256-SHA bits=256 verify=FAIL) for <roll@ietf.org>; Fri, 2 Oct 2015 13:05:20 GMT
Received: from xch-rcd-001.cisco.com (173.37.102.11) by XCH-ALN-003.cisco.com (173.36.7.13) with Microsoft SMTP Server (TLS) id 15.0.1104.5; Fri, 2 Oct 2015 08:05:20 -0500
Received: from xch-rcd-001.cisco.com ([173.37.102.11]) by XCH-RCD-001.cisco.com ([173.37.102.11]) with mapi id 15.00.1104.000; Fri, 2 Oct 2015 08:05:20 -0500
From: "Pascal Thubert (pthubert)" <pthubert@cisco.com>
To: Routing Over Low power and Lossy networks <roll@ietf.org>
Thread-Topic: [Roll] Semantics of DAO ACK
Thread-Index: AQHQ/RLuUOW0iZzzpE24p86U+vUYyg==
Date: Fri, 02 Oct 2015 13:04:54 +0000
Deferred-Delivery: Fri, 2 Oct 2015 13:04:36 +0000
Message-ID: <c905b2df38e8446cb71459ee3f66f50a@XCH-RCD-001.cisco.com>
References: <DB5PR01MB10807DAF503BBFF45787599C80420@DB5PR01MB1080.eurprd01.prod.exchangelabs.com> <6d21d0f86ab14ae7a99ff9fe6873b1fd@XCH-RCD-001.cisco.com> <C885EE62-D889-4229-9CCB-B3CB540F5692@sics.se> <560AFDBB.8050505@gmail.com> <560B68B2.6030501@fbk.eu> <245E0C92-6ED6-426B-95E1-09BA8736F1BC@sics.se> <560D8386.6000502@fbk.eu> <C00C9B6A-65F3-4C30-9982-44C94925D5D1@sics.se>
In-Reply-To: <C00C9B6A-65F3-4C30-9982-44C94925D5D1@sics.se>
Accept-Language: fr-FR, en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-ms-exchange-transport-fromentityheader: Hosted
x-originating-ip: [10.55.22.4]
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
MIME-Version: 1.0
Archived-At: <http://mailarchive.ietf.org/arch/msg/roll/qdWRTgDZ65YjYD-aNxApvqK4DsE>
Subject: Re: [Roll] Semantics of DAO ACK
X-BeenThere: roll@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
Reply-To: Routing Over Low power and Lossy networks <roll@ietf.org>
List-Id: Routing Over Low power and Lossy networks <roll.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/roll>, <mailto:roll-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/roll/>
List-Post: <mailto:roll@ietf.org>
List-Help: <mailto:roll-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/roll>, <mailto:roll-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 02 Oct 2015 13:05:32 -0000

Hello Joakim:

What I read here boils down to a limited diffusion algorithm. We tried to avoid that in RPL because any movement within the convergence would cause a deadlock. IOW if every node waits for (all of) the parents to ack, then this recursively takes you to the root, and if any node in the chain moves or dies, the diffusion will not be complete. 

Cheers,

Pascal

> -----Original Message-----
> From: Roll [mailto:roll-bounces@ietf.org] On Behalf Of Joakim Eriksson
> Sent: jeudi 1 octobre 2015 22:11
> To: Routing Over Low power and Lossy networks <roll@ietf.org>
> Subject: Re: [Roll] Semantics of DAO ACK
> 
> 
> > On 01 Oct 2015, at 21:03, Csaba Kiraly <kiraly@fbk.eu> wrote:
> >
> >
> >
> > On 30/09/15 17:55, Joakim Eriksson wrote:
> >>> On 30 Sep 2015, at 06:44, Csaba Kiraly <kiraly@fbk.eu> wrote:
> >>>
> >>> Hello Joakim,
> >>>
> >>> I have also worked on the Contiki DAO-ACK code, enabling ACKs,
> implementing fixes to the DAOSequence handling, and looking into multiple
> targets.
> >> Nice, I have a PR on Contiki now with what we have been doing to get
> Contiki RPL more scalable (but still only single targets / paths).
> >>
> >>> What Cenk is saying sounds a reasonable hack, but the standard itself
> is in my opinion a bit underspecified for the semantics of DAO-ACK
> messages in several ways.
> >> Yes, it is underspecified. There is need for clarifications and more details
> on the DAO / DAO ACK!
> >>
> >>> My preferred solution for ACKing DAO messages with multiple targets
> would be to have support for the same semantics that you would have had
> with one DAO message per Target, i.e., I would prefer an option field that
> gives an individual status for each Target.
> >> Yes, but that would require more memory in the sending node to keep
> track of things or full specification of the target in the response.
> >> I guess it might be solved by allowing multiple DAO ACKs for the same
> DAO and to have the Target options included in the DAO ACK.
> > This is a compromise to consider for sure. If you look at my
> implementation, you can see that I use the routing table entry to match the
> DAO ACK, and I keep only DAOSequence (SEQ) as extra state. I'm not sure it
> would work in all cases, but it did work for the case I considered. I'm
> keeping only the SEQ, and I think this is a must if you have no aggregation,
> and you want to match in case you forward multiple DAOs before getting
> the ACK or timing out on it. In fact, simply enabling ACKs in the original
> code was messing up the match between DAOs and their ACKs because it
> wasn't even keeping track of the DAO sequence numbers.
> >
> > If you do aggregate ACKs, or you have more complex Target+Transit
> scenarios, the game changes, and I don't know what would be the balance
> between state you have to keep anyway and state you keep just to be able
> to interpret a later ACK correctly. I think it is implementation specific, so my
> suggestion would be a flag to indicate whether you request full ACK
> (bumping back all T+T info for rejected entries in the DAO) or a much
> smaller partial ACK with SEQ and similar IDs only. This flag (lets call it F for
> now), could go right next to K. As the DAO propagates, each node could set
> F based on whether it is able to store and/or reproduce info (F not set) or
> chose to remain stateless (set F ). This would exclude aggregation in the
> DAO-ACK sent down, but still allow for aggregation in the DAO sent up.
> >
> >>
> >>> To give another example of subtle problems with DAO-ACK, it was not
> clear to me whether I want my implementation to ACK when the Target is
> added to the routing table, or only when this node itself receives an ACK for
> the same Target from its parent. Both makes sense, with the former giving a
> quick one-hop ACK, while the latter works as an end-to-end ACK, ensuring
> that the path is actually built. Both look conformant to the standard, but I
> suppose the original author was thinking of the former, and I can easily see
> interoperability problems arising between two implementations using
> different semantics.
> >>>
> >> We did go for the end-to-end ACK to achieve better scalability. There
> >> is to me no point having a route to the parent if it is not possible to get
> it all the way to root. But I totally agree - this is not obvious from the RPL
> RFC either.
> > Do you mean end-to-end ACK as in non-storing, or end-to-end ACK in the
> sense that it is still addressed to the next hop but you delay ACK till you
> know your parent (and all its parents) ACKed?
> >
> 
> I mean end-to-end as in waiting for all the parents to ACK so that before
> ACKing it is known that the route is properly installed where it needs to.
> 
> >>
> >> Do you have your Contiki code somewhere in the open-source?
> > I did some cleanup and pushed the clean part of the code to github:
> > https://github.com/cskiraly/contiki/tree/DAO-NACK
> > It is not yet rebased to the latest master, but it should apply. I suppose
> describing it would be too off-topic for the list.
> > There is also the more experimental part of the code in a PR, if interested.
> 
> Ok - I’ll take a look!
> 
> BTW: We have done a few deployments using our implementation using
> Yanzi networks products - take a look at this video where 1000 nodes is
> deployed with Contiki RPL + our fixes and 20 neighbors and routes per
> node. If without our fixes it would not work since Contiki RPL did not allow
> scaling beyond number of neighbors and routes / node that well before
> these fixes.
> 
> http://www.yanzi.se/video.jsp?id=7
> 
> Best regards,
> — Joakim Eriksson
> 
> >
> > Best regards,
> > Csaba
> >>
> >> Best regards,
> >> — Joakim
> >>
> >>> Best regards,
> >>> Csaba
> >>>
> >>> On 29/09/15 23:08, Cenk Gündogan wrote:
> >>>> Hello Joakim,
> >>>>
> >>>> This is an interesting question and I also couldn't find any answers in
> RFC 6550.
> >>>> However, my thoughts on this are as follows:
> >>>> Since a sub-set of the announced RPL targets could have been
> >>>> accepted before filling up the routing table (e.g.), I would choose a
> status code between 1 and 127.
> >>>> I would expect a node to choose another parent if a more aggressive
> status code is received ([128-255]).
> >>>> But a full routing table can have free space again until the next or any
> subsequent DAO arrives ..
> >>>> therefore I prefer a "mild rejection" with a status code of [1-127].
> >>>>
> >>>> To give some feedback to the originator of the DAO, it might be
> >>>> sensible to copy the rejected RPL Target options from the affected
> >>>> DAO to the DAO-ACK, so that the originator is fully aware of which
> Target prefixes got rejected (and which ones got accepted, implicitly).
> >>>> I would choose this method, because it doesn't require the
> >>>> originator of the DAO to save any extra state about the DAO and its
> contents.
> >>>>
> >>>> Nonetheless, everything I wrote is nonconform and I am also
> >>>> interested in the RPL experts' opinions and solutions.
> >>>>
> >>>> Best,
> >>>> Cenk
> >>>>
> >>>> On 29.09.2015 21:44, Joakim Eriksson wrote:
> >>>>> Hello All,
> >>>>>
> >>>>> I have spend quite some time to get a more stable implementation
> >>>>> of DAO handling for Contiki RPL and I am currently looking into
> >>>>> DAO aggregation. But I realised that it is for me not 100% clear
> >>>>> what a node that receives a DAO with several prefixes to be
> >>>>> registered but can only accept a sub-set of them. Should it be a
> DAO_NACK in this case or is there any other way to handle that case?
> >>>>>
> >>>>> If each would have been sent separately it is obvious that the
> >>>>> receiving node can do a NACK when the routing table is full and
> >>>>> therefore it is possible to get fine-grained answers. But with
> aggregation of DAOs this is not the case.
> >>>>>
> >>>>> Any ideas?
> >>>>>
> >>>>> Best regards,
> >>>>> — Joakim Eriksson, SICS
> >>>>> _______________________________________________
> >>>>> Roll mailing list
> >>>>> Roll@ietf.org
> >>>>> https://www.ietf.org/mailman/listinfo/roll
> >>>> _______________________________________________
> >>>> Roll mailing list
> >>>> Roll@ietf.org
> >>>> https://www.ietf.org/mailman/listinfo/roll
> >>> _______________________________________________
> >>> Roll mailing list
> >>> Roll@ietf.org
> >>> https://www.ietf.org/mailman/listinfo/roll
> >> _______________________________________________
> >> Roll mailing list
> >> Roll@ietf.org
> >> https://www.ietf.org/mailman/listinfo/roll
> >
> > _______________________________________________
> > Roll mailing list
> > Roll@ietf.org
> > https://www.ietf.org/mailman/listinfo/roll
> 
> _______________________________________________
> Roll mailing list
> Roll@ietf.org
> https://www.ietf.org/mailman/listinfo/roll