Re: [Roll] Way forward for draft-clausen-lln-rpl-experiences

Thomas Heide Clausen <IETF@ThomasClausen.org> Mon, 04 June 2012 00:27 UTC

Return-Path: <IETF@ThomasClausen.org>
X-Original-To: roll@ietfa.amsl.com
Delivered-To: roll@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 3355621F87B4 for <roll@ietfa.amsl.com>; Sun, 3 Jun 2012 17:27:36 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.188
X-Spam-Level:
X-Spam-Status: No, score=-1.188 tagged_above=-999 required=5 tests=[AWL=1.077, BAYES_00=-2.599, IP_NOT_FRIENDLY=0.334]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 2TfkqLt77mAI for <roll@ietfa.amsl.com>; Sun, 3 Jun 2012 17:27:35 -0700 (PDT)
Received: from morbo.mail.tigertech.net (morbo.mail.tigertech.net [67.131.251.54]) by ietfa.amsl.com (Postfix) with ESMTP id 50DEF21F87B2 for <roll@ietf.org>; Sun, 3 Jun 2012 17:27:35 -0700 (PDT)
Received: from mailc2.tigertech.net (mailc2.tigertech.net [208.80.4.156]) by morbo.tigertech.net (Postfix) with ESMTP id 30696557F16 for <roll@ietf.org>; Sun, 3 Jun 2012 17:27:35 -0700 (PDT)
Received: from localhost (localhost [127.0.0.1]) by mailc2.tigertech.net (Postfix) with ESMTP id EE83A1C01E0A; Sun, 3 Jun 2012 17:27:34 -0700 (PDT)
X-Virus-Scanned: Debian amavisd-new at c2.tigertech.net
Received: from [192.168.1.113] (Cs-136-215.CS.UCLA.EDU [131.179.136.215]) (using TLSv1 with cipher AES128-SHA (128/128 bits)) (No client certificate requested) by mailc2.tigertech.net (Postfix) with ESMTPSA id B19521C01E09; Sun, 3 Jun 2012 17:27:34 -0700 (PDT)
Mime-Version: 1.0 (Apple Message framework v1257)
Content-Type: text/plain; charset="windows-1252"
From: Thomas Heide Clausen <IETF@ThomasClausen.org>
In-Reply-To: <78FB3B50-3150-4729-A089-D9EAF0B02BB6@cs.stanford.edu>
Date: Sun, 03 Jun 2012 17:27:34 -0700
Content-Transfer-Encoding: quoted-printable
Message-Id: <59F4B025-990D-4738-8424-D8078EF9FB7C@ThomasClausen.org>
References: <258D7E2F-F0C7-49EA-B831-81070C86EDB3@thomasclausen.org> <2257A578-B2DF-4145-8393-9BB5D7E1CFBD@cisco.com> <2225986E-992E-43C7-B0CA-9CDA91CE1F3A@thomasclausen.org> <B1B81482-0F7E-4BCE-BBA7-B21949E3C16C@cisco.com> <0958556A-7D9A-4E8B-8091-1D6EC0B813B4@thomasclausen.org> <ACBA7834-F4A1-4D9C-80D6-E76C793A6770@cisco.com> <91E71E23-8797-4C70-A1F8-1CE64BD4ED39@thomasclausen.org> <1D6FEB49-CB62-4FFA-9E34-3FEF82DB644C@cisco.com> <BE51553F-67BE-4652-A8E8-9654BF953A96@thomasclausen.org> <78FB3B50-3150-4729-A089-D9EAF0B02BB6@cs.stanford.edu>
To: Philip Levis <pal@cs.stanford.edu>
X-Mailer: Apple Mail (2.1257)
Cc: roll WG <roll@ietf.org>, Michael Richardson <mcr@sandelman.ca>
Subject: Re: [Roll] Way forward for draft-clausen-lln-rpl-experiences
X-BeenThere: roll@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: Routing Over Low power and Lossy networks <roll.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/roll>, <mailto:roll-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/roll>
List-Post: <mailto:roll@ietf.org>
List-Help: <mailto:roll-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/roll>, <mailto:roll-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 04 Jun 2012 00:27:36 -0000

Dear Phil,

Thank you for your review. Comments below (with apologies for the incurred delay).

On May 16, 2012, at 22:02 , Philip Levis wrote:

> 
> On May 16, 2012, at 10:11 AM, Thomas Heide Clausen wrote:
> 
>> 
>> 
>> For example: 
>> 
>> 	o	the state required for storing/non-storing mode does *not* depend on a 
>> 		specific implementation and deployment, but is an artifact from the RPL design.
>> 
>> 	o	the message-size of RPL is not dependent on a specific implementation and deployment,
>> 		but is an artifact from the RPL design.
>> 
>> 	o	the use of links "upwards" based on receipt of traffic "downwards" (i.e. the unidirectional
>> 		link issue) is not dependent on a specific implementation and deployment,
>> 		but is an artifact from the RPL design.
>> 
>> 	o	the unknown-by-source MTU issues for data-traffic when using non-storing mode 
>> 		is not dependent on a specific implementation and deployment,
>> 		but is an artifact from the RPL design.
>> 
>> I could go on, but then it'd be easier to just paste the whole I-D into this email.
> 
> 
> Thomas,
> 
> I agree that some points in the ID are valid statements about the protocol itself, such as message sizes and the issues caused by floating DODAGs. Those, to me, seem like reasonable points to make.

Thank you, we appreciate this.
> 
> However, some of the points are hypotheses about performance (14), a bit naive about how wireless networks behave (13) or somewhat snippy gripes (11, 12). In my opinion, a document which focused on factual statements of the protocol design not really open to interpretation and cut hypothetical or subjective statements might be ready for the working group to consider.
> 
> As just a start, I object to 10, 11, 12, 13, and 14 being considered inherent to the protocol.
> 

Very good, this gives us precise points to discuss technically, and we want to thank you for having been so precise and constructive in your email on this subject.

> 10 assumes that a node only uses DIO reception to allow a parent; the specification is pretty clear that you should check the parent is usable (section 1.1). You're taking a bad implementation decision and assuming there isn't another way to do things. 
> 

That is definitely a fair point to bring up.

We agree that various external mechanisms can contribute to make a smart, or even very smart, parent selection - but, as section 1.1 points out, as bi-directional links are a minimum requirement, the fact that RPL doesn't specify a minimum, basic mechanism for ensuring that is highly problematic. NUD is the only mechanism explicitly called out in RPL (section 8.2, 9.8, for example), which is why the observations in section 10 concern RPL when using NUD (and not some other "good (but unspecified) design decision").

We do, however, understand (but correct us if we are wrong) that you agree that the mechanisms for this problem, as specified in the RPL RFC, are insufficient for building and operating networks. That is exactly the point that we are trying to make also.

Where we may differ in opinion (do we?)  is, that we think that the IETF should specify such mechanisms: in order to make interoperable implementations, some sort of agreement on what a "good implementation decision" is required, in particular if such requires messages being exchanged.

 

> For 11, there are implementations of RPL smaller than 50kB; they do not implement every feature, but that was kind of the point of the protocol, that it could be implemented on a sliding scale of implementation complexity. The TinyOS implementation, for example, is, I believe, ~20kB, less than half the size. You don't report what architecture the 50kB is for, clearly it would be more for a 32-bit than a 16-bit architecture. 

Phil, would you clarify this for us? 

As the specification reads, there is only the MOP flag that can specify that which the DODAG root expects from the routers in the network; if a router receives a MOP flag that it doesn't support, it won't be able to join as anything as a leaf.

There are no negotiations, or discovery of capabilities, we think, nor is it clear what exactly the "sliding scale"  you refer to is. (sure, we can see the MOP-scale: DIO only, DAO-nonstoring, DAO-storing - but beyond that, what do we miss?)

Short of heavy-handed administrative configuration, the only realistic way of ensuring interoperability of devices from different vendors is to support the full spec.

But, don't just take our words for complexity issues:

	http://comments.gmane.org/gmane.os.contiki.devel/12102

The implementation that we cite (URL is in the draft) is the ContikiRPL implementation, which only supports storing mode, and which consumes these ~50 KB (in the above link, they talk about 44KB being their code size). This on a 16bit MSP 430 tmote sky.

> For 12, "implementations may exhibit a bad performance if not carefully implemented."  I think it is safe to say this is true for almost ANY protocol.

We think we get your point, and this phrase is indeed a little like "You can write Fortran77 in any language" ;)

> A specification is not intended to be a complete statement of efficient implementation, otherwise you give little latitude to future improvements and good engineering.

This is very true, and we agree that a specification should (necessarily) be indicating the most efficient way of doing something. 

Alas, for example, for DAO messages, as the diffusion mechanism is not well specified, this may lead to  not just inefficiency, but actual traffic loss by way of connectivity not being accurately reflected in the information sets on RPL, in very mechanical way. 

It would, of course, be perfectly fine to specify an inefficient - but working - method, and allow future improvements and good engineering to better that. Unfortunately, in this case, no working diffusion method is given (i.e., it may lead to the above issues).

In addition, it is not just about efficient implementation from one vendor that could mitigate these problems, but rather about allowing for interoperable implementations in the same routing domain.

> For 13, this assumes that a wireless network has a stable topology which the protocol can converge to. Wireless networks are often NOT stable: one cannot expect a protocol to converge on a dynamic graph.

That it absolutely correct. This observation absolutely applies to the testbed, in which these trickle experiments were conducted.

The point we want to make here is, that contrary to what we observe in simulations and modeling, when deploying trickle in wireless networks, one should expect more control traffic than seen in theory and simulations. We may be able to do an editorial pass to make this section clearer -- especially as it seems that we do agree on the fundamental property that's being discussed?

> 14 is similarly confused about what a wireless network looks like. How can the state of a distributed system based on a dynamic topology be "consistent?" I think this is a fundamental misunderstanding of how the network works.

We are a little unclear on what you mean here, or what you suggest that we disagree on. Is it the use of "inconsistent" in the penultimate paragraph of  page 24? 

The purpose is to indicate some of the situations where loops can occur in RPL, and what the consequences hereof, and of the proposed detection mechanisms, are - especially the requirement for IPv6-in-IPv6 tunneling at each hop (which, btw., is independent of if wireless or not).

Thank you again, and we hope that you can clarify the issues that we indicated above.

Best,

Thomas, Ulrich, Jiazi, Axel

> That being said, I think point 6 is well taken and should be considered, maybe others. Maybe the constructive way to take this document, if you don't want to take the step of specifying solutions, is at least casting it as a roadmap for ways in which you think the WG should improve RPL?
> 
> Phil
>