Re: [P2PSIP] Alvaro Retana's Discuss on draft-ietf-p2psip-diagnostics-19: (with DISCUSS and COMMENT)

"Songhaibin (A)" <haibin.song@huawei.com> Wed, 09 March 2016 03:04 UTC

Return-Path: <haibin.song@huawei.com>
X-Original-To: p2psip@ietfa.amsl.com
Delivered-To: p2psip@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id B545512D81C; Tue, 8 Mar 2016 19:04:10 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.221
X-Spam-Level:
X-Spam-Status: No, score=-4.221 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_MED=-2.3, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([127.0.0.1]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id XAaDjxfLNtJ7; Tue, 8 Mar 2016 19:04:07 -0800 (PST)
Received: from lhrrgout.huawei.com (lhrrgout.huawei.com [194.213.3.17]) (using TLSv1 with cipher RC4-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 1194712DDAB; Tue, 8 Mar 2016 19:04:04 -0800 (PST)
Received: from 172.18.7.190 (EHLO lhreml703-cah.china.huawei.com) ([172.18.7.190]) by lhrrg02-dlp.huawei.com (MOS 4.3.7-GA FastPath queued) with ESMTP id CFO80831; Wed, 09 Mar 2016 03:04:02 +0000 (GMT)
Received: from NKGEML403-HUB.china.huawei.com (10.98.56.34) by lhreml703-cah.china.huawei.com (10.201.5.104) with Microsoft SMTP Server (TLS) id 14.3.235.1; Wed, 9 Mar 2016 03:04:01 +0000
Received: from NKGEML513-MBS.china.huawei.com ([169.254.2.35]) by nkgeml403-hub.china.huawei.com ([10.98.56.34]) with mapi id 14.03.0235.001; Wed, 9 Mar 2016 11:03:57 +0800
From: "Songhaibin (A)" <haibin.song@huawei.com>
To: Alissa Cooper <alissa@cooperw.in>, "Alvaro Retana (aretana)" <aretana@cisco.com>
Thread-Topic: Alvaro Retana's Discuss on draft-ietf-p2psip-diagnostics-19: (with DISCUSS and COMMENT)
Thread-Index: AQHROE3xg1hbhCRGu0GftwreM9LEL57uSeMggAJp5wCAWO/agIAHTBxw
Date: Wed, 09 Mar 2016 03:03:56 +0000
Message-ID: <E33E01DFD5BEA24B9F3F18671078951F65DC7CEB@nkgeml513-mbs.china.huawei.com>
References: <20151216220546.7663.70255.idtracker@ietfa.amsl.com> <E33E01DFD5BEA24B9F3F18671078951F65DAC97F@nkgeml513-mbx.china.huawei.com> <D2B42F28.FFE86%aretana@cisco.com> <AFED0F52-7A79-4F70-819D-D6F385ED6B3A@cooperw.in>
In-Reply-To: <AFED0F52-7A79-4F70-819D-D6F385ED6B3A@cooperw.in>
Accept-Language: en-US, zh-CN
Content-Language: zh-CN
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-originating-ip: [10.136.79.145]
Content-Type: multipart/alternative; boundary="_000_E33E01DFD5BEA24B9F3F18671078951F65DC7CEBnkgeml513mbschi_"
MIME-Version: 1.0
X-CFilter-Loop: Reflected
X-Mirapoint-Virus-RAPID-Raw: score=unknown(0), refid=str=0001.0A020205.56DF92A3.0063, ss=1, re=0.000, recu=0.000, reip=0.000, cl=1, cld=1, fgs=0, ip=169.254.2.35, so=2013-06-18 04:22:30, dmn=2013-03-21 17:37:32
X-Mirapoint-Loop-Id: b57b51ea44527d2a47ed345b83233b60
Archived-At: <http://mailarchive.ietf.org/arch/msg/p2psip/8YL7pibxfDJaqIA_zDECSKMu69g>
Cc: "p2psip-chairs@ietf.org" <p2psip-chairs@ietf.org>, "p2psip@ietf.org" <p2psip@ietf.org>, IESG <iesg@ietf.org>, "draft-ietf-p2psip-diagnostics@ietf.org" <draft-ietf-p2psip-diagnostics@ietf.org>
Subject: Re: [P2PSIP] Alvaro Retana's Discuss on draft-ietf-p2psip-diagnostics-19: (with DISCUSS and COMMENT)
X-BeenThere: p2psip@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: Peer-to-Peer SIP working group discussion list <p2psip.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/p2psip>, <mailto:p2psip-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/p2psip/>
List-Post: <mailto:p2psip@ietf.org>
List-Help: <mailto:p2psip-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/p2psip>, <mailto:p2psip-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 09 Mar 2016 03:04:10 -0000

And would Jari clear his DISCUSS based on the version 21?

Best Regards!
-Haibin

From: Alissa Cooper [mailto:alissa@cooperw.in]
Sent: Saturday, March 05, 2016 3:37 AM
To: Alvaro Retana (aretana)
Cc: Songhaibin (A); IESG; Roni Even; p2psip-chairs@ietf.org; draft-ietf-p2psip-diagnostics@ietf.org; p2psip@ietf.org; cjbc@it.uc3m.es
Subject: Re: Alvaro Retana's Discuss on draft-ietf-p2psip-diagnostics-19: (with DISCUSS and COMMENT)

Hi Alvaro,

Are you able to clear based on the latest version? https://tools.ietf.org/html/draft-ietf-p2psip-diagnostics-21

Thanks,
Alissa


On Jan 7, 2016, at 12:27 PM, Alvaro Retana (aretana) <aretana@cisco.com<mailto:aretana@cisco.com>> wrote:

On 1/6/16, 4:44 AM, "Songhaibin (A)" <haibin.song@huawei.com<mailto:haibin.song@huawei.com>> wrote:

Haibin:

Hi!

...




----------------------------------------------------------------------
DISCUSS:
----------------------------------------------------------------------

I am balloting a DISCUSS because I am concerned that this document
doesn't
actually do what it set out to achieve.  I would love it if during the
discussion I
was pointed to the places where RELOAD already solves the issues, but
for now
I wasn't able to find them.

According to the text, both extensions are intended to collect
information
"along the path", and the Figures clearly depict what is intended to
happen.
However, I don't think that as specified (or at least as explained) the
behavior is
guaranteed.  Specific points:

1. RFC6940 says (in Section 6.2. (Symmetric Recursive Routing)) that an
"overlay MAY be configured to use alternative routing
algorithmsŠ[or]ŠMAY be
selected on a per-message basis". How is the symmetry enforced if other
routing algorithms are used?  Enforcing that the ping/trace messages use
symmetric routing when other algorithms are in use won't necessarily
help
because the paths may be different.

I think one important thing is that, the draft does not guarantee what it
conveys must be the information that caused the previous failures, but
with the retrieved information from the previous traversed nodes (with
high probability, as it cannot guarantee the exact same path), a user or
machine can analyze and infer what is the problem. Symmetric routing is
achieved by the via list (whatever DHT routing algorithms are used), but
response message could go through not exactly the same path as there
still can be failures during that short period.



2. RFC6940 also (in 6.2.2. (Response Origination)) reads: "the response
traverses the same peers as the request traversed, except in reverse
order
(symmetric routing) and possibly with extra nodes (loose routing)."
In other words, even if symmetric routing is used, there is no
guarantee that
the same path will be followed by the response, unless the originator
builds the
Via List with strict details of all the nodes in the path -- maybe this
is what is
intended, but no explicit mention occurs in the document.

I agree this should be mentioned in the document.

Clarifying the caveats (like what we're discussing above, for example) is
one of the ways to move forward with resolving my DISCUSS.

However, I have to say that the clarifications, which basically point at
no guarantees about the path (or its relevance), leave me very
dissatisfied with a technical solution that is unreliable. :-(





3. In 4.3, what does "directly or via symmetric routing" mean? Is it
directly
connected? If so, then (for the text in 4.3) that would mean that C is
adjacent
to A, and even though it is the next hop after B, the path taken to
reach C with
the PathTrack request doesn't include B < the result is that the
diagnostic
information received from C may not be relevant relative to destination
D.

As each PathTrack request will contain the destination ID, which was the
same destination ID as the previously failed message. So no matter how
the PathTrack request arrives to C, C will have a look at that
destination ID for the next step.

My point above was the the path *to* C (from A) may not actually even go
through B.  In other words, even if the rest of the path (all the way to
D) is congruent, the overall information is still not completely relevant.






Are there implementations available?  What has been the experience? The
Shepherd's write-up didn't mention any, and the TBD codes make me think
that
maybe Experimental might be a better status for this document.

In my memory, there was one several years ago, but not based on RELOAD.

Given the clear inconsistency and the lack of experience, I want to
suggest that Experimental status be considered.







----------------------------------------------------------------------
COMMENT:
----------------------------------------------------------------------

In general, as others have mentioned, I think the text could be a lot
clearer and
not leave some concepts to interpretation.

1. In 4.2.1, the MessageExtension is defined with a Boolean of
"critical", but
the text (a paragraph later) says that this "extension is not
critical". I may be
missing some of the semantics, or there's an error somewhere.

My understanding with RELOAD, if value of "critical" can be true or
false, if it is critical, then every node must support the extension, if
it is not critical, then it is optional to support it. As it explains in
that section, "If a peer does not support the extension, they will simply
ignore the diagnostic portion of the message..."

I looked at the text again -- it is still confusing to me, but I noticed
that Section 6.1. (Message Creation and Transmission) does say that "the
sender MUST...[set] the value of critical to FALSE".  Maybe using
"Critical" (capital C) to describe the state (vs the use of the word as an
adjective) would help.





2. In 4.3..the last paragraph reads: if "...succesive calls to
PathTrack return
different paths, the results should be discarded and the request resent,
ensuring that the second request traverses the appropriate path".
Path changes are a fact of life < the second request may just be
reflecting the
new path, so resending it in an attempt to find the "appropriate path"
may be
futile.

At least in the previous two attempts, the path was "stable".


3. What is a "routing mode"?  Section 4.3.1. (New RELOAD Request:
PathTrack) talks about it when saying that the "PathTrack request can be
routed directly or through the overlay based on the routing mode".
Later
Section 6.2. (Message Processing: Intermediate Peers) says that
"processing
this request according to the overlay specified routing mode from RELOAD
protocol" -- I looked in RFC6940 for "routing mode", but didn't find
anything.
Note that it also looks to me like there's no place in the
DiagnosticsRequest to
indicate a mode..

This draft should add a reference to RFC 7263 for the routing mode.

Ok.




4. Section 5.3. (dMFlags and Diagnostic Kind ID Types) defines an
"UNDERLAY_HOP (8 bits): Indicates the IP layer hops from the
intermediate
peer which receives the diagnostics message to the next hop peer for
this
message."  How is that information derived?

It can use the traceroute tool.


5. I think that the Reference to RFC5226 should be Normative.

Accepted.



I also agree with others that the title should probably be something
around
RELOAD Diagnostics, and that this work doesn't really update RFC6940, it
extends it in independent ways.

It seems we can achieve a consensus here that it extends RELOAD. What
about the title "RELOAD Extension for P2P Overlay Diagnostics"?

I am not opposed to that.

Thanks!

Alvaro.