Re: [tcpm] Heads up on the upcoming WGLC for COCOA

"Scharf, Michael (Nokia - DE/Stuttgart)" <michael.scharf@nokia.com> Sat, 22 April 2017 17:45 UTC

Return-Path: <michael.scharf@nokia.com>
X-Original-To: tcpm@ietfa.amsl.com
Delivered-To: tcpm@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 5F45A1294B9; Sat, 22 Apr 2017 10:45:24 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.922
X-Spam-Level:
X-Spam-Status: No, score=-1.922 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H4=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=nokia.onmicrosoft.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 5TcRpLWJFges; Sat, 22 Apr 2017 10:45:21 -0700 (PDT)
Received: from EUR03-AM5-obe.outbound.protection.outlook.com (mail-eopbgr30112.outbound.protection.outlook.com [40.107.3.112]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id EB6C8128B8F; Sat, 22 Apr 2017 10:45:20 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=nokia.onmicrosoft.com; s=selector1-nokia-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version; bh=f0tVLJ/PxTCazr7eKnliBWkLbD4p+Z3iyU+1cnX6iiI=; b=pL8mvLxTHWGT307cF5h4EGGj2Qbhpw3fiCTs8ETs1KSF58eK73zLG9/IKu5/TnwAllxXwCSV7C1jdSgMxrXKTjSfKdxvKTW6ySzQMfexDqR9DgRA4XFhnL9Jt7fDn2t6E7s4SBrVYf+tFCfaHAxeIM0sv310eJetd0J8GpvA6E8=
Received: from AM5PR0701MB2547.eurprd07.prod.outlook.com (10.173.92.15) by AM5PR0701MB2547.eurprd07.prod.outlook.com (10.173.92.15) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_CBC_SHA256_P256) id 15.1.1047.6; Sat, 22 Apr 2017 17:45:18 +0000
Received: from AM5PR0701MB2547.eurprd07.prod.outlook.com ([10.173.92.15]) by AM5PR0701MB2547.eurprd07.prod.outlook.com ([10.173.92.15]) with mapi id 15.01.1047.018; Sat, 22 Apr 2017 17:45:18 +0000
From: "Scharf, Michael (Nokia - DE/Stuttgart)" <michael.scharf@nokia.com>
To: "core@ietf.org WG" <core@ietf.org>
CC: "tcpm@ietf.org" <tcpm@ietf.org>, "iccrg@ietf.org" <iccrg@ietf.org>, "michawe@ifi.uio.no" <michawe@ifi.uio.no>
Thread-Topic: Heads up on the upcoming WGLC for COCOA
Thread-Index: AQHSs4k0mnjKYz30k0+Zp7QNidIepKHRhciX
Date: Sat, 22 Apr 2017 17:45:18 +0000
Message-ID: <AM5PR0701MB2547E0C48821A5038375E01B931D0@AM5PR0701MB2547.eurprd07.prod.outlook.com>
References: <23333C37-60FB-41FD-B43B-EDC75F7BBFDB@ericsson.com>
In-Reply-To: <23333C37-60FB-41FD-B43B-EDC75F7BBFDB@ericsson.com>
Accept-Language: en-US, de-DE
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
authentication-results: ietf.org; dkim=none (message not signed) header.d=none;ietf.org; dmarc=none action=none header.from=nokia.com;
x-originating-ip: [25.163.180.4]
x-microsoft-exchange-diagnostics: 1; AM5PR0701MB2547; 7:TmsjZpbQYd5hTSFMOpyo/8h5hykDiw7eUR6AW4+LdE1mOTSQ1HCPqjA96HRoVwfgGtwD+jJupP1d7OPFW7hvv7TFQx96t9ACtvvL2Ge4WC4LHKOGoiQ16TEMsa/jnrgxn0zLhrMbZyB4wcTl2k4O7LOvKcZ9F8sK1hMlzkXrCEXoUzKB/dQTUjxW4XTHHVEjAuhTuSWyLR1PAm/NhBbrkCIm4ShvyplFkVIz0XZeFmnZ67pw0NXBKTZ/eal6KayiFFtLVQxFBM1S6v3a6I8Wg6Tewpm/FxKRaKedb/ixr/eumDiCsZa1BzWf1V3JRHhnDzFiN1lgoy+gQvk+Wm1/sg==
x-ms-office365-filtering-correlation-id: eed05b5c-7b4c-43b2-1d1a-08d489a75796
x-ms-office365-filtering-ht: Tenant
x-microsoft-antispam: UriScan:; BCL:0; PCL:0; RULEID:(22001)(2017030254075)(48565401081)(201703131423075)(201703031133081)(201702281549075); SRVR:AM5PR0701MB2547;
x-microsoft-antispam-prvs: <AM5PR0701MB25470CB9E594FA32AF80EAA7931D0@AM5PR0701MB2547.eurprd07.prod.outlook.com>
x-exchange-antispam-report-test: UriScan:(37575265505322)(192374486261705);
x-exchange-antispam-report-cfa-test: BCL:0; PCL:0; RULEID:(6040450)(601004)(2401047)(8121501046)(5005006)(93006095)(93001095)(3002001)(10201501046)(6055026)(6041248)(201703131423075)(201702281528075)(201703061421075)(20161123562025)(20161123560025)(20161123555025)(20161123564025)(6072148); SRVR:AM5PR0701MB2547; BCL:0; PCL:0; RULEID:; SRVR:AM5PR0701MB2547;
x-forefront-prvs: 0285201563
x-forefront-antispam-report: SFV:NSPM; SFS:(10019020)(6009001)(39850400002)(39860400002)(39840400002)(39410400002)(39400400002)(39450400003)(54094003)(53754006)(6436002)(53936002)(9686003)(6306002)(102836003)(3846002)(6116002)(99286003)(54906002)(50986999)(54356999)(76176999)(55016002)(3660700001)(3280700002)(77096006)(6506006)(81166006)(2900100001)(110136004)(6246003)(38730400002)(8676002)(33656002)(66066001)(8936002)(2906002)(53546009)(189998001)(25786009)(6916009)(7696004)(229853002)(4326008)(5660300001)(7736002)(305945005)(2950100002)(74316002)(122556002)(86362001)(557034004); DIR:OUT; SFP:1102; SCL:1; SRVR:AM5PR0701MB2547; H:AM5PR0701MB2547.eurprd07.prod.outlook.com; FPR:; SPF:None; MLV:sfv; LANG:en;
spamdiagnosticoutput: 1:99
spamdiagnosticmetadata: NSPM
Content-Type: text/plain; charset="Windows-1252"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
X-OriginatorOrg: nokia.com
X-MS-Exchange-CrossTenant-originalarrivaltime: 22 Apr 2017 17:45:18.6440 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 5d471751-9675-428d-917b-70f44f9630b0
X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM5PR0701MB2547
Archived-At: <https://mailarchive.ietf.org/arch/msg/tcpm/GBzlnb206R5wuSJik-MtHqxXbko>
Subject: Re: [tcpm] Heads up on the upcoming WGLC for COCOA
X-BeenThere: tcpm@ietf.org
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: TCP Maintenance and Minor Extensions Working Group <tcpm.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tcpm>, <mailto:tcpm-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tcpm/>
List-Post: <mailto:tcpm@ietf.org>
List-Help: <mailto:tcpm-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tcpm>, <mailto:tcpm-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sat, 22 Apr 2017 17:45:24 -0000

Hi all,

I have read draft-ietf-core-cocoa-01 as a contributor to TCPM, in particular since TCP documents are mentioned therein. Please find below my comments. Note that I have not followed discussions on this document and I also do not monitor the CoRE mailing list, i.e., I may lack some context. If topics have been discussed already, I apologize for that.

I wonder if this document is already ready for WGLC. There is missing text, and to me the alorithm is not very well motivated. A formal issue is the explicit violation of the BCP RFC 8085 but this is more something for the IESG to care about.

If the document should continue to use an algorithm contradicting RFC 8085, I think Appendix B should be in the document.

I have not read Appendix A. An algorithm like the one mentioned in Appendix A IMHO requires comprehensive simulation and empirical evaluation to ensure it is robust in all possible corner cases. This may be a topic e.g. for ICCRG. In my option, either evidence should be included or referenced in the document, or the Appendix A should be removed. It seems to me that Appendix A could be moved to another draft without impacting the rest of this document.


Major issues:

- Abstract: The abstract does not explain well the actual content of the document. For instance, it does not mention that an RTO algorithm is described in the document.

- Abstract: The abstract should mention that the document is informational.

- Section 3: I think the area of applicability should be better explained. As far as I can see, the document focuses on low-bandwidth and high-latency links. This is probably a relevant application scenario for CoAP. However, I wonder if the mechanism would be equally well applicable to other transports built for constrained network, see e.g. the "IoT" work in 4G/5G. I am not sure if they have the same delay characteristics. Also, the document seems to assume later "naturally lossy networks"; this is not generally true in the Internet. All in all, it really would make sense to explicitly spell out for what environments the algorithms have been designed.

- Section 4: Somewhat related, I think at the beginning of Section 4 a better description is needed for the design rationale of the algorithms, e.g., on underlying assumptions on the absolute values of delays. The current draft basically presents an algorithm without much discussion on the design choices, and whether alternatives would exist. For instance, I believe that the document implicitly assumes that delays/RTT can be larger than 1s. Such assumptions should be spelt out. 

- Section 4, Section 4.2, and Section 4.2.2: Indeed, BCP RFC 8085 defines "Latency samples MUST NOT be derived from ambiguous transactions." As far as I understand, this document explicitly violates the IETF concensus on RFC 8085. Whether this is acceptable from a process perspective probably needs IESG discussion. From a formal perspective, at minimum, I would expect this deviation to be mentioned very explicitly in the introduction and at the beginning of Section 4. I'll comment below on some technical aspects.

- Section 4.2 and in particular 4.2.1: This document should use a terminolgy like "differences to the algorithm in RFC 6298". Using the term "modification" is confusing. This (informational) document does not *modify* RFC 6298 or the algorithms therein.


Minor issues:

 - I find the title "CoAP Simple Congestion Control/Advanced" confusing. To me the terms "simple" and "adanced" don't fit together very well. If the objective is to use the acronym "CoCoA", this could be better explained, e.g., in the abstract.

- Section 1: I think "(See Abstract.)" should be replaced by actual text. As mentioned separately, IMHO the abstract does not well describe the document neither.

- Section 1 and 2: I would avoid that a long-lasting document refers to "minutes of the IEFT 84 CoRE WG meetings". I think a document should be reasonably self-contained, in particular in the introduction. I also think that some content of Section 2 would well fit into the introduction. But this may be a personal preference of me.

- Section 3: As already mentioned, if "aggregate congestion control (Appendix A) is not yet supported by research", I wonder why not to move this to a separate document and publish that other document once there is research. In any case, the term "cloud side" in that paragraph is not well defined and the use case assumed here is not clear to me.

- Section 4: "This is based on the usual algorithms for RTO estimation [RFC6298]". RFC 6298 describes the standardized TCP algorithm; I think "usual" is not a precise characterization as some TCP stacks may not exactly follow RFC 6298.

- Section 4: "One important consideration not relevant for TCP is the fact that a CoAP round-trip may include application processing time". If the document wants to compare to TCP, it may be worthwile here to discuss the TCP delayed acks.

- Section 4.2 (and follow-up): I don't think the use of the weak estimator is well motivated. The document does not give a good explanation why using ambiguous samples is really useful, except for the observation that there have been some (small?) benefits in some studies, without looking into root causes. For instance, the weak estimator can be significantly larger than the actual network delay. Why does it make sense to increase the RTO way beyond the actual network delay in that case? Wouldn't this impact the delivery delay if a follow-up packet gets lost, and thus degrade the application performance more than needed? In what network scenarios does using ambiguous samples really help? And why does it even make sense to calculate the variance of a sample that is ambiguous? Unfortunately, there is no single set of delay and loss characteristics in the Internet that a performance study could use. Therefore, arguments on functional properties of an algorithm (e.g., in simple example scenarios) would be quite helpful.

- Section 4.2 (and follow-ups): I don't think the document addresses well how close or far away the estimates of the strong and weak estimator can be to each other. For instance, in a network without much congestion, would the weak estimator get updated if the latency in the network changes? How does the Aging according to Section 4.3 tie in in that case? Based on the document it is difficult to understand how the various different algorithms with conditional statements finally react altogether, and whether they are safe in all corner cases. It may be simpler to present the complete algorithms as a sequence of processing steps. Alternatively or in addition, some examples for simple scenarios could help to better understand what the algorithms would converge to (e.g., one example with rather low RTT samples and one with rather large ones). Such examples may not be mandatory in an RFC, but this would really be nice-to-have, IMHO.

- Section 4.2: If we assume that the weak estimator is used (see my other comments), it seems that a sender would *either* use (1) or (2), right? If so, this could be better spelt out.

- Section 4.2.1: "For RTO estimations below 1 s, the RTO for a  retransmission is multiplied by 3, while for estimations above 3 s,  the RTO is multiplied only by 1.5 (this updated choice of numbers to  be verified by more simulations)". Will the document be updated with results from more simulations prior to WGLC?

- Section 4.2.2 "In contrast to [RFC6298], this algorithm attempts to make use of ambiguous information from retransmissions.  This is motivated by the high non-congestion loss rates expected in constrained node networks,  and the need to update the RTO estimators even in the presence of loss." I believe this assumption does not generally apply to all network technologies. Several technologies have ARQ or FEC mechanisms below IP to make the rate of non-congestion loss extremely small. I think this should be discussed. It would be worthwile to expand on what benefits (if any) the weak estimator might have in networks without non-congestion loss.

- Section 4.2.2: "However, those samples are not simply combined into the strong estimator, but are used to correct the limited knowledge that can be gained from the strong RTT measurements by employing an additional weak estimator." If there is benefit, it would be useful to add an example that explains that "limited knowledge" and why such a correction could be useful.

- Section 7: Security considerations seem to be missing. I wonder about the scenario that an attacker only has temporary access to the network path. By causing some packets to be dropped, can he increase the RTOs in a large number of senders for a relatively long time, thus degrading the performance significantly?


Thanks

Michael

________________________________________
From: tcpm <tcpm-bounces@ietf.org> on behalf of Jaime Jiménez <jaime.jimenez@ericsson.com>
Sent: Wednesday, April 12, 2017 14:35
To: core@ietf.org WG
Cc: tcpm@ietf.org; iccrg@ietf.org; michawe@ifi.uio.no
Subject: [tcpm] Heads up on the upcoming WGLC for COCOA

Hi CoRE WG,

this is just a heads up for the upcoming WGLC for "CoAP Simple Congestion Control/Advanced”.
https://tools.ietf.org/html/draft-ietf-core-cocoa-01

Note that there is still some discussion about whether to keep the appendixes or not, and that there is time to get other comments before WGLC.
This document has been presented multiple times and I’d like to progress with it soon, it’d be great to have it by Prague.

To ICCRG, as you now already, this was presented quite some time ago, below is the links to the session and the slides.

Session: https://www.ietf.org/proceedings/92/iccrg.html
Presentation: https://www.ietf.org/proceedings/92/slides/slides-92-iccrg-2.pdf

Between the two sessions the main difference is the addition of the appendixes:
https://tools.ietf.org/rfcdiff?url2=draft-ietf-core-cocoa-01.txt

Ciao!
- - Jaime Jiménez