RE: Measurement bit(s) or not

"Lubashev, Igor" <ilubashe@akamai.com> Sat, 10 February 2018 21:41 UTC

Return-Path: <ilubashe@akamai.com>
X-Original-To: quic@ietfa.amsl.com
Delivered-To: quic@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id DE5A61270AE for <quic@ietfa.amsl.com>; Sat, 10 Feb 2018 13:41:46 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.699
X-Spam-Level:
X-Spam-Status: No, score=-2.699 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=akamai.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id wxoAETvMAm1a for <quic@ietfa.amsl.com>; Sat, 10 Feb 2018 13:41:45 -0800 (PST)
Received: from mx0a-00190b01.pphosted.com (mx0a-00190b01.pphosted.com [IPv6:2620:100:9001:583::1]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 758CE1267BB for <quic@ietf.org>; Sat, 10 Feb 2018 13:41:45 -0800 (PST)
Received: from pps.filterd (m0122332.ppops.net [127.0.0.1]) by mx0a-00190b01.pphosted.com (8.16.0.22/8.16.0.22) with SMTP id w1ALbsZb007209; Sat, 10 Feb 2018 21:41:45 GMT
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=akamai.com; h=from : to : subject : date : message-id : references : in-reply-to : content-type : mime-version; s=jan2016.eng; bh=4BEhnCfBYwSPOzqCm6HxRdmiA4AZ80FKtCip8DOLiiA=; b=WcQF9JLGi/23iYM+K7IX8lF5pRzq++iwtTgYbib0X3AqDkEeH3Ff438tYkrH+cL5pCu3 0tM6S07ggtS1CTz6z7AKMNRx2NVHAQAEZveufVzBxbgQLYJ5VjMlP30uEvDTigDQj8j/ h+HnW7vxMXezJLVhQKxQTpsUY0RoLAk5+tCdfKKO4QnqrgdE9vcpPl3Q6rIARJSmz+dk LLNS9kZl15i0znV2vHPB5g8YJtaZE58WSyWiOmR+fiWo2LfMIDVRFBnohHL4A+MoFKzT WnA3lLSbYrmBKDebD6EX0FVvqoFly2pOMMLaJouSJ9WGFstTQmwauIaLYgC1nfGtH4IL Ig==
Received: from prod-mail-ppoint3 ([96.6.114.86]) by mx0a-00190b01.pphosted.com with ESMTP id 2g1snkht4s-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Sat, 10 Feb 2018 21:41:45 +0000
Received: from pps.filterd (prod-mail-ppoint3.akamai.com [127.0.0.1]) by prod-mail-ppoint3.akamai.com (8.16.0.21/8.16.0.21) with SMTP id w1ALfZLg017535; Sat, 10 Feb 2018 16:41:44 -0500
Received: from email.msg.corp.akamai.com ([172.27.123.30]) by prod-mail-ppoint3.akamai.com with ESMTP id 2g1vy112ax-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-SHA384 bits=256 verify=NOT); Sat, 10 Feb 2018 16:41:43 -0500
Received: from USMA1EX-DAG1MB5.msg.corp.akamai.com (172.27.123.105) by usma1ex-dag1mb2.msg.corp.akamai.com (172.27.123.102) with Microsoft SMTP Server (TLS) id 15.0.1263.5; Sat, 10 Feb 2018 16:41:42 -0500
Received: from USMA1EX-DAG1MB5.msg.corp.akamai.com ([172.27.123.105]) by usma1ex-dag1mb5.msg.corp.akamai.com ([172.27.123.105]) with mapi id 15.00.1263.000; Sat, 10 Feb 2018 16:41:42 -0500
From: "Lubashev, Igor" <ilubashe@akamai.com>
To: "quic@ietf.org" <quic@ietf.org>, "alexandre.ferrieux@orange.com" <alexandre.ferrieux@orange.com>
Subject: RE: Measurement bit(s) or not
Thread-Topic: Measurement bit(s) or not
Thread-Index: AQHTopV5Xk/Gq+i72kuhP7it8q6tzaOeKra0
Date: Sat, 10 Feb 2018 21:41:42 +0000
Message-ID: <aa7a56d01f0a41fe9ad0fd9e61c54c50@usma1ex-dag1mb5.msg.corp.akamai.com>
References: <1817_1518284090_5A7F2D3A_1817_79_1_5A7F2D3E.4050806@orange.com>
In-Reply-To: <1817_1518284090_5A7F2D3A_1817_79_1_5A7F2D3E.4050806@orange.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-ms-exchange-transport-fromentityheader: Hosted
Content-Type: multipart/alternative; boundary="_000_aa7a56d01f0a41fe9ad0fd9e61c54c50usma1exdag1mb5msgcorpak_"
MIME-Version: 1.0
X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:, , definitions=2018-02-10_10:, , signatures=0
X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 suspectscore=0 malwarescore=0 phishscore=0 bulkscore=0 spamscore=0 mlxscore=0 mlxlogscore=999 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1711220000 definitions=main-1802100286
X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:, , definitions=2018-02-10_10:, , signatures=0
X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 priorityscore=1501 malwarescore=0 suspectscore=0 phishscore=0 bulkscore=0 spamscore=0 clxscore=1011 lowpriorityscore=0 mlxscore=0 impostorscore=0 mlxlogscore=999 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1711220000 definitions=main-1802100285
Archived-At: <https://mailarchive.ietf.org/arch/msg/quic/MaIde78PCEzE5gTHIoEoAIr6SsM>
X-BeenThere: quic@ietf.org
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: Main mailing list of the IETF QUIC working group <quic.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/quic>, <mailto:quic-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/quic/>
List-Post: <mailto:quic@ietf.org>
List-Help: <mailto:quic-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/quic>, <mailto:quic-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sat, 10 Feb 2018 21:41:47 -0000

I am a bit sceptical about the square. With TCP, packet trace shows loss of any packet, except for SYN. The "square wave" whould show no loss at all for short connections (fewer than a "wavelength" of packets). Moreover, the square wave can only detect loss after up to the wave length of subsequent packets has been transmitted. If there is an application pause in transmission, loss may not become evident.

Basically, a spin bit or a bit pattern (like a square wave) may help in troubleshooting, but it will be more uncertain and hokey than what people come to expect of tcp. We have be very clear in acknowledging it.

-----Original Message-----
From: alexandre.ferrieux@orange.com [alexandre.ferrieux@orange.com]
Received: Saturday, 10 Feb 2018, 12:34PM
To: quic@ietf.org [quic@ietf.org]
Subject: Measurement bit(s) or not

On 07/02/2018 14:34, Brian Trammell (IETF) wrote:
 > hi Jana,
 >
 >> 3. Some sequencing information -- a few bits of the packet number perhaps
 >> -- should be revealed (for monitoring. Number of bits TBD.)
 >
 > This is the crux of the argument. On one side we have the risk of misuse and
 > ossification (well, not ossification -- these bits are *meant* for the path
 > -- rather the risk that we'll figure out later that we specified the wrong
 > thing), on the other side we have the loss of visibility into how QUIC
 > traffic interacts with the network as compared to TCP, with a side question
 > of whether or not this visibility is really the transport layer's problem
 > despite the evolution the practice of diagnostics and troubleshooting using
 > TCP information.
 >
 > If we can come to agreement on this question, everything else falls into
 > place. I have my arguments here, but as you said, this subthread is not the
 > place for them. :)

The crux indeed. So what about settling it first ?

With the troubleshooting hat, I can only stress the need for measurement bits,
for the benefit of everybody, since s**t happens, networks are imperfect, and
nifty encapsulations-with-seqnum will simply not be where you need them.

Now to the exact nature of these measurement bits:

Thanks to the detailed exchanges on this thread, it is by now clear that a
simple gapless counter, even nonzero-based and XORed, is not acceptable. The
4-bit SSN comes pretty close but is not enough when things go really wrong (and
they will - and that's where we need the tool).

Then Kazuho's square signal and Mikkel's Pi (or any other consensual
self-synchronizing sequence) ramification came up. They are both appealing for
their elegance and low complexity on QUIC endpoints. Beyond their quirks
acknowledged here, here are a few considerations for troubleshooting:

  (1) Since reordering is less of a concern to QUIC than to TCP, it becomes a
secondary goal. This is nice, because the square doesn't see it, and the
self-synchronizing sequence will only tolerate a mild one, and never see its
detail like cycle length etc.

  (2) There's of course a huge difference between them in complexity for the
midpoint: square is trivial, Pi is hefty.

Given these, a benevolent, troubleshooting-minded passive midpoint will clearly
vote for the square. Now the obvious question is: is this acceptable, or deemed
too easy for a Murphy, Inc. active middlebox to see upstream losses and
benevolently wreak havoc by delaying packets ?

_________________________________________________________________________________________________________________________

Ce message et ses pieces jointes peuvent contenir des informations confidentielles ou privilegiees et ne doivent donc
pas etre diffuses, exploites ou copies sans autorisation. Si vous avez recu ce message par erreur, veuillez le signaler
a l'expediteur et le detruire ainsi que les pieces jointes. Les messages electroniques etant susceptibles d'alteration,
Orange decline toute responsabilite si ce message a ete altere, deforme ou falsifie. Merci.

This message and its attachments may contain confidential or privileged information that may be protected by law;
they should not be distributed, used or copied without authorisation.
If you have received this email in error, please notify the sender and delete this message and its attachments.
As emails may be altered, Orange is not liable for messages that have been modified, changed or falsified.
Thank you.