Re: What CSRs Do

Willy Tarreau <w@1wt.eu> Tue, 19 November 2019 04:29 UTC

Return-Path: <w@1wt.eu>
X-Original-To: quic@ietfa.amsl.com
Delivered-To: quic@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 29A5712081A for <quic@ietfa.amsl.com>; Mon, 18 Nov 2019 20:29:13 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.901
X-Spam-Level:
X-Spam-Status: No, score=-1.901 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id DD5nwds3Lc8a for <quic@ietfa.amsl.com>; Mon, 18 Nov 2019 20:29:11 -0800 (PST)
Received: from 1wt.eu (wtarreau.pck.nerim.net [62.212.114.60]) by ietfa.amsl.com (Postfix) with ESMTP id AC5E212008A for <quic@ietf.org>; Mon, 18 Nov 2019 20:29:10 -0800 (PST)
Received: (from willy@localhost) by pcw.home.local (8.15.2/8.15.2/Submit) id xAJ4T463005686; Tue, 19 Nov 2019 05:29:04 +0100
Date: Tue, 19 Nov 2019 05:29:04 +0100
From: Willy Tarreau <w@1wt.eu>
To: Frode Kileng <frodek@tele.no>
Cc: "Border, John" <John.Border@hughes.com>, "quic@ietf.org" <quic@ietf.org>
Subject: Re: What CSRs Do
Message-ID: <20191119042904.GC5602@1wt.eu>
References: <BL0PR11MB3394D769128DEFEEFBC3CA71904C0@BL0PR11MB3394.namprd11.prod.outlook.com> <0cf4d3ad-0f64-0ae4-2d95-77a39f40639d@tele.no>
MIME-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"
Content-Disposition: inline
Content-Transfer-Encoding: 8bit
In-Reply-To: <0cf4d3ad-0f64-0ae4-2d95-77a39f40639d@tele.no>
User-Agent: Mutt/1.6.1 (2016-04-27)
Archived-At: <https://mailarchive.ietf.org/arch/msg/quic/2rtHlpuIpjkegE1Dr1o5nt0MkP8>
X-BeenThere: quic@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Main mailing list of the IETF QUIC working group <quic.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/quic>, <mailto:quic-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/quic/>
List-Post: <mailto:quic@ietf.org>
List-Help: <mailto:quic-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/quic>, <mailto:quic-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 19 Nov 2019 04:29:13 -0000

On Tue, Nov 19, 2019 at 04:51:14AM +0100, Frode Kileng wrote:
> 
> On 19/11/2019 04:34, Border, John wrote:
> > 
> >      I agree that it is too late to solve problems like troubleshooting
> > packet loss for v1.  Plus, we want a good solution (which takes time). 
> > But, we need to solve this problem or it will impact QUICv1
> > deployment...
> > 
> >      When a customer calls reporting that an application does not work,
> > the Customer Service Representative is trained to do whatever it takes
> > to fix the problem as quickly as possible.  Time on the phone equals
> > money.  And, unhappy customers equal churn.  There is a lot of anecdotal
> > learning involved with the troubleshooting process. If the CSRs learn
> > that disabling QUIC fixes the problem (because the application falls
> > back to TCP), eventually this will become the first thing that is tried
> > every time. Eventually, their supervisors will push for disabling it
> > across the board.  Once disabled, it is a lot of work to get it
> > reenabled.
> > 
> I do think this exemplifies that some network operators have operational
> practices they want to preserve and requests "packet loss" signal support in
> v1. Other operators use alternative operational practices.

The only known alternative is to claim "we have no loss here" and point the
finger at the next step in the chain operated by someone else. Actually
this is already what incompetent operators do, hence the common need for
captures to know whether everything is OK at one point or not. Usually
this has to be done within the operator's infrastructure once others
around managed to prove that something is already missing on their side.

> Should the WG 
> make efforts to contribute to preserving current practices of some network
> operators?

It will depend how long some operators will want to troubleshoot the issues
before claiming the protocol doesn't work :-/  Worse, since the network stack
will be implemented in browsers and servers, they will simply point the
fingers at these components, proving that just changing them (hence the
protocol as well) solves the issue. My personal take on this has been
expressed many times, which is that network must be observable if you want
to fix it, otherwise it degrades by itself till a point of poor quality
that "works enough" for most users (e.g. public WiFi access points). Things
as simple as detecting packet reordering caused by multiple paths is trivial
at the TCP level, it's way harder when you cannot say what the packet is
about.

> Frode

Willy