[quicwg/base-drafts] persistent congestion is more aggressive than RTO verified (#2255)

Marten Seemann <notifications@github.com> Tue, 25 December 2018 05:08 UTC

Return-Path: <noreply@github.com>
X-Original-To: quic-issues@ietfa.amsl.com
Delivered-To: quic-issues@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 0A66112F1A6 for <quic-issues@ietfa.amsl.com>; Mon, 24 Dec 2018 21:08:16 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -8.065
X-Spam-Level:
X-Spam-Status: No, score=-8.065 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.065, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=0.001, MAILING_LIST_MULTI=-1, RCVD_IN_DNSWL_HI=-5, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=github.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id XkcRj347cFsf for <quic-issues@ietfa.amsl.com>; Mon, 24 Dec 2018 21:08:14 -0800 (PST)
Received: from out-2.smtp.github.com (out-2.smtp.github.com [192.30.252.193]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 555E512F1A5 for <quic-issues@ietf.org>; Mon, 24 Dec 2018 21:08:14 -0800 (PST)
Date: Mon, 24 Dec 2018 21:08:12 -0800
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=github.com; s=pf2014; t=1545714492; bh=nAC98QWoDOgypfgF/96lNdZ9LW7HSS0TuvyzFysnJ4E=; h=Date:From:Reply-To:To:Cc:Subject:List-ID:List-Archive:List-Post: List-Unsubscribe:From; b=XQk9ExP0gXCPXOaS9wmkqWoFpZ3/FIIEEM2ytxZd20cyjBxmlTA5gAK0YwiULivTq ppNeLEBVkSZphnHnEEo18rYjWoHFJQYNinAUKNN3q+sYWYCMhKTuy/38FngBNF8cbv /3+26kMDby4RlKjcvR3du8EUfildBEBX1D9ZZgsI=
From: Marten Seemann <notifications@github.com>
Reply-To: quicwg/base-drafts <reply+0166e4abcb1fbcd756c0ecdf7e04cb4f76a79e22ada7e07c92cf0000000118397d3c92a169ce177b64ce@reply.github.com>
To: quicwg/base-drafts <base-drafts@noreply.github.com>
Cc: Subscribed <subscribed@noreply.github.com>
Message-ID: <quicwg/base-drafts/issues/2255@github.com>
Subject: [quicwg/base-drafts] persistent congestion is more aggressive than RTO verified (#2255)
Mime-Version: 1.0
Content-Type: multipart/alternative; boundary="--==_mimepart_5c21bb3c5243d_6b1a3f7f098d45c41805663"; charset="UTF-8"
Content-Transfer-Encoding: 7bit
Precedence: list
X-GitHub-Sender: marten-seemann
X-GitHub-Recipient: quic-issues
X-GitHub-Reason: subscribed
X-Auto-Response-Suppress: All
X-GitHub-Recipient-Address: quic-issues@ietf.org
Archived-At: <https://mailarchive.ietf.org/arch/msg/quic-issues/JxnVMj-THFcFcCPrgzFJD9bE2Wk>
X-BeenThere: quic-issues@ietf.org
X-Mailman-Version: 2.1.29
List-Id: Notification list for GitHub issues related to the QUIC WG <quic-issues.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/quic-issues>, <mailto:quic-issues-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/quic-issues/>
List-Post: <mailto:quic-issues@ietf.org>
List-Help: <mailto:quic-issues-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/quic-issues>, <mailto:quic-issues-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 25 Dec 2018 05:08:16 -0000

Consider the following sequence of packets being sent

|  | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 |
| -- | -- | -- | -- | -- | -- | -- | -- | -- |
| old: | packet | packet | TLP 1 | TLP 1 | TLP 2 | TLP 2 | RTO | RTO |
| new: | packet | packet | PTO 1 | PTO 1 | PTO 2 | PTO 2 | PTO 3 | PTO 3 |

In our old loss recovery code (before unifying TLP and RTO, #2114), we used the following logic for determining if an RTO was not spurious, which would then lead to collapse of the congestion window:
```
if (rto_count > 0 && smallest_newly_acked > largest_sent_before_rto):
  OnRetransmissionTimeoutVerified(smallest_newly_acked)
```
In the example above `largest_sent_before_rto` would be packet 14. This means that to count as a verified RTO, an ACK must acknowledge 15 or 16, but not *any* packets smaller or equal than 14.

With #2114, we now collapse the congestion window when `pto_count` is larger than 2 and we receive an ACK which leads us to declare *any* packet lost. In the example above this means any ACK received after sending packet 15 which declares a packet lost would suffice. Even worse, this logic already triggers if we receive the ACK *immediately after* sending 15, i.e. when the peer didn't even have a chance to receive it.

This seems  **a lot more aggressive** than what we had before before. Is this intentional?

-- 
You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub:
https://github.com/quicwg/base-drafts/issues/2255