Re: [tcpm] [EXTERNAL] Re: Seeking WG opinions on ACKing ACKs with good cause

Bob Briscoe <ietf@bobbriscoe.net> Sun, 11 July 2021 11:43 UTC

Return-Path: <ietf@bobbriscoe.net>
X-Original-To: tcpm@ietfa.amsl.com
Delivered-To: tcpm@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 7E58B3A091F for <tcpm@ietfa.amsl.com>; Sun, 11 Jul 2021 04:43:23 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -0.197
X-Spam-Level:
X-Spam-Status: No, score=-0.197 tagged_above=-999 required=5 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, NICE_REPLY_A=-0.001, RCVD_IN_DNSWL_BLOCKED=0.001, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=bobbriscoe.net
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id RKScQQNqS8zu for <tcpm@ietfa.amsl.com>; Sun, 11 Jul 2021 04:43:19 -0700 (PDT)
Received: from mail-ssdrsserver2.hostinginterface.eu (mail-ssdrsserver2.hostinginterface.eu [185.185.85.90]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id CFA513A091D for <tcpm@ietf.org>; Sun, 11 Jul 2021 04:43:18 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=bobbriscoe.net; s=default; h=Content-Transfer-Encoding:Content-Type: In-Reply-To:MIME-Version:Date:Message-ID:From:References:Cc:To:Subject:Sender :Reply-To:Content-ID:Content-Description:Resent-Date:Resent-From: Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help: List-Unsubscribe:List-Subscribe:List-Post:List-Owner:List-Archive; bh=I0HVX2uInLbob16zeCrny+m58LAVdgqMHxC44+W1rY4=; b=vsjUDhTnQtFvJ7M3Nl5uTLW3Sv gln18Yi41qV95TVTVCcIUfYUE8W2g0khkk+viL+7VnpYHY+zBdY+Ullo5BTM0S06yDp5tz4R5gxeL moqP9NOII+po3T6MaKUZAjvX84SPuDScL0Jx8vfUxO/KjppzUkkSQb2lu4dZTV4m+d3KsBp4vZUHu HA5uFhotES2+G+cLAZiX+3nTfg1cs3wkGitkhMGmUjBveBwYRc/BAM+rUU2N+UAJTg1W4mS4FE7yV YTonSp9SstDfj8V+ucYmbhyiN8W+ibbYcMknsBR2Szd1rBQsluCboBxaoHoR2pV7uRv0W8h9XCfrQ wxyXyupA==;
Received: from 67.153.238.178.in-addr.arpa ([178.238.153.67]:58474 helo=[192.168.1.10]) by ssdrsserver2.hostinginterface.eu with esmtpsa (TLS1.2) tls TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 (Exim 4.94.2) (envelope-from <ietf@bobbriscoe.net>) id 1m2Xrf-0007Sp-Ii; Sun, 11 Jul 2021 12:43:15 +0100
To: Christian Huitema <huitema@huitema.net>, Yoshifumi Nishida <nsd.ietf@gmail.com>, "tcpm@ietf.org" <tcpm@ietf.org>, Mirja Kuehlewind <ietf@kuehlewind.net>, "rs.ietf@gmx.at" <rs.ietf@gmx.at>
Cc: "nishida@sfc.wide.ad.jp" <nishida@sfc.wide.ad.jp>
References: <47df9b8b-515e-d40d-3473-599b0a3e3876@bobbriscoe.net> <CAAK044QgF4pz5Wamnxkobthou5ac4_LBxh8=nBYWyOxQUtcW-Q@mail.gmail.com> <8151fdef-ae78-80f3-adfc-d40db878ac8e@gmx.at> <CAAK044RhdAYexcGRj_XDkdY_o6JqB0DDo1X0H2AeFkRcsb0i4A@mail.gmail.com> <48c5910d-5340-acd6-8fd9-fff1b7758310@bobbriscoe.net> <CAAK044QOdi8DzBbLbwTvdcesFK21i1KU+Sj+4J7_odE5UQfmNg@mail.gmail.com> <dc11cd02-616e-93a7-7bcf-1e5112c2e1e1@bobbriscoe.net> <99B71B9A-EC9B-47FD-A149-FBEF9DEEC8DC@kuehlewind.net> <CAAK044SgdHAiBvDPHYOaq7-fgJTrBBoAqZQ70F5X3Q5HXvTEuw@mail.gmail.com> <ce4f09c2-2b50-8b35-3c3d-a01d45acce3b@bobbriscoe.net> <CAAK044SC4+=RBOEky34OzuirM-u_Om58Lm8uqSqkcpExSBwfHA@mail.gmail.com> <c98723ea-8cbe-5141-a127-33975676050c@gmx.at> <CO2PR00MB016662270FCE3E01AC4138D9B67D9@CO2PR00MB0166.namprd00.prod.outlook.com> <CAAK044QxkxxuuRXyRQB4U5q+SOKqUYLBqv7-tRJHybkFQjsO2A@mail.gmail.com> <d9d36273-7407-4e8e-b9e8-cd97f1096d54@bobbriscoe.net> <c862f6e1-9a95-6f37-6bed-0111a16b3cc6@huitema.net>
From: Bob Briscoe <ietf@bobbriscoe.net>
Message-ID: <2482da71-5b79-1933-f975-e46cd7661c39@bobbriscoe.net>
Date: Sun, 11 Jul 2021 12:43:11 +0100
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.11.0
MIME-Version: 1.0
In-Reply-To: <c862f6e1-9a95-6f37-6bed-0111a16b3cc6@huitema.net>
Content-Type: text/plain; charset="utf-8"; format="flowed"
Content-Transfer-Encoding: 8bit
Content-Language: en-GB
X-AntiAbuse: This header was added to track abuse, please include it with any abuse report
X-AntiAbuse: Primary Hostname - ssdrsserver2.hostinginterface.eu
X-AntiAbuse: Original Domain - ietf.org
X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12]
X-AntiAbuse: Sender Address Domain - bobbriscoe.net
X-Get-Message-Sender-Via: ssdrsserver2.hostinginterface.eu: authenticated_id: in@bobbriscoe.net
X-Authenticated-Sender: ssdrsserver2.hostinginterface.eu: in@bobbriscoe.net
X-Source:
X-Source-Args:
X-Source-Dir:
Archived-At: <https://mailarchive.ietf.org/arch/msg/tcpm/oY57ptht-dFsFIzMn-9zRgpJdaw>
Subject: Re: [tcpm] [EXTERNAL] Re: Seeking WG opinions on ACKing ACKs with good cause
X-BeenThere: tcpm@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: TCP Maintenance and Minor Extensions Working Group <tcpm.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tcpm>, <mailto:tcpm-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tcpm/>
List-Post: <mailto:tcpm@ietf.org>
List-Help: <mailto:tcpm-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tcpm>, <mailto:tcpm-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 11 Jul 2021 11:43:24 -0000

Christian,

On 11/07/2021 06:38, Christian Huitema wrote:
>
> On 7/10/2021 2:13 PM, Bob Briscoe wrote:
>> Yoshi, Richard, Mirja, tcpm list,
>>
>> We came to a good consensus in late March on the conditions to place 
>> around ACKing ACKs in order to keep congestion feedback flowing, but 
>> to ensure ping-pong ACKs are rapidly damped as well as avoiding false 
>> DupACK detection. I have written that up in a rev of the draft, which 
>> I will post separately so as not to distract from the question I have 
>> here...
>> ...While writing up, I realized we had glossed over a possibly more 
>> important point, when we proposed to change the number in the basic 
>> rule from 2 to 3, as follows:
>>
>>       An AccECN Data Receiver MUST immediately send an ACK once 'n'
>>       CE marks have arrived since the previous ACK, where 'n' SHOULD
>>       be 3 and MUST be in the range 3 to 6 inclusive.
>>
>>
>> I'd like to suggest an alternative:
>>
>>       An AccECN Data Receiver MUST immediately send an ACK once 'n'
>>       CE marks have arrived since the previous ACK. If there is new
>>       data to acknowledge, 'n' SHOULD be 2.  If there is no new data
>>       to acknowledge, 'n' SHOULD be 3 and MUST be no less than 3.
>>       In either case, 'n' MUST be no greater than 6.
>>
>>
>> Rationale: The data ACKs case shouldn't be compromised by the ACKs of 
>> ACKs case.
>>
>> When we were originally only thinking of the data ACKs case (before 
>> we realized this rule might trigger ACKs of ACKs), we recommended 
>> n=2  because it ensures congestion information is kept fresh, and 
>> during runs of congestion it generates more ACKs, which might make it 
>> more likely that at least one ACK survives some types of coalescing 
>> before the ACE field wraps. Also 2 is the default for the equivalent 
>> repetition of DCTCP feedback.
>>
>> Then, we noticed the ACKs of ACKs case, and we wanted to damp any ACK 
>> ping-pong, so we recommended 3. Without really discussing the pros 
>> and cons, we extended 3 to all cases (both data ACKs and ACKs of 
>> ACKs). I suspect the main reason was that it is just simpler to have 
>> a single rule. But these two cases are likely to be controlled from 
>> different parts of the code anyway.
>>
>> I've also realized that, in the data ACKs case, it was wrong to set a 
>> lower bound on 'n', let alone a /mandatory/ lower bound. There might 
>> be scenarios where it makes sense to trigger an ACK every CE mark; we 
>> have no reason to stop implementers doing that if they want. A lower 
>> bound could even be perversely read to mean that you're not allowed 
>> to immediately ACK data until you've received 'n' CE marks.
>>
>> Thoughts? 
>
> [CH] I am looking at this from the QUIC point of view. In general, 
> frequent ACKs provide better control, but there are two points of 
> tension: managing low bandwidth return paths; and, reducing the impact 
> of ACK on performance.
>
> There are a number of scenarios in which the return path used by ACKs 
> has a much lower bandwidth than the data path. If ACKs are two 
> frequent, the return path can become congested. ACKing every two or 3 
> packets is only sustainable if the return path bandwidth is not too 
> small compared to the data path -- 1/10th is generally sustainable, 
> but lower that 1/20th and the ACKs have to be spaced. The alternative 
> to spacing would be increased RTT due to congestion on the return 
> path, and random losses of ACKs. Of course, even with ACK spacing, 
> there are limits to asymmetry: control becomes loose if the transport 
> does not receive multiple ACKs per RTT.
>
> ACK frequency has been found a key factor in the performance of QUIC 
> implementations. For example, implementations may achieve 5Gbps with 
> spaced ACK, but the performance drops to a few 100 Mbps if we insist 
> on an ACK for 2 packets. High performance requires sending packets in 
> batches, using API like UDP GSO, and performance seems best when the 
> frequency of ACKs more or less matches the frequency of batches.

[BB] I think a brief summary of AccECN might be in order here - it seems 
like you're starting from scratch, but we're aware of all the above.

The implementation will have its own ACK ratio - that's out of scope of 
AccECN, except to set this max of 6 CEs, which is to mitigate wrap of 
the 3-bit counter of CE-marks (which in the worst case of 100% marking 
in the data direction could then induce 1 ACK per 6 packets). This 
shouldn't limit forward performance, because it only increases the 
reverse ACK rate if there is heavy congestion in the forward path, when 
the Data Sender should be reducing the forward rate anyway.

AccECN can also use the AccECN TCP Option that provides a much larger 
counter (24-bit), but AccECN has to work reasonably well with just the 
3-bit counter in the main TCP header in case the option can't traverse 
the path. However, the Data Receiver doesn't know whether the option is 
reaching the other end, so it has to assume it might not be.

My original question asks what the recommended value of 'n' should be 
for how much the counter of CE-marked data packets should be allowed to 
increment between ACKs. We don't have to recommend anything, but it 
helps implementers if we do. AccECN is TCP only. So when we originally 
recommended 2, we figured that TCP's default ACK ratio is 2. So, even if 
ECN marking in the forward direction was 100%, a default increment of 2 
CE marked packets ('n') wouldn't cause any more frequent ACKs than the 
default.

Nonetheless, if the implementer has chosen a longer ACK ratio (or 
perhaps allowed the ratio to be controlled by the sender e.g. 
draft-gomez-tcpm-ack-rate-request) they don't have to use the 
recommended n=2; the proposed wording allows a range.

Don't worry, we certainly wouldn't "insist on an ACK for 2 packets".


Also, FYI, AccECN provides feedback about ECN marking on the ACK stream, 
back to the Data Receiver who is generating the ACKs. It then has a hook 
on which it can hang feedback control of TCP's ACK stream without 
waiting for anyone else to deploy anything (it's up to the implementer 
or the IETF to add this if they want). However, the scope of the AccECN 
spec is deliberately wire protocol only, so it doesn't say what to do 
with this feedback.

Cheers


Bob

>
> -- Christian Huitema
>
>

-- 
________________________________________________________________
Bob Briscoe                               http://bobbriscoe.net/