Re: [EToSat] Interop runner with satellite links

Christian Huitema <huitema@huitema.net> Mon, 28 February 2022 19:55 UTC

Return-Path: <huitema@huitema.net>
X-Original-To: etosat@ietfa.amsl.com
Delivered-To: etosat@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 022A83A1481 for <etosat@ietfa.amsl.com>; Mon, 28 Feb 2022 11:55:18 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.899
X-Spam-Level:
X-Spam-Status: No, score=-1.899 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, NICE_REPLY_A=-0.001, SPF_HELO_NONE=0.001, T_SCC_BODY_TEXT_LINE=-0.01, T_SPF_PERMERROR=0.01, URIBL_BLOCKED=0.001] autolearn=unavailable autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id qkGBXEF3pujf for <etosat@ietfa.amsl.com>; Mon, 28 Feb 2022 11:55:15 -0800 (PST)
Received: from mx36-out10.antispamcloud.com (mx36-out10.antispamcloud.com [209.126.121.30]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 4F9243A147B for <etosat@ietf.org>; Mon, 28 Feb 2022 11:55:15 -0800 (PST)
Received: from xse272.mail2web.com ([66.113.197.18] helo=xse.mail2web.com) by mx136.antispamcloud.com with esmtp (Exim 4.92) (envelope-from <huitema@huitema.net>) id 1nOm6v-0006c9-3U for etosat@ietf.org; Mon, 28 Feb 2022 20:55:13 +0100
Received: from xsmtp22.mail2web.com (unknown [10.100.68.61]) by xse.mail2web.com (Postfix) with ESMTPS id 4K6rh83jPxzBw5 for <etosat@ietf.org>; Mon, 28 Feb 2022 11:55:04 -0800 (PST)
Received: from [10.5.2.16] (helo=xmail06.myhosting.com) by xsmtp22.mail2web.com with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:256) (Exim 4.92) (envelope-from <huitema@huitema.net>) id 1nOm6q-0008QF-Cd for etosat@ietf.org; Mon, 28 Feb 2022 11:55:04 -0800
Received: (qmail 15765 invoked from network); 28 Feb 2022 19:54:58 -0000
Received: from unknown (HELO [192.168.1.105]) (Authenticated-user:_huitema@huitema.net@[172.58.43.208]) (envelope-sender <huitema@huitema.net>) by xmail06.myhosting.com (qmail-ldap-1.03) with ESMTPA for <Chi-Jiun.Su=40hughes.com@dmarc.ietf.org>; 28 Feb 2022 19:54:55 -0000
Message-ID: <0138234c-a642-662e-afe7-518bbba9afcc@huitema.net>
Date: Mon, 28 Feb 2022 11:54:49 -0800
MIME-Version: 1.0
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101 Thunderbird/91.6.1
Content-Language: en-US
To: "Su, Chi-Jiun" <Chi-Jiun.Su=40hughes.com@dmarc.ietf.org>, Sebastian Endres <basti.endres@fau.de>, "etosat@ietf.org" <etosat@ietf.org>, "quic@ietf.org" <quic@ietf.org>
Cc: "joerg.deutschmann@fau.de" <joerg.deutschmann@fau.de>
References: <2572762.KRHqeOQrTU@7b74a564-70da-4baa-82f8-b74434554dd0> <2321206.ElGaqSPkdT@7b74a564-70da-4baa-82f8-b74434554dd0> <BL1PR11MB53038951CA6E71DC10A7F6E5CE3A9@BL1PR11MB5303.namprd11.prod.outlook.com> <1f27c94e-e9ca-e93e-f158-b696fc2e82a8@huitema.net> <BL1PR11MB5303217652820041562CF87DCE019@BL1PR11MB5303.namprd11.prod.outlook.com>
From: Christian Huitema <huitema@huitema.net>
In-Reply-To: <BL1PR11MB5303217652820041562CF87DCE019@BL1PR11MB5303.namprd11.prod.outlook.com>
Content-Type: text/plain; charset="UTF-8"; format="flowed"
Content-Transfer-Encoding: 7bit
X-Originating-IP: 66.113.197.18
X-Spampanel-Domain: xsmtpout.mail2web.com
X-Spampanel-Username: 66.113.197.0/24
Authentication-Results: antispamcloud.com; auth=pass smtp.auth=66.113.197.0/24@xsmtpout.mail2web.com
X-Spampanel-Outgoing-Class: unsure
X-Spampanel-Outgoing-Evidence: Combined (0.11)
X-Recommended-Action: accept
X-Filter-ID: Pt3MvcO5N4iKaDQ5O6lkdGlMVN6RH8bjRMzItlySaT8nEgLciUz3Ugqr711xcXEJPUtbdvnXkggZ 3YnVId/Y5jcf0yeVQAvfjHznO7+bT5zGFFoTDPM8FUgarAdxxdv742UuDhyzVYcwl2RB+0Aaejcd Vc6ZFiqf7rUGZSmiztoh55uqY3MhMgFAHq5BxPxPXn36fLqvhISQ5ykyqUZqUd1jhnM/Mbva2XLV /LIEzaL2KoAZhJekBPedneT7f6994ir+F2NBzC6Lcr3eWcqZV4PAgTtUp75uqlx0KezvZHV8InZ+ 7lKRjU3tJ9MJeN7wWQaaSSaRcFTFxaRvADgOuFdAU5fRzM/QzQW9/IoH33AG8ECuCwECazCwODtO F78PiyQEs+dlGXUJLWZ+Gc08Nmllke3azHdKmySKNUVQl4ntlVxnbS8qIO7oudHyb2T1VQ58xe/l rqiRGalI3YPsxOTrFXToVyBmRCgQVX6zVyFUu8qzeMQP6uTHL0d9UjfY+eX5ZvcELCIKs663F/co VFYFvf25LVONYbYifH5OzZDcG6hsRQZiAIgw+z837AqgX7ewI8e1h7RITgN14BHmGVt/ReJ9Mfhz zmbKTH7wI9GEU1utNskUAORCV2WFZX0jUeJfX5HDIsTH89Prkv7CGV7lLXQUcNAszDsnoUOr0BhV 3HJyuM07TiZLiryB9xQrOI+gTB/pfSlbi1HgG7umZ25gpnihbI3Vv1c2tRvdVD2GbN7BITAZon7Z Iz1ONK9yUo4/+EUytKrR9Md9I2Rs15pZq2u3REG1/5U8X7S8nEZPCC/cRgvQKtcrMMueERx3jDXc 5rYnHEzHxxN7GhSZ70qBhMCg+mOWge/UfCuc694LVMcScraAEu0RqZMYbqf5zSotXAmH6EZbH/M6 BW01H36m+UeFXprlCOm3BAEbJtAT1BYHStA0OogdNtRxnRSLF+XCKxIG9XMEgRDdaWpvCv+zESlk TxdSCNcDfRohcehWBb39uS1TjWG2Inx+Ts2QNOYPIz4ynMa7pZQ4hi/HGtuWeHzx9sLaQmDwvYQn 76e9NXttZBkk6PeFqH6So31P
X-Report-Abuse-To: spam@quarantine11.antispamcloud.com
Archived-At: <https://mailarchive.ietf.org/arch/msg/etosat/luOR6oL7ujdcpg7GwazR4FATQdQ>
Subject: Re: [EToSat] Interop runner with satellite links
X-BeenThere: etosat@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "The EToSat list is a non-WG mailing list used to discuss performance implications of running encrypted transports such as QUIC over satellite." <etosat.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/etosat>, <mailto:etosat-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/etosat/>
List-Post: <mailto:etosat@ietf.org>
List-Help: <mailto:etosat-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/etosat>, <mailto:etosat-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 28 Feb 2022 19:55:18 -0000

Thanks for the kind words, CJ.

A lot of the satellite innovation implemented in Picoquic results from 
the efforts of Nicolas Kuhn, Gorry Fairhurst, John Border, Stephan Emile 
and others in the ETOSAT community. In my mind, the most interesting 
contribution of that community was to define series of scenarios and 
expected results 
(https://datatracker.ietf.org/doc/draft-kuhn-quic-4-sat/). These 
scenarios act as a kind of "benchmark" for implementers. If implementers 
care about the scenarios, they will program the corresponding test cases 
in their test suites, and if needed ship the corresponding fixes. In 
order to work in these scenarios, Picoquic did the following:

1) Allow flow control windows to grow to values larger than the BDP. I 
believe that many bad results seen in Sebastian's tests are due to 
implementations capping the maximum flow control window to a low value.

2) Use Cubic or BBR rather than Reno. Both Cubic and BBR would give OK 
results in the absence of losses. BBR works well by default. Cubic works 
OK if using a filter to ignore isolated losses.

3) Perform ACK coalescing and avoid congesting a lower capacity return 
link. Picoquic implements the ACK frequency draft 
(https://datatracker.ietf.org/doc/draft-ietf-quic-ack-frequency/) and 
tries to not send too many acks.

4) Remember the bandwidth and delay observed in previous connections. 
Sebastian's benchmark did not test for that, but local tests show that 
implementing the 0RTT BDP draft 
(https://datatracker.ietf.org/doc/draft-kuhn-quic-0rtt-bdp/) results in 
much better performance when resuming sessions over GEO satellite links.

These four points are all pretty much obvious, and I would expect to see 
them over time in most implementations. The only controversial one is 
the "flow control" issue. Some implementers may feel that opening flow 
control too much carries risks of memory exhaustion. On the other hand, 
some implementations are doing adaptive flow control already, i.e., 
"opening flow control enough but not too much". So there is hope.

On top of the four fairly standard points above, Picoquic implements two 
simple features:

5) Fast bandwidth estimate during slow start, in order to accelerate the 
slow start phase on high BDP paths 
(https://huitema.wordpress.com/2020/04/21/faster-slow-start-for-satellite-links/). 
Classic slow start needs log2(BDP/IW) RTTs. Fast estimate allows initial 
CWIN estimation in 4 or 5 RTT, instead of 10 RTT for a 250 Mbps GEO 
path, or 8 for a 20 Mbps GEO path.

6) Preemptive repeat of unacknowledged data at the end of a file 
transfer, in order to avoid waiting 1 or 2 extra RTT for complete 
reception of a file in presence of packet losses.

I don't know if there is any interest in standardizing these two.

-- Christian Huitema


On 2/28/2022 10:43 AM, Su, Chi-Jiun wrote:
> Hi Christian,
>
> Thank you very much for coming up with various innovative approaches to improve QUIC over satellite links.
> As the result shows, picoquic seems to perform very well over the GEO satellites.
> We should continue this effort.
>
>    *   It is a great challenge to improve user's application performance due to lack of corporation between end points and network
>       *   picoquic does preemptive retransmission to speed up the data transfer
>       *   Link layer may employ ARQ or packet level FEC in local links
>       *   Neither end points nor local link is aware of optimization done by the other party.
>       *   As a result, the performance may end up worse than without the optimization.
>       *   Especially for upload cases, where return link in wireless/sat is more limited and resource allocation is more involved.
>    *   Repetition code is simplest error correcting code with least complexity.
>       *   Other FEC will offer better bandwidth efficiency.
>       *   Your idea of using some form of FEC will be useful.
>
> thanks.
> cj
>
>    *
>       *
>
> ________________________________
> From: EToSat <etosat-bounces@ietf.org> on behalf of Christian Huitema <huitema@huitema.net>
> Sent: Friday, February 25, 2022 9:39 PM
> To: Su, Chi-Jiun <Chi-Jiun.Su=40hughes.com@dmarc.ietf.org>; Sebastian Endres <basti.endres@fau.de>; etosat@ietf.org <etosat@ietf.org>; quic@ietf.org <quic@ietf.org>
> Cc: joerg.deutschmann@fau.de <joerg.deutschmann@fau.de>
> Subject: Re: [EToSat] Interop runner with satellite links
>
> **EXTERNAL EMAIL**
>
>
> On 2/21/2022 10:16 AM, Su, Chi-Jiun wrote:
>> Hi Sebastian,
>>
>> Thanks for the great work.
>> Some comments/questions.
>>
>>     *   Sec. IV,D, 5: Interesting to know "picoquic" does speculative retransmission. As you argue, this may not always help. Did you confirm with the author?
> This is indeed supported in picoquic. There are APIs that allows the
> implementation to turn the feature on or off:
> `picoquic_set_preemptive_repeat_policy(quic_context, do_repeat)` is used
> to set the policy per Quic context, i.e., for all new connections;
> `picoquic_set_preemptive_repeat_per_cnx(connection_context, do_repeat)`
> is used to set the policy for a specific connection.
>
> The speculative retransmission happens at the lowest priority, i.e.,
> only happens if there is nothing else to send. It is subject to
> congestion control and rate limiting. The "nothing to send" rule means
> that it will only kick after all data of a stream and the FIN mark have
> been sent. The selection of data for speculative retransmission is based
> on stream level acknowledgements. The code deduces the acknowledged
> parts of a data stream from the packet acknowledgements. It will look at
> data that has been sent, but not yet acknowledged.
>
> As you say, this does not always help. If the packet loss rate is low,
> most of the preemptively repeated packets will be useless. On the other
> hand, when sending a file over a lossy link, the application may wait a
> long time for retransmission of packets lost in the last RTT. If loss
> rate and data rates are high enough, some of these packets will have to
> be repeated twice, or maybe three times. So we have a tradeoff: waste
> bandwidth, or waste time. The API allows the application to consider
> that tradeoff and decide whether it is useful or not.
>
> I would very much like to replace the current implementation of
> preemptive repeat by some version of FEC. FEC in general is a poor fit
> for transmission of big files, because it is always less bandwidth
> efficient than just repeating the packets that are lost. But if the
> application knows that the file transmission is almost complete, because
> there is just one CWIN worth of data left, it could turn own FEC for the
> the duration of the last RTT. It will "waste" a modest number of FEC
> packets, while drastically reducing the impact of packet losses on the
> duration of the file transfer. We could even imagine only sending the
> FEC packets after the FIN mark has been sent, making sure that FEC does
> not increase the duration of transfer if there were no errors.
>
> -- Christian Huitema
>
>
>>     *    The results show loss-based CC does not perform well compared to BBR
>>     *   Production software performs well more like picoquic or not ?
>>        *   how big difference is between production vs these implementations in the test?
>>     *   Any Time-Offset graphs for EUTELSAT case ?
>>     *   Research overview page: additional column on emulated or real Sat link will be helpful
>>
>> Good useful work.
>> thanks.
>> cj
>> ________________________________
>> From: QUIC <quic-bounces@ietf.org> on behalf of Sebastian Endres <basti.endres@fau.de>
>> Sent: Friday, February 18, 2022 7:02 AM
>> To: etosat@ietf.org <etosat@ietf.org>; quic@ietf.org <quic@ietf.org>
>> Cc: joerg.deutschmann@fau.de <joerg.deutschmann@fau.de>
>> Subject: Re: [EToSat] Interop runner with satellite links
>>
>> **EXTERNAL EMAIL**
>>
>> Dear all,
>>
>> we've published a pre-print of our paper in which we present the QUIC-Interop-runner extended to include satellite scenarios and our measurement results using numerous publicly available QUIC implementations:
>>
>> https://urldefense.com/v3/__https://arxiv.org/abs/2202.08228__;!!Emaut56SYw!mFvp0jheUG95wMrw7L6ZHr3AUjZZgp63MEXhAZnEYvwEkQITw9uIFauoKkW-JRj-Kg$
>>
>> Best regards,
>>
>> Sebastian
>>
>> On Mittwoch, 29. September 2021 21:38:05 CET Sebastian Endres wrote:
>>> Dear all,
>>>
>>> for my master's thesis we ran measurements of all publicly available QUIC implementations over an emulated satellite link. The results are available online: https://urldefense.com/v3/__https://interop.sedrubal.de/__;!!Emaut56SYw!mFvp0jheUG95wMrw7L6ZHr3AUjZZgp63MEXhAZnEYvwEkQITw9uIFauoKkUlf0EgaQ$
>>>
>>> A click on the results also shows time-offset plots, but are not available for every combination.
>>>
>>> In general, the performance of QUIC over high latency (e.g., geostationary satellites) is rather poor, especially if there is packet loss.
>>>
>>> Would it make sense to add such tests with challenging link characteristics to the official QUIC interop runner?
>>>
>>> Best regards,
>>>
>>> Sebastian
>>>
>>>
>>> _______________________________________________
>>> EToSat mailing list
>>> EToSat@ietf.org
>>> https://urldefense.com/v3/__https://www.ietf.org/mailman/listinfo/etosat__;!!Emaut56SYw!mFvp0jheUG95wMrw7L6ZHr3AUjZZgp63MEXhAZnEYvwEkQITw9uIFauoKkXTZO8rhQ$
>>>
>>
>>
>> _______________________________________________
>> EToSat mailing list
>> EToSat@ietf.org
>> https://urldefense.com/v3/__https://www.ietf.org/mailman/listinfo/etosat__;!!Emaut56SYw!nXc1E9u7JuP0YnV6VSYRN1raD2YV53cI9SG-w15XNrNM6F4tNALqWp9LY3Xoq4YQvw$
> _______________________________________________
> EToSat mailing list
> EToSat@ietf.org
> https://urldefense.com/v3/__https://www.ietf.org/mailman/listinfo/etosat__;!!Emaut56SYw!nXc1E9u7JuP0YnV6VSYRN1raD2YV53cI9SG-w15XNrNM6F4tNALqWp9LY3Xoq4YQvw$
>
>
> _______________________________________________
> EToSat mailing list
> EToSat@ietf.org
> https://www.ietf.org/mailman/listinfo/etosat