Re: UDP send costs in Linux

Phillip Hallam-Baker <phill@hallambaker.com> Wed, 04 April 2018 16:28 UTC

Return-Path: <hallam@gmail.com>
X-Original-To: quic@ietfa.amsl.com
Delivered-To: quic@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 555AD127978 for <quic@ietfa.amsl.com>; Wed, 4 Apr 2018 09:28:51 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: 0.589
X-Spam-Level:
X-Spam-Status: No, score=0.589 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FREEMAIL_FORGED_FROMDOMAIN=0.25, FREEMAIL_FROM=0.001, HEADER_FROM_DIFFERENT_DOMAINS=0.249, HTML_MESSAGE=0.001, HTTPS_HTTP_MISMATCH=1.989, RCVD_IN_DNSWL_NONE=-0.0001, SPF_PASS=-0.001] autolearn=no autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id mOXfjqRfoaej for <quic@ietfa.amsl.com>; Wed, 4 Apr 2018 09:28:49 -0700 (PDT)
Received: from mail-ot0-x234.google.com (mail-ot0-x234.google.com [IPv6:2607:f8b0:4003:c0f::234]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id CF025120724 for <quic@ietf.org>; Wed, 4 Apr 2018 09:28:48 -0700 (PDT)
Received: by mail-ot0-x234.google.com with SMTP id m22-v6so23974592otf.10 for <quic@ietf.org>; Wed, 04 Apr 2018 09:28:48 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:sender:in-reply-to:references:from:date:message-id :subject:to:cc; bh=o/UKnXq8LGgjByp3g7WRuBhEDe+BTRj+KwyoaArl/+o=; b=u7iknLhy5RCkW2HQJa30NFclDyMEVwqeXMDN18D+XebBKeDBSI+B0dhiauRO5El5m7 1gLBp+TGd8GtN0KeWNnnRXSpbryuEvHr8/TgTQ8qN0K0T3iognz7XH562OqSmnXxv14D Xf5dcYewixAUK6olfFupXzdo9xU57Bg0CUpCCra481vrgBwlWcPEsQtCi8qA4OmxDvOa I5J6d63JrQz3LOd8SJGleE/3o0veAqkv8OH1WuFJV/t2aEb/XgMraEML15IRURqFoh3z 0dSYuZM3mwVpp8vuyjdGN0rHzZo+vnmcAXgFuiUmkeUMP3y++m/gxJuXcR3aw6zc2gnS vIuQ==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:sender:in-reply-to:references:from :date:message-id:subject:to:cc; bh=o/UKnXq8LGgjByp3g7WRuBhEDe+BTRj+KwyoaArl/+o=; b=nRgpubTIW5Hek/zskzQhpNux65i9FXK9r8OHPieUVaAo57NlP8Co/WtkPz6djoeUrD rOzWmVnYpqb9gXWQrulTTnWzUTO7Qj2qOJaTln68PkdnhPJMoB+8GNA9kDHHfjvMZP4H vQ1tgN/Ym6lXMj9/edcTbgd3qe/Lrk56vg6h7fuRtqSq9mkjOdgDBcYtzOZQsIvrXVAo CO9aSg549fzS7lQcJcZRSSQrjG6/ASZVfGBSWZ49dkbDdVrJeBq/mND6aGnyYoji7sKs ijl4bNUt10nkPj63k1r3Nm1kSgmSUdjreAa1aOq8bfjHzEq4ojiturGRUr+K0/JRmmJj thcA==
X-Gm-Message-State: AElRT7GDchIz2GQAabGPNSsx2pBgkZ5gJJ9OxQdZI2U0IiItbZ6ITbIr Y3AEe6BVc4oLcK2epme0kNR71TrG1CG3YjpRBNE=
X-Google-Smtp-Source: AIpwx49YJTNM+HGE9ttmaCRPaKidPEdT6a7CfVbfcD0jOht5rcA7/hz3nRM1anINdqusYeDfUx53BIr5Z1kbE/66WgY=
X-Received: by 2002:a9d:4a52:: with SMTP id d18-v6mr11962675otj.380.1522859328227; Wed, 04 Apr 2018 09:28:48 -0700 (PDT)
MIME-Version: 1.0
Sender: hallam@gmail.com
Received: by 2002:a9d:233c:0:0:0:0:0 with HTTP; Wed, 4 Apr 2018 09:28:47 -0700 (PDT)
In-Reply-To: <CAN1APde+YNN0QT0=CDN30qYr6PUinv96zjd10jAYU9-onL5Q9Q@mail.gmail.com>
References: <CAKcm_gP4zz1bW5T-_N2Oxy6o5Sw2mEs3DFU9_HrmfkuaJyLz0A@mail.gmail.com> <MWHPR15MB18215781CA00A71CF1AD0137B6A40@MWHPR15MB1821.namprd15.prod.outlook.com> <CAKcm_gO1BdLJOfyQeURWU0jmJo7q9Zft4U9fu1o9py9Bys5NeA@mail.gmail.com> <CAN1APde+YNN0QT0=CDN30qYr6PUinv96zjd10jAYU9-onL5Q9Q@mail.gmail.com>
From: Phillip Hallam-Baker <phill@hallambaker.com>
Date: Wed, 04 Apr 2018 12:28:47 -0400
X-Google-Sender-Auth: A7-t4T3n-A-3qZPwIkhvgklotpo
Message-ID: <CAMm+Lwi7XLWECXKhND7gK2JPZUTySu+ZFVXAChMWTgP2XN87BQ@mail.gmail.com>
Subject: Re: UDP send costs in Linux
To: Mikkel Fahnøe Jørgensen <mikkelfj@gmail.com>
Cc: Ian Swett <ianswett=40google.com@dmarc.ietf.org>, Subodh Iyengar <subodh@fb.com>, IETF QUIC WG <quic@ietf.org>
Content-Type: multipart/alternative; boundary="0000000000004d4ca20569085173"
Archived-At: <https://mailarchive.ietf.org/arch/msg/quic/SfH4vyot4gBCYxJrhiN2HN-uB_k>
X-BeenThere: quic@ietf.org
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: Main mailing list of the IETF QUIC working group <quic.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/quic>, <mailto:quic-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/quic/>
List-Post: <mailto:quic@ietf.org>
List-Help: <mailto:quic-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/quic>, <mailto:quic-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 04 Apr 2018 16:28:51 -0000

I would not worry too much at this point.

The reason we want to be able to work at the application level is backwards
compatibility. It has to be possible to deploy QUIC on any machine even
without OS support or it won't be deployable.

It does not have to be performant on every platform. If people are using
QUIC, whatever needs to be moved into the kernel for performance reasons
will move there.




On Wed, Apr 4, 2018 at 11:32 AM, Mikkel Fahnøe Jørgensen <mikkelfj@gmail.com
> wrote:

> I have no data to add on the Linux UDP stack, but another issue is the
> lack of netmap support in cloud hosting environments.
> I have not yet been working with this, but have looked into the problem
> and asked around.
>
> netmap is default in FreeBSD and optional in Linux. But neither works
> efficiently without a hypervisor patch that is also available for netmap.
> With support for netmap, the user space application can send directly to
> the network adapter with very little overhead. There is also dpdk and some
> other interfaces that might be slightly faster but more vendor specific.
>
> Assuming an application has access to optimized netmap, the only hurdle is
> address lookup, but if the application also manages that, or at least does
> the caching, there shouldn’t be much in the way of OS interference.
>
> Of course, netmap blocks the entire network stack, so no PING or SSH.
> CloudFlare added a netmap patch so only some traffic would be routed fra
> the network interface to netmap, and netmap also supports efficient packet
> forwarding to the OS or other applications.
>
> None of this works well in general, but for a cloud host that can be
> bootet automatically and destroyed rather than serviced, there is some
> opportunity.
>
> but only if cloud service providers starts adding support their supported
> images and hypervisors. Not sure if any are working on this now.
>
>
> Mikkel
>
>
> On 4 April 2018 at 15.11.17, Ian Swett (ianswett=40google.com@dmarc.
> ietf.org) wrote:
>
> I hope some of these patches will be available soon, but I'm not sure if
> soon is a month or 6.
>
> On Tue, Apr 3, 2018 at 10:50 PM Subodh Iyengar <subodh@fb.com> wrote:
>
>> Thanks for sharing this Ian.
>>
>>
>> This definitely matches some of the observations we've seen as well in
>> the UDP write path. Some of the other paths that we saw that added overhead
>> was the route table lookup in linux udp stack. Connected UDP sockets did
>> amortize that.
>>
>>
>> I'm looking forward to a smarter sendmmsg with GSO and zero copy. Is
>> there any indication of the timeline for these patches to make it to linux?
>> Would be happy to try any of these out to help iron out the API.
>>
>>
>> Subodh
>> ------------------------------
>> *From:* QUIC <quic-bounces@ietf.org> on behalf of Ian Swett
>> <ianswett=40google.com@dmarc.ietf..org>
>> *Sent:* Tuesday, April 3, 2018 5:20:08 PM
>> *To:* IETF QUIC WG
>> *Subject:* UDP send costs in Linux
>>
>> One challenge with QUIC at the moment is the increased CPU cost of
>> sending UDP packets vs TCP payloads.  I've seen this across every platform
>> Google has deployed QUIC on, so it's a widespread issue.
>>
>> Here's an excellent presentation on what's causing the increased CPU
>> consumption on Linux from Willem de Bruijn(UDP starts on slide 9).
>> http://vger.kernel.org/netconf2017_files/rx_hardening_and_udp_gso.pdf
>>
>> And while you're thinking of CPU usage, it's worth looking at the
>> presentation on timing wheel based packet pacing(which is minimum release
>> time based) and is ideal for QUIC(and TCP for that matter):
>> https://conferences.sigcomm.org/sigcomm/2017/
>> files/program/ts-9-4-carousel.pdf
>> <https://urldefense.proofpoint.com/v2/url?u=https-3A__conferences.sigcomm.org_sigcomm_2017_files_program_ts-2D9-2D4-2Dcarousel.pdf&d=DwMFaQ&c=5VD0RTtNlTh3ycd41b3MUw&r=h3Ju9EBS7mHtwg-wAyN7fQ&m=tgghgvFkps7jYaFNdNyZBNFf0epVxFZbOGhhybFwPiE&s=87gOGfz3S0lLbw8jy-lz3M9vPGChkmgtiJVzUxbMfvY&e=>
>>
>> -Ian
>>
>