Re: Never fragment: getting PMTU info transmitted reliably

Mark Smith <markzzzsmith@gmail.com> Thu, 17 January 2019 03:11 UTC

Return-Path: <markzzzsmith@gmail.com>
X-Original-To: ipv6@ietfa.amsl.com
Delivered-To: ipv6@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 7B9D3130DD6 for <ipv6@ietfa.amsl.com>; Wed, 16 Jan 2019 19:11:08 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -0.5
X-Spam-Level:
X-Spam-Status: No, score=-0.5 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, FROM_LOCAL_NOVOWEL=0.5, HK_RANDOM_ENVFROM=0.001, HK_RANDOM_FROM=0.999, RCVD_IN_DNSWL_NONE=-0.0001, SPF_PASS=-0.001] autolearn=no autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Rtv_Oa5ALfQD for <ipv6@ietfa.amsl.com>; Wed, 16 Jan 2019 19:11:07 -0800 (PST)
Received: from mail-oi1-x243.google.com (mail-oi1-x243.google.com [IPv6:2607:f8b0:4864:20::243]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 2928A12785F for <ipv6@ietf.org>; Wed, 16 Jan 2019 19:11:07 -0800 (PST)
Received: by mail-oi1-x243.google.com with SMTP id w13so5012952oiw.9 for <ipv6@ietf.org>; Wed, 16 Jan 2019 19:11:07 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=CfKulwk9vOgDm0ZUS/ZcbE15PVtn6iCpx4rhh+czWBo=; b=hgCOhAVpdnKa8RC3n7kC+GGHfEiPGHQxXk0YggkVCQcgrH4lkZBoGuqihhDh8/vGoI l0GvDnu9CqEbRexslxJSp7+qPz6qVXP2ChoA6fUGxOyGKT8xTOli5eqWzCfpox7opxmA fNUj4rb+M+sMm3/5fCyHxV/RufPoGMR6iJlQWeOEKV1YjQdup5qWy/inBHXrCPjtUY1c UTioJ5xlDsJCISNGYSQ9pV6BN8vJ1ykw+DuuSDN4aDU9RJ7I54e3+F4JU+cBTkVkNWAM q5Nd8cotryvUrEeWoPWpObF1GcfXxA1Uxj/rqgz09XwqdETS4t5JPZ+uN/agerKvKqwp cg8g==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=CfKulwk9vOgDm0ZUS/ZcbE15PVtn6iCpx4rhh+czWBo=; b=PZp4XC9TF2iPrAjKfv+mkRmh9WitpvjgnENlECKAzPJfDHU4p/FGVD0DQKwZPIHcZI SebeMXSaHiB9RpG2+ygRCWqhNhfwGRmxuk6KvkcER1wxbNMjssgPK7fn3wFZvAZmSKpN wXdYM5Q1FlMzrGrVOvjM1+1MU2G0fJGxBktlsZuwwaJooWylVOQLGFpc3iqadT0nH84+ jpTl3W9988jC1QJkdktAGqIEjqOTgBmdjoZi1BduW4KfZ8FgDf4MqkqmUXtLXVlNmVlW DuiionLHyYIoeNW5zz3YFXWEx/3BUhrRYaYUIYp2ts8SO6spGlIEsg69QOzRy9YsE+Y/ Dq9A==
X-Gm-Message-State: AJcUukdSq6feBdRL2ypnMIMpE2yJjfv06E/6QyvY0fxybuNgh0tTamOT SvMyIkyJCA3sfM/80Z1Z2x7vGn2j4LHzw45F9is=
X-Google-Smtp-Source: ALg8bN52GtwoETCUwjO+MhqIpKHHFcF1g0x+AUWl+lDW3CvP+iHnTLq/YsAu4DLUn3wOFV/HleGdzGIauubpYlFwLH4=
X-Received: by 2002:aca:cd48:: with SMTP id d69mr1749145oig.326.1547694666343; Wed, 16 Jan 2019 19:11:06 -0800 (PST)
MIME-Version: 1.0
References: <CAOSSMjV0Vazum5OKztWhAhJrjLjXc5w5YGxdzHgbzi7YVSk7rg@mail.gmail.com> <AEA47E27-C0CB-4ABE-8ADE-51E9D599EF8F@gmail.com> <6aae7888-46a4-342d-1d76-10f8b50cebc4@gmail.com> <EC9CC5FE-5215-4105-8A34-B3F123D574B9@employees.org> <4c56f504-7cd7-6323-b14a-d34050d13f4e@foobar.org> <9E6D4A6E-8ABA-4BAB-BEC5-969078323C96@employees.org> <CAAedzxpdF+yhBXfnwUcaQb-HkgdaqXRU3L+S7v8sS1F0OkwM9A@mail.gmail.com> <78a8a0e0-8808-364c-41f7-f81f90362432@gont.com.ar> <CAAedzxpjxhP0nOZVU0CTwA1u3fsPFthrJASjDEfnLcRNvr2gBQ@mail.gmail.com> <c9be798e-5a32-7c3e-a948-9ca2fab30411@si6networks.com> <CAHw9_i+M2-420pykp99LcgMNSG=eeDqsZK8+hN20t_uUdANHfA@mail.gmail.com> <d6e52c30-bbd1-1ee7-144c-fa13a9df5f38@gmail.com> <0f4a6c88-1def-6766-235b-1bcd2cc5e33b@si6networks.com> <CAHw9_i+FB-tb8c+G22FCUxNg9BDpMfwqur8gSn5QaXteBcABZA@mail.gmail.com> <14135.1547681760@localhost> <a044c327-d9ce-573e-a158-6c4b157f2d6c@joelhalpern.com> <d3ee03ad-bd24-f353-ddc9-c3cf8a4eb89b@gmail.com>
In-Reply-To: <d3ee03ad-bd24-f353-ddc9-c3cf8a4eb89b@gmail.com>
From: Mark Smith <markzzzsmith@gmail.com>
Date: Thu, 17 Jan 2019 14:10:39 +1100
Message-ID: <CAO42Z2z3AxkMH3KNakZeDSr7sSDRY7-mT23stRSY2MatS_W5Eg@mail.gmail.com>
Subject: Re: Never fragment: getting PMTU info transmitted reliably
To: Brian E Carpenter <brian.e.carpenter@gmail.com>
Cc: "Joel M. Halpern" <jmh@joelhalpern.com>, Michael Richardson <mcr+ietf@sandelman.ca>, IPv6 List <ipv6@ietf.org>
Content-Type: text/plain; charset="UTF-8"
Archived-At: <https://mailarchive.ietf.org/arch/msg/ipv6/Ji_rpOCp0twZZWPm33_16Yy8AvE>
X-BeenThere: ipv6@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "IPv6 Maintenance Working Group \(6man\)" <ipv6.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ipv6>, <mailto:ipv6-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ipv6/>
List-Post: <mailto:ipv6@ietf.org>
List-Help: <mailto:ipv6-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ipv6>, <mailto:ipv6-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 17 Jan 2019 03:11:08 -0000

On Thu, 17 Jan 2019 at 11:30, Brian E Carpenter
<brian.e.carpenter@gmail.com> wrote:
>
> On 2019-01-17 13:12, Joel M. Halpern wrote:
> > Just to clarify one aspect of the way entropy in path selection, I want
> > to point out a complication.
> >

<snip>

> I am strongly against breaking the flow label just at the time when
> the major o/s are starting to set it correctly.
>

It would be being set quite widely now.

OS X and Linux (courtesy of Tom Herbert) have set it by default for a
number of years.

The Windows 10 Creator Update released in July of 2017 started setting it

https://blogs.technet.microsoft.com/networking/2017/07/13/core-network-stack-features-in-the-creators-update-for-windows-10/

And Microsoft have said that as of June last year Windows 10 was on
700 million PCs.

https://windowsreport.com/windows-10-market-share-2018/

So there would have to be be 100s of millions of devices now setting
the flow label per RFC 6437.

> I'm all for fixing the fragmentation problem; draft-ietf-intarea-frag-fragile
> exists for a reason. But not by breaking something else.
>
>    Brian
>
> >
> > Yours,
> > Joel
> >
> > On 1/16/19 6:36 PM, Michael Richardson wrote:
> >>
> >> Warren writes about putting MTU info into flow Label:
> >>      >> to signal up to a 9K MTU, we would need 13bits (LN(9K-1280)/LN(2) =
> >>      >> 12.9).
> >>      >> 20 - 13 gives 7 bits (128) for the hash entropy. "7 bits of entropy,
> >>      >> evenly distributed, should be enough for anyone", he said, hoping
> >>      >> no-one points at
> >>      >> the obvious correlation to 640K of RAM...
> >>
> >> Brian E Carpenter <brian.e.carpenter@gmail.com> wrote:
> >>      > I guess it all depends what you expect from the entropy. 20 bits gives
> >>      > you a 1-in-a-million chance of a clash. 7 bits gives you a 1-in-128 chance
> >>      > of a clash. This probably doesn't matter for a simple ECMP or LAG kind
> >>      > of load sharing, but who's to say it doesn't matter for some more
> >>      > fancy kind of sharing across a large array of servers?
> >>
> >> You'd have to have more than 128 servers and/or paths.
> >> Maybe one of our SPRING people in this WG can tell us if that's a real
> >> problem today.  Obviously, we can't guess if it will be bad in the future,
> >> but if it's a problem today, then we can know that immediately.
> >>
> >> Now when I pushed for draft-ietf-6man-rfc6434-bis-09 and RFC8200 to say
> >> that PLMTUD to be made MUST, I got various push backs that amounted to:
> >>    1) we don't have enough evidence yet.
> >>    2) it doesn't work for UDP and other traffic.
> >>
> >> (1) turned out to be a real issue. I thought that some of the big players
> >>      could easily get, or already would have, that kind of evidence.
> >>      (Linux does not ship with PLPMTUD on by default.  If you want it, btw,
> >>      sysctl -w net.ipv4.tcp_mtu_probing=2.  Yes. ipv4. It affects both)
> >>      Turns out I was told that they always set their TCP segment size such that
> >>      they likely will never fragment for v4 or v6, because due to hardware
> >>      Transit Offload, the cost of missing a tx-op exceeds the benefit of
> >>      making the packet slightly bigger.  I suspect that this is true in
> >>      general to UDP and QUIC traffic too.
> >>      I imagine a next generation 10G NICs might offer QUIC offload, including
> >>      doing the crypto.  That's what I'd be coding if I worked in that space.
> >>
> >> 2) I will admit that I personally don't care that much about UDP traffic,
> >>     except in that it lets me run IPsec through NAT44s.   I know that I use
> >>     QUIC and WebRTC regularly, and that corporate enterprise users gets
> >>     screwed by lack of UDP regularly.
> >>
> >> So I think question is: is there really a problem that needs to be solved?
> >> Maybe I will have to back and read the beginning of this thread again to
> >> recall what the issue was.   Does this belong in SPUD?
> >>
> >> I wish that SCTP had flown higher...
> >>
> >> --
> >> Michael Richardson <mcr+IETF@sandelman.ca>, Sandelman Software Works
> >>   -= IPv6 IoT consulting =-
> >>
> >>
> >> --------------------------------------------------------------------
> >> IETF IPv6 working group mailing list
> >> ipv6@ietf.org
> >> Administrative Requests: https://www.ietf.org/mailman/listinfo/ipv6
> >> --------------------------------------------------------------------
> >>
> >
> > --------------------------------------------------------------------
> > IETF IPv6 working group mailing list
> > ipv6@ietf.org
> > Administrative Requests: https://www.ietf.org/mailman/listinfo/ipv6
> > --------------------------------------------------------------------
> >
>
> --------------------------------------------------------------------
> IETF IPv6 working group mailing list
> ipv6@ietf.org
> Administrative Requests: https://www.ietf.org/mailman/listinfo/ipv6
> --------------------------------------------------------------------