Re: Never fragment: getting PMTU info transmitted reliably

Michael Richardson <mcr+ietf@sandelman.ca> Thu, 17 January 2019 02:39 UTC

Return-Path: <mcr+ietf@sandelman.ca>
X-Original-To: ipv6@ietfa.amsl.com
Delivered-To: ipv6@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id A1777130E9C for <ipv6@ietfa.amsl.com>; Wed, 16 Jan 2019 18:39:44 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.201
X-Spam-Level:
X-Spam-Status: No, score=-4.201 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_MED=-2.3, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id twC8tBxWquka for <ipv6@ietfa.amsl.com>; Wed, 16 Jan 2019 18:39:42 -0800 (PST)
Received: from tuna.sandelman.ca (tuna.sandelman.ca [IPv6:2607:f0b0:f:3:216:3eff:fe7c:d1f3]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 9A8E7130EE0 for <ipv6@ietf.org>; Wed, 16 Jan 2019 18:39:42 -0800 (PST)
Received: from sandelman.ca (obiwan.sandelman.ca [209.87.249.21]) by tuna.sandelman.ca (Postfix) with ESMTP id 51C613808A; Wed, 16 Jan 2019 21:39:04 -0500 (EST)
Received: by sandelman.ca (Postfix, from userid 179) id 6B4351CD5; Wed, 16 Jan 2019 21:39:41 -0500 (EST)
Received: from sandelman.ca (localhost [127.0.0.1]) by sandelman.ca (Postfix) with ESMTP id 69DE5BF6; Wed, 16 Jan 2019 21:39:41 -0500 (EST)
From: Michael Richardson <mcr+ietf@sandelman.ca>
To: Brian E Carpenter <brian.e.carpenter@gmail.com>
cc: "Joel M. Halpern" <jmh@joelhalpern.com>, IPv6 List <ipv6@ietf.org>
Subject: Re: Never fragment: getting PMTU info transmitted reliably
In-Reply-To: <d3ee03ad-bd24-f353-ddc9-c3cf8a4eb89b@gmail.com>
References: <CAOSSMjV0Vazum5OKztWhAhJrjLjXc5w5YGxdzHgbzi7YVSk7rg@mail.gmail.com> <AEA47E27-C0CB-4ABE-8ADE-51E9D599EF8F@gmail.com> <6aae7888-46a4-342d-1d76-10f8b50cebc4@gmail.com> <EC9CC5FE-5215-4105-8A34-B3F123D574B9@employees.org> <4c56f504-7cd7-6323-b14a-d34050d13f4e@foobar.org> <9E6D4A6E-8ABA-4BAB-BEC5-969078323C96@employees.org> <CAAedzxpdF+yhBXfnwUcaQb-HkgdaqXRU3L+S7v8sS1F0OkwM9A@mail.gmail.com> <78a8a0e0-8808-364c-41f7-f81f90362432@gont.com.ar> <CAAedzxpjxhP0nOZVU0CTwA1u3fsPFthrJASjDEfnLcRNvr2gBQ@mail.gmail.com> <c9be798e-5a32-7c3e-a948-9ca2fab30411@si6networks.com> <CAHw9_i+M2-420pykp99LcgMNSG=eeDqsZK8+hN20t_uUdANHfA@mail.gmail.com> <d6e52c30-bbd1-1ee7-144c-fa13a9df5f38@gmail.com> <0f4a6c88-1def-6766-235b-1bcd2cc5e33b@si6networks.com> <CAHw9_i+FB-tb8c+G22FCUxNg9BDpMfwqur8gSn5QaXteBcABZA@mail.gmail.com> <3 eead7ba-dcb4-ed52-05bb-a41a5602f251@gmail.com> <14135.1547681760@localhost> <a044c327-d9ce-573e-a158-6c4b157f2d6c@joelhalpern.com> <d3ee03ad-bd24-f353-ddc9-c3cf8a4eb89b@gmail .com>
X-Mailer: MH-E 8.6; nmh 1.7+dev; GNU Emacs 24.5.1
X-Face: $\n1pF)h^`}$H>Hk{L"x@)JS7<%Az}5RyS@k9X%29-lHB$Ti.V>2bi.~ehC0; <'$9xN5Ub# z!G,p`nR&p7Fz@^UXIn156S8.~^@MJ*mMsD7=QFeq%AL4m<nPbLgmtKK-5dC@#:k
MIME-Version: 1.0
Content-Type: multipart/signed; boundary="=-=-="; micalg="pgp-sha256"; protocol="application/pgp-signature"
Date: Wed, 16 Jan 2019 21:39:41 -0500
Message-ID: <24583.1547692781@localhost>
Archived-At: <https://mailarchive.ietf.org/arch/msg/ipv6/1wUKKZEycejRC46a5P0HBU0fWB8>
X-BeenThere: ipv6@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "IPv6 Maintenance Working Group \(6man\)" <ipv6.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ipv6>, <mailto:ipv6-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ipv6/>
List-Post: <mailto:ipv6@ietf.org>
List-Help: <mailto:ipv6-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ipv6>, <mailto:ipv6-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 17 Jan 2019 02:39:45 -0000

Brian E Carpenter <brian.e.carpenter@gmail.com> wrote:
    > On 2019-01-17 13:12, Joel M. Halpern wrote:
    >> Just to clarify one aspect of the way entropy in path selection, I want
    >> to point out a complication.
    >>
    >> It is not anywhere near enough to have as much entropy data as the
    >> number of choices.  The problem is that you need enough randomness so
    >> that you can expect a good distribution of flows.  And that even the
    >> smaller number of larger flows will likely get distributed across the
    >> choices.    Reducing the amount of available entropy can be quite
    >> problematic.

    > Right. And for the server farm case, I don't think it's science fiction
    > these days to think about hundreds or thousands of servers. Also, if the
    > load sharing algorithm attempts to ensure that a given server has only
    > one big job at a time, then a high collision rate in the hash can
    > defeat it. A form of the birthday paradox applies: not "what is the
    > chance of a clash per flow" but "what is the chance that out of a
    > thousand servers, one of them gets two big jobs at the same time"?

Based upon my reading of the netflix blogs, they have experiemented
extensively with the load sharing, and they really don't care about
flow-labels in their decision process. (Of course, because IPv4 has
no such things)

It's about how fresh the (disk read) caches on the servers are, what content
is being streamed, and other things that have nothing to do with the
packets themselves.

Architecturally with IPv6, if you have an entire /64 (or more) to play with
and you can statelessly forward packets at wire speed,  then there are
other interesting off-path choices one can do.  (For instance, assign
new server/128 for each client connection, and then when the connection
arrives, dynamically map it to a particular server.  This pushes the state
storage from layer-4 to the neighbour cache, which might not be a win)

So I seriously question whether any of this matters to server farms.

    > I am strongly against breaking the flow label just at the time when
    > the major o/s are starting to set it correctly.

:-)

    > I'm all for fixing the fragmentation problem;
    > draft-ietf-intarea-frag-fragile
    > exists for a reason. But not by breaking something else.

My quick read says that it looks great to me.

Again, I don't really think that using the flow label to seed PLPMTUD
is much of a win, but if it did provide something useful, I think it could be
done without too much harm.

To reiterate: I don't think the benefit is high enough to warrant the
   risk, despite the fact that I don't think the risk is as high as you
   are suggesting.

--
]               Never tell me the odds!                 | ipv6 mesh networks [
]   Michael Richardson, Sandelman Software Works        |    IoT architect   [
]     mcr@sandelman.ca  http://www.sandelman.ca/        |   ruby on rails    [


--
Michael Richardson <mcr+IETF@sandelman.ca>, Sandelman Software Works
 -= IPv6 IoT consulting =-