Re: Non-Last Small IPv6 Fragments

Brian E Carpenter <brian.e.carpenter@gmail.com> Wed, 16 January 2019 19:47 UTC

Return-Path: <brian.e.carpenter@gmail.com>
X-Original-To: ipv6@ietfa.amsl.com
Delivered-To: ipv6@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id F37B2131101 for <ipv6@ietfa.amsl.com>; Wed, 16 Jan 2019 11:47:32 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.999
X-Spam-Level:
X-Spam-Status: No, score=-1.999 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id dU-wVUCtITsZ for <ipv6@ietfa.amsl.com>; Wed, 16 Jan 2019 11:47:31 -0800 (PST)
Received: from mail-pf1-x436.google.com (mail-pf1-x436.google.com [IPv6:2607:f8b0:4864:20::436]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id E4290131100 for <ipv6@ietf.org>; Wed, 16 Jan 2019 11:47:30 -0800 (PST)
Received: by mail-pf1-x436.google.com with SMTP id b85so3563822pfc.3 for <ipv6@ietf.org>; Wed, 16 Jan 2019 11:47:30 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=subject:to:cc:references:from:message-id:date:user-agent :mime-version:in-reply-to:content-language:content-transfer-encoding; bh=ni3qNmtD0RbtQU2ww/+YQUKjb9Z6at0JzRBQJB+IE54=; b=o9j65RAz8IoAHTblxnrAiZuZFzu25/Xqe4FYwrgs0l6BIa1I/6bwW//DC37GiW1eSQ nM38nghQQMgQqVOJCnwyTTO0ksQrQ/rLTzgg5Xxo1DRv1+bXnz3LV3YHSiMVt9nRsZ4n kuiXwPsaNFwMH3WY+9JHj1H3Rqfcd0bsOS4pHYAST4LCkCGhkteg3okIy9iSGcBhRqxK NOsl10xo2huI4J8lpmFdojNqgOopPBukV0/eM8pvUWSIHSgnNorjXq3pVlnDnCG1dz6w v8awZgsm9PZzuvefWuEnunlyuO5KcFAlEBvvVZAC5u74U4iPOGFKyEp1wN4V51W048L6 jaBQ==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-language :content-transfer-encoding; bh=ni3qNmtD0RbtQU2ww/+YQUKjb9Z6at0JzRBQJB+IE54=; b=L2xOsLn8bXk0AzQUkCLE5yIRnGF1UbOrjWbxH+67xpIU1VKLJHvc1npRH7lgoKDpsM dD7FYv2jo9loX6X+F90BohDSciRjGjUL4Gh2Fdm3mF3lApM8KuF9L1dHEDP13nhKJlq8 BHWSVQ5D14kAY8a/1cMW3N1sTlXO+mDAIwfEvf3yUddlgrBspxC7kZQ6vvcBidHdIK7p 42mjfdtus71R7z9e8htLo5meirK/PrGd/iI/3EtNzM/+NwJ4YQ8FdFiDzwQhHJT8HjAf HOPfSfmexB5KBvJs8hOPr/v5PEJL4/1vyN/65p99FbCY0q7g/8WJT5xC4ZHmaSnjDBxx TFYA==
X-Gm-Message-State: AJcUukdtm4sNnpzuTpUWD7NJMUaAV95sfnrqFK6YReVu/tcIYG4NNF+Y 2TH4bnCEFflp7AsP05IbklA=
X-Google-Smtp-Source: ALg8bN55CGsmIuvktM0QOCOo/NjNJiu5rAhW6/2SU/jcoI+1e7yC0IWXHGRp8oqCMGJcYBJYW0M8+w==
X-Received: by 2002:a65:6392:: with SMTP id h18mr10485131pgv.107.1547668050217; Wed, 16 Jan 2019 11:47:30 -0800 (PST)
Received: from [192.168.178.30] ([118.148.79.176]) by smtp.gmail.com with ESMTPSA id a90sm11225730pfj.109.2019.01.16.11.47.26 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Wed, 16 Jan 2019 11:47:29 -0800 (PST)
Subject: Re: Non-Last Small IPv6 Fragments
To: Warren Kumari <warren@kumari.net>, Fernando Gont <fgont@si6networks.com>
Cc: Bob Hinden <bob.hinden@gmail.com>, IPv6 List <ipv6@ietf.org>, Fernando Gont <fernando@gont.com.ar>, ek@loon.co
References: <CAOSSMjV0Vazum5OKztWhAhJrjLjXc5w5YGxdzHgbzi7YVSk7rg@mail.gmail.com> <973A1649-55F6-4D97-A97F-CEF555A4D397@employees.org> <CALx6S34YbBe8xBod3VsWVO33TpZcdxh2uV1vaO8Z_NKnVXp66g@mail.gmail.com> <A3C3F9C0-0A07-41AF-9671-B9E486CB8246@employees.org> <AEA47E27-C0CB-4ABE-8ADE-51E9D599EF8F@gmail.com> <6aae7888-46a4-342d-1d76-10f8b50cebc4@gmail.com> <EC9CC5FE-5215-4105-8A34-B3F123D574B9@employees.org> <4c56f504-7cd7-6323-b14a-d34050d13f4e@foobar.org> <9E6D4A6E-8ABA-4BAB-BEC5-969078323C96@employees.org> <CAAedzxpdF+yhBXfnwUcaQb-HkgdaqXRU3L+S7v8sS1F0OkwM9A@mail.gmail.com> <78a8a0e0-8808-364c-41f7-f81f90362432@gont.com.ar> <CAAedzxpjxhP0nOZVU0CTwA1u3fsPFthrJASjDEfnLcRNvr2gBQ@mail.gmail.com> <c9be798e-5a32-7c3e-a948-9ca2fab30411@si6networks.com> <CAHw9_i+M2-420pykp99LcgMNSG=eeDqsZK8+hN20t_uUdANHfA@mail.gmail.com> <d6e52c30-bbd1-1ee7-144c-fa13a9df5f38@gmail.com> <0f4a6c88-1def-6766-235b-1bcd2cc5e33b@si6networks.com> <CAHw9_i+FB-tb8c+G22FCUxNg9BDpMfwqur8gSn5QaXteBcABZA@mail.gmail.com>
From: Brian E Carpenter <brian.e.carpenter@gmail.com>
Message-ID: <3eead7ba-dcb4-ed52-05bb-a41a5602f251@gmail.com>
Date: Thu, 17 Jan 2019 08:47:22 +1300
User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.4.0
MIME-Version: 1.0
In-Reply-To: <CAHw9_i+FB-tb8c+G22FCUxNg9BDpMfwqur8gSn5QaXteBcABZA@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"
Content-Language: en-US
Content-Transfer-Encoding: 7bit
Archived-At: <https://mailarchive.ietf.org/arch/msg/ipv6/sdG3r0u2GjU_ZYaTBVXqj7lZR48>
X-BeenThere: ipv6@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "IPv6 Maintenance Working Group \(6man\)" <ipv6.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ipv6>, <mailto:ipv6-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ipv6/>
List-Post: <mailto:ipv6@ietf.org>
List-Help: <mailto:ipv6-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ipv6>, <mailto:ipv6-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 16 Jan 2019 19:47:33 -0000

On 2019-01-17 04:11, Warren Kumari wrote:
> On Wed, Jan 16, 2019 at 2:52 AM Fernando Gont <fgont@si6networks.com> wrote:
> 
>> On 15/1/19 22:02, Brian E Carpenter wrote:
>>> For my comment, search for "Aaargh"
>>
>> :-) -- I was catching up with email, and read them in sequence. --
>> thanks for the note, I *expected* it :-)
>>
>>
>>
>>>>     > [Silly Idea #2]
>>>>     > Separate, even sillier idea: everyone converges on using 16 bits
>> of
>>>>     > the flow ID to encode the lowest MTU encountered along the path.
>> Here
>>>>     > again, TCP MSS clamping style behaviour would be applied to this
>>>>
>>>>
>>>> https://media.giphy.com/media/3o6ZthnDqxfAOFIhdC/giphy.gif
>>>> Because there are deployed systems already, I don't think that this can
>> be deployed **and fully relied upon**, but Ido think that it might actually
>> be really useful. If the first 2 bits of the flow label are '11' or '10' or
>> something, then it could mean that the next N (13?) bits of the flow label
>> contain the minimum MTU along the path.
>>>
>>> Aaargh. That's a gross violation of the standard for the flow label.
>> Short form: you can't use the flow label for signaling. Long form: RFC6436,
>> RFC6437, plus RFC6294 for a menagerie of other terrible ways to abuse the
>> flow label.
>>
>> Or, put another way: if we were to use the FL for this, it would screw
>> up the currently-intended use.
>>
> 
> Yup, I had actually read those the above, but it was a while back...
> 
> I'm not really sure it does screw up the intended use -- the FL is 20 bits.
> If we were to do something like Erik had proposed, and we wanted to be able
> to signal up to a 9K MTU, we would need 13bits (LN(9K-1280)/LN(2) = 12.9).
> 20 - 13 gives 7 bits (128) for the hash entropy. "7 bits of entropy, evenly
> distributed, should be enough for anyone", he said, hoping no-one points at
> the obvious correlation to 640K of RAM...

I guess it all depends what you expect from the entropy. 20 bits gives
you a 1-in-a-million chance of a clash. 7 bits gives you a 1-in-128 chance
of a clash. This probably doesn't matter for a simple ECMP or LAG kind
of load sharing, but who's to say it doesn't matter for some more
fancy kind of sharing across a large array of servers?

> If we only wanted to signal full granularity between 1500 and 1280, and
> then require that any MTU between 1500 and 9K falls into one of 36 buckets,
> we could reserve one byte to do so (1500-1280=220. 256-220=36).
> 
> This likely is still a horrible, no good idea, but it does at least make
> the MTU available to things looking at the L3 header only, and doesn't
> require routers to have to dig into extension headers.
> 
> Whatever the case, when we finally design IP-NG-NG, let's set aside 512bits
> (at least) to be "reserved for future abuse" :-P Or, perhaps, every bit
> should be fully cast in stone (with no room for extensions) so that they
> don't become attractive nuisances for ugly hacks...

I hope that by then we will have found a better approach to flexibility
and extensibility that allows for interfering middleboxes from the start.
But that discussion is definitely out of scope for the "IPv6 Maintenance" WG ;-)

    Brian

> 
> :-)
> W
> 
>>
>>
>>
>> --
>> Fernando Gont
>> SI6 Networks
>> e-mail: fgont@si6networks.com
>> PGP Fingerprint: 6666 31C6 D484 63B2 8FB1 E3C4 AE25 0D55 1D4E 7492
>>
>>
>>
>>
>>
>