RE: draft-bonica-6man-frag-deprecate

"Templin, Fred L" <Fred.L.Templin@boeing.com> Mon, 01 July 2013 16:00 UTC

Return-Path: <Fred.L.Templin@boeing.com>
X-Original-To: ipv6@ietfa.amsl.com
Delivered-To: ipv6@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 6EF7911E8215 for <ipv6@ietfa.amsl.com>; Mon, 1 Jul 2013 09:00:03 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -6.484
X-Spam-Level:
X-Spam-Status: No, score=-6.484 tagged_above=-999 required=5 tests=[AWL=0.115, BAYES_00=-2.599, RCVD_IN_DNSWL_MED=-4]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id FhUs01FodXnT for <ipv6@ietfa.amsl.com>; Mon, 1 Jul 2013 08:59:56 -0700 (PDT)
Received: from blv-mbsout-01.boeing.com (blv-mbsout-01.boeing.com [130.76.32.231]) by ietfa.amsl.com (Postfix) with ESMTP id 338D111E814F for <ipv6@ietf.org>; Mon, 1 Jul 2013 08:59:48 -0700 (PDT)
Received: from blv-mbsout-01.boeing.com (localhost.localdomain [127.0.0.1]) by blv-mbsout-01.boeing.com (8.14.4/8.14.4/DOWNSTREAM_MBSOUT) with ESMTP id r61G0SQR026640 for <ipv6@ietf.org>; Mon, 1 Jul 2013 09:00:28 -0700
Received: from XCH-NWHT-06.nw.nos.boeing.com (xch-nwht-06.nw.nos.boeing.com [130.247.25.110]) by blv-mbsout-01.boeing.com (8.14.4/8.14.4/UPSTREAM_MBSOUT) with ESMTP id r61G0RpT026632 (version=TLSv1/SSLv3 cipher=RC4-MD5 bits=128 verify=OK); Mon, 1 Jul 2013 09:00:27 -0700
Received: from XCH-BLV-103.nw.nos.boeing.com (130.247.25.118) by XCH-NWHT-06.nw.nos.boeing.com (130.247.25.110) with Microsoft SMTP Server (TLS) id 8.3.297.1; Mon, 1 Jul 2013 08:59:46 -0700
Received: from XCH-BLV-504.nw.nos.boeing.com ([169.254.4.48]) by XCH-BLV-103.nw.nos.boeing.com ([169.254.3.252]) with mapi id 14.02.0328.011; Mon, 1 Jul 2013 08:59:46 -0700
From: "Templin, Fred L" <Fred.L.Templin@boeing.com>
To: Ole Troan <otroan@employees.org>
Subject: RE: draft-bonica-6man-frag-deprecate
Thread-Topic: draft-bonica-6man-frag-deprecate
Thread-Index: AQHOdJWoUZaGJnzSMU6ZOUB9nA7EbJlP+NBA
Date: Mon, 01 Jul 2013 15:59:46 +0000
Message-ID: <2134F8430051B64F815C691A62D983180B19B9@XCH-BLV-504.nw.nos.boeing.com>
References: <2CF4CB03E2AA464BA0982EC92A02CE2509F85151@BY2PRD0512MB653.namprd05.prod.outlook.com> <51C32FA9.1090207@gmail.com> <2CF4CB03E2AA464BA0982EC92A02CE2509F85F38@BY2PRD0512MB653.namprd05.prod.outlook.com> <20130624204008.GB3647@virgo.local> <20130624205226.GC3647@virgo.local> <2CF4CB03E2AA464BA0982EC92A02CE2509F8761C@BY2PRD0512MB653.namprd05.prod.outlook.com> <51C902DC.9000408@gmail.com> <m24ncmaozs.wl%randy@psg.com> <2EA20F89-02F5-4D06-90EE-A7D2974045A3@employees.org> <m2li5yj7u3.wl%randy@psg.com> <8C48B86A895913448548E6D15DA7553B9268E3@xmb-rcd-x09.cisco.com> <m2ehbpij86.wl%randy@psg.com> <51CB91E4.5090603@gmail.com> <2134F8430051B64F815C691A62D983180AEA5F@XCH-BLV-504.nw.nos.boeing.com> <B7796834-C80E-498D-9F9C-02F74A9E53C9@employees.org> <2134F8430051B64F815C691A62D983180B0221@XCH-BLV-504.nw.nos.boeing.com> <520FB317-19E8-4DCA-81FB-124B17732FE0@employees.org>
In-Reply-To: <520FB317-19E8-4DCA-81FB-124B17732FE0@employees.org>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-originating-ip: [130.247.104.6]
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
X-TM-AS-MML: disable
Cc: 6man <ipv6@ietf.org>
X-BeenThere: ipv6@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "IPv6 Maintenance Working Group \(6man\)" <ipv6.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ipv6>, <mailto:ipv6-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ipv6>
List-Post: <mailto:ipv6@ietf.org>
List-Help: <mailto:ipv6-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ipv6>, <mailto:ipv6-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 01 Jul 2013 16:00:03 -0000

Hi Ole,

> -----Original Message-----
> From: Ole Troan [mailto:otroan@employees.org]
> Sent: Friday, June 28, 2013 11:56 PM
> To: Templin, Fred L
> Cc: Brian E Carpenter; 6man
> Subject: Re: draft-bonica-6man-frag-deprecate
> 
> Fred,
> 
> >> in my understanding the 1280 minimum MTU for IPv6 was already chosen
> to
> >> accommodate nested tunnels.
> >
> > Right, but..
> >
> >> the expectation made, I believe, was that native links support at
> least
> >> an MTU of 1500 bytes,
> >
> > But, that expectation went out the window at the same time the IPv6
> > minMTU was set to 1280 - so now 1280 is all that can be expected.
> 
> quite the opposite.
> physical link MTUs aren't getting smaller.
> do you have any evidence that operators set the IPv6 MTU to a value
> smaller than the MTU supported
> by the link-layer?

The evidence is in RFC2460, where the minMTU is set to 1280.
Plus, there are links (e.g., 6lowpan, aviation data links,
tactical military links etc.) where setting an MTU larger
than 1280 might not be practical.
 
> as far as I can see the setting IPv6 minimum MTU to 1280, which gives
> ample room for
> nested tunnels works as designed.

But, what would that look like in real life? The first tunnel
(T1) would have to set a 1280 MTU so that its tunneled packets
would emerge as 1320 bytes. Then, the next tunnel (T2) would
have to set a 1320 MTU so its tunneled packets would emerge as
1360. Then the next tunnel (T3) would have to set a 1360 MTU
so that its tunneled packets would emerge as 1400, etc. until
we run out of available MTU. The question is, how can those
tunnels be so carefully coordinated so that the nesting is
nailed down enough so that we will never have an MTU infraction?
In a single administrative domain where an operator can lay
hands on every tunnel ingress this may be possible, but we
need to engineer for the general case.

> > So, without fragmentation, when the tunnel ingress receives a 1280
> > packet, it emits a (1280+HLEN) packet. If that packet is dropped
> > at a link that configures a 1280 MTU, then it simply black holes.
> >
> >> allowing 220 bytes for tunnel encap (5 IPv6 headers).
> >
> > Tunnels within tunnels presents an additional pain point, but the
> > plain truth is that IPv6 links only need to configure a 1280 MTU;
> > not a 1500.
> 
> again, why would anyone do that?

Performance. What matters is time on the wire, and some "wires"
may not be fast enough to support anything larger than 1280
(see above).

> the pain here is fragmentation, not supporting an MTU larger than 1280.

Fragmentation and reassembly is a pain point for some (but
not all) routers. The pain may be observed by the hosts
which would (selfishly) like to do something to improve
their performance. With SEAL, to get better performance (and
in the process ease the burden on the routers) the hosts can
use RFC4821 and begin sending packets larger than 1500. 
 
> >> you want to avoid reassembly at tunnel endpoints, if you care about
> >> performance that is.
> >
> > Unfortunately, reassembly is absolutely necessary if the tunnel is
> > configured over a path that contains at least one 1280 link (see
> above).
> 
> reassembly in the network is a non-starter.

Let me say again what I said several messages ago. Never say
"always" and never say "never". The answer as to whether
reassembly can be tolerated is: "it depends" (i.e., it
depends on where the tunnel endpoints are positioned). 

> the vendor I'm most familiar with does not do this in hardware. sure,
> you can get reassembly, but at an order of one or two magnitudes less.
> if anyone know of platforms doing IPv6 fragment reassembly at line rate
> for 10G or 100G interfaces I'll stand corrected. ;-)
> for now reassembly in the network is not realistic.

Reassembly is required by RFC2460, RFC2473 and I think others
have also been mentioned. To support tunnels, and especially
tunnels within tunnels, it needs to be there.

Thanks - Fred
fred.l.templin@boeing.com

> cheers,
> Ole