Re: [IPv6] FW: I-D Action: draft-templin-6man-ipid-ext-00.txt

"Templin (US), Fred L" <Fred.L.Templin@boeing.com> Fri, 08 December 2023 15:37 UTC

Return-Path: <Fred.L.Templin@boeing.com>
X-Original-To: ipv6@ietfa.amsl.com
Delivered-To: ipv6@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 66822C47A22D for <ipv6@ietfa.amsl.com>; Fri, 8 Dec 2023 07:37:14 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -7.104
X-Spam-Level:
X-Spam-Status: No, score=-7.104 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_HI=-5, RCVD_IN_MSPIKE_H5=0.001, RCVD_IN_MSPIKE_WL=0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=boeing.com header.b="HhIV4XAD"; dkim=pass (1024-bit key) header.d=boeing.onmicrosoft.com header.b="cTGqUMWF"
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id qVgZ6wKwoa3j for <ipv6@ietfa.amsl.com>; Fri, 8 Dec 2023 07:37:10 -0800 (PST)
Received: from clt-mbsout-02.mbs.boeing.net (clt-mbsout-02.mbs.boeing.net [130.76.144.163]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id AB142C47A222 for <ipv6@ietf.org>; Fri, 8 Dec 2023 07:37:09 -0800 (PST)
Received: from localhost (localhost [127.0.0.1]) by clt-mbsout-02.mbs.boeing.net (8.15.2/8.15.2/DOWNSTREAM_MBSOUT) with SMTP id 3B8Fb4Wp019135; Fri, 8 Dec 2023 10:37:08 -0500
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=boeing.com; s=boeing-s1912; t=1702049828; bh=wM4fOiwX1OUp7KYKQNDf2ZvU7OMSohCEiaTbzGdBDxQ=; h=From:To:CC:Subject:Date:References:In-Reply-To:From; b=HhIV4XADc6/eAm1EB+4sqaOXbmfSM4lm29INx8ACcFO7hgyv+PouSS7wNczAHsor+ Epqz1KmS2z3hEFP55qkEiVn3x810xOulEyY0wpI3eIRoSQnVjjddelESUvzYI2d9BM 6hUokyz9srMLdl/gwolAQ2rLQcDj199TsE/uKBfjyU9ZtA7kbgkKS/WwNB/lonr6uk 0KlRqEsSmfN8UvdjQSS4ev02ZIA7NMGVBC5murVCM6wmo9gEb27VNk2h0L/NYSKe0v TVWTnBuNmd7ckBUf1iXbdqnQxV7ztMEOBBKxHzmpiPh7oknJcJYwzNGIRvjCLMNT+R 9w5/d1nbuhgvg==
Received: from XCH16-05-12.nos.boeing.com (xch16-05-12.nos.boeing.com [144.115.66.96]) by clt-mbsout-02.mbs.boeing.net (8.15.2/8.15.2/8.15.2/UPSTREAM_MBSOUT) with ESMTPS id 3B8Fat9B019027 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NO); Fri, 8 Dec 2023 10:36:56 -0500
Received: from XCH16-03-12.nos.boeing.com (144.115.66.84) by XCH16-05-12.nos.boeing.com (144.115.66.96) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.1.2507.34; Fri, 8 Dec 2023 07:36:54 -0800
Received: from XCH19-EDGE-C01.nos.boeing.com (130.76.144.197) by XCH16-03-12.nos.boeing.com (144.115.66.84) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.1.2507.34 via Frontend Transport; Fri, 8 Dec 2023 07:36:54 -0800
Received: from USG02-CY1-obe.outbound.protection.office365.us (23.103.199.178) by boeing.com (130.76.144.197) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.1258.27; Fri, 8 Dec 2023 07:36:33 -0800
ARC-Seal: i=1; a=rsa-sha256; s=arcselector5401; d=microsoft.com; cv=none; b=FQ0AN+d4BSIdQLBgEjb5wlU4eF3cVKf4XfmTJJz1tw1rfQHLdk8o6beiFVpmz+EQxxkHVFi+KAhCpYOkbq8R1PtHFzffOwOLQfUDy7wwNZUx25SWwtPuMCwygGLAPtRsNt90CmH7I6baDYZhr/o33tLnrZTUSyZWC1X2MAHJUU9B/G/Dz9j4appKMLClqHpDWcBKi77r0pbwzFSb0MsYNQDDcG9Mwhsvz/lOpQNbyOfh/eu2lWc7976u3OkSATdOQcJsup+8ftiSBhut8ke/KJhLRixs/MJ9owUdy8Q84JGRH71D5MPZvC59EKIR0QlgAoNoWRwMjtvmOG4Z1e1U7Q==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector5401; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=wM4fOiwX1OUp7KYKQNDf2ZvU7OMSohCEiaTbzGdBDxQ=; b=x+Dob+/v/0rsh5K609Kzns4HxRvd+knOTpBxDmkFDdjpSBOQoadNvgHf0j1rV+82RReAkHojIaavedhumq8l4HL+zD+I3vJeCfAdvGPXvqVoD8CdWD0bDacrSp/OGTI3v09uIMWBeRg5xr5HWlkfAGj2tXXzib62TR3QwDU5WoxvM2ai1YV1AlT9yYAUKiJ/O/Bd5iJIpCs4BliG3MgkdiRU1ZQwf7CJ9P69RMfW2vlpecsPxrzakAOURoHY/CpEap3x3w8OIAQGG262XDi68Hhy3tdaKRSQtXxreZKina5Pk160n1mE5nkvP/CvOngqkZDhcuUeSPWR6VqCeKSNjA==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=boeing.com; dmarc=pass action=none header.from=boeing.com; dkim=pass header.d=boeing.com; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=boeing.onmicrosoft.com; s=selector1-boeing-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=wM4fOiwX1OUp7KYKQNDf2ZvU7OMSohCEiaTbzGdBDxQ=; b=cTGqUMWFrPJflrWMsBFYMMsd1HngeLr2qcQtPZAo88UkCUeLdBNeE/vYAxzgNrjPveGITeDhQERHRvOE66CzJJGaHSRcEWknMM0uQmLWmShQvbUGiqPvKCBluE5aSFL7D5LEHn8H/Ybtc2MjfgFi6XS964deoZz7jDFMOB/ZgT8=
Received: from BN0P110MB1420.NAMP110.PROD.OUTLOOK.COM (2001:489a:200:183::19) by BN0P110MB1913.NAMP110.PROD.OUTLOOK.COM (2001:489a:200:1a4::21) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7002.40; Fri, 8 Dec 2023 15:36:32 +0000
Received: from BN0P110MB1420.NAMP110.PROD.OUTLOOK.COM ([fe80::5df2:a8d0:34f2:c244]) by BN0P110MB1420.NAMP110.PROD.OUTLOOK.COM ([fe80::5df2:a8d0:34f2:c244%5]) with mapi id 15.20.7002.040; Fri, 8 Dec 2023 15:36:31 +0000
From: "Templin (US), Fred L" <Fred.L.Templin@boeing.com>
To: Christian Huitema <huitema@huitema.net>, Tom Herbert <tom@herbertland.com>
CC: IPv6 List <ipv6@ietf.org>
Thread-Topic: [IPv6] FW: I-D Action: draft-templin-6man-ipid-ext-00.txt
Thread-Index: AQHaKexRv5N91RNy70Ke3shxH5tPTA==
Date: Fri, 08 Dec 2023 15:36:31 +0000
Message-ID: <BN0P110MB1420A66D481B00EF33487E36A38AA@BN0P110MB1420.NAMP110.PROD.OUTLOOK.COM>
References: <13091d25c5874d5ba27b2de77d337646@boeing.com> <CALx6S371iasRTW+gzjgCPT1BY-KxZZau2Fu3qGYnoHpiu3o9tQ@mail.gmail.com> <BN0P110MB14205F118B67DD0225A18634A38BA@BN0P110MB1420.NAMP110.PROD.OUTLOOK.COM> <CALx6S36TZqh9h4aZ-o5gkY5Hp1Md2w5gPwpyO4weWeVwqXC5yQ@mail.gmail.com> <c0d3f33b-1193-470a-9f72-2c39dcbacb4f@huitema.net>
In-Reply-To: <c0d3f33b-1193-470a-9f72-2c39dcbacb4f@huitema.net>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=boeing.com;
x-ms-publictraffictype: Email
x-ms-traffictypediagnostic: BN0P110MB1420:EE_|BN0P110MB1913:EE_
x-ms-office365-filtering-correlation-id: 3535ce93-f6b3-4f10-85e7-08dbf8037419
x-ms-exchange-senderadcheck: 1
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: pEm+CWSNSYUeiz455iCzb+rzicgv8ctYO/KzliAcge2Ih2edt5/SrSvswSZwxCuINiqTwdQxWXjXp7RbaPkcriBmrfUoYXUlN7DfeJ2byUMKY0cB0rioZ96tze8vg2nzR+ey0NJRdV2yvr7OJBvi/nNeI75yp+H0Vp3W7HLJ1mWhpQVZVBWnSOqmEhGhySzHIts67q7gdGJrcae9q3KmAJ9whxTu+tRNuByBqE93ULjuc7K2ULry/bkpZglzupTASYoO1SZUyUHCU8bzXJsGpgysBqrpzgRowvIDdAb7tth3HRMQ5XKZadKItjeSDR9IravlVShwR9X/56YigaSnl2i5mtp/2BgoR0xMzihCHbUwOg5sPcgUwVQC3igUhm3UogRDu+UPouDP4qi7xFOYuwfXA1pvOjWURBeARgkkt0kfR5RiONm3Qwrwgr/vVTikjvkJLCUn5bOCkditjyafKojy7wJj+YsvV5Izhc0JVkpecsiGyZiNTGvCI2SlSQdQHPJ8QVKHH0iNXt4xPoGnUdBwugBCCLiV3Fj97Cxl31z7xum2fA/1gIIbZ+ajlbbF3ma7P8ZT05jNAsoG87aTHrg8ttBfVjldky4vWudXCAg=
x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:BN0P110MB1420.NAMP110.PROD.OUTLOOK.COM; PTR:; CAT:NONE; SFS:(13230031)(366004)(230922051799003)(1800799012)(451199024)(186009)(66899024)(55016003)(66574015)(6506007)(7696005)(9686003)(53546011)(71200400001)(4326008)(498600001)(966005)(8936002)(122000001)(8676002)(38100700002)(64756008)(66446008)(66556008)(66476007)(76116006)(110136005)(83380400001)(33656002)(66946007)(52536014)(5660300002)(38070700009)(86362001)(2906002); DIR:OUT; SFP:1102;
x-ms-exchange-antispam-messagedata-chunkcount: 1
x-ms-exchange-antispam-messagedata-0: c8nxgJqKjgHcoXVRJPKZpTkmrlwMq+UoXeUSZw6jiHBmNUtWVj5YQjGP6Lg6JMsxNUepsV9jxDZKIdayM4S3yg1EYSKnVzUV2kwx5ofwqtjAtIDUqVsUElV13bGjEii5GCCEYUaOdmACimVPn1qdcw4lFm7LlVvJXTPG+rIKFfSga7KbV3n9gl0zWEDsi1YnY3cVzGq+1nY4/6i4+Sw24CpWSCHIjo04WJs4TLSk/aVt94ufo7nZe4Yx8aNtPTw9Q1L53AqqbfzJx3pkD/1FXSiogHoNqFWWIm23KUAV++cNZaUzHontwPyj66+VPrT+r20x6HbR0i2yrAhCqVEgI1ZaZisCdtNrGu2hj/AX/OsSiUFN2ZTvUWuXfYcf0bKXD+hyz6dOBxUOjCt1o9SFMM2a2lIarvxmDQFoJXngK8JpvfI4vZZT0ptrMvr1yWAHrKEldycnVik5q7aBV5RmncK5zaRYIJBSLph3Jl1yvc31/Odw7UArOQNlbcdX+B2WjFk92VgotXh+lKPyl7yr8LZLdW/NJvqX9rxnetBi0Zdshh/X0hTkUCjdQMWJi51wc+EGpcVsaRDRXJiUXlMKiaA8f4VBbDgC5uvL2dRc3MYFgGH4Yhw2GGiq81nym0wfVu2/L2na2YUiNCUgCvvCDVbLalyN84E4n5k++bJ7aIF+JBnA366B+f2xv6jNAGzg57Ipfr1Sak/EpK+WbRdDqBdCkPkgkuneOnOD1WDXR/nFkiTbxzoF3C9AKkjiudd+IB/eQ8FXqVK9esgibuK1V8LKyFwXSkis3Ea5Fkd4gJrCyEO/KhANpYHGtTEhhlr5CplPOcHdl2kZcHblKnFGxhuwMOhqNg9o4q8cLztMCsyPAkMgkTxR47KPhXXTg+bFe1Ug4vpqxgqR5Qc9dyVpglnvmgPR7Dj9FLBgIP5cBwRGYx0ESp4felXKzn/bm8u5ukiS5AnhYTMnWuJ+KrjeuJg2zTJoq97vi+GgnJuPO+9CA7o605Z7I+uiLqGxw2xxfQPdDyokPRJdKTRxtb2MF/FEXsdLd06aSmka2ErqQPdq8oBVIAiR+L5HEUy9HTfISXMygREuOpgEmog2iB4GnT1kil64mlFVOz5LkGQ3huni7HxFgleQtNS6VWUzIlXjQhQHhsUxJ3j43cDvgQRqFSVgSMINhQbZPf0kckzn0zfASww+h1km/LnPVbkpqf9iK0DaBp0i2Lmrh1IqYuiIZwnl+wncL6xHzxCw2FNZ6/BgZyJBkVU7zuwQOdxrDEZXAKDBS6d8FVqQ+kDfZxercyA0332TIhlQOwswcXo93Iin6/xs0oTwv1jBZ+AoGmeqPFHUyCE2qD9bsZnYBZxSeMNY6o/CK/NS+qWIGn9/oeooBEEvyjS8avccYW6TXrQ8lIXhXHRZ28gYoZlSLXmbX82Dqxfe7UwK2Ifzv2Lz+jwmZusa6lrrYBAnpt//qSlHY53M5Gnq40lO5zqcxGazMffsiixOQ0KyXEbiFvU7zcu9PHRPHqHCc/vR/Ia4hp13GhTIAoAhhpwnkTzPXzIOREEZD9wWy/Nfj5KwQ7JWgp8uDrnAP6uu2LMF/AIomVSI
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
MIME-Version: 1.0
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-AuthSource: BN0P110MB1420.NAMP110.PROD.OUTLOOK.COM
X-MS-Exchange-CrossTenant-Network-Message-Id: 3535ce93-f6b3-4f10-85e7-08dbf8037419
X-MS-Exchange-CrossTenant-originalarrivaltime: 08 Dec 2023 15:36:31.8950 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: bcf48bba-4d6f-4dee-a0d2-7df59cc36629
X-MS-Exchange-Transport-CrossTenantHeadersStamped: BN0P110MB1913
X-OriginatorOrg: boeing.com
X-TM-SNTS-SMTP: B886F014C61DBAEE57C23821421D827D536B98C8D0D3D5050E78DD208253B63F2000:8
X-TM-AS-GCONF: 00
Archived-At: <https://mailarchive.ietf.org/arch/msg/ipv6/DUtSkbbRQzho_IdREmRKIAX4BiA>
Subject: Re: [IPv6] FW: I-D Action: draft-templin-6man-ipid-ext-00.txt
X-BeenThere: ipv6@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: "IPv6 Maintenance Working Group \(6man\)" <ipv6.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ipv6>, <mailto:ipv6-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ipv6/>
List-Post: <mailto:ipv6@ietf.org>
List-Help: <mailto:ipv6-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ipv6>, <mailto:ipv6-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 08 Dec 2023 15:37:14 -0000

Christian, I am working with the DTN LTP over UDP transport, and what I have found is
the performance is increased only by increasing the segment size even if that size exceeds
the path MTU. I have shown performance increases with segment sizes all the way up to
64KB even over 1500B path MTUs, and I believe that still larger segment sizes (over paths
with sufficient MTUs) would do even better. This was also a well-known characteristic of
NFS over UDP back in the early days, and I believe we will find other transports today that
would benefit from larger packets.

I have tried many ways to apply the "conventional wisdom" you have expressed to LTP/UDP
but have seen no appreciable performance increases using those methods. I tried using
sendmmsg()/recvmmsg() and they did nothing to improve performance. I then implemented
GSO/GRO and again the performance increase if any was minimal. I even implemented a
first pass at IP parcels and sent 64KB parcels with ~1500B segments over an OMNI interface
and that did give some minor performance increase due to the reduction in header
overhead but nothing within the realm of simply sending larger packets where the
performance increases were multiplicative.

I object to categorizing this as a transport issue - this is an Internetworking issue where
large packet sizes currently are not well supported especially when they exceed the path
MTU. I believe many transports will benefit from using larger packets, and that a robust
fragmentation and reassembly service is essential for performance maximization in the
Internet, and my drafts clearly explain why that is so.

Fred

> -----Original Message-----
> From: Christian Huitema <huitema@huitema.net>
> Sent: Thursday, December 07, 2023 3:59 PM
> To: Tom Herbert <tom@herbertland.com>; Templin (US), Fred L <Fred.L.Templin@boeing.com>
> Cc: IPv6 List <ipv6@ietf.org>
> Subject: Re: [IPv6] FW: I-D Action: draft-templin-6man-ipid-ext-00.txt
>
> On 12/7/2023 11:51 AM, Tom Herbert wrote:
> > On Thu, Dec 7, 2023 at 7:58 AM Templin (US), Fred L
> > <Fred.L.Templin=40boeing.com@dmarc.ietf.org>  wrote:
> >> Tom, to the point on performance:
> >>
> >>> Please provide references to these studies. Also, note IP
> >>> fragmentation is only one possibility, PMTUD and transport layer
> >>> segmentation is another and that latter seems more prevalent.
> >> If by transport layer segmentation you mean GSO/GRO, it is not the same thing
> >> as IP fragmentation at all. GSO/GRO provide a means for the application of the
> >> source to transfer a block of data containing multiple MTU- or smaller-sized
> >> segments to the kernel in a single system call, then the kernel breaks the
> >> segments out into individual packets that are all no larger than the path MTU
> >> and sends them to the destination. The destination kernel then gathers them
> >> up and posts them to the local application in a reassembled buffer possibly
> >> as large as that used by the original source. But, if some packets are lost,
> >> the destination kernel instead sends up what it has gathered so far which
> >> may be less than the block used by the original source.
> >>
> >> IP fragmentation is very different and operates on a single large transport
> >> layer segment instead of multiple smaller ones. And, the studies I am referring
> >> to show that performance was most positively affected by increasing the
> >> segment size even to larger than the path MTU. I implemented GSO/GRO
> >> in the ion-dtn LTP/UDP implementation and noted that the performance
> >> increase I saw was very minor and related to more efficient packaging
> >> and not a system call bottleneck. Conversely, when I increased the segment
> >> sizes to larger than the path MTU and intentionally invoked IP fragmentation
> >> the performance increase was dramatic. You can see this in the charts I
> >> showed at IETF118 intarea here:
> >>
> >> https://datatracker.ietf.org/meeting/118/materials/slides-118-intarea-identification-extension-for-the-internet-protocol-00
> 
> I don't doubt your experience, but this is not what we saw with QUIC. In
> the early stages of QUIC development, the performance were gated by the
> cost of the UDP socket API. I have benchmarks showing that sendmsg was
> accounting for 70 to 80% of CPU on sender side. Using GSO was key to
> lowering that, with one single call to sendmsg for 64K worth of data.
> 
> 
> >> Again, GSO/GRO address performance limitations of the application/kernel
> >> system call interface which seems to have a positive performance effect for
> >> some applications. But, IP fragmentation addresses a performance limitation
> >> of transport layer protocols in allowing the transport protocol to use larger
> >> segment sizes and therefore have fewer segments to deal with.
> 
> At the cost of very inefficient error correction, repeating 64K bytes if
> 1500 bytes are lost. The processing cost of retransmissions with
> selective acknowledgement is not large, it hardly shows in the flame
> graphs. Also, the next more important cost after sendmsg/recvmsg is the
> cost of encryption. If the application had to resend 64KB, it also has
> to encrypt 64KB again, and that costs more than re-encrypting 1500B.
> Given that, I am not sure that for QUIC we would see a lower CPU by
> delegating fragmentation to the IP stack.
> 
> That does not mean that larger packets would not result in lower CPU
> load. It would, but only if the larger packet size did not involve
> fragmentation, reassembly, and the overhead caused by the occasional
> loss of a fragment.
> 
> > Hi Fred,
> >
> > Fewer segments, but NOT fewer packets. The net amount of work in the
> > system is unchanged when sending larger segments instead of smaller so
> > there won't be any material performance differences other than maybe
> > implementation effects at the host and no effect at routers. Segments
> > are the unit of congestion management and retransmission in a
> > transport protocol, but fragments are transparent to the transport
> > protocol-- this distinction can cause material issues in performance.
> >
> > It's pretty easy to see why this is. Consider that the minimum number
> > of segments for a connection would be to use 64K segments and fragment
> > them. For a 1500 MTU one segment then would be sent in 43 fragments.
> > The problem is that if just one fragment is dropped in a segment then
> > the whole segment is retransmitted. Furthermore, the fragments
> > themselves are likely to be the cause of the congestion at routers. So
> > there is a high likelihood of creating congestion in the network and
> > needing a lot of retransmissions. Even if CWND goes to one, each
> > connection can still send 43 packets and SACKs don't help because
> > there's no granularity at 64K segments so congestion control really
> > wouldn't be effective. The net effect is likely to be very poor TCP
> > performance.
> 
> Yes. That's actually a known issue with GSO, and why GSO is typically
> limited to no more than 64K. If the sender does not implement some form
> of pacing, the segments will be sent back to back, causing short peaks
> of traffic that can cause queues to fill up and overflow. But it is
> difficult to delegate this pacing to the kernel, because the API only
> expresses the pacing in "milliseconds between packets". Segmentation in
> the kernel or the drivers would have the same issues.
> 
> > While I think there might be some incidental positive performance
> > effects in host implementation by using fragmentation, I really don't
> > see how it addresses any fundamental performance limitation in a
> > transport layer protocol like TCP. In fact, I don't see how IP
> > fragmentation could possibly be better than doing PMTUD with SACKs
> > especially on the Internet.
> 
> Yet another issue is that Fred is not the only one with that particular
> bad idea. The UDP options defined in TSVWG include a
> sgementation/fragmentation option that looks very similar. The two bad
> ideas would probably have to be reconciled in a single bad idea.
> 
> In any case, Fred is making arguments related to transport, which means
> this draft ought to be discussed in TSVWG.
> 
> -- Christian Huitema
> 
> 
>