Re: [Int-area] Call for WG adoption of draft-templin-intarea-parcels-10

Tom Herbert <tom@herbertland.com> Mon, 11 July 2022 20:34 UTC

Return-Path: <tom@herbertland.com>
X-Original-To: int-area@ietfa.amsl.com
Delivered-To: int-area@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 68719C14F742 for <int-area@ietfa.amsl.com>; Mon, 11 Jul 2022 13:34:00 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.794
X-Spam-Level:
X-Spam-Status: No, score=-1.794 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=0.001, HTTPS_HTTP_MISMATCH=0.1, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, T_SCC_BODY_TEXT_LINE=-0.01, T_SPF_PERMERROR=0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=no autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=herbertland-com.20210112.gappssmtp.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id zPVpvKDWAwV4 for <int-area@ietfa.amsl.com>; Mon, 11 Jul 2022 13:33:56 -0700 (PDT)
Received: from mail-lj1-x230.google.com (mail-lj1-x230.google.com [IPv6:2a00:1450:4864:20::230]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 26C64C14F5E1 for <int-area@ietf.org>; Mon, 11 Jul 2022 13:33:56 -0700 (PDT)
Received: by mail-lj1-x230.google.com with SMTP id a39so7572016ljq.11 for <int-area@ietf.org>; Mon, 11 Jul 2022 13:33:55 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=herbertland-com.20210112.gappssmtp.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=itZdn5JYybFC4zYfOCDFnt0Ntjr16ZXTLRxvSxg+71Q=; b=5m6p7PnMw76ABj80ffpWkhpm5O2lHBgXeKrFJo7fmPJkSyvSPoA7bGJTohriTWCXOG vC9g/tPohAMw5G/8Wh3ZsWVGJFKSIBDUuHlQhOM9qlP3p9DzTDZ/hoCVVuo4STQZ+fYz wKLqn3XgzZD8HjFMGY9ukXf4acbL024gEuE37atvciS9VrI9Y+KrMljBqVnPWxyiChuh nJhLI0cG4yAaV5jP2VLizfb+hI0kRxO53d3GiHCH86XQE+htEIvWhEbZ7jBvJpo/Frla Dj0JGlxL+Ic0oAo4ciO9H/yUB0TLIMJsJb13tImziEvWeqLxRDTzJJv2gMOERXDw5eW4 HZhg==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=itZdn5JYybFC4zYfOCDFnt0Ntjr16ZXTLRxvSxg+71Q=; b=dPOuvaj1Z1MIvR3fp9vgjqcVrZ3iQj9wSiciaV6QDhYC8ddrYGnP32rYgySr2x+pHk 2NHl0Mfa6dDL85XdUlMk6K4fxqW8o5om8a/O8ud9Mh4PCwd1VV5jqdlOo4CnUlCLlMCF fydC419cmi+PM9oY59uZTRWwliG7Tw5BJTN5X0wtj8WMF1+vBI59jHy59SLhe6zwLenJ n7xNtAjOlAEjMwWW9mUnyYh4Ti/V2O1ugpb8JhVp+OPeY/ZPmkb5zMZJiJuJaOjir2fK nGlnSq7sWTHDScJp5hO6luRfQsN01Q3vsVtoJ7aY70ZUu996dQf5GXvNUrwChYBDrzXK 8evA==
X-Gm-Message-State: AJIora/oBqMpx4AxfoNzSoRxslo6hqiml8U2C/46JH41eTvLmd/JhBsN o+BQJDJBR5wxyjC4Z6peASpKK152UuB7SEqbf/v9nvW7h7J6cg==
X-Google-Smtp-Source: AGRyM1uUPa218d5lnVBDsXxH3V7CiijxQsljuERAWaXoYiVZhyahntZO9kxsnhfvoARev76LEtvi87Rslvq3kyt5MAA=
X-Received: by 2002:a2e:bc2a:0:b0:25b:c248:2efe with SMTP id b42-20020a2ebc2a000000b0025bc2482efemr10537287ljf.301.1657571634022; Mon, 11 Jul 2022 13:33:54 -0700 (PDT)
MIME-Version: 1.0
References: <SJ0PR11MB57692D589B130307F4C9C823D1B29@SJ0PR11MB5769.namprd11.prod.outlook.com> <BYAPR13MB2279C0291E3AC5998958824D87BD9@BYAPR13MB2279.namprd13.prod.outlook.com> <f90f462f9df94e2a82a1b8afe598a204@boeing.com>
In-Reply-To: <f90f462f9df94e2a82a1b8afe598a204@boeing.com>
From: Tom Herbert <tom@herbertland.com>
Date: Mon, 11 Jul 2022 13:33:42 -0700
Message-ID: <CALx6S35Ute7f3D_zdbxauBiR+KG7J5gqSwXhH+MQcfQGuHGHzQ@mail.gmail.com>
To: "Templin (US), Fred L" <Fred.L.Templin@boeing.com>
Cc: Richard Li <richard.li@futurewei.com>, "Juan Carlos Zuniga (juzuniga)" <juzuniga=40cisco.com@dmarc.ietf.org>, "int-area@ietf.org" <int-area@ietf.org>
Content-Type: multipart/alternative; boundary="0000000000006fa5b905e38d7821"
Archived-At: <https://mailarchive.ietf.org/arch/msg/int-area/0TQLOVwZ3z7FArLe8306AQcC8Pk>
Subject: Re: [Int-area] Call for WG adoption of draft-templin-intarea-parcels-10
X-BeenThere: int-area@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: IETF Internet Area WG Mailing List <int-area.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/int-area>, <mailto:int-area-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/int-area/>
List-Post: <mailto:int-area@ietf.org>
List-Help: <mailto:int-area-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/int-area>, <mailto:int-area-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 11 Jul 2022 20:34:00 -0000

On Mon, Jul 11, 2022 at 12:22 PM Templin (US), Fred L <
Fred.L.Templin@boeing.com> wrote:

> Richard and others, thank you for these comments and for the ensuing
> discussion that
>
> took place over the time I was away on vacation. Strange how the timing
> hit when I
>
> was away from the office and off the grid - I was on a camping trip in
> Canada not far
>
> from where Steve Deering lives although I did not visit him.
>
>
>
> In any event, I was able to push out a new draft version ahead of the
> deadline that
>
> may address some (but likely not all) of your concerns:
>
>
>
> https://datatracker.ietf.org/doc/draft-templin-intarea-parcels/
>
>
>
> The major change is that the draft now talks about interactions with upper
> layer
>
> protocols including TCP and UDP, whereas the previous draft versions were
> silent
>
> regarding upper layer protocol framing.
>
>
>
> To others who have commented, I beg to differ and maintain that IP parcels
> do
>
> represent a significant improvement over the current state of affairs and
> over
>
> just regular IP jumbograms. In particular:
>

Hi Fred, some comments in line.


>
>
> 1) IP parcels make it so that the loss unit is a single segment instead of
> the entire
>
> packet/parcel, and loss of a segment often results in retransmission of
> just that
>
> segment instead of the entire packet/parcel.
>

Yes, I agree if the packet is fragmented by the network then this is a nice
feature. However, today we already have this from a host perspective
property by just sending "small" packets.


>
>
> 2) IP parcels are more efficient than sending a single segment per IP
> packet, since
>
> the parcel includes a single IP header plus single full {TCP,UDP} header
> for possibly
>
> many segments. This can result in significant savings in terms of bits
> over the wire
>
> for omitting unnecessary header bytes.
>

I'm not sure the savings qualify as significant. 9K MTUs are becoming
common in data centers and the standard TCP/IPv6 header is 80 bytes so
that's already less than 1% overhead.


> Consider the postal service analogy; when
>
> many items can be sent together in a single package/parcel there is a
> large savings
>
> in shippeing and handling costs than when each individual item is shipped
> separately.
>

As I already mentioned, this is addressed by the BiGTCP work (
https://lwn.net/Articles/884104). Sending or receiving multi-megabytes TCP
segments in one system call is now feasible. Also, it's inevitable that NIC
vendors will apply this also to be able to offload TCP jumbo grams. Given
this is just software that doesn't require hardware change or on-the-wire
protocols to change, it's immediately deployable with just a softwar change
which is a huge benefit to datacenter operators.

>
>
> 3) IP parcels improve large packet integrity by including a separate
> checksum for
>
> each segment instead of a single checksum for the entire packet.
>

All modern NIC HW can deal with offloading a single checksum per packet,
it's going to be a major effort for them to offload multiple checksum like
IP parcels needs. Without checksum offload, this would be a non-starter for
a lot of deployments.

This means that
>
> large parcels (up to a few MB) can be sent in one piece over links with
> sufficiently
>
> large MTU without requiring the link itself to provide strong integrity
> checks over
>
> the entire length of the parcel. This means that link MTUs significantly
> larger than
>
> 9KB are now safely possible.
>
>
>
> 4) IP parcels offer all of the efficiency advantages to upper layers as
> are offered
>
> by GSO/GRO, etc. but also provide benefits 1) through 3) above that are not
>
> offered by GSO/GRO.
>

Most of this is doable in GSO/GRO.


>
>
> 5) Plus, the idea is just plain neat. Better packaging is good. More
> efficient
>
> handling is good. Reduced header overhead is good. SAFE larger MTUs are
>
> good. The idea itself is good.
>

I'm not convinced of that. For instance, I'm skeptical that intermediate
devices trying to reassemble packets that aren't addressed to themselves
could ever be robust or efficient (i.e. complexity, non-work conserving
resource requirements, security issues with reassembly, multi-path that
causes latency increase, potential DoS vector, etc.). Can you comment on
this?

Tom


>
> Fred
>
>
>
> *From:* Int-area [mailto:int-area-bounces@ietf.org] *On Behalf Of *Richard
> Li
> *Sent:* Friday, July 01, 2022 3:11 PM
> *To:* Juan Carlos Zuniga (juzuniga) <juzuniga=40cisco.com@dmarc.ietf.org>
> *Cc:* int-area@ietf.org
> *Subject:* Re: [Int-area] Call for WG adoption of
> draft-templin-intarea-parcels-10
>
>
>
> Chairs and Authors,
>
>
>
> I always like every new idea and effort to improve the Internet
> performance, and thus I have read this draft with a great interest. The
> following are my observations/comments/questions. If they don’t make any
> sense to you, please accept my apology, and disregard them.
>
>
>
> 1.      The text “multiple upper layer protocol segments” is ambiguous.
> It seems that you really mean “multiple segments from ‘the same’ upper
> layer protocol”, doesn’t it? It seems that multiple segments from different
> upper layer protocols are not allowed in your parcel.
>
>
>
> 2.      Is the following a fair statement? All segments in the same
> packet come from the same application identified by the 5-tupe (source
> address, destination address, source port, destination port, protocol
> number).
>
>
>
> 3.      Segment size
>
> You require that their sizes be the same except for the last one. Is this
> required for easy implementation or what? Do you require it for any other
> reasons?
>
>
>
> 4.      TTL issue
>
> You described how parcels are forwarded over the Internetwork, and in
> particular you described what the ingress/egress middlebox does about
> parcels. I understand that the ingress middlebox may break the parcel into
> smaller ones, which may rejoin at the egress middlebox. My question is
> about TTL. As different smaller parcels may traverse along different paths,
> as a result their TTLs may be different when they reach the egress
> middlebox . How does the egress middlebox set up the TTL value? Please
> provide more descriptions.
>
>
>
> 5.      Reordering at the egress middlebox
>
> The parcels would arrive one after another, and therefore the egress
> middlebox would “wait” for a little bit to identify and pick up enough
> parcels/packets for their rejoining and repackaging. A description of the
> egress middlebox behavior would be useful and helpful, in particular I
> would like to know more about the waiting time if any, and how you deal
> with the reordering and loss.
>
>
>
> 6.      IPv4 option
>
> Does IETF still allow to change/add IPv4 option fields? I might be wrong,
> but aren’t they frozen? Also, do commercial routers still care about IPv4
> options?
>
>
>
> 7.      IPv6 option
>
> This draft has defined a hop-by-hop option, it will require every
> intermediate IPv6 router to inspect this option. There have been some
> discussions on the pros/cons about Hop-by-Hop IPv6 Option. Is there any
> feedback from WG 6man?
>
>
>
> 8.      Parcel Path Qualification
>
> This draft has described a method for parcel path qualification probe from
> end to end. It is nice to have it, but it is unreliable simply for the
> following reason: a probe parcel goes along one specific path, and your
> real application parcels may take different paths.
>
>
>
> 9.      Integrity
>
> First paragraph of Section 7. More explanation/elaboration should be
> useful. I might have missed it in previous paragraphs, but if I do, please
> provide a reference to it such as “as described in …”.
>
>
>
> 10.   Implementation Status
>
> In section 10. TSO’s performance gain and Parcel’s gain should be regarded
> as two different things. Since this draft is adding a hop-by-hop option,
> every intermediate router is required to process the hop-by-hop option,
> which will, theoretically speaking, lead to performance downgrade. Of
> course, the whole performance would depend on many other factors, such as
> the total numbers of routing table lookups and number of segments.
>
>
>
> 11.   General observation
>
> This proposal essentially tries to solve a problem caused by MTU. If MTU
> be very big, one would simply put the whole data in a single packet. Since
> MTU is limited, a packet has to be cut into many smaller pieces (segments).
> In the existing specification, when an intermediate router sees a packet
> with its size larger than MTU, the router would be expected to fragment it
> so that the fragments could be forwarded. Here let me call it
> “fragmentation as needed”. In reality, however, some (if not all)
> commercial routers don’t do “fragmentation as needed”, instead of
> fragmenting the packet they simply discard it in order to achieve the
> wire-speed. This draft defines a new way to address the MTU issue: when a
> router sees a packet with its size larger than MTU, the router is asked to
> fragment it in a prescribed way (fragment it into pre-packaged segments).
> If I may, let me call it “fragmentation as prescribed”. Both “fragmentation
> as needed” and “fragmentation as prescribed” would require the support from
> intermediate routers. As the same as fragmentation as needed, fragmentation
> as prescribed may downgrade the performance of intermediate routers. What
> is more, intermediate routers/boxes may perform “rejoining and
> repackaging”, which will adversely impact the performance of the
> intermediate routers/boxes.
>
>
>
>
>
> Best regards,
>
>
>
> Richard
>
>
>
>
>
>
>
> *From:* Int-area <int-area-bounces@ietf.org> *On Behalf Of *Juan Carlos
> Zuniga (juzuniga)
> *Sent:* Wednesday, June 22, 2022 12:25 PM
> *To:* int-area@ietf.org
> *Subject:* [Int-area] Call for WG adoption of
> draft-templin-intarea-parcels-10
>
>
>
> Dear IntArea WG,
>
>
>
> We are starting a 2-week call for adoption of the IP-Parcels draft:
>
> https://www.ietf.org/archive/id/draft-templin-intarea-parcels-10.html
> <https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.ietf.org%2Farchive%2Fid%2Fdraft-templin-intarea-parcels-10.html&data=05%7C01%7Crichard.li%40futurewei.com%7C715b5db213134932c70208da5484f702%7C0fee8ff2a3b240189c753a1d5591fedc%7C1%7C1%7C637915227299598680%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000%7C%7C%7C&sdata=w4G5ypaSRv%2FR31%2F%2B857XT2xUqHdEXv90ubD5GGjqBEQ%3D&reserved=0>
>
>
>
> The document has been discussed for some time and it has received multiple
> comments.
>
>
>
> If you have an opinion on whether this document should be adopted by the
> IntArea WG please indicate it on the list by the end of Wednesday July 6th
> .
>
>
>
> Thanks,
>
>
>
> Juan-Carlos & Wassim
>
> (IntArea WG chairs)
>
>
> _______________________________________________
> Int-area mailing list
> Int-area@ietf.org
> https://www.ietf.org/mailman/listinfo/int-area
>