[Taps] Review of draft-trammell-taps-post-sockets-03

Michael Welzl <michawe@ifi.uio.no> Sat, 11 November 2017 01:44 UTC

From: Michael Welzl <michawe@ifi.uio.no>
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
Mime-Version: 1.0 (Mac OS X Mail 10.3 \(3273\))
Message-Id: <CF0546DA-422B-4AA3-A593-1BC133AD730D@ifi.uio.no>
Date: Sat, 11 Nov 2017 09:43:51 +0800
To: "taps@ietf.org" <taps@ietf.org>
Archived-At: <https://mailarchive.ietf.org/arch/msg/taps/G5XOaPnsEXP82uQl8hzS3Ht7vTw>
Subject: [Taps] Review of draft-trammell-taps-post-sockets-03
Precedence: list

Dear all,

Because of the planned request for draft-trammell-taps-post-sockets-03 to become a WG item, I gave the draft another close read and decided to write a review based on this. I hope it's useful.

To me, post-sockets aims in the right direction - I like a number of things about it: the introduction of higher-layer concepts such as the carrier and association; I suspect that the long-term association management could also be implemented one-sided, which is nice. I like the idea of "forking" a carrier, which would clearly translate into doing multi-streaming "under the hood" whenever that's possible (side note: from the list on pages 7/8, it seems that *only* "Listener" can be forked. Why not a Source? This seems odd to me). Finally, obviously I also agree that being message-oriented is the right way to go.

On the negative side, I still see several problems here (actually almost nothing has changed from --02 to -03), listed below:

The biggest problem I see is that this has very limited usage scenarios, as it requires a post-socket system on both sides (there's quite a bit of functionality in there that won't work otherwise - to give an illustrative example: "When a Carrier is forked, its corresponding Carrier at the remote endpoint receives a fork request, .."). I find it particularly disappointing because, in my opinion, it wouldn't take much to make this able to fall-back to TCP such that e.g. a post-sockets client could talk to an existing TCP server. Anyway, what IS this falling back to? If you add functionality both on the sender and receiver side, you require a post-sockets system on both sides - and then you could just as well consistently run over UDP or TCP or HTTP/2 or whatever else you want, without any fall-backs (and as this develops new transport functionality, we're then really moving away from the TAPS charter). I think we must better understand what types of fall-backs are envisioned here.

The document talks about "asynchronous reception", as if this was a feature that would only be attainable with a novel approach, and/or a message-based transport. I see having a callback-based API and a socket-style API as largely orthogonal from the protocol below: one can of course also write a callback-based API over TCP... the real problem is to know after how much received data the receiving app should be notified, by whatever means - that's just a general dilemma, and not a new problem, as the PUSH bit in RFC 793 shows. Messages may seem to make this easier, but then you have to start wondering how large can messages be, how we should handle partial messages... e.g., you may need a signal from the receiver-side TAPS system to the app to say "you have stored half a message, but please throw this data block away now, you'll never get the rest". You have some discussion of partial messages in this draft, and we've been around that block with SCTP. IMO, the right approach is to leave all of that out: define a short max limit, and let an application take care of splitting into smaller messages itself (since you have message dependencies, why do you need to have large messages too, and not just smaller sub-messages that all sequentially depend on another?). This also makes prioritization more meaningful.

Generally, the draft just leaves many questions unanswered at this stage, as it stays at a rather high level and lists most services just as examples, saying that more could be possible. If it's that vague, how should anybody implement such a system? Here's a list of transport features (from minset) that we will lose if we don't offer them to an application, and that I didn't see mentioned in post-sockets (sorry if I missed some!). Some of them may seem a bit odd and unimportant, others less so (this is from appendix A2, where "T" and "U" mean "can be supported with a fall-back to TCP and UDP, respectively):

o T,U: Specify number of attempts and/or timeout for the first
establishment message
o T: Change timeout for aborting connection (using retransmit limit
or time value)
o T: Suggest timeout to the peer
o T,U: Disable Nagle algorithm
o T,U: Notification of Excessive Retransmissions (early warning
below abortion threshold)
o T,U: Specify DSCP field
o T,U: Notification of ICMP error message arrival
o T,U: Set Cookie life value
o T,U: Disable checksum when sending
o T,U: Disable checksum requirement when receiving
o T,U: Specify checksum coverage used by the sender
o T,U: Specify minimum checksum coverage required by receiver
o T,U: Specify DF field
o T,U: Get max. transport-message size that may be sent using a non-
fragmented IP packet from the configured interface
o T,U: Get max. transport-message size that may be received from the
configured interface
o T,U: Obtain ECN field
o T,U: Enable and configure a "Low Extra Delay Background Transfer"
o T,U: Request not to delay the acknowledgement (SACK) of a message
o T,U: Notification that the stack has no more user data to send

From this list, a few functions that I would consider important are:

- DSCP, and enabling and configuring LEDBAT-like behavior: these two could be lumped together in a higher-level request, but they're not just a priority. Sure, your message "niceness" could be translated into a DSCP choice, but I mean that an app may want to be able to control more (e.g. do a general per-carrier configuration for low latency or LBE, for example). Speaking of LEDBAT, post-sockets also doesn't say anything about whether the transport is congestion controlled or not (which translates into: does the app have to worry about it?). Another congestion control thing: access to the ECN flag in case of UDP should also be covered in some form.

- Notification that the stack has no more user data to send: this is the TCP_NOWAIT_LOWAT function. Hmmm... wait! At first I thought the binding to an .OnAcked() event handler can't be done with TCP, but if the post-sockets layer would intelligently use TCP_NOWAIT_LOWAT, then you could pull this off.

- Get max. transport-message size that may be sent using a non-fragmented IP packet from the configured interface: this is useful to avoid fragmentation overhead, and

- Specify DF field => if we fall back to UDP, we need this in order to do PMTUD.

... and then there's also the need for early configuration of some things to guide protocol choice, which I wrote as a decision tree in draft-ietf-taps-minset-00. While I agree with your idea of specifying a limited set of protocols to use - we do the same with policy in NEAT - that can't be all: a TAPS system should probably prevent, e.g., first picking UDP and then getting a request for reliability. That brings us back to the question about what you're planning to fall back to, of course.

I hope this was helpful,

cheers,
Michael

[Taps] Review of draft-trammell-taps-post-sockets… Michael Welzl
Re: [Taps] Review of draft-trammell-taps-post-soc… Tommy Pauly
Re: [Taps] Review of draft-trammell-taps-post-soc… Michael Welzl
Re: [Taps] Review of draft-trammell-taps-post-soc… Tommy Pauly
Re: [Taps] Review of draft-trammell-taps-post-soc… Michael Welzl
[Taps] API design: dependencies or buffer control? Michael Welzl
Re: [Taps] API design: dependencies or buffer con… Tommy Pauly
Re: [Taps] Review of draft-trammell-taps-post-soc… Tommy Pauly
Re: [Taps] API design: dependencies or buffer con… Michael Welzl
Re: [Taps] Review of draft-trammell-taps-post-soc… Michael Welzl
Re: [Taps] API design: dependencies or buffer con… Tommy Pauly
Re: [Taps] API design: dependencies or buffer con… Michael Welzl