Re: [tsvwg] I-D Action: draft-ietf-tsvwg-ecn-l4s-id-06.txt

"Scheffenegger, Richard" <rs.ietf@gmx.at> Wed, 24 April 2019 11:03 UTC

Return-Path: <rs.ietf@gmx.at>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 2825412008B; Wed, 24 Apr 2019 04:03:39 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.599
X-Spam-Level:
X-Spam-Status: No, score=-2.599 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=gmx.net
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id qQh-Thmk3hUJ; Wed, 24 Apr 2019 04:03:35 -0700 (PDT)
Received: from mout.gmx.net (mout.gmx.net [212.227.15.19]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 6050412003F; Wed, 24 Apr 2019 04:03:35 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=gmx.net; s=badeba3b8450; t=1556103812; bh=dPMilXXjIEKX29d+dMB7JnAbpywXCi8TjDervKQNBxs=; h=X-UI-Sender-Class:Subject:To:References:From:Date:In-Reply-To; b=CYkUPV9iNtulEt1QwBT5B/ntJUDQyEZvcDoz5xOJGentccg4SxV6D7YUEir7edqTC bUV6lXiL1FkIQNKRt++hiOTMiaFEjj3Tovj9Az9t/Iyq/rEqWnZhD0x/0AOXy8pKKt k/VUHvd/1qqFrkvfBrvYssxT4dRQyvfyDBOuAiCU=
X-UI-Sender-Class: 01bb95c1-4bf8-414a-932a-4f6e2808ef9c
Received: from [10.249.68.248] ([217.70.210.6]) by mail.gmx.com (mrgmx001 [212.227.17.190]) with ESMTPSA (Nemesis) id 0MKYLf-1hLBEB03Oh-001vPc; Wed, 24 Apr 2019 13:03:32 +0200
To: tsvwg@ietf.org, internet-drafts@ietf.org, i-d-announce@ietf.org
References: <155234888454.23211.14096201623514072643@ietfa.amsl.com>
From: "Scheffenegger, Richard" <rs.ietf@gmx.at>
Message-ID: <3d296bef-c316-d9ca-9b14-a08917a4ab0d@gmx.at>
Date: Wed, 24 Apr 2019 13:03:30 +0200
User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.6.1
MIME-Version: 1.0
In-Reply-To: <155234888454.23211.14096201623514072643@ietfa.amsl.com>
Content-Type: text/plain; charset="utf-8"; format="flowed"
Content-Transfer-Encoding: quoted-printable
X-Provags-ID: V03:K1:v885HBOKpoic2c6gMIp26Bxgvw99N9KKcXvRYhZwXn+nLwyIKzT Xc2waz2EpASEKb2cN2LZ5A/RPzAk7EU4vYrJYhYHpVL1j/nn1TrFu5z9vgdbJ8c3kMaP2Y7 j+UMdagZFTu2A9XmJCk9P2tntyscllnedNuQc99MWPzD/62AlICuHUeM/xxI9bGYc84Wql9 0atkpKtZ3QjrmMxFEoNRg==
X-UI-Out-Filterresults: notjunk:1;V03:K0:g7Km/zJUfMs=:cI6df5HUKudyzw9kZdl04i 8Pot2my0L1kZeOObEnZSom/pyFbGbCEh7jTLjDi9TNhbymHDmybGHjjbuGwFi/z2zYLqim63H Mb0PP6dYkZnQXOxgBHHhY2YArNGmPwIC7b8lvgL6LHC7keVnCAGFA4Avd3QQL86XAWPPzPWVz EsArrsaE43rpCMiIyfhtcMEYo01ObHnQOdreHrKwX3LdAb7xNC9kpO1uWHDjqLNg4rnciG7mv IY9c9vOtocqe8RAZtSJpQkyRny3LfDG/SrcH5iWM4NAitFYiASpifZC0LdsKTHYIvxLTzibTr I19TpunYraJvHCZoAyiv6bggVTQdyysp05duELfWWukajJBH4QN4JRbunDbb3dmeZguVN6G9G ZlV6yPS0hL8xvCXglvk7uwNt+obiKz7m9rPylo/3yL22QCjaT9J5SHsn+dWLszoEZ7aGTr+Pl jWuAYCeoOmUbC4G0NomuHgthdmnOueCYAek9tCa4Vi8CjXW1zsl325GoJlf6AzZ3NzWbvgwji m5FtJErx/Xowz2Vgf09+iNz8YuK9EKNPBbJzGRpi3AoTi0lnFQ66fAIcsXY+aQ4dcIJeWemXq d03TMrzGSgzTch+7chK2z8dppd1webz7Bq3a1omL/OKD4UF5YJAkzZeS68LAXoahHh/RM+vGM VdOUkRjvnMXrtPGb4iPSD1vNR5eBPP4zo8tI/E10MayZRGYwJw8LjkI05iyHyv1hL7DQrJqr3 NcEhp6cfCd4/hJI/Ph93PrOiipexZesZ7U6ZIAg2xIMC+bH5quXCwQV3aJSSzjvG8KfOis52j ZHnVO2kt6vesm2WdDXocwxr1EFedS8ngcgbibJcUslm1TwUX3q4GoxOCvWoqb/9Gw8qp8iV0c SJsW8FqKJoJ/38QdRpR45bKuU1bpC3m7Ozr2DZY6pxShW5NXyEIimCc/pDNGdhI39pIq+nsW3 Ofw7d8QS1og==
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/bsvEWdGsS4Gr6m8jRPGKp8DPx4o>
Subject: Re: [tsvwg] I-D Action: draft-ietf-tsvwg-ecn-l4s-id-06.txt
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg/>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 24 Apr 2019 11:03:39 -0000

Forwarding to the entire list:

Hi Bob,

I’m ok with all the answers you provided.
I would like to encourage that draft to the attention of implementers of
current RDMA hardware, for them to think about L4S implications 😊

Regarding appendix B.1:

My suggestion would be to mention DSACK as another technique (currently
only used by Linux) to mitigate the effect of spurious CC reactions,
when it turns out that the reaction was unnecessary.

With RFC6675 ("New" SACK), there is a slightly higher chance of
triggering loss recovery then three consecutive, reordered packets. For
regular non-SACK flows and the old three-dupack rule, you are correct
that singular reorderings are not an issue, ever.




 >> Apdx A.1.7, ~para 5
 >>
 >> As another example, LACP restricts individual flows to use only
 >> one specific constituent link - effectively restricting the
 >> maximum bandwidth attainable for a single flow to no more than
 >> the bandwidth of a single link - even when the majority of the
 >> remaining links is idle. (this is a common misconception by
 >> enterprise network operators, who believe LACP can "scale up"
 >> the bandwidth available for  a single mount)

 > Yup. As long as LACP is still being used e2e, this single link
 > constraint would have to remain. The L4S requirement about counting
 > in time units might be applied to whatever e2e protocol replaced LACP
 > in future, then it would scale with the transfer rate.

LACP is p2p not e2e 😊. And many devices already allow a round-robin
load balancing (some devices really don’t like this though – older
silicon switching engines usually). TCP RACK  /time based loss detection
would allow the use of RR load balancing for LACP (ethernet), to
actually increase the single-flow throughput across LACP trunks . Sorry
for not making this point more clear.


[removed invalid objection discussed privately]

From: Bob Briscoe <ietf@bobbriscoe.net>
Sent: Montag, 25. März 2019 01:49
To: Scheffenegger, Richard <Richard.Scheffenegger@netapp.com>;
gorry@erg.abdn.ac.uk; koen.de_schepper@nokia.com
Cc: tsvwg-chairs@ietf.org; 'Wesley Eddy' <wes@mti-systems.com>
Subject: Re: Review on tsvwg of one of the L4S WG drafts

NetApp Security WARNING: This is an external email. Do not click links
or open attachments unless you recognize the sender and know the content
is safe.

Richard,

Thx for your obviously v thorough review. Inline...
On 24/03/2019 12:52, Scheffenegger, Richard wrote:
Review of draft-ietf-tsvwg-ecn-l4s-id:

Overall, a well-written document. However, I found the following nits,
clarification questions that may need to be answered by Mellanox, before
referenced in the draft, and one incorrect claim about the CE
classification into the L4S Queue:



Nit: I understand Bobs urge to use the British English (borrowed from
French?) centre instead of center. However, at least in "Data Center
TCP" I think the spelling should be US-english.
OK


Section 1.2:

L4S:

May need to solicit input from Mellanox around their RoCEv2 CC mechanism
DCQCN - if this would also fall - and be interoperable with - the L4S
definition. Based on the publicly available info, they designed DCQCN to
work along with instantaneous queue marking, and a variable-extent
feedback mechanism (inspired by QCN). However, I've not seen and
independent studies where DCTCP and DCQCN would run in the same Q.

(RDMA is currently en-vogue in DC environments; Mellanox doing the dev
on RoCE behind closed doors doesn't really inspire too much confidence
in the outcome with me though.
I don't think you're asking me to do anything here. Whatever the outcome
of this question, it doesn't alter what section 1.2 defines L4S to be,
or what it should be.

RoCEv2 is a proprietary protocol. And it's only used in controlled
environments. If it falls under the definition of L4S given here, fine.
If it doesn't, also fine. Whether or not it is compatible with this
definition of L4S does not stop it being used in the same controlled
environments as it is today. Or am I missing something?




Section 5.2.

Cross ref link not yet exists.
Thx for catching this.


Section 5.3

Is text around this eventuality really required in the draft? Wouldn't
long-term identification of flows beyond a single packet be outside the
scope  normally? Just wondering if this complicates the draft more than
it helps.
An FQ implementation with L4S AQM in some queues and Classic in others
is a valid alternative to the DualQ Coupled AQM.

In the stateless case it's straightforward: ECT(1) identifies a packet
as L4S.
But with flow state, we need to define what an L4S flow is, which
implies we have to deal with the case where there's a mix of ECT(0) and
ECT(1) in the flow.




Section 5.4

Another example of a protocol that would benefit from low-latency
treatment, but is low volume would be NTP, and perhaps LDAP (but the
latter can be voluminous at times).
I've added them both.



Regarding ECN++ - references to it are currently only in the appendices.
It appears to me, that and successful operational deployment of a L4S
transport might be dependent on this, to warrant mentioning it somewhere
in the body of the draft too?
The reason it's only in the Appendix is because the Prague requirements
are split into mandatory requirements and optional performance
improvements. Only the mandatory reqs are in the body.

Altho a performance improvement is important, it doesn't need to be
mandated (cos there's no interop implication or downside so it's a
no-brainer without having to be a requirement).




Apdx A.1.7, ~para 5

As another example, LACP restricts individual flows to use only one
specific constituent link - effectively restricting the maximum
bandwidth attainable for a single flow to no more than the bandwidth of
a single link - even when the majority of the remaining links is idle.
(this is a common misconception by enterprise network operators, who
believe LACP can "scale up" the bandwidth available for  a single mount)
Yup. As long as LACP is still being used e2e, this single link
constraint would have to remain. The L4S requirement about counting in
time units might be applied to whatever e2e protocol replaced LACP in
future, then it would scale with the transfer rate.


Para6: type "ussed".
Thx
Note that the wording might incorrectly imply that only RoCE can be used
for RDMA, while at least one IETF alternative exists (iWARP). Also, I
believe the extremely strict in-order requirement was with RoCEv1, while
RoCEv2 did learn from real-world deployments where in-order and
loss-less delivery is not always attainable... Again, something to get
Mellanox to comment about.
Good point. So I could say:
    Also note that in some controlled environments no reordering is
    tolerated by some transports (e.g.  RoCEv2 used for RDMA, in
contrast to iWARP),




Apdx B.1

 >> A reference to DSACK could be made here - while no stack (perhaps
 >> Linux does) performs cwnd unwinding in case of a DSACK for fast
 >> retransmissions (more stacks to unrolling of RTO state changes
 >> when DSACK is found), this would be a mechanism which also mitigates
 >> the problem of spurious retransmissions.

 > What point are you wanting me to make here? I'm not sure a point
about > DSACK would be relevant to the thread of the argument here.
 >
 > Regards and thanks again...
 >
 >Bob





Best regards,
    Richard


Am 12.03.2019 um 01:01 schrieb internet-drafts@ietf.org:
>
> A New Internet-Draft is available from the on-line Internet-Drafts directories.
> This draft is a work item of the Transport Area Working Group WG of the IETF.
>
>          Title           : Identifying Modified Explicit Congestion Notification (ECN) Semantics for Ultra-Low Queuing Delay (L4S)
>          Authors         : Koen De Schepper
>                            Bob Briscoe
> 	Filename        : draft-ietf-tsvwg-ecn-l4s-id-06.txt
> 	Pages           : 40
> 	Date            : 2019-03-11
>
> Abstract:
>     This specification defines the identifier to be used on IP packets
>     for a new network service called low latency, low loss and scalable
>     throughput (L4S).  It is similar to the original (or 'Classic')
>     Explicit Congestion Notification (ECN).  'Classic' ECN marking was
>     required to be equivalent to a drop, both when applied in the network
>     and when responded to by a transport.  Unlike 'Classic' ECN marking,
>     for packets carrying the L4S identifier, the network applies marking
>     more immediately and more aggressively than drop, and the transport
>     response to each mark is reduced and smoothed relative to that for
>     drop.  The two changes counterbalance each other so that the
>     throughput of an L4S flow will be roughly the same as a 'Classic'
>     flow under the same conditions.  However, the much more frequent
>     control signals and the finer responses to them result in ultra-low
>     queuing delay without compromising link utilization, and low delay is
>     maintained during high load.  Examples of new active queue management
>     (AQM) marking algorithms and examples of new transports (whether TCP-
>     like or real-time) are specified separately.  The new L4S identifier
>     is the key piece that enables them to interwork and distinguishes
>     them from 'Classic' traffic.  It gives an incremental migration path
>     so that existing 'Classic' TCP traffic will be no worse off, but it
>     can be prevented from degrading the ultra-low delay and loss of the
>     new scalable transports.
>
>
> The IETF datatracker status page for this draft is:
> https://datatracker.ietf.org/doc/draft-ietf-tsvwg-ecn-l4s-id/
>
> There are also htmlized versions available at:
> https://tools.ietf.org/html/draft-ietf-tsvwg-ecn-l4s-id-06
> https://datatracker.ietf.org/doc/html/draft-ietf-tsvwg-ecn-l4s-id-06
>
> A diff from the previous version is available at:
> https://www.ietf.org/rfcdiff?url2=draft-ietf-tsvwg-ecn-l4s-id-06
>
>
> Please note that it may take a couple of minutes from the time of submission
> until the htmlized version and diff are available at tools.ietf.org.
>
> Internet-Drafts are also available by anonymous FTP at:
> ftp://ftp.ietf.org/internet-drafts/
>