[Detnet] Proposed update of draft-ietf-detnet-scaling-requirements

Peng Liu <liupengyjy@chinamobile.com> Mon, 25 September 2023 04:50 UTC

Return-Path: <liupengyjy@chinamobile.com>
X-Original-To: detnet@ietfa.amsl.com
Delivered-To: detnet@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 8E244C151530 for <detnet@ietfa.amsl.com>; Sun, 24 Sep 2023 21:50:29 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -0.994
X-Spam-Level:
X-Spam-Status: No, score=-0.994 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HTML_MESSAGE=0.001, NUMERIC_HTTP_ADDR=1.242, OBFU_TEXT_ATTACH=0.34, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_HTML_ATTACH=0.01, T_KAM_HTML_FONT_INVALID=0.01, T_OBFU_HTML_ATTACH=0.01, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id wmilu15gKp3y for <detnet@ietfa.amsl.com>; Sun, 24 Sep 2023 21:50:25 -0700 (PDT)
Received: from cmccmta1.chinamobile.com (cmccmta1.chinamobile.com [221.176.66.79]) by ietfa.amsl.com (Postfix) with ESMTP id 7D08DC14F747 for <detnet@ietf.org>; Sun, 24 Sep 2023 21:50:16 -0700 (PDT)
X-RM-TagInfo: emlType=0
X-RM-SPAM-FLAG: 00000000
Received: from spf.mail.chinamobile.com (unknown[172.16.121.87]) by rmmx-syy-dmz-app04-12004 (RichMail) with SMTP id 2ee4651111843ed-a26cf; Mon, 25 Sep 2023 12:50:12 +0800 (CST)
X-RM-TRANSID: 2ee4651111843ed-a26cf
X-RM-TagInfo: emlType=0
X-RM-SPAM-FLAG: 00000000
Received: from CMCC-LP (unknown[10.2.53.201]) by rmsmtp-syy-appsvrnew04-12029 (RichMail) with SMTP id 2efd65111181e84-14f49; Mon, 25 Sep 2023 12:50:12 +0800 (CST)
X-RM-TRANSID: 2efd65111181e84-14f49
Date: Mon, 25 Sep 2023 12:51:41 +0800
From: Peng Liu <liupengyjy@chinamobile.com>
To: detnet <detnet@ietf.org>
Cc: "xiong.quan" <xiong.quan@zte.com.cn>, "kiran.ietf" <kiran.ietf@gmail.com>
References: 202308231747071855622@zte.com.cn, CAFV7YBrVMzy2UQUGqfrz8v6WMYgudgURrZkaBYLQWKAp2OCXWw@mail.gmail.com, <202308241905321080389@zte.com.cn>
X-Priority: 3
X-GUID: 071D6574-2CFE-4CDC-9893-FBD10489EFC0
X-Has-Attach: yes
X-Mailer: Foxmail 7.2.21.453[cn]
Mime-Version: 1.0
Message-ID: <2023092512514137492441@chinamobile.com>
Content-Type: multipart/mixed; boundary="----=_001_NextPart478374167872_=----"
Archived-At: <https://mailarchive.ietf.org/arch/msg/detnet/PWx6x1PlmRO4zsNrOdu3jXs44Ec>
Subject: [Detnet] Proposed update of draft-ietf-detnet-scaling-requirements
X-BeenThere: detnet@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Discussions on Deterministic Networking BoF and Proposed WG <detnet.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/detnet>, <mailto:detnet-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/detnet/>
List-Post: <mailto:detnet@ietf.org>
List-Help: <mailto:detnet-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/detnet>, <mailto:detnet-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 25 Sep 2023 04:50:29 -0000

Hi, All, 

There were good comments from Kiran, Quan, Toerless and good discussions in the interim. The documents is updated accordingly and here are the main changes:
 
1. Section 3.1.3. Change the title to 'Provide Mechanisms not Requiring Strict Time Synchronization'.
2.  Section 3.4. Change the title to 'Be Scalable to The Large Number of Flows and Tolerate High Utilization of Bandwidth'.
Try to make it clear of the reqs of High Utilization of Bandwidth'. It is for the case that the bandwidth utilization is more than 75% and/or up to near 100%.
Considering the resources required for detnets are usually reserved and it is not sure if there is any space for the queuing solutions can improve the bandwidth utilization based on the current methods except CQF, I prefer to make it optional and use the word ‘it is better to...’
3. Section 3.6. Make it clear to ‘DetNet flows’ rather than all the network flows.
4. Section 3.7. Propose the requirements of latency, jitter in the case of complex topology according to Quan's suggestion. Considering the diversity of the potential solutions now and in the future, it lists in which case it is acceptable or unacceptable, and also leave the opportunity to others. It also want to meet the proposal of add a req or evaluation of 'support tight jitter/sync-control loops'. 
5. Section 3.8. Change the third type of the flows in figure to ‘Acyclic, lower latency requirements’
    Section 3.8.1. Change the title to 'Support Provisioning of Multiple Mechanisms'
    Section 3.8.2. Add the reference to I-D.ietf-detnet-controller-plane-framework
6. There are also some miner changes in Section 3.1.1, 3.2, please find and check them.

Attachments are the updated text and the diff, please find them and more comments are welcome.

Regards,
Peng



liupengyjy@chinamobile.com
 
From: xiong.quan@zte.com.cn
Date: 2023-08-24 19:05
To: kiran.ietf@gmail.com
CC: detnet@ietf.org
Subject: Re: [Detnet]Comments on draft-ietf-detnet-scaling-requirements-03.txt

Hi Kiran,

Thanks for your quick reply!
Please see inline with [Quan].

Regards,
Quan


Thanks for your prompt reply Quan! I will look forward to next revision. Added some comments inline with [km].


On August 23, 2023 at 2:47:25 AM, xiong.quan@zte.com.cn (xiong.quan@zte.com.cn) wrote:
Hi Kiran, 

Thank you for the detailed review and comments! Much appreciated. 
As a co-author, I try to reply you but I am not sure my understanding is correct. Happy to discuss with you! 
Please see inline with Quan->. 
Thanks! 


<Hello Authors, 
<My apologies for the late review. 
<The document is in a very good shape and covers several large-scale aspects. My only concern is in the very last comment at the bottom of this <email. Would like to hear your thoughts about that. Hope you find these comments useful. 
<Section 1. Introduction * There are a large number of flows which may be difficult to identify per-flow state and cause the high utilization.(Section <3.4) >>High utilization of what? - could be bandwidth, buffers, energy, cpu, etc. 
Quan->Thanks for pointing out. I think it mainly focus on high utilization of bandwidth. It is better to fix with “...and cause the high utilization of bandwidth ". 
[KM] ack. If I am right, this should apply to bursty traffic also. Right?
[Quan]Yes, I think so.

< * The mechanisms used to ensure bounded latency (e.g. queuing mechanism) may be multiple or have different configuration/parameters in <multi-domains. >> The first part of the sentence is not clear whether the "mechanisms may be multiple" in a single or multi-domain. I suggest use <'different' instead of 'multiple'. Did authors mean one domain will implement one type of mechanism? Or do you consider the situation where each <hop may process packet for bounded/definite latency differently? 
Quan->It may support more than one type of queuing mechanisms in a DetNet domain based on the different bounded latency requirements. But for now, one domain will select one type of mechanism for a flow and the queuing mechanism and the packet processing of the nodes along the path will be the same. 

<Section 3.1.1.: ‘This can be done, for example, by putting extra buffer space at the ingress of a new domain, increasing the dead time as a guard <band, or using some timing compensation mechanism.’ >> Can you provide references to dead time and timing compensation details for offline <reading? I am not familiar with technical details here but identifying 'how much' 2 domains are out of sync is also a requirement. Similarly, in <section 3.1.1 should we not say "identify (or estimate), “recover and absorb such time variance..." 
Quan-> The TSN connections scenario is a traditional use case for DetNet. I am not sure that it is a necessary requirement especially when DetNet domain provides mechanisms not requiring synchronization.The references of dead time and timing compensation details may refer to IEEE TSN such as ECQF[1]. 
[1]https://1.ieee802.org/tsn/802-1qdv/ 
[km] I believe that the references are necessary. When I read " increasing the dead time as a guard “ it was not obvious what the authors referring to. Either you would remove the example or provide the background context to it. Is there an expectation to be familiar with the TSN to understand the document?
[Quan]Yes, your comment will be revised in next version.


<Section 3.1.3 >>wondering what is a better term - Full time synchronization or end-to-end time synchronization? I am also concerned that partial <synchronization could mean something else – like the traffic was only synchronized for a certain part. What if we used alternate terms like inter-<domain time synchronization and intra-domain time synchronization to mean full and partial? 
Quan-> I think section 3.1.3 mainly proposed an example such as frequency synchronization not focus on the inter-domain, intra-domain or end-to-end time synchronization. I am not sure about the definition of full time synchronization and partial synchronization. If there is no consensus, maybe“Provide Mechanisms not Requiring Strict Time Synchronization" is better? 
[km] Yes, that works. What’s confusing in this paragraph is use of full synchronization, strict synchronization and partial synchronization. These terms can be confusing based on the context. I will also suggest to replace all occurrences of 'full synchronization' with 'strict synchronization'.
[Quan]Yes, this may be revised in next version.


<Section 3.2 ‘It is much greater than that of a LAN, and introduces impacts on queuing mechanisms, such as cyclic or time aware scheduling <method. So the queuing mechanism for LAN networks needs to be extended,’ >> I think we can rephrase the second path of the sentence. The <requirement is to consider propagation delays in end-to-end computations. Why only restrict to LAN queueing. "queuing for LAN needs to be <extended" does not appeal as correct to me.  
Quan->Good point! The mechanisms can not be restricted to TSN mechanisms extensions. 


<Section 3.3, It is practical to have a few flow- based or aggregated-flow based status in the local network.But in higher speed and larger scale <networks, it is hardly feasible. If [IEEE802.1Qcr] is in use, it requires more buffers comparing to the other full/partial time synchronized mechanisms. <Therefore, it requires optimizations to support higher link speeds. >> The text started to discuss flow-state in small-scale vs large-scale networks. <But then there is a discussion about buffers which is related to packet-queueing. I think these are 2 separate things. Do you want to say explicitly <that managing status for large number of flows will need more resources to maintain the state on per flow basis.  
Quan->When supporting higher link speeds, more data can be sent if the network transmission time remains the same. Then more flows and aggregated flows may be transmitted. And the ATS [IEEE802.1Qcr] is a per-flow queuing mechanism which means more states should be maintained and more buffers should be reserved. 

<Section 3.4, maybe related to 3.6 Curious question: DetNet are described to have always sufficient resources so that they will be congestion-free. <Does high-utilization mean that there is a possibility of congestion? 
Quan->In my understanding, one of the three techniques of DetNet architecture [RFC8655] is to allocate sufficient resources to avoid the congestion but that can not guarantee the bounded latency. For a node at a particular time, the large number of flows will be transmitted at the same time and they may be queued and it will significantly fill up the capacity of the link. The instantaneous bandwidth utilization of this link may be high but can not over 100%. The time-based queuing mechanisms can schedule the flows and tolerate high utilization. 

<Section 3.6 ,It is required to support bursty traffic and some methods to decrease the micro-burst. So the pre-planned , ingress traffic <conditioning, scalable queuing, and enhanced capacity of buffer are required to accommodate the flow fluctuation, >> It is slightly difficult to infer <whether traffic above means mixed traffic or is it always deterministic traffic? Is it expected to support both deterministic and best-effort traffic <over the same network. I'd suggest a discussion to clarify this aspect should be done (one or 2 sentences in introduction).  
Quan->I agree to make clarification for the bursty traffic. It includes the deterministic traffic and non-deterministic traffic and the ingress traffic control policy may be different. For non-deterministic traffic, the ingress may just drop it based on the policy. 
[km] ack


<Section 3.8.1 Support Configuration of Multiple Mechanisms >> have authors considered provisioning as a better term? Since the scale of <distributing queueing mechanisms and corresponding metadata can be a daunting task in large-scale networks, more dynamic or in-band means <may be needed. Moreover, it will be good to discuss that having a common set of set of parameters for different mechanisms will help with <interoperability. Section 3.8.2 s/Both of the two cases may may generate/Both of the cases may generate· 
Quan-> Thanks for your advice. I agree with you. “Support provisioning of Multiple Mechanisms” is better. And the common data field for different mechanisms has been discussed in draft [1] and [2]. 
[1]https://datatracker.ietf.org/doc/draft-xiong-detnet-data-fields-edp/ 
[2]https://datatracker.ietf.org/doc/draft-geng-detnet-dataplane-enchancment-encap/ 

<Section 4. Data plane enhancements >>If the data plane is enhanced, should we say something about the existing DetNet data plane and <backward compatibility with it? 
Quan->Yes, I agree with you. The DetNet data plane should be enhanced and the new functions and related metadata should be supported as per [1]. Backward compatibility should be taken into account. 
[1]https://datatracker.ietf.org/doc/draft-xiong-detnet-large-scale-enhancements/ 
[km] I have not gone through this document. Is this a solution or requirements? If it is not a solution, it should be merged here otherwise we should  add requirements text.
[Quan] I think it is a framework for enhanced DetNet data plane based on RFC8938. And I presented it in IETF meetings before. Maybe it can be merged if we get concensus after discussion.  

<Lastly, I found control plane considerations section missing. Since large scale networks are formed from stitching of <multiple domains, at least <one activity standards worthy is how the domains will exchange information together? This is somewhat covered in Section 3.8.2. But my overall <concern with this requirements document is that it has very few constraints. There are no limits on time-sync requirements, mechanisms, <topologies, link speeds, buffers etc. Then how do we verify/claim that the DetNet flows will meet the requested QoS for all DetNet flows in that <large-scale network. Maybe we also need a requirement for control planes to signal feasibility, violations, and monitoring performance of the <deterministic flows. 
Quan->This draft mainly focus on the scaling-requirements in enhanced DetNet data plane. The control plane considerations has been discussed several times in IETF meetings before. Chairs suggested to discuss and update that in existing control plane draft[1]. And the control plane considerations and extensions has been proposed in [2][3]. Discussions and comments are very welcome. 
[1] https://datatracker.ietf.org/doc/draft-ietf-detnet-controller-plane-framework/ 
[2] https://datatracker.ietf.org/doc/draft-xiong-detnet-teas-te-extensions/ 
[3] https://www.ietf.org/archive/id/draft-xiong-detnet-large-scale-enhancements-02.html#name-controller-plane-management 
[km] I suggest to add a reference to these documents for the sake of completeness.
[km] Authors should also add a requirement that states verification mechanisms for successful delivery of flows or signaling for failures. And look forward to revision with comments addressed.
[Quan] Thanks. I will discuss with other authors to see if we should add requirements in control plane and manegement plane.

Cheers,
Kiran


<Thanks Kiran