Re: [tsvwg] Genart last call review of draft-ietf-tsvwg-ecn-encap-guidelines-20

Bob Briscoe <in@bobbriscoe.net> Fri, 10 November 2023 14:27 UTC

Return-Path: <in@bobbriscoe.net>
X-Original-To: tsvwg@ietfa.amsl.com
Delivered-To: tsvwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id C16BFC15C297; Fri, 10 Nov 2023 06:27:45 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -7.105
X-Spam-Level:
X-Spam-Status: No, score=-7.105 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_HI=-5, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=bobbriscoe.net
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id ZPBz0nJMiAyF; Fri, 10 Nov 2023 06:27:41 -0800 (PST)
Received: from mail-ssdrsserver2.hostinginterface.eu (mail-ssdrsserver2.hostinginterface.eu [185.185.85.90]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 20FDAC151096; Fri, 10 Nov 2023 06:27:37 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=bobbriscoe.net; s=default; h=In-Reply-To:References:Cc:To:From:Subject: MIME-Version:Date:Message-ID:Content-Type:Sender:Reply-To: Content-Transfer-Encoding:Content-ID:Content-Description:Resent-Date: Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:List-Id: List-Help:List-Unsubscribe:List-Subscribe:List-Post:List-Owner:List-Archive; bh=n7s3ReEqDLt3yHjFqmeu+Eqg8D/HZTvhkZ9MHqBa0oc=; b=u/dtcxT62p9KL8yaBa7uZi3Nat n/FYCpGXnEUlvNpWE0QXe3LdBlBJO0B5EeUyZV4kMfAPVvTYsFk+sUzxBKDXfowA5SiDvCBGwXOBP s8OigXpTdDAOzwFX42k3FWVyh4QYRhLpDuIF+TsFU2w/g7oTlZJ83q+C6VDiv4Ql4Ja8wE0B2RQqY pDRem/B/VcjHmBVAdojHDtVcdMOj/YPwb7JiH+rjOh6LWCinyUxcZLoZQh80+YWGELli/cgeQ05wM KxQvf0PtuMZng0gh7sevpEFcKvc8/8TBeRNRtzdudNgBIgB4b/geiAP0LOF0ZR5ERRwDAJYjsaSBe IlF+MvGA==;
Received: from 67.153.238.178.in-addr.arpa ([178.238.153.67]:53966 helo=[192.168.1.3]) by ssdrsserver2.hostinginterface.eu with esmtpsa (TLS1.2) tls TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 (Exim 4.96.2) (envelope-from <in@bobbriscoe.net>) id 1r1STv-0007Tl-0V; Fri, 10 Nov 2023 14:27:35 +0000
Content-Type: multipart/alternative; boundary="------------OAk7WVCQJLpqdy3brwrg89XB"
Message-ID: <d678f8aa-4f51-4ba2-902b-8d7fc5b43127@bobbriscoe.net>
Date: Fri, 10 Nov 2023 14:27:33 +0000
MIME-Version: 1.0
User-Agent: Mozilla Thunderbird
Content-Language: en-GB
From: Bob Briscoe <in@bobbriscoe.net>
To: Susan Hares <shares@ndzh.com>, gen-art@ietf.org
Cc: draft-ietf-tsvwg-ecn-encap-guidelines.all@ietf.org, last-call@ietf.org, tsvwg@ietf.org
References: <169853147405.56049.1540851343012580745@ietfa.amsl.com> <ace8a926-eb85-441b-b6c8-472dcbd90d70@bobbriscoe.net>
In-Reply-To: <ace8a926-eb85-441b-b6c8-472dcbd90d70@bobbriscoe.net>
X-MagicSpam-TUUID: 0fe7372c-13ea-4e83-abfb-cfaad959906c
X-MagicSpam-SUUID: 829be711-eeff-4c42-8d61-ea66f4f2c577
X-AntiAbuse: This header was added to track abuse, please include it with any abuse report
X-AntiAbuse: Primary Hostname - ssdrsserver2.hostinginterface.eu
X-AntiAbuse: Original Domain - ietf.org
X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12]
X-AntiAbuse: Sender Address Domain - bobbriscoe.net
X-Get-Message-Sender-Via: ssdrsserver2.hostinginterface.eu: authenticated_id: in@bobbriscoe.net
X-Authenticated-Sender: ssdrsserver2.hostinginterface.eu: in@bobbriscoe.net
X-Source:
X-Source-Args:
X-Source-Dir:
Archived-At: <https://mailarchive.ietf.org/arch/msg/tsvwg/bq7Lsoe2gpkEaSnRnWu58To8m5Y>
Subject: Re: [tsvwg] Genart last call review of draft-ietf-tsvwg-ecn-encap-guidelines-20
X-BeenThere: tsvwg@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: Transport Area Working Group <tsvwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tsvwg/>
List-Post: <mailto:tsvwg@ietf.org>
List-Help: <mailto:tsvwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tsvwg>, <mailto:tsvwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 10 Nov 2023 14:27:45 -0000

Sue, I followed up my own email with another - see one addition tagged [BB2]

On 08/11/2023 09:54, Bob Briscoe wrote:
> Susan,
>
> Thank you for your review. See [BB]
>
> On 28/10/2023 23:17, Susan Hares via Datatracker wrote:
>> Reviewer: Susan Hares
>> Review result: Ready with Issues
>>
>> I am the assigned Gen-ART reviewer for this draft. The General Area
>> Review Team (Gen-ART) reviews all IETF documents being processed
>> by the IESG for the IETF Chair.  Please treat these comments just
>> like any other last-call comments.
>>
>> For more information, please see the FAQ at
>>
>> <https://wiki.ietf.org/en/group/gen/GenArtFAQ>.
>>
>> Document: draft-ietf-tsvwg-ecn-encap-guidelines-??
>> Reviewer: Susan Hares
>> Review Date: 2023-10-28
>> IETF LC End Date: 2023-11-02
>> IESG Telechat date: Not scheduled for a telechat
>>
>> Summary: The document summarizes decades of work on congestion work in IEEE
>> 802.1 and IETF.  It provides a set of good guidelines/recommendations for
>> designers of Layer-2 (L2) or L2/L3 shim layer.
>>
>> The authors (John Kaippallimalil and Bob Briscoe) and the original author/
>> now-contributor (Pat Thaler) should be commended for their work.
>>
>> The text is generally readable with relatively few English and editorial
>> issues.  However, the authors would be wise to fix the editorial issues prior
>> to sending it to the RFC editor since phrasing needs to be precise.
>>
>> Major issues: none.
>>
>> Minor issues:
>>
>> 1. Have the authors considered the SR-routing pathways with tunnels in this
>> draft?
>>     If so, the authors might add a side note in the document.
>
> [BB] Not specifically. The idea is to provide guidelines for any 
> protocol designers who aim to add ECN. Might there be something 
> specific in SR that would make this generic guidance inapplicable? 
> Other than not having protocol space, which is obviously not something 
> that guidelines can solve (similarly for all IEEE802 protocols).
>
>> 2. Has this document been circulated by the IETF liaisons to IEEE 802.1, MEF,
>> and 3GPP? 3.
> [BB]
>
>   * IEEE802.1: no current relevant work (Pat Thaler was a deliberate
>     choice of co-author at the time, given she was the IEEE802
>     liaison, and worked on 802.1Q congestion signalling - separate
>     control plane messages)
>       o Here's the liaison request:
>         https://datatracker.ietf.org/liaison/1364/ (I can't find the
>         response)
>   * 3GPP: responded with 6 relevant WGs and 20 relevant TRs, which
>     were subsequently analysed and a formal response returned.
>       o Formal liaison response:
>         https://datatracker.ietf.org/liaison/1499/
>       o Or, for summary, see slides 3-8 from the time:
>         https://www.ietf.org/proceedings/95/slides/slides-95-tsvwg-11.pdf
>   * MEF: No liaison. The rationale was to focus on SDOs that might
>     define ECN-like extensions to their lower layer protocols. I think
>     the MEF is unlikely to define protocols at that level; it is more
>     about using protocols defined in other bodies like IEEE802. IYO,
>     was this a bad judgement call?
>
>
>> Are you really sure your security and manageability sections are
>> complete?
>
> [BB]
>
>   * Security:
>       o yes definitely complete
>   * Manageability:
>       o We didn't write a separate manageability section, but I
>         believe we covered the various aspects well within the text,
>         but I'll let you decide by quickly listing them:
>       o Config:
>           + ECN has been designed to have zero to one alternatives
>               # decap has no choices
>               # encap is fairly minimal wrt config, the main aspect
>                 being config of encaps where it doesn't know whether
>                 the egress supports ECN decap because it doesn't
>                 negotiate with it (which is heavily covered in the draft).
>           + in numerous places, the draft distinguishes approaches
>             that will work in managed environments from those that are
>             more appropriate in non-managed plug-and-play scenarios.
>       o Monitoring:
>           + The guidelines on encap justify themselves based on being
>             able to monitor congestion over the whole path from the
>             origin load source, but also allow the congestion
>             introduced over the scope of the tunnel, link or subnet to
>             be measured.
>       o Anomaly detection:
>           + propagating ECN over encaps doesn't change the usefulness
>             or otherwise of ECN in detecting anomalies, so there's no
>             mention of this.
>       o Deployment & Coexistence:
>           + A large part of the draft is about addressing the problem
>             of incrementally deploying ECN support, in particular
>             where the egress of a subnet or tunnel does not support
>             ECN decap.
>       o Scaling:
>           + The good scaling properties of ECN itself are not changed
>             by encaps, other than whether the propagation process can
>             be implemented as line rate scales. The design of the ECN
>             protocol determines that, so there's nothing more this
>             draft can say about that than, you will have to work out
>             how to implement efficiently (which goes without saying).
>
>>          RFC7713 predates large deployment of SR routing.
>
> [BB] I was rather surprised to see these two mentioned in the same 
> sentence under security considerations, as if they might offer similar 
> functions.
> However, all I knew about segment routing was what Rob Shakir had 
> described to me many years ago, so I've been reading up. I still don't 
> see anything in SR that could help with assurance of e2e integrity of 
> congestion signals. The problem addressed by RFC7713 is how to ensure 
> that everything around the whole end-to-end congestion control loop is 
> reporting congestion to the next device honestly. Just as each network 
> node in isolation can lie about how congested it is, so can a segment, 
> or an SR domain. So I can't see how SR could make any contribution to 
> a solution to this problem.
>
> But if I'm missing some key insight here, pls enlighten me.
>
> BTW, altho RFC7713 is cited here, it's only to say that an integrity 
> solution exists if its needed. It's not been deployed widely, not 
> least because the problem hasn't arisen. Hence the draft describes 
> these bullets as "experimental or proposed".
>
>> Nits/editorial comments:
>>
>> Formatting in the pdf seems to be problematic.  I am not commenting on this
>> point since html form does not have hte same problem.
>
> [BB] I've just looked through the PDF - what problem should I look 
> for? I can't see anything wrong.
>
>> Nits in Editing:
>> Section 2
>> Old/Not-ECN-PDU:
>> A PDU at the IP layer or below that is part of a congestion control
>> feedback-loop within which at least one node necessary to propagate any
>> explicit congestion notification signals back to the Load Regulator is not
>> capable of doing that propagation./
>> New:/Not-ECN-PDU: A PDU at the IP layer or
>> below that is part of a congestion control feedback-loop within which at least
>> one node necessary to propagate any explicit congestion notification signals
>> back to the Load Regulator this PDU is not capable of doing that propagation./
>
> [BB] This isn't what it meant, which implies that the sentence might 
> be on the edge of parseability.
> Whether it's an ECN-PDU or a Not-ECN-PDU is defined as a property of 
> the control loop that the PDU is bound to (by its addressing or 
> labelling), not necessariiy a direct property of the PDU itself. This 
> was meant to be a way of getting round the problem of a packet 
> adopting multiple L2 headers of multiple different L2 protocols as it 
> traverses the path.  Then each PDU has to have some way of indicating 
> whether it is an ECN-PDU or not. In some protocols that will be 
> self-described. But in others, it might be described indirectly in the 
> control plane. I think I'm going to need to explicitly say all this in 
> the terminology explanation.
>
> Normally, I'd give proposed new text, but I'm just heading for my 
> flight, so I'll press sent now.

[BB2] Here's proposed text:

           A PDU at the IP layer or below that is
           part of a congestion control feedback loop that is not capable of
           propagating explicit congestion notification signals back to 
the Load
           Regulator, because at least one of the nodes necessary to 
propagate
           the signals is incapable of doing that propagation. Note that 
this
           definition is a property of the feedback-loop, not 
necessarily of the
           PDU itself, because in some protocols the PDU will 
self-describe the
           property, but in others the property might be carried in a 
separate
           control-plane context that is somehow bound to the PDU.


Bob

>
> [BTW, I've also noticed there are places where it says a congested 
> buffer marks or drops packets without making it clear that it only 
> marks or drops a proportion, not all packets. Apparently this caused 
> problems with the whole attempt to use ECN for Voice over LTE (VoLTE) 
> in 3GPP, where they defined ECN as either 100% on or 100% off, because 
> it didn't say otherwise anywhere in RFC3168 (!). So I'd better fix this.]
>
>> Section 3
>> Text:/
>> The router forwards the marked L3 header into subnet 2, and when it adds a new
>> L2 header it copies the L3 marking into the L2 header as well, as shown by the
>> 'C's in both layers (assuming the technology of subnet 2 also supports explicit
>> congestion marking)./
>>
>> Question - did you mean subnet b instead of subnet 2 (per figure 1)
>
> [BB] Yes. Also caught by another reviewer - strange no-one in the WG 
> noticed this.
>
>> Section 4.3
>> Text:/
>> This ensures that bulk congestion monitoring of outer headers (e.g. by a
>> network management node monitoring ECN in passing frames) will measure
>> congestion accumulated along the whole upstream path — since the Load Regulator
>> not just since the ingress of the subnet. /
>>
>> The portion of this sentence that has me confused is "since the Load Regulator
>> not just since the ingress of the subnet.".   I'm not really sure what you are
>> trying to say - so I have no suggested new text.
>
> [BB]
> will measure
> congestion accumulated along the whole upstream path —*starting from*  the Load Regulator
> not just*starting from*  the ingress of the subnet.
>
> Would that make it comprehensible?
>
>
>
> Bob
>
>
>
>
>
> -- 
> ________________________________________________________________
> Bob Briscoehttp://bobbriscoe.net/

-- 
________________________________________________________________
Bob Briscoehttp://bobbriscoe.net/