Re: [ippm] 1st AD review of rfc8889bis
Martin Duke <martin.h.duke@gmail.com> Thu, 02 December 2021 19:23 UTC
Return-Path: <martin.h.duke@gmail.com>
X-Original-To: ippm@ietfa.amsl.com
Delivered-To: ippm@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 0532A3A0417; Thu, 2 Dec 2021 11:23:20 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.097
X-Spam-Level:
X-Spam-Status: No, score=-2.097 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id g7RLJT3lQFyH; Thu, 2 Dec 2021 11:23:15 -0800 (PST)
Received: from mail-ua1-x936.google.com (mail-ua1-x936.google.com [IPv6:2607:f8b0:4864:20::936]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 47AE13A0412; Thu, 2 Dec 2021 11:23:15 -0800 (PST)
Received: by mail-ua1-x936.google.com with SMTP id l24so1129689uak.2; Thu, 02 Dec 2021 11:23:15 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=egrPgf+WDyqt6ZPBU/5XNZnBvML6/JdxsvH+DypP9zI=; b=X29uYozPe6k9sGllcp2W2V7opa5G50p6jJZTeUhM7RqIbYFRmfDCtIF7j3qIljQ4gC WQ53JGJA5OQa79UNNks4TWE72cRELagVJor+50xjaG/xtYZuxEBhwPqxB/Twgn0lZrMm PknlY3cjNH1zOPUZrx6grGvbhIXCoTtj7l/xS5iBOWoEJnw/c6fQwh8MUSqXib9/tv80 vpo5Az1fZkcSNWQmEsQcZQsOpFS1/oki3PIykIf1qAv06ptMc5Z8R65yHAhsaB2rNKkv max0GJmprW50wFbAFnbTe8YTbLBMTOBA+ZvzXcDs5AxUZ3Ex584lDQe5hVy4f84VpLOp mpgw==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=egrPgf+WDyqt6ZPBU/5XNZnBvML6/JdxsvH+DypP9zI=; b=6nhETQlWVfs4nhrzYmj1F2CKeKCIR6MULkleoeLKOjtCmL8CFGxUqQh1eYilMEAuaW z8010L7qNf/eOGiWpBeDKYz92g7i3o06uJD3ADIr5Tyjp54GcXZ9yaf/BOEpe3jd7Rtw o0/GioAiwLIWV5YYg0c5AUpijkiqjRDciIqA0Fd540GqjzzbI+RKhQSzAhfFvhV6vSQW jHbrspgk4DpMbzgu6PSOxK7rczO3P5EA6NvOQWaNu4effapt/y/Tts9IrkwanGW0fRLI ArtpWVVIMioTyinDIXvdUG4WVpC4f5P8NNouAmDjoSsp0knMRW+es2fxLQ0psSD7huXu AFpA==
X-Gm-Message-State: AOAM533BvCSQoZ5wTn6GS6yjtP6xIItYDYBnNlRGyc78wxTgnFdqlEh5 NrEHD2rn1Xl0B10PU0qr+FVId6k6mVnXbk+gy2lq7gCZ
X-Google-Smtp-Source: ABdhPJyHGxcQK1Ons+qVV7Wum5x7Me8yU2LvMIgHAYquaMH6yqVvoKufYWSwhQOlwXShcJL+LGVMh9Q9ijAgmHJQ4ws=
X-Received: by 2002:a67:f950:: with SMTP id u16mr17463102vsq.68.1638472993429; Thu, 02 Dec 2021 11:23:13 -0800 (PST)
MIME-Version: 1.0
References: <CAM4esxSq3bY+LATXWhgM5Xth+PwUJYPv0cxWFZk6LDPDcJUAHA@mail.gmail.com> <6ef48bea72684e4984e496c906a7723e@huawei.com>
In-Reply-To: <6ef48bea72684e4984e496c906a7723e@huawei.com>
From: Martin Duke <martin.h.duke@gmail.com>
Date: Thu, 02 Dec 2021 11:22:57 -0800
Message-ID: <CAM4esxRmLhpOT+M47i7yDw+bA6sOvbA50eaKQzkHcQOHHiQyZA@mail.gmail.com>
To: Giuseppe Fioccola <giuseppe.fioccola@huawei.com>
Cc: "draft-fioccola-rfc8889bis.all@ietf.org" <draft-fioccola-rfc8889bis.all@ietf.org>, IETF IPPM WG <ippm@ietf.org>
Content-Type: multipart/alternative; boundary="000000000000bf5c6505d22eb841"
Archived-At: <https://mailarchive.ietf.org/arch/msg/ippm/Aq2hDJPD-ImZKbZHK5jriPKgUOk>
Subject: Re: [ippm] 1st AD review of rfc8889bis
X-BeenThere: ippm@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: IETF IP Performance Metrics Working Group <ippm.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ippm>, <mailto:ippm-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ippm/>
List-Post: <mailto:ippm@ietf.org>
List-Help: <mailto:ippm-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ippm>, <mailto:ippm-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 02 Dec 2021 19:23:20 -0000
On Thu, Dec 2, 2021 at 3:46 AM Giuseppe Fioccola < giuseppe.fioccola@huawei.com> wrote: > > *From:* Martin Duke <martin.h.duke@gmail.com> > > > - I am confused as to whether ECMP is covered by 8321 or 8889. The 8321 > section on reordering talks about ECMP as a cause, which leads me to > believe it's to be covered; but in the intro 8889 claims this use case. I > suppose it depends on how much you're localizing the measurement? > > > > [GF]: In RFC8321 it was mentioned ECMP just to explain that reordering > can be caused by ECMP paths, but the monitoring was assumed point-to-point > in the whole document. I can omit that sentence in RFC8321 if it is > confusing. While, RFC8889 covers the general case of > multipoint-to-multipoint unicast flows (including ECMP). The BUM traffic > was not covered in the original RFC8889, but Alternate Marking can be > applied in case of multicast. Maybe, in a later version, we can add a short > section in one of the two bis documents to explain how to monitor multicast > traffic by simply applying RFC8321 link by link. > > > > - The intro claims that 8321 covers BUM traffic. Isn't multicast a classic > point-to-multipoint situation? Indeed, doesn't multicast involve packet > replication, which inherently breaks the packet loss mechanism? > > > > [GF]: With the “clustered” approach of RFC8889 you cannot monitor BUM > traffic but only multipoint-to-multipoint unicast flows. For BUM traffic, > you can apply the basic method of RFC8321 but link by link and therefore > split the multicast flow into separate unicast point-to-point links. As > said, a new section on BUM traffic could be useful to clarify this point. > This distinction seems artificial to me; the split between the drafts is not the type of traffic but the type of traffic combined with the scope of the measurement. For example, if you're measuring ECMP at the source and destination, 8321 is fine. Similarly, as you point out BUM traffic for 8321 only works if you combine it with the links. I don't think a substantive change is needed here but maybe the text could be a bit clearer? > > > - 8889 brings up IP fragmentation, but this seems like a general concern > for 8321 too. Although it probably ought to be written out in 8321 and not > here, this seems under-addressed here. Some questions: > > (a) if a router has to fragment a packet, should it replicate the marking > on all the fragments? > > (b) at a measurement point, is a packet "lost" if any fragment is lost? > what if the fragments entered separately in the network under test? > > (c) What if the marking location is not replicated across the fragments > (e.g. it's in the transport header)? > > > > [GF]: Agree, few considerations can be included in RFC8321. The assumption > in RFC8889 was that the fragmentation can be managed if it happens outside > the portion of the network that is monitored. > > Regarding your questions: > > a) both possibilities could be considered. If you do not replicate marking > you may have less problems to apply the methodology but you are missing > fragments and possible losses. > > b) yes, if we mark all the fragments. If the fragments enter separately it > is not a problem since they behave as normal packets (we are assuming > fragmentation and reassembly outside the domain). > > c) if the marking is not applied to all the fragments, the fragmentation > could also happen within the domain but the monitoring is not complete > since you are missing fragments. > I agree that there are many possibilities, but a standard should either provide guidance or (at least) work through the implications of each. It should be more prescriptive; marking nodes and measurement nodes need to have common expectations about the marking rules, although they can deterministically depend on whether or not one can mark fragments (which is obviously deployment-dependent). I'm happy to be corrected by an expert, but after thinking about it for a few minutes I would propose the following: (i) marking nodes MUST mark all fragments if there is a codepoint to use (i.e. it is in the IP header or encapsulation), as if they were separate packets. (ii) routers that fragment packets in a network under test MUST NOT replicate marks (thus signalling that it was fragmented internally), but SHOULD mark the first fragment if they have the capability to do so. (iii) Measurement points MAY simply ignore unmarked fragments and count marked fragments as full packets. However, if resources allow, Measurement Points SHOULD make note of marked initial fragments and only increment its counter if (a) other fragments are also marked, or (b) it observes all other fragments and they are unmarked. > - I don't understand the L determination when there are multiple marking > nodes (Sec 7). "The source measurement intervals can be of different > lengths and with different offsets" Some questions: > > (a) What is a "source measurement interval"? Does it have anything to do > with L? > > (b) So let's say that source nodes start marking at arbitrary times. Isn't > the "offset" then directly a function of L? And if so, how can we scale L > based on the offset m? > > (c) if the idea is that there is a clock time when marking should begin, > then why isn't the value of A [clock skew] embedded in d sufficient? > > > > [GF]: For a multipoint-to-multipoint flow, you may have multiple marking > nodes, so there is an additional contribution to take into account. Indeed, > the marking nodes may apply the markings at different moments and this time > offset does not depend from the clock error or from the network delay but > it is an intrinsic delay of the router. > > Regarding your questions: > > a) it is the marking period. I will replace it to be aligned with the > terminology of the rest of the document. > > b) yes, there is the formula: L - 2m - 2d > 0 > > c) m is not given by clock skew, indeed A is already included in d. > I wasn't clear enough about (b). Let's say node A has period L1, and node B is trying to set its L but is going to start at an arbitrary time. What is m for node B? It is a uniform distribution over [0, L1]. so L2 has to be >> L1. But the same is true of A! The only way to disambiguate the blocks is for each L to be much bigger than all the other Ls, which is of course impossible. The only way I can see for this to work is for the marking nodes to agree a clock time to switch blocks, with fixed L, and then you're only constrained by the clock skew and delay as in 8321. To me this is the central problem 8889 is trying to solve, and it's under-baked. Maybe there's a constraint I'm not seeing? > > > - I can't understand the purpose of "delay measurements on a single-packet > basis" in Section 8. I cannot grok the context of Section 8.2 at all. What > are these techniques trying to achieve? This reads like an informational > exploration of options rather than a normative standard. 8321 has detailed > examples of sample measurements and how you derive a measurement; something > like that would be very useful here. > > > > [GF]: I agree with you. Indeed, in my first local revision of the document > I reduced a lot these sections. As you noticed, it is only an exploration > that it is ok for an experimental document but less relevant for a PS. > great, as we discussed earlier let's delete irrelevant text. >
- [ippm] 1st AD review of rfc8889bis Martin Duke
- Re: [ippm] 1st AD review of rfc8889bis Giuseppe Fioccola
- Re: [ippm] 1st AD review of rfc8889bis Martin Duke
- Re: [ippm] 1st AD review of rfc8889bis Giuseppe Fioccola
- Re: [ippm] 1st AD review of rfc8889bis Martin Duke
- Re: [ippm] 1st AD review of rfc8889bis Giuseppe Fioccola