Re: [bmwg] Martin Duke's Discuss on draft-ietf-bmwg-evpntest-09: (with DISCUSS and COMMENT)

Martin Duke <martin.h.duke@gmail.com> Thu, 01 July 2021 17:13 UTC

Return-Path: <martin.h.duke@gmail.com>
X-Original-To: bmwg@ietfa.amsl.com
Delivered-To: bmwg@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id E97AF3A0BF4; Thu, 1 Jul 2021 10:13:38 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.097
X-Spam-Level:
X-Spam-Status: No, score=-2.097 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id LR-ebYFDc0eO; Thu, 1 Jul 2021 10:13:34 -0700 (PDT)
Received: from mail-il1-x131.google.com (mail-il1-x131.google.com [IPv6:2607:f8b0:4864:20::131]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id D34053A0BF1; Thu, 1 Jul 2021 10:13:33 -0700 (PDT)
Received: by mail-il1-x131.google.com with SMTP id b5so7050893ilc.12; Thu, 01 Jul 2021 10:13:33 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=CS8F0N9TzokLMDZbV5xwp3zJbEZGUSe03C5MAwyb3Xw=; b=g/KJHvaHvuBBoGhoVys0n6wMp6GnnXKnJDRhXxOs/xHSSPdSSHXFiwX3k2+aDK7PK5 mqTDS3U9u95Od5ZhDTK3K2DiN/xHy6dz8reeZvt+k9VBfXohAtURa8219ZAjPsTyJXoE ZbYV6I5gHKWCNwV7gW0/QpaY8jg8juhQPOLW154Wl/cdT723PfHhIqFdTTfRoCmk/8kB NP7JP3tJNYYkDaRjb7+h4mJX5V3K77AuVZ1kmKs4P7y6zNGlb8Z942IblVghb/p12fIM r7tE9kLq0vgDZXOKwApqDTtBt38la8sszCBHMWyLrcd0U2kbBWZNLG2EoWfZIZL0WFmx s9dA==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=CS8F0N9TzokLMDZbV5xwp3zJbEZGUSe03C5MAwyb3Xw=; b=Lpgji5ASiqTgjRaA1uQynnHCgn6fjiRcBJzpIELU4RmErt20xkz5llw8fDYW0zCbx5 CQjesoh5FObKI+GhXo2AkqcPbDWpRqf0LdDhKTqy6UtlrxHLkMYoPDzziZHIA7N3YmJW +vwVPfl+jaJwy1agMPgepiwUbRs4yJ6s/jvVeXBWlSoYRxh6yGLOBxcxrE8gIV8/1Aur Gw8FvU1XdmExUHdgxdE0VMlQYkmyXmQ4RhOg3VS5uCUw8aFHyQ2iEwSEy3URqINX0Go5 2OT/pZ+8xb7Fc2riNXfDwY+C6NM8paFramKr1QdGgVzq0wBdNrRPuZjX2pLftCKg31vT z8Xw==
X-Gm-Message-State: AOAM533qIY5NN75tJECpHTvqfQgaZXEKAwui6WerZC9tOiRaV/9+D97Y T0bapwhUi44j1lH/EY0owNXGfpdBN9wDWtFedUQ=
X-Google-Smtp-Source: ABdhPJzNwUkcKU94Uv3vpDMX1HgR8zrG+XcXG4K0Qx2nqKvkr0aA90PgxrXdfmAAiFqkyqrshpgRRlIQQTTadt1DIkE=
X-Received: by 2002:a05:6e02:1245:: with SMTP id j5mr299703ilq.249.1625159612754; Thu, 01 Jul 2021 10:13:32 -0700 (PDT)
MIME-Version: 1.0
References: <3DBD2DD7-B7DE-42EA-A2D3-D539E692DB38@encrypted.net> <1624988083.S.6320.15165.f4-234-163.1625124750.23299@webmail.rediffmail.com>
In-Reply-To: <1624988083.S.6320.15165.f4-234-163.1625124750.23299@webmail.rediffmail.com>
From: Martin Duke <martin.h.duke@gmail.com>
Date: Thu, 01 Jul 2021 10:13:22 -0700
Message-ID: <CAM4esxSjwg45sWt7ECGd0pz+X0U4Qaii35=3AqAmKu90dPeP0A@mail.gmail.com>
To: Sudhin <sudhinjacob@rediffmail.com>
Cc: Sarah Banks <sbanks@encrypted.net>, The IESG <iesg@ietf.org>, "bmwg-chairs@ietf.org" <bmwg-chairs@ietf.org>, "draft-ietf-bmwg-evpntest@ietf.org" <draft-ietf-bmwg-evpntest@ietf.org>, "bmwg@ietf.org" <bmwg@ietf.org>
Content-Type: multipart/alternative; boundary="0000000000006bf05105c612f5ef"
Archived-At: <https://mailarchive.ietf.org/arch/msg/bmwg/mKRB0TCYh_ByhiL8a8miUg9vZi8>
Subject: Re: [bmwg] Martin Duke's Discuss on draft-ietf-bmwg-evpntest-09: (with DISCUSS and COMMENT)
X-BeenThere: bmwg@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Benchmarking Methodology Working Group <bmwg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/bmwg>, <mailto:bmwg-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/bmwg/>
List-Post: <mailto:bmwg@ietf.org>
List-Help: <mailto:bmwg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/bmwg>, <mailto:bmwg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 01 Jul 2021 17:13:39 -0000

Thanks for your responses

I am trying to be quite literal and precise here, and my overall sense is
that the text has many assumptions built into it that I would like to make
explicit.

On Thu, Jul 1, 2021 at 12:32 AM Sudhin <sudhinjacob@rediffmail.com> wrote:

> > ----------------------------------------------------------------------
> > DISCUSS:
> > ----------------------------------------------------------------------
> >
> > (3.8) (4.8) Why is packet loss measured in time? How is learning 2X MAC
> > addresses relevant to the packet loss measurement at the traffic
> generator? How
> > long does the traffic generator have to wait to conclude that the packet
> is
> > lost?
>
> Sudhin>>>>> HA  test must be measured in seconds. Because the learning of
> Mac is needed to ensure frames are not flooded. Traffic generator
> calculations are beyond the scope of the draft. These devices are
> calibrated by the respective vendors.
>

I cannot reconcile this reply with the text in the draft: "Objective:
Measure traffic loss during routing engine fail over."

Maybe the objective is to measure the time till the standby device acquires
all the state, AND the packet loss, if any?

It would appear also that there is a hidden requirement that the topology
must not have a delay or jitter than exceeds the loss detection algorithm
of the generator.


> >
> > (3.9) Is a single failure to learn an address sufficient to determine
> that the
> > device has reached capacity? Or could packet loss or some other
> phenomenon lose
> > some addresses? In other words, be more precise on how polling reveals
> the
> > capacity.
> Sudhin>>> This is explained in the procedure section, which means the DUT
> can't learn any incremental values of MAC+IP/MAC+ipv6.
>

I read the procedure section, and I still have these questions. If I send X
ARP/ND messages and there are X-1 entries in the table, does that
conclusively prove that the limit is X-1, or might a packet have been lost?

>
> > Is there some lower bound on the time between sending ARP/ND packets and
> > querying the DUT?
>
> Sudhin>>> As explained above each 5% increase the data is validated.
>

That is not my question. If I send an ARP or ND message, and then query 1
us later, should I expect to see that entry in the result? (I see in [3.10]
below, IIUC, that you answer that the expectation is 1 sec.)

>
> > (3.11, 3.12, 4.10, 4.11) Does the traffic generator send F frames in
> total or F
> > ffs? The spec says both. Are there any constraints on F, perhaps an
> integer
> > multiple of X?
>
> Sudhin> F as variable is selected due to the fact it must be different
> from Mac values which is denoted by 'X'. Yes F is an integer value to
> denote frames per sec.
>

Alright please apply s/Send F frames/Send F frames per second in these
sections. Please also add text defining limits ando/or considerations for
choosing F.

>
> > ----------------------------------------------------------------------
> > COMMENT:
> > ----------------------------------------------------------------------
> >
> > (3.10, 4.9) Again, is there a minimum time between sending the traffic
> and
> > querying the result?
>
> Sudhin>>>>  The traffic is continuous and query interval to poll script
> must have minimum delay of 1s.
>

Wonderful -- please add this limit to these sections and (3.9)


> >
> > (3.12, 4.11) I don’t believe you’ve adequately addressed Al’s TSVART
> review.
> > What does “100% compared to the average usage” mean? Is that double?
> Shouldn’t
> > there be a formula to compute average usage?
>
> Sudhin>> CPU usages goes from 0% to 100%. Average usage is the usage of
> DUT during the start of the test. For example it can be 20% or 25% it must
> not spike to 100%.
>

As an example, at the start of the test, the CPU usage is 10%. If at any
point CPU usage is 100%, it fails, but if it never exceeds 99.9%, it
passes? If so, deleting the phrase "compared to the average usage" would
make this clearer.


> >
> > As Al asks, what is the threshold over which an increase in memory usage
> will
> > fail the test?
>
> Sudhin>> As the test says the memory should not increase with respect to
> time. If it is then it is a failure.
> >
>

Say the memory usage at the start of the test is 25%.

If it momentarily increases to 25.1% and then goes back down again, is that
a failure?

If it ends the test at 25.1%, is that failure?


> >
> >
> > _______________________________________________
> > bmwg mailing list
> > bmwg@ietf.org
> > https://www.ietf.org/mailman/listinfo/bmwg
> <https://www.ietf.org/mailman/listinfo/bmwg==>
>
>