Re: WGLC for draft-ietf-bfd-large-packets

Jeffrey Haas <jhaas@pfrc.org> Thu, 26 September 2019 19:46 UTC

Return-Path: <jhaas@slice.pfrc.org>
X-Original-To: rtg-bfd@ietfa.amsl.com
Delivered-To: rtg-bfd@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id AA6DE120ADD for <rtg-bfd@ietfa.amsl.com>; Thu, 26 Sep 2019 12:46:53 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.901
X-Spam-Level:
X-Spam-Status: No, score=-1.901 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 2Rp35Tj_kTP5 for <rtg-bfd@ietfa.amsl.com>; Thu, 26 Sep 2019 12:46:51 -0700 (PDT)
Received: from slice.pfrc.org (slice.pfrc.org [67.207.130.108]) by ietfa.amsl.com (Postfix) with ESMTP id 60F11120891 for <rtg-bfd@ietf.org>; Thu, 26 Sep 2019 12:46:51 -0700 (PDT)
Received: by slice.pfrc.org (Postfix, from userid 1001) id 9D8031E2F2; Thu, 26 Sep 2019 15:49:48 -0400 (EDT)
Date: Thu, 26 Sep 2019 15:49:48 -0400
From: Jeffrey Haas <jhaas@pfrc.org>
To: "Les Ginsberg (ginsberg)" <ginsberg@cisco.com>
Cc: "Ketan Talaulikar (ketant)" <ketant@cisco.com>, "Reshad Rahman (rrahman)" <rrahman@cisco.com>, "rtg-bfd@ietf.org" <rtg-bfd@ietf.org>
Subject: Re: WGLC for draft-ietf-bfd-large-packets
Message-ID: <20190926194948.GA22700@pfrc.org>
References: <C44550AC-F6E0-4351-9958-CB9144C9F23A@cisco.com> <CY4PR11MB1541F9CEEA547F1D962D79B5C1B30@CY4PR11MB1541.namprd11.prod.outlook.com> <29C6F22D-AC57-446C-8013-408C4E28A94A@pfrc.org> <BYAPR11MB36384BA8A940618DA9FC3F76C18E0@BYAPR11MB3638.namprd11.prod.outlook.com> <20190918152817.GA20672@pfrc.org> <MN2PR11MB3647316C13CAA5EBD4531B06C1890@MN2PR11MB3647.namprd11.prod.outlook.com> <20190919020128.GB20672@pfrc.org> <BYAPR11MB3638E358EF9CE34818ECD010C1890@BYAPR11MB3638.namprd11.prod.outlook.com> <D95235B0-9917-4C06-97E6-1181BFF6F7CC@pfrc.org> <BYAPR11MB36383D1DF70CFA399BFF6E59C1840@BYAPR11MB3638.namprd11.prod.outlook.com>
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Disposition: inline
In-Reply-To: <BYAPR11MB36383D1DF70CFA399BFF6E59C1840@BYAPR11MB3638.namprd11.prod.outlook.com>
User-Agent: Mutt/1.5.21 (2010-09-15)
Archived-At: <https://mailarchive.ietf.org/arch/msg/rtg-bfd/5_ZseB1ao2TwN1aQHD5BhkkYLyE>
X-BeenThere: rtg-bfd@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "RTG Area: Bidirectional Forwarding Detection DT" <rtg-bfd.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/rtg-bfd>, <mailto:rtg-bfd-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/rtg-bfd/>
List-Post: <mailto:rtg-bfd@ietf.org>
List-Help: <mailto:rtg-bfd-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/rtg-bfd>, <mailto:rtg-bfd-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 26 Sep 2019 19:46:54 -0000

Les,

On Tue, Sep 24, 2019 at 10:48:51PM +0000, Les Ginsberg (ginsberg) wrote:
> A few more thoughts - maybe these are more helpful than my previous comments - maybe not. I am sure you will let me know.
> 
> Protocol extensions allowing negotiation and/or advertisement of support for larger PDUs may well be useful - but let's agree that it is desirable to deploy this without protocol extensions just to keep the interoperability bar low.
> 
> My primary angst is with the following paragraph in Section 3:
> 
> "It is also worthy of note that even if an implementation can function
>    with larger transport PDUs, that additional packet size may have
>    impact on BFD scaling.  Such systems may support a lower transmission
>    interval (bfd.DesiredMinTxInterval) when operating in large packet
>    mode.  This interval may depend on the size of the transport PDU."
> 
> Given long experience that CPU use correlates more highly with number of packets than with number of bytes, the first sentence would seem to be weakly supported.
> Given the previously mentioned concerns about detection time, the second sentence seems to compromise the value of the extension.

My experience is largely identical to yours.

The motivation for mentioning anything at all here is TANSTAAFL[1], and
we've already had people ask about possible impacts.  And, as we discussed
previously in the thread we shall inevitably get asked about it during TSV
review in IESG.

The primary reason this is a "may" in the non-RFC 2119 sense is that our
experience also suggests that when the scaling impacts are primarily pps
rather than bps that this feature will likely have no major impact on
implementations beyond your valid concerns about exercising bugs.

I suspect had this not been mentioned at all, you would have been happier.
But you're not the target audience for this weak caveat.

> What might be better?
> 
> 1)Some statement that MTU isn't necessarily a consistent value for all systems connected to an interface - which can impact the results when large BFD packets are used. Implementations might then want to consider supporting "bfd-mtu" configuration and/or iterating across a range of packet sizes to determine what works and what doesn't.

I'm not clear what you intend by this statement.

Are you asking that we emphasize the use case in a different way?  The
Introduction currently states: 
  "However,
   some applications may require that the Path MTU [RFC1191] between
   those two systems meets a certain minimum criteria.  When the Path
   MTU decreases below the minimum threshold, those applications may
   wish to consider the path unusable."

I'm also unclear what "Implementations" may refer to here.  BFD?  An
arbitrary user application?  If the latter, the application may not have
strict control over the generation of a given PDU size; e.g. TCP
applications.

> 2)Use of both padded and unpadded packets in combination with draft-ietf-bfd-stability to determine whether a BFD failure is due to padding or a generic forwarding failure.
> 
> Either of these suggestions are really "diagnostic modes" which may help diagnose a problem but aren't meant to be used continuously as part of fast failure detection.

We could certainly add a paragraph or two as an application note about using
this for BFD stability purposes as well.

-- Jeff