RE: WGLC for draft-ietf-bfd-large-packets

"Les Ginsberg (ginsberg)" <ginsberg@cisco.com> Wed, 18 September 2019 05:24 UTC

Return-Path: <ginsberg@cisco.com>
X-Original-To: rtg-bfd@ietfa.amsl.com
Delivered-To: rtg-bfd@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 759031200E5 for <rtg-bfd@ietfa.amsl.com>; Tue, 17 Sep 2019 22:24:24 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -14.5
X-Spam-Level:
X-Spam-Status: No, score=-14.5 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_HI=-5, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, USER_IN_DEF_DKIM_WL=-7.5] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=cisco.com header.b=Un3stBsx; dkim=pass (1024-bit key) header.d=cisco.onmicrosoft.com header.b=gQvshjOc
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id OGc-rJwIfx6n for <rtg-bfd@ietfa.amsl.com>; Tue, 17 Sep 2019 22:24:21 -0700 (PDT)
Received: from alln-iport-7.cisco.com (alln-iport-7.cisco.com [173.37.142.94]) (using TLSv1.2 with cipher DHE-RSA-SEED-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 43B6C12006D for <rtg-bfd@ietf.org>; Tue, 17 Sep 2019 22:24:21 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=cisco.com; i=@cisco.com; l=12870; q=dns/txt; s=iport; t=1568784261; x=1569993861; h=from:to:cc:subject:date:message-id:references: in-reply-to:content-transfer-encoding:mime-version; bh=NoZssTgvYeVB5Efw3J0yN+xKXbqE/gOwYFWCRnljbI0=; b=Un3stBsxd0uXGWKtmJXCq+MdrZJ0Tu72X5eVJ+KPB7l0HkIc6Zs6gJGv PePXoW8MPpZ9h8ZyylE216b/cMAvopQ0M4zvSiUpYxLcbQvJEHxtrmYcw TUfyShoOqxY1R+LUsw2RC5xa2a5bK7IYHUP9Yr8b63W6p2QdSJ7foJws7 k=;
IronPort-PHdr: 9a23:I0cxcRPWNln07xdMrCMl6mtXPHoupqn0MwgJ65Eul7NJdOG58o//OFDEu6w/l0fHCIPc7f8My/HbtaztQyQh2d6AqzhDFf4ETBoZkYMTlg0kDtSCDBj0LfjxZSEgE+xJVURu+DewNk0GUMs=
X-IronPort-Anti-Spam-Filtered: true
X-IronPort-Anti-Spam-Result: A0CwAADlvoFd/4wNJK1mGgEBAQEBAgEBAQEHAgEBAQGBZ4FFUAOBQyAECyqEIoNHA4p3glx+lnWBQoEQA1QJAQEBDAEBLQIBAYQ/AheCZiM4EwIDCQEBBAEBAQIBBQRthS0MhUoBAQEBAxIREQwBASMJCwELBAIBCA4DBAEBAQICJgICAjAVCAgCBAENBQgRAgeEawMdAQKkYgKBOIhhc4Eygn0BAQWFChiCFwmBDCiLeBiBQD+BEAFGghc1PoQRARIBISSCZTKCJoxtKgyCMYVJghCVSwqCIpUdmRyODZkFAgQCBAUCDgEBBYFpIWdxcBU7gmyCQgwXg0+KUgFzgSmMew0XB4InAQE
X-IronPort-AV: E=Sophos;i="5.64,519,1559520000"; d="scan'208";a="328116890"
Received: from alln-core-7.cisco.com ([173.36.13.140]) by alln-iport-7.cisco.com with ESMTP/TLS/DHE-RSA-SEED-SHA; 18 Sep 2019 05:24:20 +0000
Received: from XCH-ALN-004.cisco.com (xch-aln-004.cisco.com [173.36.7.14]) by alln-core-7.cisco.com (8.15.2/8.15.2) with ESMTPS id x8I5OK3R031359 (version=TLSv1.2 cipher=AES256-SHA bits=256 verify=FAIL); Wed, 18 Sep 2019 05:24:20 GMT
Received: from xhs-aln-001.cisco.com (173.37.135.118) by XCH-ALN-004.cisco.com (173.36.7.14) with Microsoft SMTP Server (TLS) id 15.0.1473.3; Wed, 18 Sep 2019 00:24:19 -0500
Received: from xhs-aln-003.cisco.com (173.37.135.120) by xhs-aln-001.cisco.com (173.37.135.118) with Microsoft SMTP Server (TLS) id 15.0.1473.3; Wed, 18 Sep 2019 00:24:19 -0500
Received: from NAM05-BY2-obe.outbound.protection.outlook.com (173.37.151.57) by xhs-aln-003.cisco.com (173.37.135.120) with Microsoft SMTP Server (TLS) id 15.0.1473.3 via Frontend Transport; Wed, 18 Sep 2019 00:24:19 -0500
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=fDqZPRQrG9oENAxshtYTI/Qngv9r3/2Wo6VRi1TIkMNAH66B1gS7S/DO2x1JGfxr49pfFcjj9XljEnkYk3KHHNmf3ejckqwdP6BfIR1gkmXW4IAOEiTdiqcXXZhuP4Isa1/2T596WTLi/adus5ruOh0xtllsqqf0u5NdmyOKL7J6uZeqCBnIs+e6CG2sh3Bb1Jc3YrzN485V2yX8QM9a7HkpSIlntM2PaDa3lhdrRgzwZfUB6GcInxTye+G9rVZkeacYOamXmr8/G0iiFHgaiCMDBktg6jfBQCsBm6sHy3F5IDAPjI+S18FRjbn/XmgLsLwQDnMqf74NGHhIpNZELw==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=NoZssTgvYeVB5Efw3J0yN+xKXbqE/gOwYFWCRnljbI0=; b=So12o69dJ8GwZ5MbIy1yeTP3L8WbAutBl3W2CgSYZgNnzXNU2paC0Ayy+3CXYh6V9kEGr8d6Burt7n89hfPYr0XwB3yUYKo4Qk9UuUknDWuNBBiT7rJXLMS+ClQKfPcbhue44F7aLVV+h8hVA7/BQmfmzKJKnYR6kLhiTFQi5AGOzzWscHgvsMz6mWhx8ATYt4m9Doa4O2GKTre+bRT1mmqB9iELAFGPVdl2mGFeOb3JAI+btbpdFl3s+C2MInJBLPhwhSZ2FcbcFF57LEA0pcgmI8q3kP6XoF7K7+Oph2Czq93e/mQ20x3GtwLdsLPg4s6gyxST/te11zFa2tnSWw==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=cisco.com; dmarc=pass action=none header.from=cisco.com; dkim=pass header.d=cisco.com; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cisco.onmicrosoft.com; s=selector2-cisco-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=NoZssTgvYeVB5Efw3J0yN+xKXbqE/gOwYFWCRnljbI0=; b=gQvshjOcITN1dUt7QRSGRoiYz9kEDzM7GxRf5SXtyk0hGn8nrp8xwOuVPEZLWWIMbHgt16e0lt6n0WBtSuyJMnnBQkgLRwuKcLydoGXDWWWbm+E7gUySaMTXvguV7PwGnkWSUIoIixxV6eX9/oDxONTL0ANHVFxgMERdgccat+M=
Received: from BYAPR11MB3638.namprd11.prod.outlook.com (20.178.237.19) by BYAPR11MB2741.namprd11.prod.outlook.com (52.135.223.149) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.2263.21; Wed, 18 Sep 2019 05:24:17 +0000
Received: from BYAPR11MB3638.namprd11.prod.outlook.com ([fe80::f4d7:dd49:e1da:4474]) by BYAPR11MB3638.namprd11.prod.outlook.com ([fe80::f4d7:dd49:e1da:4474%5]) with mapi id 15.20.2263.023; Wed, 18 Sep 2019 05:24:17 +0000
From: "Les Ginsberg (ginsberg)" <ginsberg@cisco.com>
To: Jeffrey Haas <jhaas@pfrc.org>, "Ketan Talaulikar (ketant)" <ketant@cisco.com>
CC: "Reshad Rahman (rrahman)" <rrahman@cisco.com>, "rtg-bfd@ietf.org" <rtg-bfd@ietf.org>
Subject: RE: WGLC for draft-ietf-bfd-large-packets
Thread-Topic: WGLC for draft-ietf-bfd-large-packets
Thread-Index: AQHVXPUyTxWyJBtWsEOnJ4ETycFi16cjIc8AgAXoLNCAAJsYAIAHXvgQ
Date: Wed, 18 Sep 2019 05:24:17 +0000
Message-ID: <BYAPR11MB36384BA8A940618DA9FC3F76C18E0@BYAPR11MB3638.namprd11.prod.outlook.com>
References: <9ECC2E5C-E87E-4859-9DA8-E8E9403DF759@cisco.com> <C44550AC-F6E0-4351-9958-CB9144C9F23A@cisco.com> <CY4PR11MB1541F9CEEA547F1D962D79B5C1B30@CY4PR11MB1541.namprd11.prod.outlook.com> <29C6F22D-AC57-446C-8013-408C4E28A94A@pfrc.org>
In-Reply-To: <29C6F22D-AC57-446C-8013-408C4E28A94A@pfrc.org>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
authentication-results: spf=none (sender IP is ) smtp.mailfrom=ginsberg@cisco.com;
x-originating-ip: [2001:420:c0c8:1008::20d]
x-ms-publictraffictype: Email
x-ms-office365-filtering-correlation-id: 4243b2cb-8995-40a6-a57b-08d73bf873f3
x-microsoft-antispam: BCL:0; PCL:0; RULEID:(2390118)(7020095)(4652040)(8989299)(4534185)(4627221)(201703031133081)(201702281549075)(8990200)(5600167)(711020)(4605104)(1401327)(2017052603328)(7193020); SRVR:BYAPR11MB2741;
x-ms-traffictypediagnostic: BYAPR11MB2741:
x-ms-exchange-transport-forked: True
x-microsoft-antispam-prvs: <BYAPR11MB2741DFCE96C0F20DA69735FBC18E0@BYAPR11MB2741.namprd11.prod.outlook.com>
x-ms-oob-tlc-oobclassifiers: OLM:10000;
x-forefront-prvs: 01644DCF4A
x-forefront-antispam-report: SFV:NSPM; SFS:(10009020)(4636009)(366004)(136003)(39860400002)(376002)(346002)(396003)(53754006)(13464003)(85664002)(189003)(199004)(9686003)(110136005)(33656002)(446003)(81166006)(74316002)(81156014)(7736002)(46003)(6246003)(55016002)(305945005)(54906003)(52536014)(5660300002)(478600001)(4326008)(14454004)(8936002)(6506007)(25786009)(256004)(53546011)(86362001)(486006)(8676002)(6116002)(476003)(99286004)(76176011)(2906002)(6636002)(229853002)(11346002)(186003)(6436002)(7696005)(102836004)(316002)(76116006)(66446008)(64756008)(66556008)(66476007)(66946007)(71200400001)(71190400001); DIR:OUT; SFP:1101; SCL:1; SRVR:BYAPR11MB2741; H:BYAPR11MB3638.namprd11.prod.outlook.com; FPR:; SPF:None; LANG:en; PTR:InfoNoRecords; MX:1; A:1;
received-spf: None (protection.outlook.com: cisco.com does not designate permitted sender hosts)
x-ms-exchange-senderadcheck: 1
x-microsoft-antispam-message-info: v5CF7CNdeukU1QPSk4H9pUjg1oykMhUHyQzhaAoXfOJ7BBvGhsWgs6STTgEDO5ystupxVlaVB8jVq6LrRYt0PIi0iMXUy0hkAFFVLGBrBwAqAVMvTb8WD8EW7qvxglfIAHkucJNb6ZEt31xmeu2uBcNVG9yTds00ULbyYyF4cT/Kum3VYOgWEBXiQirGi0FDhCHrBwFNvjpgyf3itJaBfGBF0oHOXSCbgDhgYN0AmKNEyoYmAFczpFq3hVC/cmku+RjiKoAK+BPXbsFpBLfV6QGF1xKwljDDoob7LnO7DRiNDFKTgj2ee0lLTohdusBlN3eKOvdPIC+Qn7OrUKAJXdZc3IDxOoQ811RKrBEXKZ7a2mR9t39YpiUw1xoZBmrhqlhY4u7LYEYKIcVVpXxAGcGGNsvay0C5RfxWG0u88UY=
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
MIME-Version: 1.0
X-MS-Exchange-CrossTenant-Network-Message-Id: 4243b2cb-8995-40a6-a57b-08d73bf873f3
X-MS-Exchange-CrossTenant-originalarrivaltime: 18 Sep 2019 05:24:17.7611 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 5ae1af62-9505-4097-a69a-c1553ef7840e
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: q0ZLm6pALXTdm8Hv5AhmK+8HVZPEY4rDXKHUghxBpHbM3px36fAmA0dKizogIQiiIqLl5ZeOlMhtbn7rh1J2vA==
X-MS-Exchange-Transport-CrossTenantHeadersStamped: BYAPR11MB2741
X-OriginatorOrg: cisco.com
X-Outbound-SMTP-Client: 173.36.7.14, xch-aln-004.cisco.com
X-Outbound-Node: alln-core-7.cisco.com
Archived-At: <https://mailarchive.ietf.org/arch/msg/rtg-bfd/T8xheNBh9p3U73J08DOWcDqvMaQ>
X-BeenThere: rtg-bfd@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "RTG Area: Bidirectional Forwarding Detection DT" <rtg-bfd.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/rtg-bfd>, <mailto:rtg-bfd-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/rtg-bfd/>
List-Post: <mailto:rtg-bfd@ietf.org>
List-Help: <mailto:rtg-bfd-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/rtg-bfd>, <mailto:rtg-bfd-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 18 Sep 2019 05:24:25 -0000

A few apologies:

1)Sorry to be late in responding - but just back from vacation.
2)As a number of my comments overlap with Ketan's, I decided to respond to this email - apologies if that adds confusion.
3)I will top post my reply in the hopes it will more universally readable - apologies if the lack of context makes things harder to understand.

With that out of the way...

I support this draft - but I do not think it is ready for last call.

There are very legitimate concerns about the impact supporting padded BFD PDUs may have on existing hardware implementations - both functionally and in terms of scalability.
As a standards track document I think more needs to be said about operational considerations - and I think there may be legitimate reasons to consider negotiation of the capability.
In particular the statement in Section 3:

    "Additional changes to the base BFD   protocol may be required to permit negotiation of this functionality
         and the padding value."

If these protocol changes are to be made, shouldn't they be specified in this document?? Otherwise the document would seem just informational.

Section 3 also suggests 

      "support[ing] a lower transmission  interval (bfd.DesiredMinTxInterval)"

as a means of minimizing scalability impacts. But in early discussions of this draft it had been suggested that rather than use BFD for PathMTU validation we could use another protocol (e.g., IS-IS hellos) to do this. But that was rejected because it was stated that the deployment requirement required much faster detection. If that argument holds, then the suggestion in the draft to use longer timers seems inappropriate - or at least without sufficient context.

(BTW, by "lower transmission interval" I assume you mean a longer interval (i.e., a larger value for bfd.DesiredMinTxInterval)??)

I think these points need to be addressed in the draft before it is can be considered complete.

    Les


> -----Original Message-----
> From: Rtg-bfd <rtg-bfd-bounces@ietf.org> On Behalf Of Jeffrey Haas
> Sent: Friday, September 13, 2019 5:34 AM
> To: Ketan Talaulikar (ketant) <ketant@cisco.com>
> Cc: Reshad Rahman (rrahman) <rrahman@cisco.com>; rtg-bfd@ietf.org
> Subject: Re: WGLC for draft-ietf-bfd-large-packets
> 
> Ketan,
> 
> Thanks for your comments.
> 
> 
> > On Sep 12, 2019, at 11:55 PM, Ketan Talaulikar (ketant)
> <ketant@cisco.com> wrote:
> >
> > Hi All,
> >
> > I would like to ask some questions and seek clarifications on this draft.
> >
> > 	• I am aware that this draft originates from practical pain points at a
> specific operator. During the adoption calls, the scenarios were debated in
> detail. It was basically a L2 WAN circuit service over a provider network and
> the challenge was that the PMTU of the underlying path in the SP network
> changes dynamically. However, from the Enterprise POV, the L2 circuit is
> seen as a single link and the BFD runs in directly connected mode. The draft
> however, discusses BFD multi-hop which is an entirely different use-case.
> 
> This is perhaps some poor wording choices in the Introduction section.  The
> intent is really that this is usable for all cases where BFD may be used.  The
> Introduction text was intended to note that for single-hop scenarios, BFD
> Echo procedures may be sufficient to solve the problem.  (See comment to
> Carlos elsewhere in the thread.)  However, for any form of multi-hop, you
> can't solve this via Echo.
> 
> General note about the "pain points at a specific author": While Albert
> certainly was the first customer I'd had willing to help stamp their name on
> such a draft, I'd been approached in my chair capacity both in IETF and at my
> employer over the years about similar headaches many times.  This is sadly
> not a rare problem.
> 
> > When doing multi-hop, the BFD packet could go over entirely different
> paths in the network with different PMTUs (especially different from the
> application sending large packets) – this makes things flaky? So shouldn’t this
> mechanism be actually focussed on the single hop/directly connected mode
> instead?
> 
> Even for single-hop this may be slightly flakey.  Consider "directly connected"
> links that are composed from LAGs.
> 
> The draft doesn't try to completely address all such forms of multipath
> problems, even for directly connected solutions.  When we were doing BFD
> for LAGs, we realized that trying to get overly specific in terms of the
> implementation not only got ourselves perilously into the specifics of a given
> vendor, but potentially into conflict with IEEE in terms of discussions about
> distributing traffic.  To that end, we've been silent on the matter in this draft.
> 
> It's reasonable in any BFD implementation dealing with multipath issues to
> potentially interact with the load balancer and decide to exercise specific
> links.  I believe it's the case for some scenarios that your own employer may
> do this in pre BFD-on-LAG implementations for single hop BFD based on
> some old mailing list discussion.  However, it's proven unreasonable over the
> life of the BFD protocol to try to get too pedantic over how to do such a thing;
> some hardware may simply not support it.
> 
> 
> >
> > 	• There are implementations with BFD offload to H/W out there.
> What happens when a particular implementation cannot handle such large
> packet sizes (and I am not specifically aware of one such)? Like other aspects
> of monitoring like the intervals, would there be a value in indicating a support
> for large packets during the signalling? The draft does raise the need for it
> but doesn’t seem to do anything about it – why? Either we need it and this
> draft should define it. Or we don’t need it and it would placing the onus on
> the operator to enable this when they know both ends support it. Then it is
> something for operational consideration section perhaps?
> 
> This is effectively a criticism of BFD in general rather than specifically this
> feature.  Part of the motivation for RFC 7419, Common Interval Support in
> Bidirectional Forwarding Detection, was to specify a number of common
> implementation timings.  But even then, that document doesn't talk much
> about scaling considerations for a given set of intervals.  Common
> considerations for implementors and operators tend to be "my
> implementation can support X sessions at 3.3ms, Y and 10ms, and Z much
> slower if we leave the fully distributed to the line card mode".  This just
> simply means the product sheet gets another axis in its listings.
> 
> What you're asking for in the protocol otherwise is a mechanism to negotiate
> or renegotiate a set of timers over a given set of interfaces based on existing
> or future re-provisioning of session quantity and parameters.  I think you'll
> find operators aren't particularly supportive of that.
> 
> FWIW, we expect exactly these same questions during IESG review.  BFD had
> exactly this type of question when doing even the base RFC 5880 work.  (BFD
> tends to make the current Transport AD unhappy.)
> 
> 
> >
> > 	• There was a discussion on the list about whether this needs to be
> done for every packet or not. I don’t find that discussion or the result
> captured in the draft. The draft just says that perhaps longer intervals should
> be applied to BFD when doing large packets – but this will defeat the purpose
> of fast-detection. What would be good is we have both fast-detection and
> slow PMTU validation? Perhaps we need some analysis of whether large
> packets should be used always or intermittently and the pros/cons or
> guidelines for the same?
> 
> See my comment to Carlos in-thread about why this makes the timers tricky.
> See also how BFD Echo can solve this problem in some circumstances without
> negotiation.
> 
> It's worth noting that BFD is quite capable of dynamically re-negotiating its
> timers.  This means an implementation could have a behavior such as:
> 
> 1. Start by bringing up its session with timers for regular sized packets, and
> 3.3ms
> 2. Decide to shift to large packets, but renegotiate to 10ms timers first.
> 
> The "signaling" that a system is willing to accept the large packets is shifting to
> a longer timer.
> 
> 
> >
> > 	• The draft is missing an operational considerations and
> manageability considerations sections. Some of this info is placed in sec 3, but
> would help if things were elaborated in their individual sections. It would
> provide more insight into how exactly this mechanism in BFD is envisaged to
> be actually deployed and used. More importantly, perhaps how it should
> NOT be used?
> 
> Is there specific information you think should go into such a section that isn't
> otherwise present in the draft?  The "how it should NOT" be used is a
> challenging section to write - somewhat similar to proving negatives - without
> specific content.
> 
> >
> > Can the authors and WG discuss the above? I think it seems too rushed to
> go for WGLC just as yet?
> 
> The purpose of WGLC is to provide the working group an opportunity to drive
> out commentary like that above when things are otherwise quiet. :-)  Thanks
> for providing it.
> 
> -- Jeff