Re: [Lsr] Dynamic flow control for flooding

"Les Ginsberg (ginsberg)" <ginsberg@cisco.com> Tue, 23 July 2019 21:24 UTC

Return-Path: <ginsberg@cisco.com>
X-Original-To: lsr@ietfa.amsl.com
Delivered-To: lsr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 592821209A8 for <lsr@ietfa.amsl.com>; Tue, 23 Jul 2019 14:24:59 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -14.499
X-Spam-Level:
X-Spam-Status: No, score=-14.499 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_HI=-5, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, USER_IN_DEF_DKIM_WL=-7.5] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=cisco.com header.b=KP8qhJ51; dkim=pass (1024-bit key) header.d=cisco.onmicrosoft.com header.b=g5Gxx5Ya
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 6EmcWvz7BiCt for <lsr@ietfa.amsl.com>; Tue, 23 Jul 2019 14:24:56 -0700 (PDT)
Received: from rcdn-iport-8.cisco.com (rcdn-iport-8.cisco.com [173.37.86.79]) (using TLSv1.2 with cipher DHE-RSA-SEED-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 4A3EA120973 for <lsr@ietf.org>; Tue, 23 Jul 2019 14:24:56 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=cisco.com; i=@cisco.com; l=17402; q=dns/txt; s=iport; t=1563917096; x=1565126696; h=from:to:cc:subject:date:message-id:references: in-reply-to:mime-version; bh=tD4SIh0pkvGbasHP6UMd3rf/9ytruQF12K6kKqJkJ8g=; b=KP8qhJ51cbWbCYRd9MAF0kawEt2G+0trlNYy8tOcsm3sBLNsFSY8weUZ jwh7Lxb5bzkIVO6BSySjABufvuW5jDaqKwKrBhsp4HWZYcNnv83uU75q4 gSgNn733VrBFVlQLPpAuqZFipWJgb7EywRwa5k98U1DW7o60CKaYQZgd+ Q=;
IronPort-PHdr: 9a23:TGYu2xUclqpJXf09Bt/6j3kPFhPV8LGuZFwc94YnhrRSc6+q45XlOgnF6O5wiEPSA9yJ8OpK3uzRta2oGXcN55qMqjgjSNRNTFdE7KdehAk8GIiAAEz/IuTtank1HcJZXlJ/8FmwMFNeH4D1YFiB6nA=
X-IronPort-Anti-Spam-Filtered: true
X-IronPort-Anti-Spam-Result: A0CqAADEejdd/4gNJK1mGgEBAQEBAgEBAQEHAgEBAQGBZ4EVLyQsA21VIAQLKoQdg0cDjgCCW4lUiSeEVYJSA1QJAQEBDAEBLQIBAYRAAheCNyM4EwEDAQEEAQECAQZthR4MhUoBAQEBAxIRChMBATcBDwIBBgIRBAEBKwICAh8RHQgCBA4FCBMHgwGBHU0DHQECjyeQYAKBOIhgcYEygnkBAQWFCw0LghMJgTSLXxeBQD+BEAFGgU5QLj6CGoIKAQEgJIJlMoImjFUBgiKEfogsP41GQAkCghmQGIQPmAqWco4TAgQCBAUCDgEBBYFnIYFYcBWDJ4JCDBeDToNGhw1ygSmLYYJDAQE
X-IronPort-AV: E=Sophos;i="5.64,300,1559520000"; d="scan'208,217";a="601241378"
Received: from alln-core-3.cisco.com ([173.36.13.136]) by rcdn-iport-8.cisco.com with ESMTP/TLS/DHE-RSA-SEED-SHA; 23 Jul 2019 21:24:54 +0000
Received: from XCH-RCD-005.cisco.com (xch-rcd-005.cisco.com [173.37.102.15]) by alln-core-3.cisco.com (8.15.2/8.15.2) with ESMTPS id x6NLOskk031854 (version=TLSv1.2 cipher=AES256-SHA bits=256 verify=FAIL); Tue, 23 Jul 2019 21:24:54 GMT
Received: from xhs-rcd-003.cisco.com (173.37.227.248) by XCH-RCD-005.cisco.com (173.37.102.15) with Microsoft SMTP Server (TLS) id 15.0.1473.3; Tue, 23 Jul 2019 16:24:54 -0500
Received: from xhs-rcd-003.cisco.com (173.37.227.248) by xhs-rcd-003.cisco.com (173.37.227.248) with Microsoft SMTP Server (TLS) id 15.0.1473.3; Tue, 23 Jul 2019 16:24:53 -0500
Received: from NAM02-SN1-obe.outbound.protection.outlook.com (72.163.14.9) by xhs-rcd-003.cisco.com (173.37.227.248) with Microsoft SMTP Server (TLS) id 15.0.1473.3 via Frontend Transport; Tue, 23 Jul 2019 16:24:53 -0500
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=NAY1XzYQGk4CDpKZjUVIHcwqWhkWjjin4CHGyKy2Gv6nAW2TOXcBE/vvYuQSulhqtft38Ru1jjjAonaUqim+N2Zn7IxR0AFxhJCvLQiOm8bpduh6XB3PYMcSCzC0e+A4e+A8REeYBWYhMEyRfRnweg6LaiN1Q4CX/B22rweJrsGZsuYhnx0490VbHOTEy4Fz/CxXvSASpGIyeM9YFk2W36UrjgOuhTh0egPkzwQsnXF1j783rHiO5WFo92QkFW03VZcW5BvlRcx/rU/JlNj2UCHqWz03ZMHyXKczscrcqb7m8m8s5aGnh5mMd/Zi4uG4tHt7xt9yfo+HPiez9ZS+Rg==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=tD4SIh0pkvGbasHP6UMd3rf/9ytruQF12K6kKqJkJ8g=; b=jRKbl2EfrNBL9/U8eXJiFpqG984eJV3CjZTmWlHSfAjYEgVTmZAL4V3qr92/zqIGl1MaX8Fhba4hPqm6d65P8va0eynHB0j2Kwb/RwCaZ7VJZ4SozVZx5HbK8p94ZeButFv4LJftMhkUERKKgwF2XzbLoBTGuwEzwoN+vnREZPVlATrkBQav7fzBHdr5LFbCy+H9KOh3m59vuyUfAzPYPVUC7KH3zCLu/91BU/+J+sbnEP40VfL5pTHBRvu4hml8MBVo1TJYgUvvwG763mh98mMkiabRpx5kZBle4+p49X76EOTcjfgtEwPTezPCXqUq+PJFP2Eslmun3vcoqjNiqQ==
ARC-Authentication-Results: i=1; mx.microsoft.com 1;spf=pass smtp.mailfrom=cisco.com;dmarc=pass action=none header.from=cisco.com;dkim=pass header.d=cisco.com;arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cisco.onmicrosoft.com; s=selector2-cisco-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=tD4SIh0pkvGbasHP6UMd3rf/9ytruQF12K6kKqJkJ8g=; b=g5Gxx5YajzWEb49SEiAHxJmW9u5dsluTQHi2x/TFZe/Y+eGqqS5sFnPt8GzveTKSy1+xx9J6dYM7ceqpRxLrMBKFmeFJbCKpK8Gk2jSbWTq25rTIRiNd1J1pqysnEzwIjUfE68mpmvu4JSPuk/dul2fhE5RGG4m8vasKO58wk/o=
Received: from BYAPR11MB3638.namprd11.prod.outlook.com (20.178.237.19) by BYAPR11MB2728.namprd11.prod.outlook.com (52.135.227.158) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.2094.16; Tue, 23 Jul 2019 21:24:51 +0000
Received: from BYAPR11MB3638.namprd11.prod.outlook.com ([fe80::c8b3:b0b0:581d:e1ce]) by BYAPR11MB3638.namprd11.prod.outlook.com ([fe80::c8b3:b0b0:581d:e1ce%6]) with mapi id 15.20.2094.013; Tue, 23 Jul 2019 21:24:51 +0000
From: "Les Ginsberg (ginsberg)" <ginsberg@cisco.com>
To: Tony Przygienda <tonysietf@gmail.com>
CC: Tony Li <tony.li@tony.li>, "lsr@ietf.org" <lsr@ietf.org>
Thread-Topic: [Lsr] Dynamic flow control for flooding
Thread-Index: AQHVQVt0dc+yhfYADEyrMMJ609g9uKbYO7RggABzlYCAAAOu0A==
Date: Tue, 23 Jul 2019 21:24:51 +0000
Message-ID: <BYAPR11MB363844E1525B8B466AD35443C1C70@BYAPR11MB3638.namprd11.prod.outlook.com>
References: <CAMj-N0LdaNBapVNisWs6cbH6RsHiXd-EMg6vRvO_U+UQsYVvXw@mail.gmail.com> <BYAPR11MB36382C89363202D1B5659614C1C70@BYAPR11MB3638.namprd11.prod.outlook.com> <CA+wi2hM5QqjfyakGYPVwmb4amXKRSy5-_YuQEY5V1CePzv77cw@mail.gmail.com>
In-Reply-To: <CA+wi2hM5QqjfyakGYPVwmb4amXKRSy5-_YuQEY5V1CePzv77cw@mail.gmail.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
authentication-results: spf=none (sender IP is ) smtp.mailfrom=ginsberg@cisco.com;
x-originating-ip: [2001:420:c0c8:1006::374]
x-ms-publictraffictype: Email
x-ms-office365-filtering-correlation-id: 17d6d2df-04bc-449a-19f2-08d70fb432db
x-microsoft-antispam: BCL:0; PCL:0; RULEID:(2390118)(7020095)(4652040)(8989299)(4534185)(4627221)(201703031133081)(201702281549075)(8990200)(5600148)(711020)(4605104)(1401327)(2017052603328)(7193020); SRVR:BYAPR11MB2728;
x-ms-traffictypediagnostic: BYAPR11MB2728:
x-ms-exchange-purlcount: 2
x-microsoft-antispam-prvs: <BYAPR11MB2728C992575637FF5AF78159C1C70@BYAPR11MB2728.namprd11.prod.outlook.com>
x-ms-oob-tlc-oobclassifiers: OLM:10000;
x-forefront-prvs: 0107098B6C
x-forefront-antispam-report: SFV:NSPM; SFS:(10009020)(4636009)(346002)(376002)(366004)(396003)(39860400002)(136003)(189003)(199004)(81166006)(7696005)(14454004)(5660300002)(1411001)(7736002)(229853002)(99286004)(6246003)(54896002)(68736007)(11346002)(446003)(46003)(9686003)(53936002)(86362001)(6116002)(66574012)(790700001)(81156014)(8936002)(2906002)(76176011)(55016002)(6306002)(14444005)(316002)(64756008)(66446008)(256004)(6436002)(186003)(71190400001)(71200400001)(76116006)(33656002)(66946007)(561944003)(66476007)(74316002)(53546011)(4326008)(476003)(486006)(66556008)(478600001)(52536014)(6916009)(8676002)(54906003)(102836004)(6506007)(25786009); DIR:OUT; SFP:1101; SCL:1; SRVR:BYAPR11MB2728; H:BYAPR11MB3638.namprd11.prod.outlook.com; FPR:; SPF:None; LANG:en; PTR:InfoNoRecords; MX:1; A:1;
received-spf: None (protection.outlook.com: cisco.com does not designate permitted sender hosts)
x-ms-exchange-senderadcheck: 1
x-microsoft-antispam-message-info: DzjaIUGbGmELVy7MgKpr55afQXG/UkPd1Fm/3UsdeQR/y4xQx3x39WUzfhONyx7xljepaUjPJSLn1tH/etye0XvQCCwqQrqXxRAgZnmVqfM5S1GGiuIZOBhV+6BclXX9t2FdqAFBCSJASpipUIjGh1bcm7Z710XHkINkPHrmfTv7ilQqcwMAzysyBaH/qObcgAXlJEAjL4pbtf9Dxo8B6pvo2i4CcNtNLCuvoFbDjKcXntz1+kvuYq9LpRWPopscZty9n8EXD/IUDNyzbsp2lSl3093Rx2l50Yp9Y/+9KF8vtweJ445X7YPETZp8+4HLzUSO7GC60Ai3S2DgAWJKh1rVyUS/sy5WEQxnyw0rQ2+l9cpg4k0VE5iw06YomGtMmUKurgLKfCfIGpvKT2UKRf7iUlHM/5szJQU5qS3JTlk=
Content-Type: multipart/alternative; boundary="_000_BYAPR11MB363844E1525B8B466AD35443C1C70BYAPR11MB3638namp_"
MIME-Version: 1.0
X-MS-Exchange-CrossTenant-Network-Message-Id: 17d6d2df-04bc-449a-19f2-08d70fb432db
X-MS-Exchange-CrossTenant-originalarrivaltime: 23 Jul 2019 21:24:51.6585 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 5ae1af62-9505-4097-a69a-c1553ef7840e
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: ginsberg@cisco.com
X-MS-Exchange-Transport-CrossTenantHeadersStamped: BYAPR11MB2728
X-OriginatorOrg: cisco.com
X-Outbound-SMTP-Client: 173.37.102.15, xch-rcd-005.cisco.com
X-Outbound-Node: alln-core-3.cisco.com
Archived-At: <https://mailarchive.ietf.org/arch/msg/lsr/JtRHLMLI1TRzx7by34mLm_8TH8E>
Subject: Re: [Lsr] Dynamic flow control for flooding
X-BeenThere: lsr@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Link State Routing Working Group <lsr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/lsr>, <mailto:lsr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/lsr/>
List-Post: <mailto:lsr@ietf.org>
List-Help: <mailto:lsr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/lsr>, <mailto:lsr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 23 Jul 2019 21:25:11 -0000

Tony –

As usual, you cover a lot of territory – and even after a couple of readings I am not sure I got everything.
Still, I dare to reply.
Inline.

From: Tony Przygienda <tonysietf@gmail.com>
Sent: Tuesday, July 23, 2019 1:56 PM
To: Les Ginsberg (ginsberg) <ginsberg@cisco.com>
Cc: Tony Li <tony.li@tony.li>; lsr@ietf.org
Subject: Re: [Lsr] Dynamic flow control for flooding



It is a mistake to equate LSP flooding with a set of independent P2P “connections” – each of which can operate at a rate independent of the other.



At least my experience much disagrees with that and such a proposal seems to steer towards slowest receiver in the whole network problem so I wait for others to chime in.
[Les:] This is NOT AT ALL where I am going.
If I have a “large network” and I have a node which consistently cannot support the flooding rates necessary to deal with Tony Li’s example (node w many neighbors fails) then the network has a problem.
Slowing down everyone to meet the flooding speed of the slowest speed is not something I would expect a customer to accept. The network will not be able to deliver the convergence expected. The node in question needs to be identified and steps taken to either fix it or upgrade or replace it or…

The point I am also making is trying to run the network with some links flooding fast and some links flooding slow isn’t a solution either.

Then, to clarify on Tony's mail, the "problem" I mentioned anecdotally yesterday as behavior I saw on things I did in their time was of course when processors were still well under 1GHz and links in Gigs and not 10s and 100s of Gigs we have today but yes, the limiting factor was the flooding rate (or rather effective processing rate of receiver AFAIR before it started drop the RX queues or was late enough to cause RE-TX on senders) in terms of losses/retransmissions necessary that were causing transients to the point it looked to me then the cure seemed worse than the disease (while the disease was likely a flu then compared to today given we didn't have massively dense meshes we steer towards today). The base spec & mandated flooding numbers didn't change but what is possible in terms of rates when breaking the spec did change of course in terms of CPU/links speed albeit most ISIS implementations go back to megahertz processors still ;-) And the dinner was great BTW ;-)

So yes, I do think that anything that will flood @ reasonable rate without excessive losses will work well on well-computed double-flood-reduced-graph, the question is how to get the "reasonable" in place both in terms of numbers as well as mechanism for which we saw tons lively discussions/proposal yesterday, most obvious being of course going and manually bumping e'one's implementation to the desired (? ;-) value ...  Other consideration is having computation always trying to get more than 2 links in minimal cut on the graph of course which should alleviate any bottleneck or rather, make the cut less likely. Given quality of max-disjoint-node/link graph computation algorithms that should be doable by gut feeling. If e.g. the flood rate per link is available the algorithms should be doing even better in centralized case.

[Les:] Convergence issues and flooding overload as a result of excessive redundant flooding is a real issue – but it is a different problem (for which we have solutions) and we should not introduce that issue into this discussion.

   Les

BTW, with all that experience (MANET did its share in different space as we know in terms of flood reduction as well) in RIFT we chose a solution based on MANET derivative where every source chooses a different set of trees to flood on using Fisher-Yates hashes but that seems possible only if you have directionality on the graph (that's what I said once on the mike that doing flood reduction in a lattice [partial rank-ordered graph with upper & lower bounds] is fairly trivial, on generic graphs not so much necessarily). But maybe Pascal reads that and gives it a think ;-)

as usual, 2 cents to improve the internet ;-)

--- tony