Re: [Lsr] Dynamic flow control for flooding

Robert Raszuk <rraszuk@gmail.com> Wed, 24 July 2019 13:34 UTC

Return-Path: <rraszuk@gmail.com>
X-Original-To: lsr@ietfa.amsl.com
Delivered-To: lsr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 2E43F1203A8 for <lsr@ietfa.amsl.com>; Wed, 24 Jul 2019 06:34:14 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.997
X-Spam-Level:
X-Spam-Status: No, score=-1.997 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id SRFjA8szY7bv for <lsr@ietfa.amsl.com>; Wed, 24 Jul 2019 06:34:11 -0700 (PDT)
Received: from mail-pg1-x52f.google.com (mail-pg1-x52f.google.com [IPv6:2607:f8b0:4864:20::52f]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 83FED12037A for <lsr@ietf.org>; Wed, 24 Jul 2019 06:34:11 -0700 (PDT)
Received: by mail-pg1-x52f.google.com with SMTP id i70so10510204pgd.4 for <lsr@ietf.org>; Wed, 24 Jul 2019 06:34:11 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=3DWlfRBL8T2XW29k3I62KnV0mn1ybHdAmm/8ZhNKYBY=; b=jrEFS2qrUos6Op5Kdi/jirCLwhHOwj4gsmFj/3jNu1E/jZQjtUgELEnzAAHwKuOaHb eFXRMTm++bCWHacwcMOvJQiid+14/cpmagl7OJLhWDZnGk/uLM3gf+tz2/TxJWcdbsI/ 0e8oetsEMXyrQ0q4vWYcQz0qfeTkiBSfDedBKg1Zvirjn4yG9KNn+bbhSz5lnozx/s7d HxTuY/1OmZ/wGzoPoLKUcGfUaZGpDJV2a7qiQ8CPT5ISxuwlsl2AeP5MNLOBqOhzH33n Kqm2txcSNsDsyutt6WOgM2L9/rmLSmR+faibcotoYeUoieFgxsxkY1a1BeIHS1OU7T8k OBUA==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=3DWlfRBL8T2XW29k3I62KnV0mn1ybHdAmm/8ZhNKYBY=; b=BMjzr11CSKvEx1GHa9qh251tcUd3ld8frfVWwzzrcrutZHCPKt5ArSHP+ySyUE+cyA /f6lv2jEvPMXqISIillnHfDbXObBk2e/Hoy1IkTA7p4cKXG8Mk07sBmhmV62ukNBbQ3V dbu0TD1vYnFfnbvFo2OTVdGEBmcEFa7EPi7Ha+Z0JSaAu3V2tkKhQdpHqGzpN/WnJt11 99aBMz2bo2uXdCwlIeADchr2u2LMW8ewdP4i6FI0aXcvK+/ox+Tm9kq+TWW7/6dZcq/L LnZdtbWE/53C0svZwAlYS9BafJmj+/tIcJ+UJ2P2Pq3yR+j0rBbs0lR4I/jIkneuD759 gpOQ==
X-Gm-Message-State: APjAAAWCh9X8Woc5q3p7agpqzFUhKtAYDVhPgeWygKXEBFfklQfMclvw NjH7+iM5V5dW4pgqs4CDZG0J6Ppixm025qODiIY=
X-Google-Smtp-Source: APXvYqxVaWh+RvFQ4bLHNSblfxBz+vi3YzzoUKlEQc2iuSuaqr/haYJ5i6hKPu0/mG3ROFBO2emodYKjSkxLlHkIB/k=
X-Received: by 2002:a62:be04:: with SMTP id l4mr11005309pff.77.1563975250580; Wed, 24 Jul 2019 06:34:10 -0700 (PDT)
MIME-Version: 1.0
References: <CAMj-N0LdaNBapVNisWs6cbH6RsHiXd-EMg6vRvO_U+UQsYVvXw@mail.gmail.com> <BYAPR11MB36382C89363202D1B5659614C1C70@BYAPR11MB3638.namprd11.prod.outlook.com> <5841_1563943794_5D37E372_5841_105_1_9E32478DFA9976438E7A22F69B08FF924D9C373E@OPEXCAUBMA3.corporate.adroot.infra.ftgroup> <BYAPR11MB363856BB026992DFBB3BB224C1C60@BYAPR11MB3638.namprd11.prod.outlook.com> <8376a87831ffa6f5298c5122907c6e66@xs4all.nl>
In-Reply-To: <8376a87831ffa6f5298c5122907c6e66@xs4all.nl>
From: Robert Raszuk <rraszuk@gmail.com>
Date: Wed, 24 Jul 2019 15:33:56 +0200
Message-ID: <CA+b+ER=LOZxoyoonPtC7VKppSNcQohGQdx+n8D3+LndnHdsofQ@mail.gmail.com>
To: Henk Smit <henk.ietf@xs4all.nl>
Cc: "Les Ginsberg (ginsberg)" <ginsberg@cisco.com>, "<stephane.litkowski@orange.com>" <stephane.litkowski@orange.com>, Tony Li <tony.li@tony.li>, lsr@ietf.org
Content-Type: multipart/alternative; boundary="0000000000003f9a5c058e6d5dd4"
Archived-At: <https://mailarchive.ietf.org/arch/msg/lsr/BIQv79JCtKMULK8KPFZm6_FHC2I>
Subject: Re: [Lsr] Dynamic flow control for flooding
X-BeenThere: lsr@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Link State Routing Working Group <lsr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/lsr>, <mailto:lsr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/lsr/>
List-Post: <mailto:lsr@ietf.org>
List-Help: <mailto:lsr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/lsr>, <mailto:lsr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 24 Jul 2019 13:34:18 -0000

Hey Henk & all,

If acks for 1000 LSPs take 16 PSNPs (max 66 per PSNP) or even as long as
Tony mentioned the full flooding as Tony said may take 33 sec - is this
really a problem ?

Remember we are not talking about protocol convergence after link flap or
node going down. We are talking about serious network partitioning which
itself may have lasted for minutes, hours or days. While just considering
absolute numbers yelds desire to go faster and faster, if we put things in
the overall perspective is there really a problem to be solved in the first
place ?

Would there still be a problem if LSR WG recommends faster acking maybe not
for each LSP but for say 20 or 30 max ?

Thx,
R.








On Wed, Jul 24, 2019 at 3:18 PM Henk Smit <henk.ietf@xs4all.nl> wrote:

>
> Hello Les,
>
> Les Ginsberg (ginsberg) wrote on 2019-07-24 07:17:
>
> > If you accept that, then it makes sense to look for the simplest way
> > to do flow control and that is decidedly not from the RX side. (I
> > expect Tony Li to disagree with that 😊 – but I have already
> > outlined why it is more complex to do it from the Rx side.)
>
> In your talk on Monday you called the idea in
> draft-decraene-lsr-isis-flooding-speed-01 "receiver driven flow
> control".
> You don't like that. You want "transmit based flow control".
> You argued that you can do "transmit based flow control" on the sender
> only.
> Therefor your algorithm is merely a "local trick".
> And "local tricks" don't need RFCs. I agree with that.
> But I don't agree that your algorithm is just a "local trick".
>
>
> In your algorithm, a "sender" sends a number of LSPs to a receiver.
> Without waiting for acks (PNSPs). Like in any sliding window protocol.
> The sending router keeps an eye on the number of unacked LSPs.
> And it determines how fast it can send more LSPs based on the current
> number of unacked LSPs. Every time the sender receives a PSNP, it
> knows the receiver got a number of LSPs, so it can increase its
> send-window again, and then send more LSPs.
> Correct ?
>
> I agree that the core idea of this algorithm makes sense.
> After all, it looks a lot like TCP.
> I believe the authors of draft-decraene-lsr-isis-flooding-speed were
> planning something like that for the next version of their draft.
>
>
> However, I do not agree with the name "tx driven flow control".
> I also do not agree that this algorithm is "a local trick".
> Therefor I also do not think this algorithm doesn't need to be
> documented (in an RFC).
>
> In your "tx based flow control", the sender (tx) sends LSPs at a rate
> that is derived from the rate at which it receives PSNPs. Therefor
> it is the sender of the PSNPs that sets the speed of transmission !
> So it is still the receiver (of LSPs) that controls the flow control.
> The name "tx based flow control" is a little misleading, imho.
>
>
> It is important to realize that the success of your algorithm actually
> depends on the behaviour of the receiver. How does it send PSNPs ?
> Does it send one PSNP per received LSP ? Or does it pack multiple acks
> in one PSNP ? Does it send a PSNP immediatly, or does it wait a short
> time ? Does it try to fill a PSNP to the max (putting ~90 acks in one
> PSNP) ? Does the receiver does something in between ? I don't think
> the behaviour is specified exactly anywhere.
>
> I know about an IS-IS implementation from the nineties. When a router
> would receive an LSP, it would a) set the SSN bit (for that
> LSP/interface),
> and b) start the psnp-timer for that interface (if not already running).
> The psnp-timer would expire 2 seconds later. The router would then walk
> the LSPDB, find all LSPs with the SSN-bit set for that interface. And
> then build a PSNP with acks for all those LSPs. The result would be
> that: a) the first PSNP would be send 2 seconds (+/- jitter) after
> receiving the first LSP, and b) the PSNP would include ~66 acks. (As
> a router receiving at full speed would have received 66 LSPs in 2
> seconds).
>
> For your "tx based flow control" algorithm to work properly, this has
> to change. The receiving router must send PSNPs more quickly and more
> aggressively. The result would be that there will be less acks in each
> PSNP. And thus more PSNPs will be sent.
>
> This makes us realize: in the current situation, if a router receives
> a 1000 LSPs, and sends those LSPs to 64 neighbors, it would receive:
> - the 1000 LSPs from an upstream neighbor, plus
> - 1000/66 = 16 PSNPs from each downstream neighbor = 64 * 16 = 1024
> PSNPs.
> This makes a total of ~2000K PDUs received.
>
> If routers would send one PSNP per LSP (to have faster flow control),
> then the router in this example would receive:
> - the 1000 LSPs from an upstream neighbor, plus
> - 1000 PSNPs from each downstream neighbor * 16 = 1600 PSNPs.
> This makes a total of ~17000 PDUs received.
>
> The total number of PDUs received on this router would go from 2K PDUs
> to 17K PDUs.
>
> Remember that the problem we're trying to solve here is to make sure
> that routers do not get overrun on the receipt side with too many
> packets too quickly. It seems an aggressive PSNP-scheme, to achieve
> faster flow-control, is actually very counter-productive.
>
> Of course the algorithm can be tweaked. E.g. TCP sends one ack per
> every 2 received segments (if I'm not mistaken). If we do that here,
> the number of PDUs would go down from 17K to 9K PDUs. What do you
> propose ? How do you want the feedback of PSNPs to be quick, while
> maintaining an efficient packing of multiple acks per PSNP ?
>
>
> In any case, the points I'm trying to make here:
> *) Your algorithm is not sender-driven, but still receiver-driven.
> *) Your algorithm changes/dictates behaviour both on sender and
> receiver.
> *) Interaction between a sender and a receiver is what we call a
> protocol.
>     If you want to make this work, especially in multi-vendor
> environments,
>     we need to document these algorithms. Aka in an RFC.
>
> Kind regards,
>
> henk.
>
> _______________________________________________
> Lsr mailing list
> Lsr@ietf.org
> https://www.ietf.org/mailman/listinfo/lsr
>