Re: [Lsr] Seeking feedback to the revised draft-dunbar-lsr-5g-edge-compute

Gyan Mishra <hayabusagsm@gmail.com> Sat, 15 January 2022 06:52 UTC

Return-Path: <hayabusagsm@gmail.com>
X-Original-To: lsr@ietfa.amsl.com
Delivered-To: lsr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id F3BBF3A2621 for <lsr@ietfa.amsl.com>; Fri, 14 Jan 2022 22:52:33 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.097
X-Spam-Level:
X-Spam-Status: No, score=-2.097 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 1VXjz8Ie3giq for <lsr@ietfa.amsl.com>; Fri, 14 Jan 2022 22:52:28 -0800 (PST)
Received: from mail-pj1-x102d.google.com (mail-pj1-x102d.google.com [IPv6:2607:f8b0:4864:20::102d]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id B23F03A1BD9 for <lsr@ietf.org>; Fri, 14 Jan 2022 22:52:28 -0800 (PST)
Received: by mail-pj1-x102d.google.com with SMTP id o1so3255845pjr.2 for <lsr@ietf.org>; Fri, 14 Jan 2022 22:52:28 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=EI9ew//8IbKtGyQAVBqa/j86oeTKNyNW4P+EviczYQA=; b=K5FgkNKuSYb4EZGfBXfQm8JMQKOCCCg9yNeWUglBXfG8Peagfq0j3Uu+ju7VbEM8kg ej1juqh3IDW7JSEqd7yjKNr4xtC6XdmwKVZm0SoeVnMDhtm5fNnuLuum8oOHTWY35nLR TsS2d3u0JQnzHb1U9WsWPHZfJD8WoJ9U8B13YkDi2/M921eRt5xz3da2qYWe0Mp6v4YG niaFmTDpqkCE42bGMW4tpFOw+CAHR4gtvpne+Vm4/+UsT7Oz9Tr4kqBcwtF3zVDUCwct bChRzF010GDxAE3vb6rG/o7ZXa9bL23zSOmZO64psczqiuWcEVnQBPv4B5IoVZ7qN4S8 HfDA==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=EI9ew//8IbKtGyQAVBqa/j86oeTKNyNW4P+EviczYQA=; b=y5rKwteZORRB/75kkouCXG155bFVtsJU20PI7YtDjLAkCwgvZ04RVuX5evryTP67f1 e3jdg8OAXpcXP2PVD/cPK9F7CUXluva7jtyqzh/7pZMre4bNZx6fgqJixnaP7ey9sApv N3j+moGHhR6kfEeGb6SMBML1Y1+AqlW9AGJ4Os6RpUc1J0yuD8qYqpDgpqTfxyyhVSAR nL38NrP5DBTw22BeyF+jySeDEn+ZCXghE+3zDm4RhVqE3eYLqXyvmr+nPlRxnRG2W+Xv ra8zodN7MDqzPyhFsEOrLEg8EwTKatVM76pWveYWto6LZy5nSrdSuFQ/q8dC9uAuSFWA rZ9Q==
X-Gm-Message-State: AOAM532lR/FKjXbmduQu8k4o/IMl7Wv9LKOn7FK6puNtx4+nYjtRMS0q t6Y9+8m42z8sJe8H86RdWNCtMMRQLefLlJn5xT0=
X-Google-Smtp-Source: ABdhPJytB8JUNQVQcdrkLJ5Nb0r6kUkDCElDvXP12aCDRhP8DF9NN09PIdo4Uimt8t+v6gSPgd1bgB9l7TAc22X52HE=
X-Received: by 2002:a17:902:e5cf:b0:14a:8068:a5a with SMTP id u15-20020a170902e5cf00b0014a80680a5amr9889298plf.80.1642229546760; Fri, 14 Jan 2022 22:52:26 -0800 (PST)
MIME-Version: 1.0
References: <CO1PR13MB49208C0CFE0AE200E9654D62854D9@CO1PR13MB4920.namprd13.prod.outlook.com> <BY5PR11MB43374CE0329A2D0D4CBBB56CC1509@BY5PR11MB4337.namprd11.prod.outlook.com> <CO1PR13MB49209FB2780A390C060981D285529@CO1PR13MB4920.namprd13.prod.outlook.com> <BY5PR11MB43373269E621CCC90F47D650C1529@BY5PR11MB4337.namprd11.prod.outlook.com> <CO1PR13MB492084E011B67AFF7EB45B3585529@CO1PR13MB4920.namprd13.prod.outlook.com> <BY5PR11MB43376DA8FD239B220AAA7F03C1529@BY5PR11MB4337.namprd11.prod.outlook.com> <CO1PR13MB4920EDA93692C2BE3CC49F7A85529@CO1PR13MB4920.namprd13.prod.outlook.com> <CAOj+MMH0ockwESPepB0PH-_jHxtSJ2+n0cJCsm-oGGB6ztvQtQ@mail.gmail.com> <CO1PR13MB4920561C1237ECD319B2C17B85529@CO1PR13MB4920.namprd13.prod.outlook.com> <CAOj+MMEMt845bRwhn-KTTx=7DvinocYc0JYZyzPp9BR7jC1C+w@mail.gmail.com> <CABNhwV1dB5TwtibkMthxamsSZvtm36h1vrGOhtucw8fi4avfFw@mail.gmail.com> <BY5PR11MB4337053667F17BB2F2B4BA3BC1539@BY5PR11MB4337.namprd11.prod.outlook.com> <CABNhwV0=i448_7m60ownptafJyb8VE0un_s3NVNTw8=JdZF7UQ@mail.gmail.com> <CAOj+MMGj7bnhDSr0KrW5SffZmPfmrTYRy1P4h6McT+UdFuxLuA@mail.gmail.com> <CABNhwV0d51rhrMco6JZxEo506ZcdnaH_K9EpuQNC0jcYBe0dCg@mail.gmail.com> <CAOj+MMECwfit3pFky7juKZcvoPW6Jab_bKTDoY-r6_bx10pbZA@mail.gmail.com>
In-Reply-To: <CAOj+MMECwfit3pFky7juKZcvoPW6Jab_bKTDoY-r6_bx10pbZA@mail.gmail.com>
From: Gyan Mishra <hayabusagsm@gmail.com>
Date: Sat, 15 Jan 2022 01:52:15 -0500
Message-ID: <CABNhwV31x7r2DLj7+N+QSmPr1YVK2go5GQp9aQ6rn1vB-vrtZw@mail.gmail.com>
To: Robert Raszuk <robert@raszuk.net>
Cc: "Les Ginsberg (ginsberg)" <ginsberg@cisco.com>, Linda Dunbar <linda.dunbar@futurewei.com>, "lsr@ietf.org" <lsr@ietf.org>
Content-Type: multipart/related; boundary="000000000000c6b5f705d5995cb1"
Archived-At: <https://mailarchive.ietf.org/arch/msg/lsr/-tDVWAXroirvNI65fiG4KHSqhMs>
Subject: Re: [Lsr] Seeking feedback to the revised draft-dunbar-lsr-5g-edge-compute
X-BeenThere: lsr@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Link State Routing Working Group <lsr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/lsr>, <mailto:lsr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/lsr/>
List-Post: <mailto:lsr@ietf.org>
List-Help: <mailto:lsr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/lsr>, <mailto:lsr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sat, 15 Jan 2022 06:52:35 -0000

Hi Robert

Many Thanks for your feedback on the draft.

I understand your POV and views as agreed that server load balancing can be
solved at the application layer and maybe with other possible solutions.

Understood.

The thoughts behind Linda draft is an alternative to solving at the
application layer using IGP based solution.   I know there is a lot
involved that gets factored into the server pool resources provisioning and
monitoring and even in this solution the server health checks are closely
monitored by the load balancers and trigger of a HRI-host route injection
withdrawal as soon as the LBV VIP monitor fails.

Also it maybe possible but Linda can speak further on it that GEO DNS based
application based load balancing hybrid Model with IGP site cost
interleaved could possibly apply to this as well so it’s not all IGP based
as well as other application layer appliance or virtual based load
balancing concepts.




Kind Regards

Gyan
On Fri, Jan 14, 2022 at 4:19 AM Robert Raszuk <robert@raszuk.net> wrote:

> Gyan,
>
> This is not a network discussion. There are well proven techniques to
> direct user sessions or user requests to a pool of servers deployed and
> operational. All provide robust services. Network plays very little to no
> role in that.
>
> There are also lot's of factors involved in making that decision (CPU
> load, RAM, Storage, IO, CPU Temp etc ...) and IMO it would be very bad to
> now make IGP to carry it and make routing decisions (even if separate
> topology) based on that information.
>
> I do not see this like a move into the right direction. That is my input.
>
> Kind regards,
> Robert.
>
>
>
>
>
>
>
>
> On Fri, Jan 14, 2022 at 4:53 AM Gyan Mishra <hayabusagsm@gmail.com> wrote:
>
>> Robert
>>
>> Responses in-line
>>
>>
>>
>> On Thu, Jan 13, 2022 at 5:55 AM Robert Raszuk <robert@raszuk.net> wrote:
>>
>>> Gyan,
>>>
>>> I see what the draft is trying to do now. /* I did not even consider
>>> this for the reason described below. */
>>>
>>> But what you wrote requires little correction:
>>>
>>> "So now the server you are on gets overloaded and a site cost gets
>>> advertised in the IGP at which point the connection receives a TCP reset"
>>>
>>> if you *s/connection/all connections/* then you quickly realize that
>>> what is proposed here is a disaster.
>>>
>>
>>    Gyan>  Remember this is Anycast proximity based routing along with
>> ECMP / UCMP flow based load balancing and most vendors modern routers
>> support some sort of  x-tuple ECMP/UCMP hash so the flows would be evenly
>> distributed, so if you have 10s of 100s of paths, the flows would be evenly
>> distributed across all the paths.  Since it’s Anycast proximity each path
>> leads to a different Application LB VIP and backend server.  So all the TCP
>> connections would be uniformly distributed across all the paths.
>>
>> Only the connections hashed to the path with overloaded server would get
>> reset and it would be no different then if the server went down as the
>> connections would get reset anyway in that case.
>>
>>  In this case instead of being pinned to a bad connection you are now
>> reset to a good connection resulting in better QOE for the end user and a
>> Happy customer.
>>
>> To me thats a positive not a negative.
>>
>>  A good analogy would be let’s say you are on WIFI and on the same
>> channel that your neighbors are on and have horrible bandwidth.  Do you
>> stay on that bad channel and limp along all day or to you flip to an unused
>> channel.
>>
>> Another example is if you have a server that has run out of resources.
>> Do you fail production traffic off the server taking it out of rotation or
>> let it stay as is and pray things get better.  This draft is a good example
>> of how IGP can save the day with site metric.
>>
>>>
>>> Breaking all existing flows going to one LB to suddenly timeout and all
>>> go to the other LB(s) is never a technique any one would seriously deploy
>>> in a production network.
>>>
>>
>> Gyan> Application load balancing can be done with DNS based GEO load
>> balancing based on client and server IP database where you have more
>> discrete control however the failover is much slower.
>>
>>>
>>> Leave alone that doing that has potential to immediately overload the
>>> other LB(s)/server(s) too.
>>>
>>
>> Gyan> The idea with Anycast load balancing is that you may have 10 or
>> even 100s of paths, so if one server fails the load can be evenly
>> distributed based on statistical multiplexing algorithm calculated by the
>> server teams engineering the sizing of the server clusters to ensure what
>> you described won’t happen.
>>
>>>
>>> For me the conclusion is that IGP transport level is not the proper
>>> layer to address the requirement.
>>>
>>> Cheers,
>>> Robert.
>>>
>>>
>>> On Thu, Jan 13, 2022 at 7:05 AM Gyan Mishra <hayabusagsm@gmail.com>
>>> wrote:
>>>
>>>>
>>>> Hi Les
>>>>
>>>> Agreed.
>>>>
>>>> My thoughts are that the context of the draft is based on an Anycast
>>>> VIP address of a server which is proximity based load balancing and not
>>>> necessarily ECMP/UCMP and only if the proximity is the same for multiple
>>>> paths to the Anycast VIP would there be a ECMP/UCMP possibility.
>>>>
>>>> Let’s say if it’s proximity based and one path is preferred, the flow
>>>> will take that path.  So now the server you are on gets overloaded and a
>>>> site cost gets advertised in the IGP at which point the connection receives
>>>> a TCP reset and flow re-establishes on the alternate path based on the site
>>>> cost and remains there until the server goes down or  gets overloaded or a
>>>> better path comes along.
>>>>
>>>> For ECMP case, ECMP has flow affinity so the flow will stay on the same
>>>> path long lived until the connection terminates.
>>>>
>>>> So now in ECMP case the flow hashed to a path and maintains its
>>>> affinity to that path.  Now all of sudden the server gets overloaded and we
>>>> get a better site cost advertised.  So now the session terminates on
>>>> current path and establishes again on the Anycast VIP new path based on the
>>>> site cost advertised.
>>>>
>>>> The failover I believe results in the user refreshing their browser
>>>> which is better than hanging.
>>>>
>>>> As the VIP prefix is the only one that experiences reconvergence on new
>>>> path based on site cost if there is any instability with the servers that
>>>> will be reflected to the IGP Anycast prefix as well.
>>>>
>>>> Is that a good or bad thing.  I think if you had to pick your poison as
>>>> here the issue Linda is trying to solve is a server issue but leveraging
>>>> the IGP to force re-convergence when the server is in a half baked state
>>>> meaning it’s busy and connections are being dropped or very slow QOE for
>>>> end user.  If you did nothing and let it ride the the user would be stuck
>>>> on a bad connection.
>>>>
>>>> So this solution dynamically fixed the issue.
>>>>
>>>> As far as oscillation that is not a big deal as you are in a much worse
>>>> off state connected to a dying server on its last leg as far as memory and
>>>> CPU.
>>>>
>>>> This solution I can see can apply to any client / server connection and
>>>> not just 5G and can be used by enterprises as well as SP for their
>>>> customers to have an drastically improved QOE.
>>>>
>>>> I saw some feedback on the TLV and I think once that is all worked out
>>>> I am in favor of advancing this draft.
>>>>
>>>> Kind Regards
>>>>
>>>> Gyan
>>>>
>>>>
>>>> On Wed, Jan 12, 2022 at 10:16 PM Les Ginsberg (ginsberg) <
>>>> ginsberg@cisco.com> wrote:
>>>>
>>>>> Gyan –
>>>>>
>>>>>
>>>>>
>>>>> The difference between ECMP and UCMP is not significant in this
>>>>> discussion.
>>>>>
>>>>> I don’t want to speak for Robert, but for me his point was that IGPs
>>>>> can do “multipath” well – but this does not translate into managing flows.
>>>>>
>>>>> Please see my other responses on this thread.
>>>>>
>>>>>
>>>>>
>>>>> Thanx.
>>>>>
>>>>>
>>>>>
>>>>>     Les
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> *From:* Gyan Mishra <hayabusagsm@gmail.com>
>>>>> *Sent:* Wednesday, January 12, 2022 5:26 PM
>>>>> *To:* Robert Raszuk <robert@raszuk.net>
>>>>> *Cc:* Les Ginsberg (ginsberg) <ginsberg@cisco.com>; Linda Dunbar <
>>>>> linda.dunbar@futurewei.com>; lsr@ietf.org
>>>>> *Subject:* Re: [Lsr] Seeking feedback to the revised
>>>>> draft-dunbar-lsr-5g-edge-compute
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> Robert
>>>>>
>>>>>
>>>>>
>>>>> Here are a few examples of UCMP drafts below used in core and data
>>>>> center use cases.
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> https://datatracker.ietf.org/doc/html/draft-ietf-bess-evpn-unequal-lb-15
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> https://datatracker.ietf.org/doc/html/draft-mohanty-bess-weighted-hrw-02
>>>>>
>>>>>
>>>>>
>>>>> https://datatracker.ietf.org/doc/html/draft-ietf-idr-link-bandwidth
>>>>>
>>>>>
>>>>>
>>>>> https://datatracker.ietf.org/doc/html/draft-mohanty-bess-ebgp-dmz
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> There are many use cases in routing for unequal cost load balancing
>>>>> capabilities.
>>>>>
>>>>>
>>>>>
>>>>> Kind Regards
>>>>>
>>>>>
>>>>>
>>>>> Gyan
>>>>>
>>>>>
>>>>>
>>>>> On Wed, Jan 12, 2022 at 2:23 PM Robert Raszuk <robert@raszuk.net>
>>>>> wrote:
>>>>>
>>>>> Linda,
>>>>>
>>>>>
>>>>>
>>>>> > IGP has been used for the Multi-path computation for a long time
>>>>>
>>>>>
>>>>>
>>>>> IGP can and does ECMP well. Moreover, injecting metric of anycast
>>>>> server destination plays no role in it as all paths would inherit that
>>>>> external to the IGP cost.
>>>>>
>>>>>
>>>>>
>>>>> Unequal cost load balancing or intelligent traffic spread has always
>>>>> been done at the application layer *for example MPLS*
>>>>>
>>>>>
>>>>>
>>>>> Thx a lot,
>>>>>
>>>>> R.
>>>>>
>>>>>
>>>>>
>>>>> On Wed, Jan 12, 2022 at 8:18 PM Linda Dunbar <
>>>>> linda.dunbar@futurewei.com> wrote:
>>>>>
>>>>> Robert,
>>>>>
>>>>>
>>>>>
>>>>> Please see inline in green:
>>>>>
>>>>>
>>>>>
>>>>> *From:* Robert Raszuk <robert@raszuk.net>
>>>>> *Sent:* Wednesday, January 12, 2022 1:00 PM
>>>>> *To:* Linda Dunbar <linda.dunbar@futurewei.com>
>>>>> *Cc:* Les Ginsberg (ginsberg) <ginsberg@cisco.com>; lsr@ietf.org
>>>>> *Subject:* Re: [Lsr] Seeking feedback to the revised
>>>>> draft-dunbar-lsr-5g-edge-compute
>>>>>
>>>>>
>>>>>
>>>>> Hi Linda,
>>>>>
>>>>>
>>>>>
>>>>> *[LES:] It is my opinion that what you propose will not achieve your
>>>>> goals – in part because IGPs only influence forwarding on a per packet
>>>>> basis – not a per flow/connection basis.*
>>>>>
>>>>> *[Linda] Most vendors do support flow based ECMP, with Shortest Path
>>>>> computed by attributes advertised by IGP.*
>>>>>
>>>>>
>>>>>
>>>>> I am with Les here. ECMP has nothing to do with his point.
>>>>>
>>>>>
>>>>>
>>>>> [Linda] Les said that “IGP only influence forwarding on a per packet
>>>>> basis”.  I am saying that vendors supporting “forwarding per flow” with
>>>>> equal cost computed by IGP implies  that forwarding on modern routers are
>>>>> no longer purely per packet basis.
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> Draft says:
>>>>>
>>>>>
>>>>>
>>>>> *When those multiple server instances share one IP address (ANYCAST),
>>>>> the transient network and load conditions can be incorporated in selecting
>>>>> an optimal path among server instances for UEs.*
>>>>>
>>>>>
>>>>>
>>>>> So if we apply any new metric to indicate load of a single anycast
>>>>> address how is this going to help anything ?
>>>>>
>>>>>
>>>>>
>>>>> [Linda] The “Load” or “Aggregated Site Cost” is to differentiate
>>>>> multiple paths with the same routing distance.
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> You would need a mechanism where the network is smart and say per
>>>>> src-dst tuple or more spreads the traffic. IGP does not play that game
>>>>> today I am afraid.
>>>>>
>>>>> [Linda] There is one SRC and multiple paths to one DST. IGP has been
>>>>> used for the Multi-path computation for a long time.
>>>>>
>>>>>
>>>>>
>>>>> Thank you, Linda
>>>>>
>>>>>
>>>>>
>>>>> Thx a lot,
>>>>> R.
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> _______________________________________________
>>>>> Lsr mailing list
>>>>> Lsr@ietf.org
>>>>> https://www.ietf.org/mailman/listinfo/lsr
>>>>>
>>>>> --
>>>>>
>>>>> [image: Image removed by sender.] <http://www.verizon.com/>
>>>>>
>>>>> *Gyan Mishra*
>>>>>
>>>>> *Network Solutions Architect *
>>>>>
>>>>> *Email gyan.s.mishra@verizon.com <gyan.s.mishra@verizon.com>*
>>>>>
>>>>> *M 301 502-1347*
>>>>>
>>>>>
>>>>>
>>>> --
>>>>
>>>> <http://www.verizon.com/>
>>>>
>>>> *Gyan Mishra*
>>>>
>>>> *Network Solutions A**rchitect *
>>>>
>>>> *Email gyan.s.mishra@verizon.com <gyan.s.mishra@verizon.com>*
>>>>
>>>>
>>>>
>>>> *M 301 502-1347*
>>>>
>>>> --
>>
>> <http://www.verizon.com/>
>>
>> *Gyan Mishra*
>>
>> *Network Solutions A**rchitect *
>>
>> *Email gyan.s.mishra@verizon.com <gyan.s.mishra@verizon.com>*
>>
>>
>>
>> *M 301 502-1347*
>>
>> --

<http://www.verizon.com/>

*Gyan Mishra*

*Network Solutions A**rchitect *

*Email gyan.s.mishra@verizon.com <gyan.s.mishra@verizon.com>*



*M 301 502-1347*