Re: [Lsr] Seeking feedback to the revised draft-dunbar-lsr-5g-edge-compute

Robert Raszuk <robert@raszuk.net> Thu, 13 January 2022 10:59 UTC

Return-Path: <robert@raszuk.net>
X-Original-To: lsr@ietfa.amsl.com
Delivered-To: lsr@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 33D573A148B for <lsr@ietfa.amsl.com>; Thu, 13 Jan 2022 02:59:45 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.098
X-Spam-Level:
X-Spam-Status: No, score=-2.098 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=raszuk.net
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id SOO-V3eJeQj1 for <lsr@ietfa.amsl.com>; Thu, 13 Jan 2022 02:59:40 -0800 (PST)
Received: from mail-ua1-x933.google.com (mail-ua1-x933.google.com [IPv6:2607:f8b0:4864:20::933]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 267463A1489 for <lsr@ietf.org>; Thu, 13 Jan 2022 02:59:40 -0800 (PST)
Received: by mail-ua1-x933.google.com with SMTP id m90so10298908uam.2 for <lsr@ietf.org>; Thu, 13 Jan 2022 02:59:40 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=raszuk.net; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=hdUiAFsJ53ytLdpoGrX3Xn3FTk62dQFaK6mykjTpxzI=; b=SHR3kbfjPah0gCUe+H6VMgei8zYImFNSWV3QWVbAH97VAZk9lPSLn8ms6hDvD61Mhm ugCospEjk68pDX1hBwsGnRX41G+NZfK0ukmT4a1uyKjQ+z9HCu8Pc88HD3FTnENvr5fh s0jHf8z0Tr9xCvFYCxXz48+65dEKwh8zB7HiXPsftDf2NSsTtA6V+JIIy7gQkdkBwawA 9IFkT4CBIER4SmnmD6fsgSnf1iJPzRiLIgGPE1mo/+PLEp6k5kz9ImwNjy4bfn9CLtpb rZj2ik1HH62q9Nk97DYuGgpAjytAirTQpxq2+xpV5PKRwHJn4f/kcM4ok37ysl7ib33P KSHA==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=hdUiAFsJ53ytLdpoGrX3Xn3FTk62dQFaK6mykjTpxzI=; b=AFTB1WO3BAVl7s4cOwURZopqz4GEJjF6PsebktQ1TWmZAgMpxJZrieXnGzXG8PZyZw TORPK3XGFPTbgvWXiXJpRjwGEirj7hMG05j1E0vvqRPLhgS2ymfA4ju8TRYmP9C/y4bU 1toRC8/GN0wTYx41fkMja+Tpr4sy5BJPXiXa/zy4FnZPxodmlEJELX519JMgLYgWrjKk d5BvDdE0BzoyiDMEqXBvvHyL/tfVX8anmNUrQVBLp+eB78GhGv3SIYYBXA/SOiGV01gB 5Ida9HzLYvkGFz+Q0PowTPHqxMIruMZdRbs9Xm+QaL0T1ude4ewR5jwjbMYugiRGZxgd 6fDA==
X-Gm-Message-State: AOAM5325Pdg+Ra2Y/wdmygvCRHvO8QcZ6u7WQoEZo22ZumB+1qUsKkmM dZLuDEzny+FpS/3c/Y4h1E407AJjNnmjMFDviYYCaJ+Qj2hKSw==
X-Google-Smtp-Source: ABdhPJzEyS5j9tY9W3h50LVdzDDkvRoFmk5KamvcgVVGIljZ2wsMWoSdEq1opucvW+GXH1MlnLV4CB84eFG12KaoYQQ=
X-Received: by 2002:a67:e192:: with SMTP id e18mr1933970vsl.85.1642071577758; Thu, 13 Jan 2022 02:59:37 -0800 (PST)
MIME-Version: 1.0
References: <CO1PR13MB49208C0CFE0AE200E9654D62854D9@CO1PR13MB4920.namprd13.prod.outlook.com> <BY5PR11MB43374CE0329A2D0D4CBBB56CC1509@BY5PR11MB4337.namprd11.prod.outlook.com> <CO1PR13MB49209FB2780A390C060981D285529@CO1PR13MB4920.namprd13.prod.outlook.com> <BY5PR11MB43373269E621CCC90F47D650C1529@BY5PR11MB4337.namprd11.prod.outlook.com> <CO1PR13MB492084E011B67AFF7EB45B3585529@CO1PR13MB4920.namprd13.prod.outlook.com> <BY5PR11MB43376DA8FD239B220AAA7F03C1529@BY5PR11MB4337.namprd11.prod.outlook.com> <CO1PR13MB4920EDA93692C2BE3CC49F7A85529@CO1PR13MB4920.namprd13.prod.outlook.com> <CAOj+MMH0ockwESPepB0PH-_jHxtSJ2+n0cJCsm-oGGB6ztvQtQ@mail.gmail.com> <CO1PR13MB4920561C1237ECD319B2C17B85529@CO1PR13MB4920.namprd13.prod.outlook.com> <CAOj+MMEMt845bRwhn-KTTx=7DvinocYc0JYZyzPp9BR7jC1C+w@mail.gmail.com> <CABNhwV1dB5TwtibkMthxamsSZvtm36h1vrGOhtucw8fi4avfFw@mail.gmail.com> <BY5PR11MB4337053667F17BB2F2B4BA3BC1539@BY5PR11MB4337.namprd11.prod.outlook.com> <CABNhwV0=i448_7m60ownptafJyb8VE0un_s3NVNTw8=JdZF7UQ@mail.gmail.com> <CAOj+MMGj7bnhDSr0KrW5SffZmPfmrTYRy1P4h6McT+UdFuxLuA@mail.gmail.com>
In-Reply-To: <CAOj+MMGj7bnhDSr0KrW5SffZmPfmrTYRy1P4h6McT+UdFuxLuA@mail.gmail.com>
From: Robert Raszuk <robert@raszuk.net>
Date: Thu, 13 Jan 2022 11:59:41 +0100
Message-ID: <CAOj+MMFRpsX6es7GphepFb6WUAge8zRAXz7fgZrTdWmf6SnT_Q@mail.gmail.com>
To: Gyan Mishra <hayabusagsm@gmail.com>
Cc: "Les Ginsberg (ginsberg)" <ginsberg@cisco.com>, Linda Dunbar <linda.dunbar@futurewei.com>, "lsr@ietf.org" <lsr@ietf.org>
Content-Type: multipart/related; boundary="00000000000016e1b205d5749512"
Archived-At: <https://mailarchive.ietf.org/arch/msg/lsr/6y74K6EJsHvHFzBzTwI5Sld48Oo>
Subject: Re: [Lsr] Seeking feedback to the revised draft-dunbar-lsr-5g-edge-compute
X-BeenThere: lsr@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Link State Routing Working Group <lsr.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/lsr>, <mailto:lsr-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/lsr/>
List-Post: <mailto:lsr@ietf.org>
List-Help: <mailto:lsr-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/lsr>, <mailto:lsr-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 13 Jan 2022 10:59:45 -0000

And just to provide a sound alternative to the objective of this work.

Please consider using Traefik - https://traefik.io/

Thx,
R.


On Thu, Jan 13, 2022 at 11:56 AM Robert Raszuk <robert@raszuk.net> wrote:

> Gyan,
>
> I see what the draft is trying to do now. /* I did not even consider this
> for the reason described below. */
>
> But what you wrote requires little correction:
>
> "So now the server you are on gets overloaded and a site cost gets
> advertised in the IGP at which point the connection receives a TCP reset"
>
> if you *s/connection/all connections/* then you quickly realize that what
> is proposed here is a disaster.
>
> Breaking all existing flows going to one LB to suddenly timeout and all go
> to the other LB(s) is never a technique any one would seriously deploy in a
> production network.
>
> Leave alone that doing that has potential to immediately overload the
> other LB(s)/server(s) too.
>
> For me the conclusion is that IGP transport level is not the proper layer
> to address the requirement.
>
> Cheers,
> Robert.
>
>
> On Thu, Jan 13, 2022 at 7:05 AM Gyan Mishra <hayabusagsm@gmail.com> wrote:
>
>>
>> Hi Les
>>
>> Agreed.
>>
>> My thoughts are that the context of the draft is based on an Anycast VIP
>> address of a server which is proximity based load balancing and not
>> necessarily ECMP/UCMP and only if the proximity is the same for multiple
>> paths to the Anycast VIP would there be a ECMP/UCMP possibility.
>>
>> Let’s say if it’s proximity based and one path is preferred, the flow
>> will take that path.  So now the server you are on gets overloaded and a
>> site cost gets advertised in the IGP at which point the connection receives
>> a TCP reset and flow re-establishes on the alternate path based on the site
>> cost and remains there until the server goes down or  gets overloaded or a
>> better path comes along.
>>
>> For ECMP case, ECMP has flow affinity so the flow will stay on the same
>> path long lived until the connection terminates.
>>
>> So now in ECMP case the flow hashed to a path and maintains its affinity
>> to that path.  Now all of sudden the server gets overloaded and we get a
>> better site cost advertised.  So now the session terminates on current path
>> and establishes again on the Anycast VIP new path based on the site cost
>> advertised.
>>
>> The failover I believe results in the user refreshing their browser which
>> is better than hanging.
>>
>> As the VIP prefix is the only one that experiences reconvergence on new
>> path based on site cost if there is any instability with the servers that
>> will be reflected to the IGP Anycast prefix as well.
>>
>> Is that a good or bad thing.  I think if you had to pick your poison as
>> here the issue Linda is trying to solve is a server issue but leveraging
>> the IGP to force re-convergence when the server is in a half baked state
>> meaning it’s busy and connections are being dropped or very slow QOE for
>> end user.  If you did nothing and let it ride the the user would be stuck
>> on a bad connection.
>>
>> So this solution dynamically fixed the issue.
>>
>> As far as oscillation that is not a big deal as you are in a much worse
>> off state connected to a dying server on its last leg as far as memory and
>> CPU.
>>
>> This solution I can see can apply to any client / server connection and
>> not just 5G and can be used by enterprises as well as SP for their
>> customers to have an drastically improved QOE.
>>
>> I saw some feedback on the TLV and I think once that is all worked out I
>> am in favor of advancing this draft.
>>
>> Kind Regards
>>
>> Gyan
>>
>>
>> On Wed, Jan 12, 2022 at 10:16 PM Les Ginsberg (ginsberg) <
>> ginsberg@cisco.com> wrote:
>>
>>> Gyan –
>>>
>>>
>>>
>>> The difference between ECMP and UCMP is not significant in this
>>> discussion.
>>>
>>> I don’t want to speak for Robert, but for me his point was that IGPs can
>>> do “multipath” well – but this does not translate into managing flows.
>>>
>>> Please see my other responses on this thread.
>>>
>>>
>>>
>>> Thanx.
>>>
>>>
>>>
>>>     Les
>>>
>>>
>>>
>>>
>>>
>>> *From:* Gyan Mishra <hayabusagsm@gmail.com>
>>> *Sent:* Wednesday, January 12, 2022 5:26 PM
>>> *To:* Robert Raszuk <robert@raszuk.net>
>>> *Cc:* Les Ginsberg (ginsberg) <ginsberg@cisco.com>; Linda Dunbar <
>>> linda.dunbar@futurewei.com>; lsr@ietf.org
>>> *Subject:* Re: [Lsr] Seeking feedback to the revised
>>> draft-dunbar-lsr-5g-edge-compute
>>>
>>>
>>>
>>>
>>>
>>> Robert
>>>
>>>
>>>
>>> Here are a few examples of UCMP drafts below used in core and data
>>> center use cases.
>>>
>>>
>>>
>>> https://datatracker.ietf.org/doc/html/draft-ietf-bess-evpn-unequal-lb-15
>>>
>>>
>>>
>>> https://datatracker.ietf.org/doc/html/draft-mohanty-bess-weighted-hrw-02
>>>
>>>
>>>
>>> https://datatracker.ietf.org/doc/html/draft-ietf-idr-link-bandwidth
>>>
>>>
>>>
>>> https://datatracker.ietf.org/doc/html/draft-mohanty-bess-ebgp-dmz
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>> There are many use cases in routing for unequal cost load balancing
>>> capabilities.
>>>
>>>
>>>
>>> Kind Regards
>>>
>>>
>>>
>>> Gyan
>>>
>>>
>>>
>>> On Wed, Jan 12, 2022 at 2:23 PM Robert Raszuk <robert@raszuk.net> wrote:
>>>
>>> Linda,
>>>
>>>
>>>
>>> > IGP has been used for the Multi-path computation for a long time
>>>
>>>
>>>
>>> IGP can and does ECMP well. Moreover, injecting metric of anycast server
>>> destination plays no role in it as all paths would inherit that external to
>>> the IGP cost.
>>>
>>>
>>>
>>> Unequal cost load balancing or intelligent traffic spread has always
>>> been done at the application layer *for example MPLS*
>>>
>>>
>>>
>>> Thx a lot,
>>>
>>> R.
>>>
>>>
>>>
>>> On Wed, Jan 12, 2022 at 8:18 PM Linda Dunbar <linda.dunbar@futurewei.com>
>>> wrote:
>>>
>>> Robert,
>>>
>>>
>>>
>>> Please see inline in green:
>>>
>>>
>>>
>>> *From:* Robert Raszuk <robert@raszuk.net>
>>> *Sent:* Wednesday, January 12, 2022 1:00 PM
>>> *To:* Linda Dunbar <linda.dunbar@futurewei.com>
>>> *Cc:* Les Ginsberg (ginsberg) <ginsberg@cisco.com>; lsr@ietf.org
>>> *Subject:* Re: [Lsr] Seeking feedback to the revised
>>> draft-dunbar-lsr-5g-edge-compute
>>>
>>>
>>>
>>> Hi Linda,
>>>
>>>
>>>
>>> *[LES:] It is my opinion that what you propose will not achieve your
>>> goals – in part because IGPs only influence forwarding on a per packet
>>> basis – not a per flow/connection basis.*
>>>
>>> *[Linda] Most vendors do support flow based ECMP, with Shortest Path
>>> computed by attributes advertised by IGP.*
>>>
>>>
>>>
>>> I am with Les here. ECMP has nothing to do with his point.
>>>
>>>
>>>
>>> [Linda] Les said that “IGP only influence forwarding on a per packet
>>> basis”.  I am saying that vendors supporting “forwarding per flow” with
>>> equal cost computed by IGP implies  that forwarding on modern routers are
>>> no longer purely per packet basis.
>>>
>>>
>>>
>>>
>>>
>>> Draft says:
>>>
>>>
>>>
>>> *When those multiple server instances share one IP address (ANYCAST),
>>> the transient network and load conditions can be incorporated in selecting
>>> an optimal path among server instances for UEs.*
>>>
>>>
>>>
>>> So if we apply any new metric to indicate load of a single anycast
>>> address how is this going to help anything ?
>>>
>>>
>>>
>>> [Linda] The “Load” or “Aggregated Site Cost” is to differentiate
>>> multiple paths with the same routing distance.
>>>
>>>
>>>
>>>
>>>
>>> You would need a mechanism where the network is smart and say per
>>> src-dst tuple or more spreads the traffic. IGP does not play that game
>>> today I am afraid.
>>>
>>> [Linda] There is one SRC and multiple paths to one DST. IGP has been
>>> used for the Multi-path computation for a long time.
>>>
>>>
>>>
>>> Thank you, Linda
>>>
>>>
>>>
>>> Thx a lot,
>>> R.
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>>
>>> _______________________________________________
>>> Lsr mailing list
>>> Lsr@ietf.org
>>> https://www.ietf.org/mailman/listinfo/lsr
>>>
>>> --
>>>
>>> [image: Image removed by sender.] <http://www.verizon.com/>
>>>
>>> *Gyan Mishra*
>>>
>>> *Network Solutions Architect *
>>>
>>> *Email gyan.s.mishra@verizon.com <gyan.s.mishra@verizon.com>*
>>>
>>> *M 301 502-1347*
>>>
>>>
>>>
>> --
>>
>> <http://www.verizon.com/>
>>
>> *Gyan Mishra*
>>
>> *Network Solutions A**rchitect *
>>
>> *Email gyan.s.mishra@verizon.com <gyan.s.mishra@verizon.com>*
>>
>>
>>
>> *M 301 502-1347*
>>
>>