Re: [bess] WGLC, IPR and implementation poll for draft-ietf-bess-mvpn-fast-failover

Greg Mirsky <gregimirsky@gmail.com> Tue, 20 August 2019 17:38 UTC

Return-Path: <gregimirsky@gmail.com>
X-Original-To: bess@ietfa.amsl.com
Delivered-To: bess@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 0F8F41207FE; Tue, 20 Aug 2019 10:38:19 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -0.596
X-Spam-Level:
X-Spam-Status: No, score=-0.596 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, HTML_COMMENT_SAVED_URL=1.391, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_HTML_ATTACH=0.01, URIBL_BLOCKED=0.001] autolearn=no autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id WpOPP95iQDqu; Tue, 20 Aug 2019 10:38:14 -0700 (PDT)
Received: from mail-lf1-x134.google.com (mail-lf1-x134.google.com [IPv6:2a00:1450:4864:20::134]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 45C13120984; Tue, 20 Aug 2019 10:38:13 -0700 (PDT)
Received: by mail-lf1-x134.google.com with SMTP id 62so4764836lfa.8; Tue, 20 Aug 2019 10:38:13 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=lgRoFN6gi4cphr6xpox+E6iCwWjNw0bQh9S+Xa7cKmU=; b=rq11dHWeGiX9lQBj+k0hnlIaZX5Fa2RTX6/lr/o4jkXI01wmKG26cP/MLdgCfubGF/ Z51Yrm2p72Y0dl6fHI6CRkua7r1aToBzzxy6EWkCNbrNWC8TLtetyMr+zC1WSFllMy5w p8x1eaa+SWBLHOvJeq+qYN8lzoQXDWoFL17ipU2shAZMkApROyALvIMlcmqD4/YwhLyx O3Z8KI9xPndKyo+o8wqd+gcu5CCuudBVMT9nVCv5tcH030NY9qkkI/HohBvvL0d3TWP1 mOLROF6J/lIYFU1TGcGOdcLa9GTE2FAk+3xIgLP4cxAIfBfr2WHuZEd1t52griop0Iem 86bw==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=lgRoFN6gi4cphr6xpox+E6iCwWjNw0bQh9S+Xa7cKmU=; b=CqE0Oqk5VUwAlUxYRdB5iVyrtV3Vt/8jN1y63XmfIfUCLCcntqnx9cMBeuX5YR54ep +gsZnoeOLTlMttMu4+P/o9k3W0R9ls/lQQILezma1JEZKRl0wdmVcFB5TG6CQ8ydBfBN A2FLpYB2ecGJJMe6Q04RbSNRdDo/yITy+Lo4nZdysA7ASs1/Kumz+zOlYRvADhjLkJMf ZoQ5+bIvbo5T2R1MftMj/vaYEe1+gwWy6f47q26wLNc0cK6pePpGdCHDzIgxogk1txDv P+ysdD3HjRBavvYc4ze3XKEBomSjYaWhxdOw41gGxM0z2LH0772Rln0RZaVjJqr7Kfbp +EtQ==
X-Gm-Message-State: APjAAAW+a6C2txigkS0YmC3LK02q5bvoPtG1JLkXMhqX37h0G8FMtUmr g8Cin5xnLKb3Qn8Ps0UhluSQxtOwSXIE9mCHW+0=
X-Google-Smtp-Source: APXvYqwScokKBebgAp63Wj/d+8vN5gzgyYAZ2AkkjcnXSYFLJz+1+f0R1XCXHApMNnmdVvXcOc2qrA86hUlVYSnKbzQ=
X-Received: by 2002:a19:9111:: with SMTP id t17mr15863512lfd.113.1566322691376; Tue, 20 Aug 2019 10:38:11 -0700 (PDT)
MIME-Version: 1.0
References: <26502_1542873261_5BF660AD_26502_47_1_9E32478DFA9976438E7A22F69B08FF924B7752E9@OPEXCLILMA4.corporate.adroot.infra.ftgroup> <BL0PR05MB5025A934922FDDC316AFD2E4D4AC0@BL0PR05MB5025.namprd05.prod.outlook.com> <CA+RyBmXcu3b9dObX=G9vyHNJtEuJ4wWqMtQXvxCNxgNOSCsmWw@mail.gmail.com> <CO2PR05MB24550CC9932A560B1DC7B19BD44B0@CO2PR05MB2455.namprd05.prod.outlook.com> <CA+RyBmV6oigz+ODY9C6QEqkDQY1X+x=yDpWqPoiODyyqVeTwHA@mail.gmail.com> <DM5PR05MB354829730C814ADE41F373CFD4310@DM5PR05MB3548.namprd05.prod.outlook.com> <CA+RyBmWNmdavTzoeGK+b1Tz-am6foNJ=1c5Kz7iKJ1gc7Lsvcg@mail.gmail.com> <DM5PR05MB35486A07622B1E28E781AE08D4D90@DM5PR05MB3548.namprd05.prod.outlook.com> <CA+RyBmXb-MrxwjQ9C5av+5XahJ1KjBb2QF=EHunMrCnidsqp=Q@mail.gmail.com> <DM5PR05MB35487E88B38742D6DEB5F666D4D60@DM5PR05MB3548.namprd05.prod.outlook.com>
In-Reply-To: <DM5PR05MB35487E88B38742D6DEB5F666D4D60@DM5PR05MB3548.namprd05.prod.outlook.com>
From: Greg Mirsky <gregimirsky@gmail.com>
Date: Tue, 20 Aug 2019 10:38:00 -0700
Message-ID: <CA+RyBmUScYZdFQSxToH8PZ92yZQs4vhGU9oAhgJmeCiMhADhJg@mail.gmail.com>
To: "Jeffrey (Zhaohui) Zhang" <zzhang@juniper.net>
Cc: "bess-chairs@ietf.org" <bess-chairs@ietf.org>, "EXT - thomas.morin@orange.com" <thomas.morin@orange.com>, Robert Kebler <rkebler@juniper.net>, BESS <bess@ietf.org>
Content-Type: multipart/mixed; boundary="000000000000a09adf05908febb2"
Archived-At: <https://mailarchive.ietf.org/arch/msg/bess/Ytq-Tgsh9HZyUX2_0TAznGtq2uI>
Subject: Re: [bess] WGLC, IPR and implementation poll for draft-ietf-bess-mvpn-fast-failover
X-BeenThere: bess@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: BGP-Enabled ServiceS working group discussion list <bess.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/bess>, <mailto:bess-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/bess/>
List-Post: <mailto:bess@ietf.org>
List-Help: <mailto:bess-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/bess>, <mailto:bess-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 20 Aug 2019 17:38:19 -0000

Hi Jeffrey,
many thanks for the most detailed explanation. Please find my notes and
answers in-line tagged GIM5>>. Also, trimmed text to clear issues that we
agree has been resolved already.

Attached is the new working version of the draft and the diff to highlight
the changes.

Regards,
Greg

On Fri, Aug 9, 2019 at 1:38 PM Jeffrey (Zhaohui) Zhang <zzhang@juniper.net>
wrote:

> Hi Greg,
>
>
>
> I’ve trimmed some text. Please see zzh3> below.
>
>
>
> Zzh2> I checked the surrounding text in this draft and section 5.1.3 in
> RFC6513. I believe section 3 of this document, before its subsection 3.1
> should be re-written as following:
>
> …
>
> Zzh2> The reason is that for the candidate set is not ordered – it’s just
> a set to select from (either based on IP address or hashing).
>
> GIM4>> Many thanks, Jeffrey! Please check the working verion or diff and
> let me know if I've correctly applied the changes.
>
>
>
> Zzh3> There is an extra “o”:
>

>
>    o  The first two options select the Upstream PE from a candidate PE
>
>       set either based on IP address or a hashing algorithm.  When used
>
>       together   with the optional procedure of considering the P-tunnel
>
>       status as in   this document, a candidate upstream PE is included
>
>       in the set if it either:
>
>
>
>    o  ß EXTRA
>
GIM5>> I think I've managed to clear it.

>
>
>       A.  advertise a PMSI bound to a tunnel, where the specified tunnel
>
>           is not known to be down or up
>
>
>
>
>
>
>
> 3.1.7.  Per PE-CE link BFD Discriminator
>
>  …
>
>  Zzh2> Because you still want to track the tunnel state (in addition to
> pe-ce interface state), you would need at least two discriminators – one
> for the tunnel and one for the PE-CE link. However, the new “BGP- BFD
> attribute” defined in this spec only accommodates one discriminator (and my
> understanding is that you can’t have more than one of the same attribute).
>
> GIM4>> It is implied that the PE-CE link is monitored by p2p BFD session,
> most likely as described in RFC 5881 for single-hop BFD. That would not
> require bootstrapping.
>
>
>
> Zzh3> I was saying that if you use “Per PE-CE link BFD Discriminator”,
> then …
>
> Zzh2> The simplest solution is that just use the same discriminator (vs.
> per PE-CE link discriminator). With that, the ENTIRE section 3.1.7
> (including its subsections) become the following:
>
> GIM4>> I'm confused by "use the same Discriminator". The root advertises
> its Discriminator to the downstream PEs. The value is only locally unique
> for the root, not for any downstream PE. For a PE-CE link, if BFD is used,
> each PE must pick its locally unique value to use it as My Discriminator.
> CE uses that value in Your Discriminator field and thus the PE
> demultiplexes p2p sessions using its locally unique value in the Your
> Discriminator field. Note that p2mp BFD session among the root and the
> downstream PEs is such that PEs receives BFD control packets with the value
> of Your Discriminator field zeroed, and PEs use a different mechanism to
> demultiplex p2mp BFD sessions (as described in RFC 8562).
>
>
>
> Zzh3> I meant that you don’t use PE-CE link specific discriminator (e.g.
> value1 for the tunnel status, value2 for PE-CE link1 and value3 for PE-CE
> link2). Whether you track the PE-CE link status or not, you just include
> the discriminator that corresponds to the tunnel. I don’t mean that all PEs
> use the same discriminator.
>

>
> 3.1.7 Tracking upstream PE-CE link status
>
>
>
>    In case the PE-CE link on an upstream PE failed, even though the
> provider tunnel is still up,
>
>    It is desired for the downstream PEs to switch to a backup upstream PE.
> To achieve that,
>
>    If the upstream PE detects that its PE-CE link fails, it SHOULD set the
> bfd.LocalDiag of the
>
>    p2mp BFD session to Concatenated Path Down and/or Reverse Concatenated
> Path Down,
>
>    unless it switches to a new PE-CE link immediately (in that case the
> upstream PE will start tracking
>
>    the status of the new PE-CE link).
>
>    When a downstream PE receives that bfd.LocalDiag code, it treats as if
> the tunnel itself
>
>    failed and tries to switch to a backup PE.
>
> GIM4>> Would the downstream PE be switching to the backup Provider Tunnel,
> not to a backup PE? If yes, that option already listed in section 3.1.7.2
>
>
>
> Zzh3> No.
>
> Zzh3> Take one step back. When we don’t track PE-CE link status on the
> ingress PE, we only care about the tunnel status. If it is down, we don’t
> use the corresponding PE. There is no “backup tunnel”. There is only a
> “backup upstream PE”.
>
> Zzh3> Now add the PE-CE link to the picture. Even if the tunnel remains up
> but if the PE-CE link is down, we don’t use that upstream PE anymore. From
> the downstream PE’s point of view, there is no difference whether it is the
> tunnel down or upstream PE-CE link down. It should not care.
>
> Zzh3> That’s why I say that the ENTIRE section of 3.1.7 should be replaced
> with my proposed text. No more 3.1.7.1 and 3.1.7.2.
>
GIM5>>  I agree and hopefully the new section 3.1.7 captured your proposal
and addressed this concern. One question though. The text recommends that
the upstream PE notifies the downstream PEs of PE-CE link failure "unless
it switches to a new PE-CE link immediately". As we know, nothing happens
immediately. Should we tie it to the value of bfd.DesiredMinTxInterval for
the p2mp BFD session? One interval or multiplied by bfd.DetectMult? Or
leave it outside of BFD by saying "within the separately defined time
interval"?


>
> Jeffrey
>
>
>
> Juniper Business Use Only
>