RE: Why we consider the method of "label sharing for fast PE protection"

<bruno.decraene@orange.com> Thu, 14 November 2013 10:12 UTC

From: bruno.decraene@orange.com
To: Mingui Zhang <zhangmingui@huawei.com>
Subject: RE: Why we consider the method of "label sharing for fast PE protection"
Thread-Topic: Why we consider the method of "label sharing for fast PE protection"
Thread-Index: AQHO2+geoImqeN4PM02OPNhQaq9T5JohspVQgAJ79ZCAAEGdEA==
Date: Thu, 14 Nov 2013 10:12:21 +0000
Message-ID: <28062_1384423942_5284A206_28062_2935_1_53C29892C857584299CBF5D05346208A070A4BFE@PEXCVZYM11.corporate.adroot.infra.ftgroup>
References: <4552F0907735844E9204A62BBDD325E733660DBA@nkgeml508-mbx.china.huawei.com> <19124_1384271977_52825069_19124_7825_1_53C29892C857584299CBF5D05346208A070A3C8F@PEXCVZYM11.corporate.adroot.infra.ftgroup> <4552F0907735844E9204A62BBDD325E7336623C5@nkgeml508-mbx.china.huawei.com>
In-Reply-To: <4552F0907735844E9204A62BBDD325E7336623C5@nkgeml508-mbx.china.huawei.com>
Accept-Language: fr-FR, en-US
Content-Language: fr-FR
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
Cc: "pwe3@ietf.org" <pwe3@ietf.org>, "l3vpn@ietf.org" <l3vpn@ietf.org>
Precedence: list

Hi Mingui,

Thanks for the answers.
You have added pwe in cc. I'm not sure why as I'm following up on your presentation during L3 VPN WG on L3 VPN PE protection.

My comments inline below [Bruno]

>From: >Mingui Zhang >Sent: Thursday, November 14, 2013 5:57 AM
>
>Hi Bruno,
>
>My comments inline below. [Mingui]
>
>>1)      The document does not describe the scaling impact on the core as it
>>creates, in the core IGP, states on a per CE (customers'site) basis. For large SP,
>>this will represent a significant impact on the IGP and will degrade the IGP
>>convergence time. ( o(number of important IGP prefixes to rewrite) )
>
>[Mingui] Here, I need to point out that the vNH is not created per CE. If
>multiple CEs are connected to an RG, one vNH is enough. When you read the
>first sentence of Section 2.1, you will be clear.
>[Mingui] "A vNH router is created in IGP to represent the set of CEs which
>are dual-homed to the same egress PEs in the Service Provider's backbone".

[Bruno] Agreed. 

>[Mingui] Exactly, it's per RG basis. In a real network, the numbers of RGs
>is comparable to the number of PEs, so it's similar to the per PE basis
>choice (the best choice) in draft-minto.

[Bruno] First, consider that in a "POP" there may be multiple PE, i.e. more than the 2 required for the dual homing. Let's say 5 PE.
The number of RG is flexible, but this also comes with drawbacks.
One option is to have a single RG. In this case, you indeed need a single vNH but you require global label allocation for the perimeter of those 5 PE. Which roughly divide the available label space by 5. Then if this represents a scaling issue, I guess you can call for bigger labels, e.g. by stacking 2 labels. But this requires more lookup work in the forwarding plane. Roughly equivalent to the label context space that you did not want in the first time...
Another option is to have one RG per couple of PE. In this case you need 10 vNH (IINM). That's roughly the number of PE but that's still *3 the number of states in the IGP so I think the document should talk about this.


>>2)  > A master PE is elected in the same way the DR is selected (see
>section 7.3
>>of [RFC2328] <http://tools.ietf.org/html/rfc2328#section-7.3> ), or the DIS
>is
>>selected [ISIS
>><http://tools.ietf.org/html/draft-zhang-l3vpn-label-sharing-01#ref-ISIS> ].
>>I don't think that this is directly applicable. You may need to elaborate
>on this
>>and specify the corner cases (e.g. CE starts and advertise its routes to
>both PE
>>at the same time. Hence both PE believes they are the first one and hence
>are
>>the master.) Also in BGP VPN or IGP there is no advertisement of "Router
>>Priority". I guess that this can always be specified, but this remains to
>be done.
>
>[Mingui] That's a valid point. We did define the "Priority" in the "vNH TLV"
>in Section 2.3.3 of the companion draft (http://tools.ietf.org/html/draft-
>zhang-pwe3-iccp-label-sharing-00). We can change the words to fix this
>inconsistence.

[Bruno] Ack. 

><snip>
>>IMHO additional elaboration may be required as the nominal PE
>>may be the nominal because it has advertised a more specific route. Hence a
>>look up in the VRF will match the more specific route from the
>nominal/failed PE.
>>Not sure how a vanilia PE implementation would react. The issue is that
>when
>>receiving the packet, the backup PE does not know whether the nominal is
>dead
>>or not. In contrast to draft-minto for which the PE know whether it's used
>as
>>backup or not, thanks to the contextual label.
>
>[Mingui] So you propose the scenario that the backup PE directs packets to
>the primary PE since the primary PE advertised a more specific route. This
>makes the primary PE be an inevitable point on all forwarding paths to the
>CE. Eric ever used "BGP LOCAL_PREF" to compose a similar scenario
>(http://www.ietf.org/mail-archive/web/l3vpn/current/msg03691.html).
>
>[Mingui] For this subtle case, the redundant connection is not utilized in
>the first place. If customers prefer resilience to more specific routes, the
>backup PE should not use the primary PE as the next hop.

[Bruno] IMO this is a normal routing behavior which needs to be supported. It doesn't mean that the redundant is not utilized because another more specific prefix could be "attached" to this redundant connection. It probably means that the CE/customer is using TE to better load balance the flows across both connections.

>[Mingui] All in all, if customers really want the resilience, those routes
>via the primary PE SHOULD NOT be installed in the FIBs of backup PEs.

[Bruno] Good point. So you probably should add this to the document.

>[Mingui] By the way, the scenario you designed may suffer from transient
>loop issue. Imagine that the more specific routes fails on the primary PE,
>the primary PE will resort to the backup PE. At that time, it's probably
>that the backup PE is unaware of the failure. It will send packet back to
>the primary PE. Oops, loop occurs. Following the above "SHOULD NOT be
>installed" requirement, such kind of loops can be avoided.

[Bruno] You are right. I suspect that RFC 4364 implies that a PE should not reflect back to the core, a VPN packet received from the core. But I haven't checked.

><snip>
>>More seriously:
>>- Draft minto does not need to define a new protocol (ICCP) in order to
>learn the
>>labels of the backup PE. BGP already provides this. While ICCP is a 79
>pages
>>draft, plus 11 for the extension.
>>- Draft zhang relies on the having the same label on redundant PE. While
>this is
>>doable, this is a significant change compared to the existing deployed
>model.
>>- regarding context label tabel
>>                  - From a standardization standpoint, context-specific
>label
>>space is already defined in RFC 5331, so nothing new.
>>                  - From an implementation standpoint, I agree that this
>>requires some additional resources in the forwarding plane. But draft minto
>>does not require that all PE support this. This can be performed by a
>protector
>>PE if some PE do not support it.
>
>[Mingui] If nothing new for protocols, this becomes an implementation issue.
>Then, should this be considered in IETF? :-)

[Bruno] That's not a local implementation issue as this is a global behavior/feature at the network level. And to allow this behavior we need all implementations to implement this feature.
Note that all IETF documents do not describe change of the bits sent on the wire.
Finally, one protocol changed will be required.

>[Mingui] We do not define ICCP, we make use of it.

[Bruno] idem for context label :-)

> If we can have a solution happen in purely the control plane, as IETFers, we should be happy. 

[Bruno] Sure. But the solution relies on global VPN labels which is not what is deployed today. So it all depends on whether you will be able to deploy such change before. Which will requires all implementations to do it while draft minto has no such dependency.

> If customers want this feature, I promise you will see this feature supported
>by products in several months.

[Bruno] Ok. But I guess you mean a subset of the products currently deployed. There may be multiple vendors/implementations in a given network. And some implementations are sometimes even considered as legacy (by the vendor).

> For vendors who have already implemented
>ICCP, it can be faster. More importantly, this is a "once for all" effort
>compared to the redundant forwarding tables.
>
>[Mingui] As Robert and I discussed previously, existing deploy model does
>not prohibit PE to advertise the same label. 

[Bruno] Agreed.

> I don't think this changes the existing deploy model.

[Bruno] I think it does.


>>So label table need not be stored repeatedly on RG members.
>>
>>#Bruno: agreed.
>>
>>
>>2. Also, it's easy to be deployed. It does not bring any change to P
>routers
>>(control plane & data plane). It even does not change the data plane of PE
>>routers.
>>
>>#Bruno: It's not easy to deploy as it mandates to update all PE routers,
>both
>>nominal and backup. While with draft-minto-2547-egress-node-fast-protection
>><http://tools.ietf.org/html/draft-minto-2547-egress-node-fast-protection-
>02>
>>the deployment of a single protector PE can fast protect all PE of the AS
>with no
>>change on existing PE.
>
>[Mingui] As I explained above, the update to the control plane should be OK.
>
>[Mingui] OK. Let me give an analysis of the draft-minto-2547-egress-node-
>fast-protection. I hope this is constructive when the draft is updated.
>
>[Mingui] a.	The co-located and hybrid models require configuration, which is
>error prone and increases a management burden to operators.

#Bruno: You need to configure the pairs of PE. But this is a one time configuration on a per PE basis when the PE is provisioned. I don't think per VPN configuration is required as by listening to BGP routes, the backup PE can discover that it needs to back up the nominal one. Your proposition probably need the same configuration effort (configuring RG and vNH)

>[Mingui] b.	The centralized model is not robust. How do you handle the
>failure of the centralized protector itself? This problem had been proposed
>for the "central controller" stuff.

#Bruno: If one wants to protect the protection one may use a second (redundant) protector. 
However, as the protector is expected to be in use only few seconds per day at most, I'm not sure people would consider such specific case of double failure.

>[Mingui] c.	The delegation of protector is topology dependent. I guess the
>number of possible alternative paths held by the protector is a much smaller
>set of all possible alternative paths.

#Bruno: Indeed, currently the location of the protector is topology independent. Simply because this relies on the existing FRR mechanism used in the core and this FRR may have coverage limitations (e.g. LFA). There are plans to remove this limitation.
I'm not sure to understand your second point (second sentence).

>[Mingui] d.	The centralized protector has to manage the context label
>assignment and maintain multiple forwarding tables for all RGs. I am afraid
>this model is not scalable.

#Bruno: Just like a PE does (maintain multiple forwarding tables) so I think this is just as scalable as 4364 VPNs are. Besides, we are talking about dedicated protector PE so it can be choosen to scale in that direction (just like for VPN BGP RR)

>[Mingui] e.	There is a brief description on Fast ReRoute in Section 5.3. For
>the "RSVP" choice, it depends on another individual draft
>[http://tools.ietf.org/html/draft-minto-2547-egress-node-fast-protection-
>02]. Does this draft mandates update on P and PE routers? For the "LDP"
>choice, it's not clear how IGP can find alternative paths (I find the
>"beyond the scope" words.). I think the coverage of alternative paths cannot
>be very good, especially when the centralized protector is used.

#Bruno: For LDP, cf 2 points above.

>>3. In addition, it does not bear the restriction of "no penultimate-hop-
>popping".
>>
>>#Bruno: ultimate hop poping is only required for the label associated with
>the
>>context table of the backup PE. This is required to identify the context-
>specific
>>label space. This is not an additional argument compared to 1 (context
>label
>>table).
>
>[Mingui] I agree.
>
>Thanks,
>Mingui

_________________________________________________________________________________________________________________________

Ce message et ses pieces jointes peuvent contenir des informations confidentielles ou privilegiees et ne doivent donc
pas etre diffuses, exploites ou copies sans autorisation. Si vous avez recu ce message par erreur, veuillez le signaler
a l'expediteur et le detruire ainsi que les pieces jointes. Les messages electroniques etant susceptibles d'alteration,
Orange decline toute responsabilite si ce message a ete altere, deforme ou falsifie. Merci.

This message and its attachments may contain confidential or privileged information that may be protected by law;
they should not be distributed, used or copied without authorisation.
If you have received this email in error, please notify the sender and delete this message and its attachments.
As emails may be altered, Orange is not liable for messages that have been modified, changed or falsified.
Thank you.

Why we consider the method of "label sharing for … Mingui Zhang
RE: Why we consider the method of "label sharing … Jakob Heitz
RE: Why we consider the method of "label sharing … Mingui Zhang
Re: Why we consider the method of "label sharing … Robert Raszuk
RE: Why we consider the method of "label sharing … Mingui Zhang
RE: Why we consider the method of "label sharing … bruno.decraene
RE: Why we consider the method of "label sharing … Mingui Zhang
RE: Why we consider the method of "label sharing … bruno.decraene
Re: Why we consider the method of "label sharing … Stewart Bryant
RE: Why we consider the method of "label sharing … Mingui Zhang
RE: Why we consider the method of "label sharing … Mingui Zhang
RE: Why we consider the method of "label sharing … Eric Osborne (eosborne)
答复: Why we consider the method of "label sharing … Lizhenbin
RE: Why we consider the method of "label sharing … UTTARO, JAMES
RE: Why we consider the method of "label sharing … UTTARO, JAMES
Re: Why we consider the method of "label sharing … Jakob Heitz
RE: Why we consider the method of "label sharing … Eric Osborne (eosborne)
RE: Why we consider the method of "label sharing … UTTARO, JAMES
Re: Why we consider the method of "label sharing … Jakob Heitz
Re: Why we consider the method of "label sharing … Stewart Bryant
RE: Why we consider the method of "label sharing … UTTARO, JAMES
Re: Why we consider the method of "label sharing … Robert Raszuk
RE: Why we consider the method of "label sharing … Mingui Zhang
RE: Why we consider the method of "label sharing … Mingui Zhang
Re: Why we consider the method of "label sharing … Robert Raszuk
Re: Why we consider the method of "label sharing … Jeff Tantsura
RE: Why we consider the method of "label sharing … Mingui Zhang
答复: Why we consider the method of "label sharing … Lizhenbin
答复: Why we consider the method of "label sharing … Lizhenbin
答复: Why we consider the method of "label sharing … Lizhenbin
RE: Why we consider the method of "label sharing … Jakob Heitz
RE: Why we consider the method of "label sharing … UTTARO, JAMES
Re: Why we consider the method of "label sharing … Stewart Bryant
Re: 答复: Why we consider the method of "label shar… Stewart Bryant
RE: Why we consider the method of "label sharing … Mingui Zhang
RE: Why we consider the method of "label sharing … Mingui Zhang
Re: Why we consider the method of "label sharing … Jakob Heitz
RE: Why we consider the method of "label sharing … Mingui Zhang
Re: Why we consider the method of "label sharing … Stewart Bryant
Re: Why we consider the method of "label sharing … Robert Raszuk
RE: Why we consider the method of "label sharing … Eric Gray
Re: Why we consider the method of "label sharing … Robert Raszuk
RE: Why we consider the method of "label sharing … Eric Gray
Re: Why we consider the method of "label sharing … Robert Raszuk
Re: Why we consider the method of "label sharing … Jakob Heitz
Re: Why we consider the method of "label sharing … Robert Raszuk
RE: Why we consider the method of "label sharing … Eric Gray
Re: Why we consider the method of "label sharing … Robert Raszuk
RE: Why we consider the method of "label sharing … bruno.decraene
RE: Why we consider the method of "label sharing … Zhoupeng (Jewpon)
RE: Why we consider the method of "label sharing … UTTARO, JAMES
RE: Why we consider the method of "label sharing … Mingui Zhang
Re: Why we consider the method of "label sharing … Jakob Heitz
RE: Why we consider the method of "label sharing … Mingui Zhang
RE: Why we consider the method of "label sharing … bruno.decraene
RE: Why we consider the method of "label sharing … Zhoupeng (Jewpon)
RE: Why we consider the method of "label sharing … UTTARO, JAMES
Re: Why we consider the method of "label sharing … Jakob Heitz
Re: Why we consider the method of "label sharing … Jeff Tantsura
RE: Why we consider the method of "label sharing … Mingui Zhang