Re: [Pce] draft-ietf-pce-pce-initiated-lsp

Jonathan Hardwick <Jonathan.Hardwick@metaswitch.com> Tue, 28 February 2017 10:02 UTC

Return-Path: <Jonathan.Hardwick@metaswitch.com>
X-Original-To: pce@ietfa.amsl.com
Delivered-To: pce@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id C26521294F2 for <pce@ietfa.amsl.com>; Tue, 28 Feb 2017 02:02:59 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.812
X-Spam-Level:
X-Spam-Status: No, score=-1.812 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H4=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001, T_DKIM_INVALID=0.01] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=fail (1024-bit key) reason="fail (body has been altered)" header.d=metaswitch.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id sh5_bnthlAih for <pce@ietfa.amsl.com>; Tue, 28 Feb 2017 02:02:57 -0800 (PST)
Received: from NAM03-DM3-obe.outbound.protection.outlook.com (mail-dm3nam03on0128.outbound.protection.outlook.com [104.47.41.128]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id E194E129449 for <pce@ietf.org>; Tue, 28 Feb 2017 02:02:56 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=metaswitch.com; s=selector1; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version; bh=NWPyh3di4ParfRXoHNQ0L15KpPz+q8rKllTv7z5fZaI=; b=WJms/QUCm81gcIanVeMBwWR/+/UWqcL6ZViaTNhZiYIbjpJa3ZKD11KQXtWZeTBoPCasJrVEoQVjBQ4Sw/bOrvWmhzd5xXS0rxRbAoOhQ3UFG/U36IBEDQasAhI5tmh8JN01OU0gk3j2iE4g8+bA9fiei99/5sLH6vQyiG7zdeU=
Received: from BY2PR0201MB1910.namprd02.prod.outlook.com (10.163.75.152) by BY2PR0201MB1909.namprd02.prod.outlook.com (10.163.75.151) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_CBC_SHA384_P384) id 15.1.933.12; Tue, 28 Feb 2017 10:02:54 +0000
Received: from BY2PR0201MB1910.namprd02.prod.outlook.com ([10.163.75.152]) by BY2PR0201MB1910.namprd02.prod.outlook.com ([10.163.75.152]) with mapi id 15.01.0933.019; Tue, 28 Feb 2017 10:02:54 +0000
From: Jonathan Hardwick <Jonathan.Hardwick@metaswitch.com>
To: Julien Meuric <julien.meuric@orange.com>, Aaron Itskovich <aitskovi@cisco.com>, Cyril Margaria <cmargaria@juniper.net>, Robert Varga <nite@hq.sk>
Thread-Topic: [Pce] draft-ietf-pce-pce-initiated-lsp
Thread-Index: AQHSZsaLoAs2ttFXEki3plxhYOOUL6EqEZnQgAA1EQCABea2MIAAK5eAgE4q41A=
Date: Tue, 28 Feb 2017 10:02:54 +0000
Message-ID: <BY2PR0201MB19106B133D0FDC5EBB67A4A984560@BY2PR0201MB1910.namprd02.prod.outlook.com>
References: <f632749c-0f2c-b30b-6013-bc05725361a3@cisco.com> <BY2PR0201MB19106FFF4425C7A04EFE3F7A84600@BY2PR0201MB1910.namprd02.prod.outlook.com> <6abff8b0-3b68-c17c-dfaf-3e636292ed3b@cisco.com> <BY2PR0201MB191005DB14CD2F18759AB68084640@BY2PR0201MB1910.namprd02.prod.outlook.com> <c8b5e7a4-fdcc-1cf2-6f7f-f8ca74ac7777@orange.com>
In-Reply-To: <c8b5e7a4-fdcc-1cf2-6f7f-f8ca74ac7777@orange.com>
Accept-Language: en-GB, en-US
Content-Language: en-US
X-MS-Has-Attach: yes
X-MS-TNEF-Correlator:
authentication-results: spf=none (sender IP is ) smtp.mailfrom=Jonathan.Hardwick@metaswitch.com;
x-originating-ip: [86.132.75.121]
x-ms-office365-filtering-correlation-id: 55950de8-f99b-40fa-5854-08d45fc0f6c2
x-microsoft-antispam: UriScan:; BCL:0; PCL:0; RULEID:(22001); SRVR:BY2PR0201MB1909;
x-microsoft-exchange-diagnostics: 1; BY2PR0201MB1909; 7:n9vftp7EYhbSV9qi6ZnLDmXqMBumV7n9TQOGLqYYv37uNfjvCUhvxyMlKsQXmWJ6gVM1z4GZTfTw5JosIkRjsUZpctvfOru7xEdkwInDlr3CYIsbQowDpBSEmYARU8uEDeOR41B4IBGkpHlM6lIRykdU1ZXPIZ4D4CN8EoZn9e8eABnr6ltsUrtX8J34YmxhR3MsqFk986ai8b1emEJOPyhcYvOUVvnsXABOlmUuZFOJgUlxkFd8jEJqy6XCiwmkDr80RZxyHAnDSYDCUKoV7a/zOkPnZJC/i9SogJ821aVmJopWYr1NYcA+BoTH1d68QompBQhHtiC18lDqzV0AAw==
x-microsoft-antispam-prvs: <BY2PR0201MB19097E3267B4BC4365E1085A84560@BY2PR0201MB1909.namprd02.prod.outlook.com>
x-exchange-antispam-report-test: UriScan:(60795455431006)(158342451672863)(138986009662008)(95692535739014)(18271650672692);
x-exchange-antispam-report-cfa-test: BCL:0; PCL:0; RULEID:(102415395)(6040375)(601004)(2401047)(8121501046)(5005006)(3002001)(10201501046)(6041248)(20161123558025)(20161123564025)(20161123562025)(20161123555025)(20161123560025)(6072148); SRVR:BY2PR0201MB1909; BCL:0; PCL:0; RULEID:; SRVR:BY2PR0201MB1909;
x-forefront-prvs: 0232B30BBC
x-forefront-antispam-report: SFV:NSPM; SFS:(10019020)(6009001)(7916002)(39450400003)(199003)(13464003)(24454002)(51914003)(189002)(377454003)(76104003)(377424004)(53754006)(3280700002)(6436002)(99286003)(8666007)(4326007)(86362001)(7736002)(106356001)(53936002)(74316002)(6506006)(66066001)(81166006)(106116001)(2900100001)(561944003)(2950100002)(6246003)(81156014)(9686003)(6306002)(55016002)(2906002)(3660700001)(305945005)(1941001)(25786008)(92566002)(8936002)(53546006)(8676002)(5890100001)(38730400002)(50986999)(76176999)(54356999)(68736007)(99936001)(101416001)(77096006)(122556002)(93886004)(229853002)(6116002)(105586002)(102836003)(3846002)(230783001)(33656002)(5660300001)(7696004)(97736004)(189998001); DIR:OUT; SFP:1102; SCL:1; SRVR:BY2PR0201MB1909; H:BY2PR0201MB1910.namprd02.prod.outlook.com; FPR:; SPF:None; PTR:InfoNoRecords; A:1; MX:1; LANG:en;
received-spf: None (protection.outlook.com: metaswitch.com does not designate permitted sender hosts)
spamdiagnosticoutput: 1:99
spamdiagnosticmetadata: NSPM
Content-Type: multipart/mixed; boundary="_002_BY2PR0201MB19106B133D0FDC5EBB67A4A984560BY2PR0201MB1910_"
MIME-Version: 1.0
X-OriginatorOrg: metaswitch.com
X-MS-Exchange-CrossTenant-originalarrivaltime: 28 Feb 2017 10:02:54.1873 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 9d9e56eb-f613-4ddb-b27b-bfcdf14b2cdb
X-MS-Exchange-Transport-CrossTenantHeadersStamped: BY2PR0201MB1909
Archived-At: <https://mailarchive.ietf.org/arch/msg/pce/3m8NjDnIwZeM--2fki42ml68So4>
Cc: "pce@ietf.org" <pce@ietf.org>
Subject: Re: [Pce] draft-ietf-pce-pce-initiated-lsp
X-BeenThere: pce@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: Path Computation Element <pce.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/pce>, <mailto:pce-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/pce/>
List-Post: <mailto:pce@ietf.org>
List-Help: <mailto:pce-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/pce>, <mailto:pce-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 28 Feb 2017 10:03:00 -0000

Hi all

I would like to get this thread closed down ASAP.  Let me summarize where I think we have got to.
1. Robert has clarified the allowed PCC behaviour in the attached email.
2. I agree that Aaron's proposal would be useful to some implementations but I don't see it as essential.
3. Julien (who is shepherding this document) has reminded us that the timeline is such that we need to keep feature extensions for another document.

Given this, does anyone want to propose a change to draft-ietf-pce-pce-initiated-lsp?  If so, please reply ASAP with your suggested text (copying the PCE list), so we can review it.

Best regards
Jon

-----Original Message-----
From: Julien Meuric [mailto:julien.meuric@orange.com] 
Sent: 09 January 2017 16:13
To: Jonathan Hardwick <Jonathan.Hardwick@metaswitch.com>; Aaron Itskovich <aitskovi@cisco.com>; pce@ietf.org; Cyril Margaria <cmargaria@juniper.net>
Subject: Re: [Pce] draft-ietf-pce-pce-initiated-lsp

Hi,

This topic was tackled a while ago in the context of
draft-ietf-pce-pce-initiated-lsp:
https://mailarchive.ietf.org/arch/msg/pce/wn4gGwZnTZS53pbyg1eCHw3YMVE

It looks like the I-D would benefit from clarification on this matter before sending it to the IESG. Authors, what to do you think?

Aaron (Cyril?), while keeping feature extensions for another document (cf. Dhruv's proposal), would you have some clarification text to propose here?

Thanks,

Julien


Jan. 09, 2017 - Jonathan.Hardwick@metaswitch.com:
> Hi Aaron
> 
> Thanks.  So actually, I need to backtrack and agree that there is a mechanism defined for a backup PCE to request that an orphaned LSP is delegated to it - I had missed that.
> 
> I am not sure that this implies that the PCC cannot re-delegate of its own free will.  It would be useful to have that clarified in the draft.
> 
> However, given this mechanism exists, it does raise the question about how PCEs should properly cooperate in terms of how orphaned LSPs are detected and who should request ownership.  I know that there is work ongoing in the following draft which discusses co-operation between redundant PCEs.  I am not sure whether your idea is best discussed in the context of that draft or in draft-ietf-pce-pce-initiated-lsp.  Perhaps the authors could chime in?
> https://tools.ietf.org/id/draft-litkowski-pce-state-sync-00.txt
> 
> Best regards
> Jon
> 
> 
> -----Original Message-----
> From: Aaron Itskovich [mailto:aitskovi@cisco.com]
> Sent: 05 January 2017 19:30
> 
> Hi Jon,
> 
> Thanks for your reply.
> My interpretation of the following section of the existing draft-ietf-pce-pce-initiated-lsp is that PCC cannot just re-delegate PCE initiated LSP to a PCE.
> 
> <snip>
> 
>     In case of PCEP session failure, control over PCE-initiated LSPs
>     reverts to the PCC at the expiration of the redelegation timeout.  At
>     this point, the LSP is an "orphan" until the expiration of the State
>     Timeout timer.  To obtain control of a PCE-initiated LSP, a PCE
>     (either the original or one of its backups) sends a PCInitiate
>     message, including just the SRP and LSP objects, and carrying the
>     PLSP-ID of the LSP it wants to take control of.  Receipt of a
>     PCInitiate message with a non-zero PLSP-ID normally results in the
>     generation of a PCErr.  If the LSP is an orphan, the PCC MUST NOT
>     generate an error and MUST redelegate the LSP to the PCE.  The State
>     Timeout timer for the LSP is stopped upon the re-delegation.
> </snip>
> 
> I read it as PCC is not free to delegate PCE initiated LSP even in orphan state, it rather waits (using State Timer) for other PCE server to "adopt" such "orphan" LSP.
> This behavior is different for the PCE initiated LSPs vs PCC initiated LSPs.
> Hence my suggestion was a mechanism that will help redundant PCE servers to discover PCE initiated LSPs that enter "orphan" state and trigger "adoption"
> process which is different then the just re-delegation.
> 
> Thanks
> Aaron
> 
> 
> On 2017-01-05 11:25 AM, Jonathan Hardwick wrote:
>> Hi Aaron
>>
>> Thanks for the comment.  There are no procedures defined that would allow a PCE to initiate a take-over of an LSP from a PCC.  Rather, the PCC must delegate the LSP to an appropriate PCE (if the LSP is initiated by a PCE then it must initially be delegated to that PCE).  The cases you mention would be resolved by the PCC making a local decision whether or not to re-delegate the LSP to an alternative PCE.
>>
>> So, I am not sure that the "orphan" flag would be useful to the PCE, at least not for the purpose that you describe.
>>
>> Best regards
>> Jon
>>
>>
>> -----Original Message-----
>> From: Pce [mailto:pce-bounces@ietf.org] On Behalf Of Aaron Itskovich
>> Sent: 04 January 2017 20:08
>> To: pce@ietf.org
>> Subject: Re: [Pce] draft-ietf-pce-pce-initiated-lsp
>>
>>
>> I suggest to add indication that PCE Initiated LSP is orphan to LSP object in PCReport message. This information will be useful in facilitating takeover of an orphan LSP by redundant PCE server.
>>
>> Option 1) Adding bit to the LSP flags (L-bit "parentless/orphan") that will be set in the LSP object of the PCReport message sent when PCE Initiated LSP becomes orphan as result of connection loss to the PCE server that initiated this LSP or because such PCE server returns LSP delegation to the PCC to initiate transfer of LSP "ownership" to another PCE server.
>>
>> Option 2) Adding bit to the LSP flags (D2-bit ) that is set when such LSP is delegated to any PCE server. In this option PCE receiving report will be able to derive when PCE Initiated LSP is in orphan state when C flag in the report is set and the new D2-bit is not set.
>>
>>
>> This proposal (regardless which option 1 or 2 is used ) alows PCC to propagate "orphan" state for the PCE initiated LSP to all PCE servers.
>> It will help to trigger takeover of a PCE initiated LSP. The existing alternative is to rely on the PCE server to server communication for detection of such events which is prone to errors.
>>
>> For example:
>>
>>       - loss of communication between PCE-A and PCE-B may be interpreted as "takeover" trigger which is not necessarily true as PCE-A<->PCC connection may still be up.
>>       - In case where PCE-A <-> PCC connection is down and both PCC 
>> <-> PCE-B and  PCE-A <-> PCE-B connections are up we will need each 
>> PCE server to distribute information about connection with it's 
>> clients to all peers
>>
>> The suggestion frees PCE server from the need to propagate it's client connection status to all other PCE servers.
>>
>> Let me know what you think
>>
>> Thanks
>> Aaron
>>
>> _______________________________________________
>> Pce mailing list
>> Pce@ietf.org
>> https://www.ietf.org/mailman/listinfo/pce
> 
> 
--- Begin Message ---
On 01/09/2017 02:45 PM, Jonathan Hardwick wrote:
> Hi Aaron

Hello Jon,

> Thanks.  So actually, I need to backtrack and agree that there is a mechanism defined for a backup PCE to request that an orphaned LSP is delegated to it - I had missed that.
> 
> I am not sure that this implies that the PCC cannot re-delegate of its own free will.  It would be useful to have that clarified in the draft.

Note that Redelegation Timeout Interval is a PCC-local value, which the
PCC can alter dynamically based on its internal state at any moment (as
per pce-stateful-pce section 2). pce-stateful-pce does not change the
fact that the PCC is in control over an LSP's state.

With pce-pce-initiated-lsp the set of options available to a PCC with
respect to having a say in LSP's shape, form and existence is limited
and involves trade-offs, but at the end of the day PCC can always assume
control of an LSP.

Specifically it can (without implying it should) legally execute the
following sequence based solely on its local policy:
1) terminate the PCEP session with the primary PCE
2) reduce Redelegation Timeout Interval to 0, effectively expiring it
3) delegate the (now orphan) LSP to a currently-connected backup PCE
4) restore Redelegation Timeout Internal back to normal

Step 3 can alternatively involve reducing State Timeout Interval, at
which point the PCC is in complete control of the LSP (can delete it or
revert it to operator-defined default per section 2).

> However, given this mechanism exists, it does raise the question about how PCEs should properly cooperate in terms of how orphaned LSPs are detected and who should request ownership.

pce-stateful-pce was designed with PCC being central authority over LSP
state and how it relates to connected PCEs.

With pce-pce-initiated-lsp there are multiple options how a set of
redundant PCEs can be realized. It may also involve a lot complex
implementation-specific trade-offs, for example in load-balancing, hence
we chose to leave inter-PCE arbitration out of scope.

As outlined above, a PCC implementation with a sufficiently smart local
policy can still act as a coordinator, although quite a limited one.

Regards,
Robert

--- End Message ---