Re: [Mpls-interop] Local and Remote

"Sprecher, Nurit (NSN - IL/Hod HaSharon)" <nurit.sprecher@nsn.com> Wed, 06 May 2009 19:16 UTC

Return-Path: <nurit.sprecher@nsn.com>
X-Original-To: mpls-interop@core3.amsl.com
Delivered-To: mpls-interop@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id C624F3A6FC8 for <mpls-interop@core3.amsl.com>; Wed, 6 May 2009 12:16:41 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -3.377
X-Spam-Level:
X-Spam-Status: No, score=-3.377 tagged_above=-999 required=5 tests=[AWL=-0.779, BAYES_00=-2.599, HTML_MESSAGE=0.001]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id uggjDvnqerEn for <mpls-interop@core3.amsl.com>; Wed, 6 May 2009 12:16:33 -0700 (PDT)
Received: from demumfd002.nsn-inter.net (demumfd002.nsn-inter.net [217.115.75.234]) by core3.amsl.com (Postfix) with ESMTP id 47DB028C2D1 for <mpls-interop@ietf.org>; Wed, 6 May 2009 12:02:12 -0700 (PDT)
Received: from demuprx016.emea.nsn-intra.net ([10.150.129.55]) by demumfd002.nsn-inter.net (8.12.11.20060308/8.12.11) with ESMTP id n46J3bRO021126 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=FAIL); Wed, 6 May 2009 21:03:37 +0200
Received: from demuexc022.nsn-intra.net (demuexc022.nsn-intra.net [10.150.128.35]) by demuprx016.emea.nsn-intra.net (8.12.11.20060308/8.12.11) with ESMTP id n46J3XdH019085; Wed, 6 May 2009 21:03:37 +0200
Received: from DEMUEXC014.nsn-intra.net ([10.150.128.25]) by demuexc022.nsn-intra.net with Microsoft SMTPSVC(6.0.3790.3959); Wed, 6 May 2009 21:03:36 +0200
Content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: multipart/alternative; boundary="----_=_NextPart_001_01C9CE7D.5B9CF673"
X-MimeOLE: Produced By Microsoft Exchange V6.5
Date: Wed, 06 May 2009 21:03:36 +0200
Message-ID: <077E41CFFD002C4CAB7DFA4386A53264AB53B0@DEMUEXC014.nsn-intra.net>
In-Reply-To: <A7C5BA8C5C9C4608B44C34BD9719FE62@your029b8cecfe>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [Mpls-interop] Local and Remote
Thread-Index: AcnOVZGR05AMrqRSSy6n5wqVkQV2BAAF6GNw
References: <077E41CFFD002C4CAB7DFA4386A53264AB4B6B@DEMUEXC014.nsn-intra.net><4A0157B1.5050408@chello.nl> <73632CCC731C496AACD41C4DEA306288@your029b8cecfe> <077E41CFFD002C4CAB7DFA4386A53264AB5129@DEMUEXC014.nsn-intra.net> <A7C5BA8C5C9C4608B44C34BD9719FE62@your029b8cecfe>
From: "Sprecher, Nurit (NSN - IL/Hod HaSharon)" <nurit.sprecher@nsn.com>
To: Adrian Farrel <adrian@olddog.co.uk>
X-OriginalArrivalTime: 06 May 2009 19:03:36.0379 (UTC) FILETIME=[5BD688B0:01C9CE7D]
Cc: mpls-interop@ietf.org
Subject: Re: [Mpls-interop] Local and Remote
X-BeenThere: mpls-interop@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: IETF MPLS Interoperability Design Team <mpls-interop.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/mpls-interop>, <mailto:mpls-interop-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/mpls-interop>
List-Post: <mailto:mpls-interop@ietf.org>
List-Help: <mailto:mpls-interop-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/mpls-interop>, <mailto:mpls-interop-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 06 May 2009 19:16:42 -0000

Hi,

Thanks you Adrian for the great explanation. It really clarifies many of
the issues very well. 

I can agree to your definitions. And it is good to finally have them. 

Please see inline some comments.

The main questions that I have now are as follows:

*         Why do we need to specify in the OAM document a requirement to
distinguish between local and remote faults? Does it require any tool
that is not already required in the document? Does it require any
protocol definition and interoperability between network elements? We
have a requirement to detect loss of continuity (in each layer). We have
a requirement to send indication of a fault detected by the peer. We
have a requirement for connectivity verification, etc. what do we miss? 

*         If we have a requirement to distinguish between remote and
local alarms, don't we need a requirement to distinguish between remote
alarms happen at the same layer network and remote alarms happen at the
server layers? This is actually what is needed if we want the endpoint
of the LSPs for example to have the capability to suppress alarms that
result form a remote defect at the server layer. We would probably not
like to suppress alarms that result at a specific LSP for example as a
result of mis-connectivity in a specific node along the path. Consider
an LSP that transmit over A-B-C-D-E-F-G and there is a failure at the
server link D-E. The endpoints of the server layer are D and E. Consider
that D notifies its client layer of the fault, the fault becomes local
to the client layer at node D. In order to be able suppress alarms at
node A for all LSPs which all LSPs that start at node A and transmit
over the failed server layer, node A needs to identify that this is a
remote failure. But this is not enough to suppress alarms, because maybe
the fault is a result of mis-connectivity and not a failure of the
server layer. Fault localization is required at the same layer and since
the failure is now considered in node D as a local failure also at the
LSP layer (as the server layer notified it about the failure), how can
we distinguish between the cases given the list of functions we require?

Again I am sorry for all these exchanges, but I believe that if we do
not resolve it now, and if we try to "color" the real requirement, we
will get into these discussions in last call when we get comments from
the ITU-T (and I'll not be the one that will issue them)..  

Best regards,

Nurit

 

-----Original Message-----
From: ext Adrian Farrel [mailto:adrian@olddog.co.uk] 
Sent: Wednesday, May 06, 2009 5:18 PM
To: Sprecher, Nurit (NSN - IL/Hod HaSharon); mpls-interop@ietf.org
Subject: Re: [Mpls-interop] Local and Remote

 

Hi,

 

> * We have a requirement in the document for "Remote Defect

> Indication" and it specifies that "The MPLS-TP OAM toolset

> MUST provide a function to enable an End Point to notify its

> associated End Point of the detection of a fault or defect that

> it detects on a PW, LSP or Section between them." I.e. in the

> requirement refers to remote as to the associated endpoint at

> the same layer!

 

Why the exclamation point?

<NS> Because we call the function REMOTE fault indication but ask for a
notification only for faults that are detected by the peer endpoint at
the same layer. maybe we should change the name of the function to
reflect better the specific functionality that is required. 

 

The associated end-point at the same layer is certainly not local. Not
local 

= remote.

<NS> Right. But in this case the remote is in the same layer. 

 

The defect indication comes from somewhere else (not local). That means
it 

is a remote indication.

<NS> Correct

 

> *  I would like to refer to the last paragraph in your e-mail:

>> When a server layer provides a connection that is used as

>> a link in the client layer, a server layer fault (that is remote

>> in the server layer) may be reported to the client layer as

>> a fault in the client layer link (that is local in the client

>> layer)

> Assuming that we have an LSP that traversed nodes

> A-B-C-D-E-F-G and we have a fault in the link

> between D and E. According to the above the server

> layer needs to notify the client layer.

 

Well, I said "may," but I agree it would be useful.

 

> Is it to the client layer in nodes D and E or to the

> endpoints of the client layers, i.e. nodes A and G?

 

To D and E.

The server layer is only providing connectivity between D and E.

<NS> Right. BUT if you refer to the e-mails from Huub and to the
discussions with Malcolm last night they want the endpoints of the
client (e.g. LSPs, PWs) to identify failures that happened remotely at
the server layer! How can the endpoints know about such failures if
there is no requirement for someone to notify it about such a failure? 

 

It has (and should have) no knowledge of the client services carried
over 

the connection.

<NS> Correct. This is exactly the problem I am referring (or at least
try to refer to). 

As far as the server layer is concerned, it provides a connection from D
to 

E.

<NS> Correct. 

The client layer decides to use this connection as the link D-E in the 

client layer, and decides to route the LSP AG over that link.

 

NS> to be more specific, in transport it is the management station or
the Ingress node which calculate the route and make this decision. 

 

> If it is node D and E (this is local) they need now to notify

> nodes A and G

 

First question is "why?"

 

<NS> Because this is the requirement that was written in the revised
document (at least I read it) and that was specified in the mail
exchanges with Huub and Malcolm (at least as I understood it). I do not
remember the exact words but it was something like that the endpoints of
LSP, PW MUST indicate if the fault causing loss of continuity is local
(i.e. within the node containing the end point) or remote. How can the
endpoints know about REMOTE failures if no one notifies them about it? 

 

It may help to think of the client layer network as a single layer
network 

and resolve the issues there first.

 

Let us suppose that the fault was in the client layer at node D. How
would A 

and G find out about the fault?

 

<NS> Exactly. That is the point. If they need to know about it as stated
in the requirement above, then someone needs to notify them about remote
failures. 

 

Now suppose that the fault is in the client layer link D-E. How would A
and 

G find out about the fault?

 

Now we can come back to your question. The fault is in the server layer 

between D and E. It is reported to D and E. D and E map this to a client


layer fault on the link D-E. How would A and G find out about the fault?

 

<NS> This is exactly the issue I have with the requirement that was
specified in the revised document. If the endpoints of an LSP need to
distinguish between local and remote faults, someone needs to notify
them about it. Otherwise they can not know it. 

 

> but according to the requirement document

> and to the framework they may be MIPs of the

> LSPs but they cannot initiate OAM messages. Therefore, it seems

> that the endpoint of the server layer needs to notify the endpoint

> of the client layers of the fault (remote). And IMO we need to be

> clear about the requirement. And also to extend the definition in

> the first bullet for remote (not necessarily endpoint).

 

The answer is that the end points of the link (in the client layer) or
the 

connection (in the server layer) may be MEPs. When they detect faults
they 

raise alarms to their management systems.

 

When the fault is detected by the end points (as surely it will be) they
can 

perform fault localisation to isolate the fault. That is, the MEPs can 

consult the MIPs and (since the MIPs are allowed to respond) determine
where 

the fault is.

 

<NS> > All this is correct. But in order to understand that this is a
remote failure you do not need to know the location of the fault. You
just need to know that it is a remote failure. In such a case all LSPs
need to consult with their immediate MIPS to understand be able to
distinguish between the faults. 

Why do we need at all to specify a requirement for the endpoints to
distinguish between local and remote failures? Does it require any
interoperation between network elements? 

 

If, in the opinion of the deployer, the end points of an LSP need to
know 

about the location of a fault at detection time (I have serious doubts
about 

this) then he can designate every node along the path as a MEP.

<NS> According to the definition, a MEP is an endpoint of LSP, or PW. If
you designate any node along a path as a MEP you have many concatenated
LSPs.  

 

 

> * Assuming that the failure is in the first (server) link. As a

> result we have loss of continuity in the LSPs that transmit over this

> failed link and in the PWs that are attached to these LSPs. As I

> understand from your definition, as these fault are detected by the

> endpoint (to which the link is attached), all these faults are

> considered locals!

 

No.

Please read again.

 

The fault is local if the fault exists in a resource that is local (in
the 

layer that is detecting the fault).

<NS> OK. 

In your example, G detects a fault in the LSP, but G does not defect a
fault 

in the link FG or at the node G. Therefore the fault is not local.

 

Still in your example, E detects a fault in the server connection DE
(say 

DxyzE).

- If the fault is in E or in the link zE, the fault is local in the
server 

layer

- If the fault is not in E or in the link zE, the fault is not local in
the 

server layer

In *both* cases, once the fault has been reported to the client layer,
the 

fault is local in the client layer (it is a failure of the link DE).

<NS> OK

 

May help you to note that local and remote are relative terms. I.e.
"local 

to E" is not the same as "local to G".

 

> Is there still a requirement to distinguish between

> these failures? Note that all of these are now local.

 

Yes, there is a requirement to distinguish.

<NS> Why do we need to specify it in the document? Does it require any
development of a protocol? 

No, they are not all local.

 

 

I would just like to point out that we are now very far into
implementation 

and not standardisation. It is not the job of the IETF (in my personal 

opinion) to tell people how to build boxes.

 

Thanks,

Adrian

 

-----Original Message-----

From: mpls-interop-bounces@ietf.org

[mailto:mpls-interop-bounces@ietf.org] On Behalf Of ext Adrian Farrel

Sent: Wednesday, May 06, 2009 3:16 PM

To: mpls-interop@ietf.org

Subject: [Mpls-interop] Local and Remote

 

 

 

I know that Malcolm and Huub are going to work on the definition of

local

 

and remote.

 

 

 

Huub usefully typed during the meeting yesterday that "remote" might be

 

better stated as "non-local."

 

 

 

Watching some emails this morning, I wonder whether part of the

confusion arises from

- where the fault is detected

- where the fault exists

 

A node can (IMHO) only be detected locally. That is local to the point

of detection.

 

A fault may be reported (to the management plane) from the node of
detection 

(a local report), or may be signaled to another node (through the OAM or


control plane) and reported (to the management plane) from that other
node 

(a remote report).

 

The fault that is detected may be in the detecting node, or in a
resource 

directly connected to that node (such as a link). This is a local fault.

 

But the fault may also exist in a node or resource that is not local to
the 

detecting node. This is a remote fault.

 

 

 

If an LSP continuity check fails, the fault is detected by an end point

 

(local detection), but the fault might be somewhere out in the network

 

(remote fault).

 

 

 

When a server layer provides a connection that is used as a link in the

 

client layer, a server layer fault (that is remote in the server layer)

may

 

be reported to the client layer as a fault in the client layer link

(that is

 

local in the client layer). The client layer does not know about the

route

 

or resources used in the server layer (we MUST assume clean layer

 

separation) so the client layer does not need to know about the location

of

 

the server layer remote fault. If the client layer requests the server

layer

 

to repair the client layer link (i.e. to repair the server layer

connection)

 

the server layer may need to consider the location of the server layer

 

fault.

 

 

 

Am I wrong?

 

Is this really so complicated?

 

 

 

Thanks,

 

Adrian

 

 

 

 

 

_______________________________________________

 

Mpls-interop mailing list

 

Mpls-interop@ietf.org

 

https://www.ietf.org/mailman/listinfo/mpls-interop