Re: [mpls] proposed drafts for aligning MPLS-TP PSC linear protection protocol to transport requirements

Huub van Helvoort <huubatwork@gmail.com> Mon, 22 July 2013 11:19 UTC

Return-Path: <huubatwork@gmail.com>
X-Original-To: mpls@ietfa.amsl.com
Delivered-To: mpls@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 9AB7621F89C3 for <mpls@ietfa.amsl.com>; Mon, 22 Jul 2013 04:19:34 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.599
X-Spam-Level:
X-Spam-Status: No, score=-2.599 tagged_above=-999 required=5 tests=[BAYES_00=-2.599]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id DbMuge8E07Ka for <mpls@ietfa.amsl.com>; Mon, 22 Jul 2013 04:19:34 -0700 (PDT)
Received: from mail-ea0-x236.google.com (mail-ea0-x236.google.com [IPv6:2a00:1450:4013:c01::236]) by ietfa.amsl.com (Postfix) with ESMTP id B67DC21F8F78 for <mpls@ietf.org>; Mon, 22 Jul 2013 04:19:31 -0700 (PDT)
Received: by mail-ea0-f182.google.com with SMTP id d10so3759665eaj.41 for <mpls@ietf.org>; Mon, 22 Jul 2013 04:19:30 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=message-id:disposition-notification-to:date:from:reply-to :user-agent:mime-version:to:cc:subject:references:in-reply-to :content-type:content-transfer-encoding; bh=rVVQ+T1bT3YMVH/A/8L048QKF1BnXWA5XsH24BJKOZs=; b=pO5AHQ/V+9s6bDNCGwn7IBcBUue8lnOtvBZPgdQmW1SnDNYrZE9qOItewWKATIJZNa kWBXP4T69X3jbhioMZApnoe4m+SASMpK/uOx7/bpAUg7L6aisnOrEXUeRlRFYpAIjwB1 2gDfVjtTQm0Ng3HyuQqYcVe1G7D1xezBztp/V6xlcJJpgw/mzxJqfWxyQjQCVJwDslVw PUox1jwGdvt//zvIjlAOKz0ocwNuzIZ+LfbgCOom85qd9rz/nPT9PMcPEsQEIupmk2+V J0G2C39oM8mhyft/WOACf1gvoZj6hqontw5jOn3dY4kQf2nBqa2eoLhUdKSPXqyG8w7x Wz5A==
X-Received: by 10.15.21.78 with SMTP id c54mr27036105eeu.14.1374491970813; Mon, 22 Jul 2013 04:19:30 -0700 (PDT)
Received: from McAsterix.local (g215085.upc-g.chello.nl. [80.57.215.85]) by mx.google.com with ESMTPSA id l42sm49958268eeo.14.2013.07.22.04.19.29 for <multiple recipients> (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Mon, 22 Jul 2013 04:19:30 -0700 (PDT)
Message-ID: <51ED1541.70108@gmail.com>
Date: Mon, 22 Jul 2013 13:19:29 +0200
From: Huub van Helvoort <huubatwork@gmail.com>
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.8; rv:17.0) Gecko/20130620 Thunderbird/17.0.7
MIME-Version: 1.0
To: "Eric Osborne (eosborne)" <eosborne@cisco.com>
References: <22257C41A415324A984CD03D63344E271F1B7A8B@TELMBA002RM001.telecomitalia.local> <20ECF67871905846A80F77F8F4A2757210288D02@xmb-rcd-x09.cisco.com>
In-Reply-To: <20ECF67871905846A80F77F8F4A2757210288D02@xmb-rcd-x09.cisco.com>
Content-Type: text/plain; charset="UTF-8"; format="flowed"
Content-Transfer-Encoding: 8bit
Cc: "Huub helvoort (huub.van.helvoort@huawei.com)" <huub.van.helvoort@huawei.com>, "mpls@ietf.org" <mpls@ietf.org>
Subject: Re: [mpls] proposed drafts for aligning MPLS-TP PSC linear protection protocol to transport requirements
X-BeenThere: mpls@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
Reply-To: huubatwork@gmail.com
List-Id: Multi-Protocol Label Switching WG <mpls.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/mpls>, <mailto:mpls-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/mpls>
List-Post: <mailto:mpls@ietf.org>
List-Help: <mailto:mpls-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/mpls>, <mailto:mpls-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 22 Jul 2013 11:19:34 -0000

Hello Eric,

You wrote:

> Thanks for this; the threads I started some time back seem to have
> died down, it's good to get them going again.

There was atwo week ITU-T plenary meeting, and after that I took
(and still have) a holiday.

> I have two things I never quite understood, can you clarify them
 > for me?

OK, I'll try. See in-line [Huub]

> i) can you explain EXER at a higher level?  I'm not looking for a
> description of the state machine changes, and I'm not looking for the
> one line "It allows the FSM to be tested".  We have all of that in
> the draft and in the equivalent ITU specs.
>
> What I'd like to understand about EXER is where it came from.  The
> ITU specs that define it are pretty hard to follow, they seem to
> assume the reader already knows what EXER is and what problem it
> solves.  It feels very much like a mechanism used to catch a very
> specific implementation bug, back when transport gear was far less
> debuggable than what we have today.

[Huub] EXER was not designed/intended to be used for bug finding
although it will detect problems with implementation.

[Huub] EXER was designed to verify that the state-machine at the
far end is able to respond to APS/PSC messages it receives from
the local end.
Even though state-machines should be tested extensively, there is
no 100% warranty. It can still have stopped due to external
circumstances, be in a deadlock due to unforseen order of events,
etc.

[Huub] note that APS/PSC should continue to operate even if no
control plane is available. The EXER is to enable an operator to
take corrective action before a protection switch request fails
and the 50ms switch time is not met.

> No other state machines that I'm familiar with (RSVP, LDP, BGP, OSPF,
> ISIS) have explicit signaling in them just to ask the neighbor
> whether it *would* be broken if if were, in the future, to be given a
> particular input.

[Huub] All these rely on a control plane, some of then include
"keepalive" messages to see if the far end responds.

> Part of my reluctance to get behind EXER has been
> that I don't feel comfortable with the idea of keeping a 30-year-old
> workaround in a protocol.

[Huub] it is NOT a workaround, it is an essential part of the
protocol.

> Is there more to it than that?  Have I
> misread and misunderstood EXER?  Does modern transport gear ever
> actually detect a problem via EXER/RR that wasn't obvious to the
> operator using other means?

[Huub] if there is no control plane I have no other means.
What means are available to verify if a state-machine that is
in a stable state is still functioning?

> ii) Why the push to standardize the SD state changes before we've
> defined SD?  I certainly agree that handling signal degrade is a good
> idea, but coming up with a definition for it has been challenging.
> What happens if we change the FSM to handle it, then come up with
> something more sophisticated (say, multiple levels of SD) that
> doesn't quite fit with the FSM changes?

[Huub] the definition of the signal degrade defect that causes the
SD event for the APS/PSC is still under discussion, but the APS/PSC
response to an SD event should be included. If we don't do it now
we have to issue another version of PSC later with all kinds of
backwards compatibility issues, additional complexity, and options
that operators do not like to have to manage.

[Huub] APS/PSC now responds to an SF event, based on detected
signal fail defects. However in the future we may extend the
signal fail defect by another probable cause. This will not
affect the existing APS/PCS.

Regards, Huub.



-- 
*****************************************************************
               请记住,你是独一无二的,就像其他每一个人一样