Re: [ippm] Extending TWAMP for Monitoring Service KPIs

Srivathsa Sarangapani <srivathsas@juniper.net> Mon, 18 July 2016 14:27 UTC

Return-Path: <srivathsas@juniper.net>
X-Original-To: ippm@ietfa.amsl.com
Delivered-To: ippm@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 8E8F312E26A for <ippm@ietfa.amsl.com>; Mon, 18 Jul 2016 07:27:11 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.921
X-Spam-Level:
X-Spam-Status: No, score=-1.921 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H4=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=junipernetworks.onmicrosoft.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id u9kFIK6ujjRi for <ippm@ietfa.amsl.com>; Mon, 18 Jul 2016 07:27:07 -0700 (PDT)
Received: from NAM03-DM3-obe.outbound.protection.outlook.com (mail-dm3nam03on0134.outbound.protection.outlook.com [104.47.41.134]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 71F8612DF95 for <ippm@ietf.org>; Mon, 18 Jul 2016 06:52:56 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=junipernetworks.onmicrosoft.com; s=selector1-juniper-net; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version; bh=4wzSy/XHFi3ZXx51EfPVD7QBSO0cwyuO9KYzb3wJcH8=; b=Q34i2svmpd95xPjJ0lRIDDv+obNqnTt3FXcUl5dXFHQwS76zSW+xy2n8P4H7qqJg7SuA1rW7l3m2J/OwiCZEQ7mdtlcwnAjPgoEbUFHHJjlhC3fQhAWIOfiM0PPMrQhsA4Y5iiVg9j5GCzvijx1cBLKS7wnvc2MLHHBYFm7CTvA=
Received: from BY2PR0501MB2133.namprd05.prod.outlook.com (10.163.198.19) by BN1PR05MB940.namprd05.prod.outlook.com (10.255.206.143) with Microsoft SMTP Server (TLS) id 15.1.539.14; Mon, 18 Jul 2016 13:52:53 +0000
Received: from BY2PR0501MB2133.namprd05.prod.outlook.com ([10.163.198.19]) by BY2PR0501MB2133.namprd05.prod.outlook.com ([10.163.198.19]) with mapi id 15.01.0539.019; Mon, 18 Jul 2016 13:52:52 +0000
From: Srivathsa Sarangapani <srivathsas@juniper.net>
To: P Muthu Arul Mozhi <p.muthu.arul.mozhi@ericsson.com>, IETF IPPM WG <ippm@ietf.org>
Thread-Topic: [ippm] Extending TWAMP for Monitoring Service KPIs
Thread-Index: AQHR4LlJs7ZNvrtoBUGJaILAXRFapqAd3QBggABvgYD//6ZtEIAAnyyA
Date: Mon, 18 Jul 2016 13:52:52 +0000
Message-ID: <3FA22AA0-8CE8-4EFE-9299-9CCCB0FF5B0A@juniper.net>
References: <0BEE6422-CA88-457A-B651-66C2DE417D16@juniper.net> <256DB779817549478A1637DDB82E83051D89167B@ESESSMB309.ericsson.se> <CE52FF64-54B5-42DD-8667-2664A0B65D24@juniper.net> <256DB779817549478A1637DDB82E83051D891705@ESESSMB309.ericsson.se>
In-Reply-To: <256DB779817549478A1637DDB82E83051D891705@ESESSMB309.ericsson.se>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
user-agent: Microsoft-MacOutlook/f.18.0.160709
authentication-results: spf=none (sender IP is ) smtp.mailfrom=srivathsas@juniper.net;
x-ms-exchange-messagesentrepresentingtype: 1
x-originating-ip: [116.197.184.12]
x-ms-office365-filtering-correlation-id: 1631013d-c117-40ef-c3b4-08d3af12d023
x-microsoft-exchange-diagnostics: 1; BN1PR05MB940; 6:UVM7RvXgk1kqIllnE/Z6WrdfGHjnTyqxts+/l6yq0WCL9NKG2Fyj+Co5yYWrqNM0hOBVWuiXtN8mobpCIqp1WvETl3DsbYXLTQcKWoYAMehsXhzMywwAxnS/mWUVut52kj86CgAAcGhukHJ5d9JAiNTG9KPOAaSMWB0ZSwffxkC/s0q8ObPvITQ8ykf8ilb+HZfzKdGYMhB8OtBDf1U9e8f9Ji+4fT9+Xn/iIBuhDzCwCAr6fAmzAAEsW+jH61FPTEo9KSaFQveMvUzuSrnHU0q5MLA6LVxKuEXumENInIWsGMO0a6RPv1gbUkaWAM6yBGCcmnvPrfceSO/pcz0Okw==; 5:MzA2vXAB1fKOfqBfXk3mr5bMPWmxSSqjgOivRX0Evh3eKIIy/VZpXWtwnAppTTf2rE6rfzGyN0SRf73cy0h/y1Gobp1RKVuDmXBtWVd9IdP4fjQNLAVJcyyIRb69KuKkLXkm3xhPgstYdzyHxxoxTA==; 24:fmc0n13IxzC45vf8I8wFHFlJR0AFlZx6gc3pSYTi6tF30YJItbHncU8nVbUp6Q6Nb7FXDN5McqyjkFxNug5fYaoFgL4tfI+nnqTO795PNho=; 7:3KbMpn4kYjgWGlYBE+G211XxRcz/TBk/WT62GfJ8xjaGDOPimHxb3xUgnB44MHsa2s02oUhlGPEAz96fMIxKYsFvz7UrAw5AnBoJ/JzBur7Tag2QprsQRUaPfj8l8OzJ+9JYkrumM/xWN//hgndfx7RBOtQwy3iFNw2HW59Pah231Ec+GsT0j/25bzmPhPjGjb4JjjXr45t02RXvWM4r/g2Mkc9vIPj26GQma7PsYM9TlQKJTb3eJv6gQTGg9tkw
x-microsoft-antispam: UriScan:;BCL:0;PCL:0;RULEID:;SRVR:BN1PR05MB940;
x-microsoft-antispam-prvs: <BN1PR05MB9408ABBE1D97B340212B4D7D6360@BN1PR05MB940.namprd05.prod.outlook.com>
x-exchange-antispam-report-test: UriScan:(37575265505322)(158342451672863)(72170088055959)(120809045254105)(138986009662008)(21748063052155);
x-exchange-antispam-report-cfa-test: BCL:0; PCL:0; RULEID:(601004)(2401047)(8121501046)(5005006)(10201501046)(3002001)(6055026); SRVR:BN1PR05MB940; BCL:0; PCL:0; RULEID:; SRVR:BN1PR05MB940;
x-forefront-prvs: 00073DB75F
x-forefront-antispam-report: SFV:NSPM; SFS:(10019020)(7916002)(199003)(52314003)(53754006)(377454003)(189002)(377424004)(2950100001)(5002640100001)(7906003)(2900100001)(4001430100002)(76176999)(54356999)(107886002)(50986999)(102836003)(19625215002)(81166006)(189998001)(19617315012)(106356001)(4326007)(81156014)(7736002)(105586002)(99286002)(106116001)(77096005)(8936002)(97736004)(5001770100001)(68736007)(10400500002)(122556002)(561944003)(8676002)(16236675004)(4001350100001)(3846002)(9326002)(7846002)(19580395003)(19580405001)(6116002)(66066001)(3280700002)(83716003)(3660700001)(2906002)(33656002)(586003)(87936001)(86362001)(19300405004)(82746002)(92566002)(101416001)(83506001)(93886004)(15975445007)(36756003)(104396002); DIR:OUT; SFP:1102; SCL:1; SRVR:BN1PR05MB940; H:BY2PR0501MB2133.namprd05.prod.outlook.com; FPR:; SPF:None; PTR:InfoNoRecords; A:1; MX:1; LANG:en;
received-spf: None (protection.outlook.com: juniper.net does not designate permitted sender hosts)
spamdiagnosticoutput: 1:99
spamdiagnosticmetadata: NSPM
Content-Type: multipart/alternative; boundary="_000_3FA22AA08CE84EFE92999CCCB0FF5B0Ajunipernet_"
MIME-Version: 1.0
X-OriginatorOrg: juniper.net
X-MS-Exchange-CrossTenant-originalarrivaltime: 18 Jul 2016 13:52:52.3687 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: bea78b3c-4cdb-4130-854a-1d193232e5f4
X-MS-Exchange-Transport-CrossTenantHeadersStamped: BN1PR05MB940
Archived-At: <https://mailarchive.ietf.org/arch/msg/ippm/AwevBzdpeST7hZXv0QRoF4NMyvU>
Cc: Peyush Gupta <peyushg@juniper.net>
Subject: Re: [ippm] Extending TWAMP for Monitoring Service KPIs
X-BeenThere: ippm@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: IETF IP Performance Metrics Working Group <ippm.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ippm>, <mailto:ippm-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ippm/>
List-Post: <mailto:ippm@ietf.org>
List-Help: <mailto:ippm-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ippm>, <mailto:ippm-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 18 Jul 2016 14:27:11 -0000

Hi Muthu,

Answers tagged VAT2  inline:

—
Regards,
Vathsa


From: P Muthu Arul Mozhi <p.muthu.arul.mozhi@ericsson.com>
Date: Monday, July 18, 2016 at 3:44 PM
To: Srivathsa Sarangapani <srivathsas@juniper.net>, IETF IPPM WG <ippm@ietf.org>
Cc: Peyush Gupta <peyushg@juniper.net>
Subject: RE: [ippm] Extending TWAMP for Monitoring Service KPIs

Hi Srivathsa,

| VAT>>>Say if the network admin wants to know the RTT + the
| liveliness of an application, then he need to send 2 probes
| every interval. One TWAMP probe for calculating RTT and one
| application specific probe to check the liveliness. So with
| our proposal the desired functionality can be achieved by
| just 1 probe. So this way, it reduces the number of probes
| by 50%. This would effectively use the network resources.

Well, you could do that by sending a single HTTP command and have the application return the result in a single command. I think the problem is with the choice of the protocol and not with the protocol deficiencies.
VAT2>>>Can you please elaborate on how would you get the RTT + liveliness using single HTTP command?
I hope you are not assuming RTT as the time when the probe was sent from application layer and when the probe was received by application layer.
The timestamping of Twamp is mostly implemented in the lowest possible layer(mostly in pfe data path) and that is what network admin is interested.

| VAT>>>Good question. I don’t think the main functionality of
| SIP, XMPP or RTSP is to Measure the IP performance Metric of
| the network.
We don't seem to be using TWAMP to measure the IP performance here anyway, instead to know the application liveliness, obtain the processing delay etc..
VAT>>>What made you think so?
In the TWAMP data probe, the timestamps to calculate the RTT is present by default. We want to add the http req/resp (assuming we are trying to probe a http server) as a part of payload which is currently unused to get the liveliness of the application.
I have specified the packet formats which indicate the TWAMP data packets containing all the earlier timestamps. Please let me know if this is not clear in the document so that I can EXPLICITLY add this information.

Regards,
Muthu

From: Srivathsa Sarangapani [mailto:srivathsas@juniper.net]
Sent: Monday, July 18, 2016 11:44 AM
To: P Muthu Arul Mozhi <p.muthu.arul.mozhi@ericsson.com>; IETF IPPM WG <ippm@ietf.org>
Cc: Peyush Gupta <peyushg@juniper.net>
Subject: Re: [ippm] Extending TWAMP for Monitoring Service KPIs

Hi Muthu,

Thanks for your comments. Please see my answers inline:

—
Regards,
Vathsa


From: P Muthu Arul Mozhi <p.muthu.arul.mozhi@ericsson.com<mailto:p.muthu.arul.mozhi@ericsson.com>>
Date: Monday, July 18, 2016 at 2:31 PM
To: Srivathsa Sarangapani <srivathsas@juniper.net<mailto:srivathsas@juniper.net>>, IETF IPPM WG <ippm@ietf.org<mailto:ippm@ietf.org>>
Cc: Peyush Gupta <peyushg@juniper.net<mailto:peyushg@juniper.net>>
Subject: RE: [ippm] Extending TWAMP for Monitoring Service KPIs

I read these drafts and some of the review comments and arguments. While the justifications for this work has been primarily "why not use TWAMP for this purpose", I do not find arguments on why TWAMP is the appropriate protocol for the kind usages described in the draft.
VAT>>>TWAMP is widely used to calculate RTT between 2 network nodes.
Why shouldn’t it be used to piggyback more relevant information like liveliness. Please note currently the TWAMP data payload is not used. It is filled with either zeros or pseudo random values. With our extension this payload can be efficiently used to carry some more relevant information.

If the intention is to find whether a HTTP application is alive, one could send the HTTP request directly to the application and wait for a response.
VAT>>>Say if the network admin wants to know the RTT + the liveliness of an application, then he need to send 2 probes every interval. One TWAMP probe for calculating RTT and one application specific probe to check the liveliness. So with our proposal the desired functionality can be achieved by just 1 probe. So this way, it reduces the number of probes by 50%. This would effectively use the network resources.

OTOH, if the intention is to do the same for some other node Z, one could send a HHTP request to an application running in Z and let it initiate the HHTP request to the target application and pass on the result. We could also extend SIP, XMPP, RTSP or any other application protocol to communicate with Z. It isn't clear why TWAMP is more suitable than any of these protocols.
VAT>>>Good question. I don’t think the main functionality of SIP, XMPP or RTSP is to Measure the IP performance Metric of the network.
But IPPM Protocols are defined and designed for this purposes. I would like to paste some relevant text from the IPPM charter:
“The WG will seek to develop new metrics and models to more accurately characterize the network paths under test and/or the performance of transport and application layer protocols on these paths.”


Regards,
Muthu

From: ippm [mailto:ippm-bounces@ietf.org] On Behalf Of Srivathsa Sarangapani
Sent: Monday, July 18, 2016 7:58 AM
To: IETF IPPM WG <ippm@ietf.org<mailto:ippm@ietf.org>>
Cc: Peyush Gupta <peyushg@juniper.net<mailto:peyushg@juniper.net>>
Subject: [ippm] Extending TWAMP for Monitoring Service KPIs

Hi All,

New versions of the TWAMP Service Monitoring extension drafts are being posted after addressing the comments given by Greg, Qin and others in the mailing list.
We request you all to please go through the documents and reply back with your comments/suggestions.
The documents are in the below path:

Name:         draft-spv-ippm-monitor-methodology-services-kpi
Revision: 02
Title:        Monitoring Service KPIs using TWAMP - Methodology
Document date: 2016-07-17
Group:        Individual Submission
Pages:        20
URL:            https://www.ietf.org/internet-drafts/draft-spv-ippm-monitor-methodology-services-kpi-02.txt
Status:         https://datatracker.ietf.org/doc/draft-spv-ippm-monitor-methodology-services-kpi/
Htmlized:       https://tools.ietf.org/html/draft-spv-ippm-monitor-methodology-services-kpi-02
Diff:           https://www.ietf.org/rfcdiff?url2=draft-spv-ippm-monitor-methodology-services-kpi-02

Abstract:
   The TWAMP protocol provides a common architecture for two way
   measurements in the IP network.  However IP network performance are
   also affected by a set of L4-L7 service deployed in the network.
   Monitoring of these service performance in the IP network also plays
   a vital role in network optimization and application layer traffic
   optimization.  This capability is not supported by the existing TWAMP
   protocol.

   In this document, we extend TWAMP protocol to support service
   performance monitoring and service KPIs calculation.  Some of the
   existing fields in the TWAMP protocol are extended to support new
   modes for calculating these KPIs.  A set of new messages are added in
   the control protocol between TWAMP client (session sender) and the
   TWAMP server (session reflector).  Services here ranging from Layer 4
   to Layer 7 services,such as Http based services, Traffic load
   balancer, DPI, Video caching, real time streaming and IPSec.  The
   KPIs MAY be service latency, liveliness of an application, number of
   flows and sessions per service, load balancer statistics.

   There is a separate Draft[I.D-spv-ippm-monitor-implementation-
   services-kpi] that talks about implementation of monitoring these
   KPIs in the network using TWAMP.  Monitoring of these KPIs in the
   service plane with in a network play a vital role in optimum usage of
   network resources and improving the overall performance and capacity.


Name:         draft-spv-ippm-monitor-implementation-services-kpi
Revision: 02
Title:        KPI Metrics for Service Monitoring using TWAMP
Document date: 2016-07-17
Group:        Individual Submission
Pages:        9
URL:            https://www.ietf.org/internet-drafts/draft-spv-ippm-monitor-implementation-services-kpi-02.txt
Status:         https://datatracker.ietf.org/doc/draft-spv-ippm-monitor-implementation-services-kpi/
Htmlized:       https://tools.ietf.org/html/draft-spv-ippm-monitor-implementation-services-kpi-02
Diff:           https://www.ietf.org/rfcdiff?url2=draft-spv-ippm-monitor-implementation-services-kpi-02

Abstract:
   We are using a new method to calculate services KPIs and metrics in
   the network using TWAMP protocol.  This draft outlines the
   implementation of the service KPIs and there use cases in the service
   plane in the network.  The KPIs discussed in this draft include
   Service Latency and Application Liveliness detection.

   Service latency is defined as the time spent by the packet when it is
   injected in the service module or service card till the time,
   serviced packet is received back by the TWAMP server.  TWAMP server
   records the timestamp of the packet when it is injected into the
   service module and then again record the timestamp when it receives
   the packet afer service is applied in the data plane.

   Application Liveliness detection means whether the application is up
   and running in the network.  In case you want to monitor the http
   application or the dns server and verify if they are up and running,
   this method is applicable.  The implementation can be used for
   liveliness detection of any service in the network.


—
Regards,
Vathsa