[RAI] Using media stream signaling in media plane or in signaling plane (re-send)

Bo Burman <bo.burman@ericsson.com> Mon, 23 July 2012 13:10 UTC

Return-Path: <bo.burman@ericsson.com>
X-Original-To: rai@ietfa.amsl.com
Delivered-To: rai@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 1C08121F86DB for <rai@ietfa.amsl.com>; Mon, 23 Jul 2012 06:10:32 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -6.248
X-Spam-Level:
X-Spam-Status: No, score=-6.248 tagged_above=-999 required=5 tests=[AWL=-0.000, BAYES_00=-2.599, HELO_EQ_SE=0.35, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_MED=-4]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 82IWDZoFfN+8 for <rai@ietfa.amsl.com>; Mon, 23 Jul 2012 06:10:30 -0700 (PDT)
Received: from mailgw1.ericsson.se (mailgw1.ericsson.se [193.180.251.45]) by ietfa.amsl.com (Postfix) with ESMTP id 050E921F863C for <rai@ietf.org>; Mon, 23 Jul 2012 06:10:29 -0700 (PDT)
X-AuditID: c1b4fb2d-b7f6c6d000001cc5-98-500d4d44cf29
Received: from esessmw0256.eemea.ericsson.se (Unknown_Domain [153.88.253.125]) by mailgw1.ericsson.se (Symantec Mail Security) with SMTP id 34.CE.07365.44D4D005; Mon, 23 Jul 2012 15:10:29 +0200 (CEST)
Received: from ESESSCMS0361.eemea.ericsson.se ([169.254.1.109]) by esessmw0256.eemea.ericsson.se ([153.88.115.96]) with mapi; Mon, 23 Jul 2012 15:10:29 +0200
From: Bo Burman <bo.burman@ericsson.com>
To: "rai@ietf.org" <rai@ietf.org>
Date: Mon, 23 Jul 2012 15:10:27 +0200
Thread-Topic: Using media stream signaling in media plane or in signaling plane (re-send)
Thread-Index: Ac1o1Ickl0NznOsHTyybSNnNjNLTOQ==
Message-ID: <05F760EF51FA6A4F804F9759C239313A45E750C5AD@ESESSCMS0361.eemea.ericsson.se>
Accept-Language: sv-SE, en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
acceptlanguage: sv-SE, en-US
Content-Type: multipart/alternative; boundary="_000_05F760EF51FA6A4F804F9759C239313A45E750C5ADESESSCMS0361e_"
MIME-Version: 1.0
X-Brightmail-Tracker: H4sIAAAAAAAAA+NgFjrKLMWRmVeSWpSXmKPExsUyM+Jvra6rL2+AwYM5nBaftzxjdWD0WLLk J1MAYxSXTUpqTmZZapG+XQJXxsyFOxkLTp1lrNi9cQZzA+PitYxdjJwcEgImEgtnNrBB2GIS F+6tB7K5OIQETjFKPJvwAspZyCjx+G8PE0gVm4CGxPwdd8G6RQQUJW483MgMYrMIqEpcnfgf bJKwQJjE53tHWSBqoiXmnn/GBmHrScxs7gabwysQLrFiymdWEJtRQFbi/vd7YPXMAuISt57M Z4K4SEBiyZ7zzBC2qMTLx/+g6mUkTi36zwpRny/xonEGC8RMQYmTM5+wTGAUmoVk1CwkZbOQ lEHE9SRuTJ3CBmFrSyxb+JoZwtaVmPHvEAuy+AJG9lWMwrmJmTnp5YZ6qUWZycXF+Xl6xamb GIFRcXDLb90djKfOiRxilOZgURLn5Ura7y8kkJ5YkpqdmlqQWhRfVJqTWnyIkYmDU6qB0fnT kfrfa9Zph0+8dWfl7123FrQ+uqRxwsInj/VcSsMz+0l/F/68sVr14EL3ezfFM65P/yP52sz5 ftjt1gVPZNheTPedOuXruUVvbTxiHnxieqjj2xn4yNo7eNXRef0bq9wc/FlEtyaf+n1p5XlP u74/d5q8ry6y7nPOjn1pOeXznVjuvkMzDsgrsRRnJBpqMRcVJwIAyOSneVgCAAA=
Subject: [RAI] Using media stream signaling in media plane or in signaling plane (re-send)
X-BeenThere: rai@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "Real-time Applications and Infrastructure \(RAI\)" <rai.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/rai>, <mailto:rai-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/rai>
List-Post: <mailto:rai@ietf.org>
List-Help: <mailto:rai-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/rai>, <mailto:rai-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 23 Jul 2012 13:10:32 -0000

As part of the comments during IETF 83 on the following drafts

* Media Stream Selection (https://datatracker.ietf.org/doc/draft-westerlund-dispatch-stream-selection/)
* PAUSE / RESUME (https://datatracker.ietf.org/doc/draft-westerlund-avtext-rtp-stream-pause/)
* Codec Operation Point (https://datatracker.ietf.org/doc/draft-westerlund-avtext-codec-operation-point/)

there was a discussion on the reasons to use one technology or the other for signaling related to specific media streams. The community was lacking knowledge of the performance implications coming from transporting information using a specific technology, and wanted a discussion on this mailing list.

I want to provide input to this discussion and have performed a comparison between using RTCP and using SIP as signaling technology for a particular purpose. I chose, more or less arbitrarily, this purpose to be sending a "PAUSE" command from a media receiver to a media sender for a specific RTP media stream (identified by SSRC).

Very briefly, these are the results (extracts from Section 5.1 of PAUSE / RESUME draft above, where more detailed descriptions of pre-conditions and assumptions can be found):

   The targeted topology is outlined in the following figure.

                  Provider A's network  .  Provider B's network
                                        .
   +-----+ SIP +------+ SIP +-------+  SIP  +-------+ SIP +-----+
   |Proxy|<--->| AS A |<--->| SGW A |<--.-->| SGW B |<--->|Proxy|
   +-----+     +------+     +-------+   .   +-------+     +-----+
       ^          ^                     .                    ^
       |          | SIP/H.248           .                    |
       |          v                     .                    |
   SIP |       +-----+ RTCP +-------+ RTCP  +-------+    SIP |
       |       | MCU |<---->| BGW A |<--.-->| BGW B |        |
       |       +-----+      +-------+   .   +-------+        |
       v         ^                      .       ^            v
   +------+     / RTCP                  .        \  RTCP  +------+
   | UA A |<---+                        .         +------>| UA B |
   +------+                             .                 +------+

   In the figure above, UA is a SIP User Agent, Proxy is a SIP Proxy, AS
   is an Application Server, MCU is a Multipoint Conference Unit, SGW a
   Signaling GateWay, and BGW a media Border GateWay.
   ...

   As a partial result, the message sizes can be compared, based on the
   messages defined in this specification (Section 8) and a SIP UPDATE
   with contents (Section 10) as discussed above.  Favorable and
   unfavorable message sizes are presented as stacked bars in the figure
   below.  Message sizes include IPv4 headers but no lower layer data,
   are rounded to the nearest 25 bytes, and the bars are to scale.

            250       500       750      1000      1250      1500 [byte]
   +---------+---------+---------+---------+---------+---------+--> Size
   |
   +-+--+
   | |50| 125 RTCP
   +-+--+---------------+--------------------------------------------+
   | SIP                | 525                                   1650 |
   +--------------------+--------------------------------------------+
   |
   ...
Favorable RTCP size comes from using non-compound RTCP packets.
Favorable SIP size comes from using dynamic SIP SigComp.
   ...

   The signaling delay results of the study are summarized in the
   following two figures.  Favorable and unfavorable values are
   presented as stacked bars.  Since there are many factors that impact
   the calculations, including some random processes, there are
   uncertainty in the calculations and delay values are thus rounded to
   nearest 5 ms.  The bars are to scale.

             50       100       150       200       250       300   [ms]
   +---------+---------+---------+---------+---------+---------+---> t
   |
   | Wireless UA to Wireless UA
   +---------------------+--------------------------------------+
   | SIP                 | 110                                  | 305
   +-----+---------------+-----------------------------+--------+
   |RTCP | 30                                          | 260
   +-----+---------------------------------------------+
   |
   | Wireless UA to MCU
   +-------------+--+
   | SIP         |70| 85
   +----+--------+--+--------------------------+
   |RTCP| 25                                   | 225
   +----+--------------------------------------+
   |
   | MCU to Wireless UA
   +--------------+-----------------------------------+
   | SIP          | 75                                | 255
   +---+----------+------------------------------+----+
   |   | 20 RTCP                                 | 230
   +---+-----------------------------------------+
   |
   ...

It should be noted that RTCP scheduling above is based on a 200 kbps media stream, leaving only 10 kbps (5%) for RTCP traffic, even for the favorable case. Should the media stream instead be 1000 kbps (50 kbps RTCP), the unfavorable RTCP delays will be reduced from 260 ms to 115 ms for Wireless-Wireless, from 225 ms to 80 ms for Wireless-MCU, and from 230 ms to 80 ms for MCU-Wireless.

   ...
   Below are the same cases for fixed access depicted.  Although delays
   are generally shorter, scales are kept the same for easy comparison
   with the previous figure.

             50       100       150       200       250       300   [ms]
   +---------+---------+---------+---------+---------+---------+---> t
   |
   | Fixed UA to Fixed UA
   +------------+
   | SIP        | 65
   +----+-------+---------------------------+
   |RTCP| 25                                | 205
   +----+-----------------------------------+
   |
   | Fixed UA to MCU
   +---------+
   | SIP     | 50
   +---+-----+-----------------------+
   |   | 15 RTCP                     | 200
   +---+-----------------------------+
   |
   | MCU to Fixed UA
   +---------+
   | SIP     | 50
   +---+-----+-----------------------+
   |   | 15 RTCP                     | 200
   +---+-----------------------------+
   |

As can be seen, there is no obvious winner in all cases. RTCP is definitely more compact, even when using SIP SigComp. Regarding response time, RTCP is more beneficial unless it is subject to unfavorable scheduling rules, too low bandwidth, or both, in which case SIP can perform better.

This rises further questions to be able to make an informed choice:
* Are there other, general, reasons or recommendations that should
  be taken into account when choosing signaling technology?
* What should the signaling strategy be when there are multiple nodes
  that need the signaled information, possibly not being in the direct
  route between the signaling peers?

My own conclusion is that I would prefer using RTCP for this type of media stream-related signaling, since:
* The media stream-related signaling does not have any major impact on
  the session itself, only what can be considered normal in-session
  variations, and thus there should be no need to notify all session-
  handling functions via SIP
* All media-handling nodes are already in the media path and will
  automatically have access to this new signaling, but may of course
  have to implement support for it if they need to act on it
* The difference in signaling bandwidth is definitely in RTCP's favor
* The RTCP signaling delay has a clear potential to be lower than SIP
  and it should already be possible to configure RTCP such that this
  low delay can be achieved in a large majority of cases

Comments on the above are greatly appreciated!

Cheers,

Bo Burman

----------------------------------------------------------------------
Multimedia Technologies, Ericsson Research EAB/TVM
----------------------------------------------------------------------
Ericsson AB                 | Phone  +46 10 7141311
Färögatan 6                 | Mobile +46 73 0949021
SE-164 80 Stockholm, Sweden | mailto: bo.burman@ericsson.com
----------------------------------------------------------------------