[AVT] The RTP payload format for multichannel codecs's signalling problem

Magnus Westerlund <magnus.westerlund@ericsson.com> Thu, 05 February 2004 13:00 UTC

Received: from optimus.ietf.org (optimus.ietf.org [132.151.1.19]) by ietf.org (8.9.1a/8.9.1a) with ESMTP id IAA14084 for <avt-archive@odin.ietf.org>; Thu, 5 Feb 2004 08:00:39 -0500 (EST)
Received: from localhost.localdomain ([127.0.0.1] helo=www1.ietf.org) by optimus.ietf.org with esmtp (Exim 4.20) id 1Aoj73-0000K9-NK for avt-archive@odin.ietf.org; Thu, 05 Feb 2004 08:00:09 -0500
Received: (from exim@localhost) by www1.ietf.org (8.12.8/8.12.8/Submit) id i15D09AE001195 for avt-archive@odin.ietf.org; Thu, 5 Feb 2004 08:00:09 -0500
Received: from localhost.localdomain ([127.0.0.1] helo=www1.ietf.org) by optimus.ietf.org with esmtp (Exim 4.20) id 1Aoj6w-0000IL-S8; Thu, 05 Feb 2004 08:00:02 -0500
Received: from odin.ietf.org ([132.151.1.176] helo=ietf.org) by optimus.ietf.org with esmtp (Exim 4.20) id 1Aoj5x-0000F1-QL for avt@optimus.ietf.org; Thu, 05 Feb 2004 07:59:01 -0500
Received: from ietf-mx (ietf-mx.ietf.org [132.151.6.1]) by ietf.org (8.9.1a/8.9.1a) with ESMTP id HAA14037 for <avt@ietf.org>; Thu, 5 Feb 2004 07:59:00 -0500 (EST)
Received: from ietf-mx ([132.151.6.1]) by ietf-mx with esmtp (Exim 4.12) id 1Aoj5w-0003ol-00 for avt@ietf.org; Thu, 05 Feb 2004 07:59:00 -0500
Received: from exim by ietf-mx with spam-scanned (Exim 4.12) id 1Aoj54-0003jf-00 for avt@ietf.org; Thu, 05 Feb 2004 07:58:07 -0500
Received: from eagle.ericsson.se ([193.180.251.53]) by ietf-mx with esmtp (Exim 4.12) id 1Aoj4e-0003ds-00 for avt@ietf.org; Thu, 05 Feb 2004 07:57:40 -0500
Received: from esealmw141.al.sw.ericsson.se ([153.88.254.120]) by eagle.ericsson.se (8.12.10/8.12.10/WIREfire-1.8b) with ESMTP id i15CvdAh027635 for <avt@ietf.org>; Thu, 5 Feb 2004 13:57:39 +0100
Received: from esealnt613.al.sw.ericsson.se ([153.88.254.125]) by esealmw141.al.sw.ericsson.se with Microsoft SMTPSVC(6.0.3790.0); Thu, 5 Feb 2004 13:57:39 +0100
Received: from ericsson.com (research-1fd0e1.ki.sw.ericsson.se [147.214.34.102]) by esealnt613.al.sw.ericsson.se with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2657.72) id 1AZBCS47; Thu, 5 Feb 2004 13:57:39 +0100
Message-ID: <40223D4C.4020209@ericsson.com>
Date: Thu, 05 Feb 2004 13:55:40 +0100
X-Sybari-Trust: efa1a901 6b512624 6c3547e6 00000138
From: Magnus Westerlund <magnus.westerlund@ericsson.com>
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.0; en-US; rv:1.4) Gecko/20030624 Netscape/7.1 (ax)
X-Accept-Language: sv, en-us, en
MIME-Version: 1.0
To: avt@ietf.org
Content-Type: text/plain; charset="ISO-8859-1"; format="flowed"
Content-Transfer-Encoding: 7bit
X-OriginalArrivalTime: 05 Feb 2004 12:57:39.0457 (UTC) FILETIME=[A29B5F10:01C3EBE7]
X-Spam-Checker-Version: SpamAssassin 2.60 (1.212-2003-09-23-exp) on ietf-mx.ietf.org
X-Spam-Status: No, hits=0.0 required=5.0 tests=none autolearn=no version=2.60
Content-Transfer-Encoding: 7bit
Subject: [AVT] The RTP payload format for multichannel codecs's signalling problem
Sender: avt-admin@ietf.org
Errors-To: avt-admin@ietf.org
X-BeenThere: avt@ietf.org
X-Mailman-Version: 2.0.12
Precedence: bulk
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/avt>, <mailto:avt-request@ietf.org?subject=unsubscribe>
List-Id: Audio/Video Transport Working Group <avt.ietf.org>
List-Post: <mailto:avt@ietf.org>
List-Help: <mailto:avt-request@ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/avt>, <mailto:avt-request@ietf.org?subject=subscribe>
Content-Transfer-Encoding: 7bit

Hi,

Colin raised in Minneapolis a concern regarding the channel signalling 
for the AMR-WB+ payload format. The AMR-WB+ codec has both mono and 
stereo modes. On a codec level the difference is detected based on the 
frame type (FT) indicator for each frame. It is technical possible to 
switch between modes, but it does not provide as good switching 
performance as between the classic AMR-WB mono modes. However this codec 
"internal" or inband information about channels, modes etc. creates 
problem on RTP level and for the signalling. The problems described here 
is not limited to the AMR-WB+ codec, but does also effect other channels 
with internal signalling for different channel configuration.

The signalling problem is that the current practice of defining a MIME 
type and parameters and include them in SDP is not really capable of 
expressing the codecs capabilities in a good way. As it looks now, each 
declared payload type (PT) can only mean one channel configuration plus 
other parameter configuration.

This has not been a problem as long as one uses the PT field for its 
intended usage to the fully, i.e. having codec, packetization and all 
parameters being encoded in the PT: For example a stereo 16-bit PCM 
encoded audio stream is easy to describe with a MIME type and a couple 
of parameters: L16, frequency=44000, channels=2. Using the PT as 
intended also has the advantage of simplifying payload design, having 
the information explicit in the RTP header (thus avoiding per payload 
format special investigation for demultiplexing). However with the 
payload inband signalling of this information, it gets concealed within 
the payload.

This results in that declarative explicitness is diminished, and creates 
problem in cases of negotiation, like for SIP using Offer/Answer. For 
example, one possible solution for AMR-WB+ could be to declare a single 
payload type, which is always declared as being stereo. Thus expressing 
the full span of the codec capabilities. However if one only intends to 
use mono modes, this can't be expressed in this solution. It also forces 
a receiver application to set up resources beyond the what is really 
needed.

To maintain the declarative explicitness and uses the available 
mechanisms it will mean that one will need to declare different PTs for 
each channel configuration. Which would mean that an AMR-WB declaration 
possibly using both mono and stereo would look like this in SDP:

m=audio 49120 RTP/AVP 98 99
a=rtpmap:98 AMR-WB+/96000/1
a=fmtp:98 interleaving=30
 a=rtpmap:99 AMR-WB+/96000/2
a=fmtp:99 interleaving=30

However I would also like to point out that using too many PTs are not 
also not advisable, especially in regards to mechanisms like RTP 
retransmission that requires you to double the number of PTs used. 
However this should not be a concern unless we start using 10 or more 
PTs per codec.

So which alternative is better in your opinion, or do you have another 
proposal on how to solve the problem?


Best Regards

Magnus Westerlund 

Multimedia Technologies, Ericsson Research EAB/TVA/A
----------------------------------------------------------------------
Ericsson AB                | Phone +46 8 4048287
Torshamsgatan 23           | Fax   +46 8 7575550
S-164 80 Stockholm, Sweden | mailto: magnus.westerlund@ericsson.com



This communication is confidential and intended solely for the addressee(s). Any unauthorized review, use, disclosure or distribution is prohibited. If you believe this message has been sent to you in error, please notify the sender by replying to this transmission and delete the message without disclosing it. Thank you.

E-mail including attachments is susceptible to data corruption, interruption, unauthorized amendment, tampering and viruses, and we only send and receive e-mails on the basis that we are not liable for any such corruption, interception, amendment, tampering or viruses or any consequences thereof.


_______________________________________________
Audio/Video Transport Working Group
avt@ietf.org
https://www1.ietf.org/mailman/listinfo/avt