Re: [rtcweb] Consensus vs. Voting (was Re: Call for Consensus Regarding Selecting Recommended Audio Codecs) (UNCLASSIFIED)

Andrew Allen <aallen@rim.com> Thu, 17 January 2013 12:54 UTC

Return-Path: <prvs=97290e39eb=aallen@rim.com>
X-Original-To: rtcweb@ietfa.amsl.com
Delivered-To: rtcweb@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 70CEF21F894C for <rtcweb@ietfa.amsl.com>; Thu, 17 Jan 2013 04:54:19 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -5.447
X-Spam-Level:
X-Spam-Status: No, score=-5.447 tagged_above=-999 required=5 tests=[AWL=1.756, BAYES_00=-2.599, GB_I_LETTER=-2, MIME_QP_LONG_LINE=1.396, RCVD_IN_DNSWL_MED=-4]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id HB8cQkL7pjOv for <rtcweb@ietfa.amsl.com>; Thu, 17 Jan 2013 04:54:18 -0800 (PST)
Received: from mhs060cnc.rim.net (mhs060cnc.rim.net [208.65.73.34]) by ietfa.amsl.com (Postfix) with ESMTP id 65D1121F895B for <rtcweb@ietf.org>; Thu, 17 Jan 2013 04:54:18 -0800 (PST)
X-AuditID: 0a41282f-b7f8c6d000004b29-cd-50f7f46d4b97
Received: from XCT104ADS.rim.net (xct104ads.rim.net [10.67.111.45]) by mhs060cnc.rim.net (SBG) with SMTP id F9.7C.19241.D64F7F05; Thu, 17 Jan 2013 06:54:05 -0600 (CST)
Received: from XMB104ADS.rim.net ([fe80::2494:a63d:e3:723b]) by XCT104ADS.rim.net ([fe80::90f9:3b89:1d94:aa9b%22]) with mapi id 14.02.0318.001; Thu, 17 Jan 2013 06:54:04 -0600
From: Andrew Allen <aallen@rim.com>
To: "radhika.r.roy.civ@mail.mil" <radhika.r.roy.civ@mail.mil>, "roman@telurix.com" <roman@telurix.com>, "martin.thomson@gmail.com" <martin.thomson@gmail.com>
Thread-Topic: [rtcweb] Consensus vs. Voting (was Re: Call for Consensus Regarding Selecting Recommended Audio Codecs) (UNCLASSIFIED)
Thread-Index: AQHN9Kv2LO2vpjF/Sk2uonpkHFQeHZhNehux
Date: Thu, 17 Jan 2013 12:54:03 +0000
Message-ID: <BBF5DDFE515C3946BC18D733B20DAD2338CF2A76@XMB104ADS.rim.net>
In-Reply-To: <8486C8728176924BAF5BDB2F7D7EEDDF4907E630@ucolhp9l.easf.csd.disa.mil>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-originating-ip: [10.67.110.253]
Content-Type: text/plain; charset="iso-8859-1"
content-transfer-encoding: quoted-printable
MIME-Version: 1.0
X-Brightmail-Tracker: H4sIAAAAAAAAA+NgFjrJKsWRmVeSWpSXmKPExsXC5Zyvq5v75XuAQf8hbYtrZ/4xWqxq/cVu MePCVGaLtf/a2R1YPHbOusvusWTJTyaP9fffM3rcmlIQwBLVwGiTlFhSFpyZnqdvZ5OYl5df kliSqpCSWpxsq+STmp6YoxBQlFmWmFyp4JJZnJyTmJmbWqSkkJliq2SipFCQk5icmpuaV2Kr lFhQkJqXomTHpYABbIDKMvMUUvOS81My89JtlTyD/XUtLEwtdQ2V7HQTOnkyrvVdZS+YpVux 4s0NlgbG6ypdjBwcEgImEusmuXUxcgKZYhIX7q1nA7GFBFYySrz+btzFyAVkb2aU2LN8H1iC TUBZYvnvGYwgCRGBhYwSDcvmgiWYBdQl7iw+xw6SEBZoZZS40reFCaKqjVHi7f4ZrCDrRASM JM5PAGtgEVCV+H1yOwuIzSvgITHt0VxGEJtTIEji65e97CA2I9BJ30+tYYJYIC5x68l8JohT BSSW7DnPDGGLSrx8/I8VwlaU+Lv3OytEvZ7EjalToI7Tlli28DUzxC5BiZMzn7BMYBSdhWTs LCQts5C0zELSsoCRZRWjYG5GsYGZQXJesl5RZq5eXmrJJkZQ2nDU0N/B+Pa9xSFGAQ5GJR7e 0nffA4RYE8uKK3MPMUpwMCuJ8H74ABTiTUmsrEotyo8vKs1JLT7E6AoMiYnMUtzJ+cCUllcS b2xggJujJM4r2Xs5QEggHZh+slNTC1KLYOYwcXCC7OGSEikGJpHUosTSkox4UKqLLwYmO6kG xpl9+qZ3exrVtMIbfvJxhrmuT1j4JuSewkt/5ofTdDy/RQb/UvrMn1/2qjAt8GrGdeY7zmJP 3ktOytRykLBTf6R1fovryrf371XfCGyO9TFSemoXYv+yxNV/QufyvyEfax+fvSDCwDdn43m3 XuVfrlqst1b9nizxOmKeV5DHS9bXChnMqps9lFiKMxINtZiLihMBN258jVwDAAA=
Cc: "rtcweb@ietf.org" <rtcweb@ietf.org>
Subject: Re: [rtcweb] Consensus vs. Voting (was Re: Call for Consensus Regarding Selecting Recommended Audio Codecs) (UNCLASSIFIED)
X-BeenThere: rtcweb@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: Real-Time Communication in WEB-browsers working group list <rtcweb.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/rtcweb>, <mailto:rtcweb-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/rtcweb>
List-Post: <mailto:rtcweb@ietf.org>
List-Help: <mailto:rtcweb-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/rtcweb>, <mailto:rtcweb-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 17 Jan 2013 12:54:19 -0000

As I recall during the MTI discussion it was claimed that OPUS was intended for interoperability between RTCweb endoints and G.711 for interoperability with legacy. That's why we selected 2 MTI Codecs.

Transcoding between G.711 and something else like G.722 will not have anything like the transcoding cost as between OPUS and G.722 - basically the complexity of a media gateway.  Of course you will not get the full fidelity of either OPUS or G.722 but basic audio communications will work fine which is all I think we can guarantee with legacy.

If platforms already have other codecs (such as AMR on mobile devices) that are usable by the browser then it makes perfect sense that they be included in the offer but I don't think IETF should start recommending other codecs to implement other than the 2 MTI. This should be left as product decisions based on commercial needs and not recommendations made by IETF the driver for which I suspect has more to do with the interests of some legacy products/deployments than those of RTCweb. 

Since almost every computing platform has a browser (including most mobile phones) within a few years every computing platform will have RTCweb and OPUS so the need for high fidelity legacy codec interoperability will I think become a mute point.

I think the inteoperability concerns should be more focussed on the video issue where we seem to have a real problem.

Andrew

----- Original Message -----
From: Roy, Radhika R CIV USARMY (US) [mailto:radhika.r.roy.civ@mail.mil]
Sent: Thursday, January 17, 2013 06:13 AM Central Standard Time
To: Roman Shpount <roman@telurix.com>; Martin Thomson <martin.thomson@gmail.com>
Cc: rtcweb@ietf.org <rtcweb@ietf.org>
Subject: Re: [rtcweb] Consensus vs. Voting (was Re: Call for Consensus Regarding Selecting Recommended Audio Codecs) (UNCLASSIFIED)

Classification: UNCLASSIFIED
Caveats: NONE

Hi, all:

A good assessment - thanks to Roman. 

So, the transcoding cost is enormous from both performance and HW/SW
implementation point of view.

In the end it may so appear that we are not hearing each other.

BR/Radhika

-----Original Message-----
From: rtcweb-bounces@ietf.org [mailto:rtcweb-bounces@ietf.org] On Behalf Of
Roman Shpount
Sent: Thursday, January 17, 2013 5:32 AM
To: Martin Thomson
Cc: rtcweb@ietf.org
Subject: Re: [rtcweb] Consensus vs. Voting (was Re: Call for Consensus
Regarding Selecting Recommended Audio Codecs)


On Wed, Jan 16, 2013 at 7:03 PM, Martin Thomson <martin.thomson@gmail.com>
wrote:


	If we don't say "webrtc implementations SHOULD implement
	G.722/AMR/AMR-WB", what is the failure mode for your application?
	Keeping in mind that - because this isn't 2119 MUST - you have to
	expect that some non-negligible proportion of clients will not
support
	this no matter how much extra ink we using in printing large
letters,
	how much pain does this really cause you?
	
	I expect that the transcoding costs are what this come down to.
Does
	anyone care to quantify this?


If common codecs are not implemented in the browser this would mean
transcoding will need to be performed on some sort of server-side media
gateway. There are three distinct issues we need to take into account when
dealing with transcoding. I would list them in what, at least from my point
of view, is the order of importance:

1. If you need to do trancoding, you will need to implement a jitter buffer
on the transcoding media gateway. ICE processing and SRTP en/decoding can be
performed on out of order packets as they are received. You cannot transcode
out of order audio. This means audio packets would need to be delayed by the
jitter buffer size. In cases of good internet connection this means 40 ms
jitter buffer delay. In cases of consumer internet in US (somebody sitting
on a DSL connection at home behind a WiFi router) you are typically looking
at 60-80 ms jitter buffer delay. If you are dealing with international
internet connections jitter of 150 ms to 200 ms is not unheard of. This
means an additional delay due to jitter buffering of 40 ms (best case) to
200 ms. This is in addition to any other delays that are already present on
the audio path. Adding an extra 100 ms of delay can make audio conversions
very awkward or unusable. On the other hand, if the gateway were performing
just ICE and SRTP, it would introduce 1-2 ms of delay at most (In this case
I am talking about a software based implementation running on generic server
hardware. A dedicated DSP solution would probably be even faster).

2. Audio artifacts due to transcoding. Apart from usual audio degradation
due to transcoding (OPUS transcoded to AMR would sound worse then OPUS or
AMR), we would also deal with double packet loss concealment. Media gateway
will conceal lost packets when transcoding. Client will conceal lost packet
received from the gateway. This would actually sound worse then PLC
performed in one place.

3. Scalability of the transcoding gateway. Gateway which just deals with ICE
processing and SRTP en/decoding can handle from 2000 to 10000 concurrent
call when running on a generic server CPU (different scalability is due to
differences in CPU performance and presence of AES specific instructions).
Gateway that would do G.722 to OPUS transcoding would most likely handle
from 500 to 1000 concurrent calls. We are looking at 10x difference in
performance. If we need to transcode AMR to OPUS the difference would be
even higher. This translates in 10x difference in hardware, data center,
power, and operations costs.

_____________
Roman Shpount

Classification: UNCLASSIFIED
Caveats: NONE



---------------------------------------------------------------------
This transmission (including any attachments) may contain confidential information, privileged material (including material protected by the solicitor-client or other applicable privileges), or constitute non-public information. Any use of this information by anyone other than the intended recipient is prohibited. If you have received this transmission in error, please immediately reply to the sender and delete this information from your system. Use, dissemination, distribution, or reproduction of this transmission by unintended recipients is not authorized and may be unlawful.