Re: [rtcweb] Consensus vs. Voting (was Re: Call for Consensus Regarding Selecting Recommended Audio Codecs) (UNCLASSIFIED)

"Roy, Radhika R CIV USARMY (US)" <radhika.r.roy.civ@mail.mil> Thu, 17 January 2013 12:12 UTC

Return-Path: <radhika.r.roy.civ@mail.mil>
X-Original-To: rtcweb@ietfa.amsl.com
Delivered-To: rtcweb@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id CAF7B21F8B19 for <rtcweb@ietfa.amsl.com>; Thu, 17 Jan 2013 04:12:39 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.599
X-Spam-Level:
X-Spam-Status: No, score=-4.599 tagged_above=-999 required=5 tests=[BAYES_00=-2.599, GB_I_LETTER=-2]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id lR74cp1m4wM5 for <rtcweb@ietfa.amsl.com>; Thu, 17 Jan 2013 04:12:38 -0800 (PST)
Received: from edge-cols.mail.mil (edge-cols.mail.mil [131.64.100.6]) by ietfa.amsl.com (Postfix) with ESMTP id 91D9921F8AE6 for <rtcweb@ietf.org>; Thu, 17 Jan 2013 04:12:38 -0800 (PST)
Received: from UCOLHP3M.easf.csd.disa.mil (131.64.100.152) by UCOLHP4Z.easf.csd.disa.mil (131.64.100.6) with Microsoft SMTP Server (TLS) id 14.2.309.2; Thu, 17 Jan 2013 12:13:52 +0000
Received: from UCOLHP9L.easf.csd.disa.mil ([169.254.2.136]) by UCOLHP3M.easf.csd.disa.mil ([131.64.100.152]) with mapi id 14.02.0309.003; Thu, 17 Jan 2013 12:13:51 +0000
From: "Roy, Radhika R CIV USARMY (US)" <radhika.r.roy.civ@mail.mil>
To: Roman Shpount <roman@telurix.com>, Martin Thomson <martin.thomson@gmail.com>
Thread-Topic: [rtcweb] Consensus vs. Voting (was Re: Call for Consensus Regarding Selecting Recommended Audio Codecs) (UNCLASSIFIED)
Thread-Index: AQHN9EY34EcSa43+0UCb5kw4I+PsEphNUyQAgAAbPuA=
Date: Thu, 17 Jan 2013 12:13:50 +0000
Message-ID: <8486C8728176924BAF5BDB2F7D7EEDDF4907E630@ucolhp9l.easf.csd.disa.mil>
References: <50D2CC6A.4090500@ericsson.com> <6515_1357907583_50F0067F_6515_1738_1_2842AD9A45C83B44B57635FD4831E60A0747CC@PEXCVZYM14.corporate.adroot.infra.ftgroup> <BLU0-SMTP880A602A311CE05C9DC39FD0290@phx.gbl> <A26C56D5-C501-4823-8099-62AF7910B8A4@ntt-at.com> <580BEA5E3B99744AB1F5BFF5E9A3C67D16813E56EC@HE111648.emea1.cds.t-internal.com> <50F41D97.1030508@nostrum.com> <CAD5OKxtsWMfAV=K4sM+zLXoyVCgihwujH2gG9ziA5GuEtsU0sQ@mail.gmail.com> <50F43ACA.80206@nostrum.com> <CAD5OKxug2qB+Xi_cp87Lt7BiPwJ1Eq1rNuioj+zDZFf=RRckPw@mail.gmail.com> <50F44AF0.4060304@nostrum.com> <CAD5OKxs7Ueto0k-5TWnQtgb+Pocp-SSu3ctr3qFs5qrcPgMtkQ@mail.gmail.com> <50F4619F.7040208@nostrum.com> <CAD5OKxu3_JJ3zS8hCeG-nHM72t=0j--ihUR8E5NvL9--wmmnEA@mail.gmail.com> <7CBFD4609D19C043A4AC4FD8381C6E2602386636@DEMUEXC014.nsn-intra.net> <50F5A74C.3030203@nostrum.com> <CABkgnnXRcFHj4gi6WEDDqU+S-adnjd91wQW4bL2S6pO8YtzE3w@mail.gmail.com> <CAD5OKxsz1AqsDG4_cZhGxbLbzmBYeYcqexbCR1LCe7Ecx0PQ9w@mail.gmail.com>
In-Reply-To: <CAD5OKxsz1AqsDG4_cZhGxbLbzmBYeYcqexbCR1LCe7Ecx0PQ9w@mail.gmail.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach: yes
X-MS-TNEF-Correlator:
x-originating-ip: [131.64.62.4]
Content-Type: multipart/signed; protocol="application/x-pkcs7-signature"; micalg="SHA1"; boundary="----=_NextPart_000_0006_01CDF481.F6819B60"
MIME-Version: 1.0
Cc: "rtcweb@ietf.org" <rtcweb@ietf.org>
Subject: Re: [rtcweb] Consensus vs. Voting (was Re: Call for Consensus Regarding Selecting Recommended Audio Codecs) (UNCLASSIFIED)
X-BeenThere: rtcweb@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: Real-Time Communication in WEB-browsers working group list <rtcweb.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/rtcweb>, <mailto:rtcweb-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/rtcweb>
List-Post: <mailto:rtcweb@ietf.org>
List-Help: <mailto:rtcweb-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/rtcweb>, <mailto:rtcweb-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 17 Jan 2013 12:12:39 -0000

Classification: UNCLASSIFIED
Caveats: NONE

Hi, all:

A good assessment - thanks to Roman. 

So, the transcoding cost is enormous from both performance and HW/SW
implementation point of view.

In the end it may so appear that we are not hearing each other.

BR/Radhika

-----Original Message-----
From: rtcweb-bounces@ietf.org [mailto:rtcweb-bounces@ietf.org] On Behalf Of
Roman Shpount
Sent: Thursday, January 17, 2013 5:32 AM
To: Martin Thomson
Cc: rtcweb@ietf.org
Subject: Re: [rtcweb] Consensus vs. Voting (was Re: Call for Consensus
Regarding Selecting Recommended Audio Codecs)


On Wed, Jan 16, 2013 at 7:03 PM, Martin Thomson <martin.thomson@gmail.com>
wrote:


	If we don't say "webrtc implementations SHOULD implement
	G.722/AMR/AMR-WB", what is the failure mode for your application?
	Keeping in mind that - because this isn't 2119 MUST - you have to
	expect that some non-negligible proportion of clients will not
support
	this no matter how much extra ink we using in printing large
letters,
	how much pain does this really cause you?
	
	I expect that the transcoding costs are what this come down to.
Does
	anyone care to quantify this?


If common codecs are not implemented in the browser this would mean
transcoding will need to be performed on some sort of server-side media
gateway. There are three distinct issues we need to take into account when
dealing with transcoding. I would list them in what, at least from my point
of view, is the order of importance:

1. If you need to do trancoding, you will need to implement a jitter buffer
on the transcoding media gateway. ICE processing and SRTP en/decoding can be
performed on out of order packets as they are received. You cannot transcode
out of order audio. This means audio packets would need to be delayed by the
jitter buffer size. In cases of good internet connection this means 40 ms
jitter buffer delay. In cases of consumer internet in US (somebody sitting
on a DSL connection at home behind a WiFi router) you are typically looking
at 60-80 ms jitter buffer delay. If you are dealing with international
internet connections jitter of 150 ms to 200 ms is not unheard of. This
means an additional delay due to jitter buffering of 40 ms (best case) to
200 ms. This is in addition to any other delays that are already present on
the audio path. Adding an extra 100 ms of delay can make audio conversions
very awkward or unusable. On the other hand, if the gateway were performing
just ICE and SRTP, it would introduce 1-2 ms of delay at most (In this case
I am talking about a software based implementation running on generic server
hardware. A dedicated DSP solution would probably be even faster).

2. Audio artifacts due to transcoding. Apart from usual audio degradation
due to transcoding (OPUS transcoded to AMR would sound worse then OPUS or
AMR), we would also deal with double packet loss concealment. Media gateway
will conceal lost packets when transcoding. Client will conceal lost packet
received from the gateway. This would actually sound worse then PLC
performed in one place.

3. Scalability of the transcoding gateway. Gateway which just deals with ICE
processing and SRTP en/decoding can handle from 2000 to 10000 concurrent
call when running on a generic server CPU (different scalability is due to
differences in CPU performance and presence of AES specific instructions).
Gateway that would do G.722 to OPUS transcoding would most likely handle
from 500 to 1000 concurrent calls. We are looking at 10x difference in
performance. If we need to transcode AMR to OPUS the difference would be
even higher. This translates in 10x difference in hardware, data center,
power, and operations costs.

_____________
Roman Shpount

Classification: UNCLASSIFIED
Caveats: NONE