Re: [rtcweb] Transcoding

John Leslie <john@jlc.net> Wed, 15 January 2014 14:32 UTC

Return-Path: <john@jlc.net>
X-Original-To: rtcweb@ietfa.amsl.com
Delivered-To: rtcweb@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 211181AE259 for <rtcweb@ietfa.amsl.com>; Wed, 15 Jan 2014 06:32:24 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.738
X-Spam-Level:
X-Spam-Status: No, score=-4.738 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_MED=-2.3, RP_MATCHES_RCVD=-0.538] autolearn=ham
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id txUrdu8zh2xH for <rtcweb@ietfa.amsl.com>; Wed, 15 Jan 2014 06:32:22 -0800 (PST)
Received: from mailhost.jlc.net (mailhost.jlc.net [199.201.159.4]) by ietfa.amsl.com (Postfix) with ESMTP id 06B261AE0F4 for <rtcweb@ietf.org>; Wed, 15 Jan 2014 06:32:22 -0800 (PST)
Received: by mailhost.jlc.net (Postfix, from userid 104) id 87D37C94C4; Wed, 15 Jan 2014 09:32:07 -0500 (EST)
Date: Wed, 15 Jan 2014 09:32:07 -0500
From: John Leslie <john@jlc.net>
To: "Cavigioli, Chris" <chris.cavigioli@intel.com>
Message-ID: <20140115143207.GD8358@verdi>
References: <20140112205608.GG47523@verdi> <CAD6AjGQ7X-h9oNtVwztSv5wkmvPAo0Fto=L=6VKuM1WnWe8bwQ@mail.gmail.com> <E36D1A4AE0B6AA4091F1728D584A6AD238213ADE@FMSMSX110.amr.corp.intel.com>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Disposition: inline
In-Reply-To: <E36D1A4AE0B6AA4091F1728D584A6AD238213ADE@FMSMSX110.amr.corp.intel.com>
User-Agent: Mutt/1.4.1i
Cc: rtcweb@ietf.org
Subject: Re: [rtcweb] Transcoding
X-BeenThere: rtcweb@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: Real-Time Communication in WEB-browsers working group list <rtcweb.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/rtcweb>, <mailto:rtcweb-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/rtcweb/>
List-Post: <mailto:rtcweb@ietf.org>
List-Help: <mailto:rtcweb-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/rtcweb>, <mailto:rtcweb-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 15 Jan 2014 14:32:24 -0000

Cavigioli, Chris <chris.cavigioli@intel.com> wrote:
> 
> Yes, transcoding consumes significant compute resources, draws power
> and generates heat.

   All true...

> With good technology, that can be somewhat mitigated, but even if you
> had the world's largest supercomputer on the head of a pin, transcoding
> introduces speech frame buffer delays that are irreducible.

   Not sure what "speech frame buffer delays" means; but yes, transcoding
introduces video delays.

> Add to those codec latencies, another set of latencies for jitter
> buffer in the receiver side and then also another jitter buffer in
> the network for the transcoder's receiver side ... then add extra
> buffer delays when audio and video don't sync up and you slip to
> the next timeslot.

   Those latencies really shouldn't be blamed on transcoding.

> In speakerphone configurations, you add extra delay for acoustic echo
> cancellation.

   Agreed that echo-cancellation adds delay -- but again, that shouldn't
be blamed on transcoding. (BTW, I'm not a fan of echo-cancellation in
best-effort-Internet telephony.)

> Then you add the latencies going in the return path. Before you know
> it, you have significantly noticeable mouth-to-ear latencies.

  Agreed it's really easy to get noticeable mouth-to-ear latency.

> Once these the round-trip delay gets to a certain level, people begin
> to double-talk and it makes a conversational dialog very difficult,
> especially when you have a multiparty call to discuss a contentious
> topic.

   I have experienced such latencies. It's really painful. But recall
that there was useful earth-to-moon conferencing -- so I don't think
we should forbid such latencies: it feels like an application-level
consideration.

> Imagine you are a service provider.

   That's easy! ;^)

> If WebRTC becomes widespread, then service providers will need to
> invest capex to purchase & maintain transcoders to support "free"
> calls.

   Actually, we don't need to...

> Does this make financial business sense?

   In some cases, yes. In the other cases, we let someone else do the
investing.

> Will WebRTC gain a reputation for poor user experience due to
> round-trip delay?

   Not if there are a majority of cases where delay is no worse than
the PSTN.

> We may argue about H.264 and VP8 today, but in the blink of an eye,
> we'll be arguing about H.265 and VP9 ... but wait ... there will still
> be H.264 and VP8 infrastructure and devices out there.

   Yes!!!

> So you can either chose to transcode an ever-growing list of evolving
> codecs ... or better ... use OFFER/ANSWER to negotiate transcoder-free
> operation so that each session will narrow-down to a common codec for
> all participants

   Exactly!

> from a constrained list of mandatory codecs.

   Good lord no! We should never limit the choice to MTI codecs!

> Thus WebRTC systems must support both VP8 and H.264 today, then also
> VP9 and H.265 in the future ... as well as the voice channel with
> Opus, G.711, AMR, AMR-WB and probably eventually also EVS, G.722, etc.

   (I don't get it... Why would any sane engineer want to go _there_?

--
John Leslie <john@jlc.net>