Re: [codec] #15: Efficiently combine pre-encoded audio

Jean-Marc Valin <jean-marc.valin@octasic.com> Wed, 12 May 2010 16:23 UTC

Message-ID: <4BEAD147.8080307@octasic.com>
Date: Wed, 12 May 2010 12:03:19 -0400
From: Jean-Marc Valin <jean-marc.valin@octasic.com>
User-Agent: Thunderbird 2.0.0.24 (X11/20100317)
MIME-Version: 1.0
To: bens@alum.mit.edu
References: <062.bc75a3b3c4a980df34535f87c9484935@tools.ietf.org> <071.30b67e93d22f0bfedf46b5035d133441@tools.ietf.org> <1F68067D-33B9-4F0C-B31B-B3A56A72DBA4@cisco.com> <4BEAC888.50109@fas.harvard.edu> <4BEACCD7.8080401@octasic.com> <4BEACEBF.7080403@fas.harvard.edu>
In-Reply-To: <4BEACEBF.7080403@fas.harvard.edu>
Content-Type: text/plain; charset="ISO-8859-1"; format="flowed"
Content-Transfer-Encoding: 7bit
Cc: codec@ietf.org
Subject: Re: [codec] #15: Efficiently combine pre-encoded audio
Precedence: list

Benjamin M. Schwartz wrote:
>> I think you can do better than an encoder VAD.
> 
> I know that CELT makes decoder VAD very efficient, 

Not only CELT. You can do that with an LPC-based codec too.

> but how is decoder VAD
> better than encoder VAD?  Encoder VAD saves even more CPU, saves
> bandwidth, and enables easier jitter buffering.

There's a few reasons why I think decoder-side is better:
- The decision for an encoder-size VAD would take some amount of space in 
the bit-stream
- If we make an encode-size VAD mandatory, then all encoders will have to 
spend the CPU cycles, even when it's not needed. If it's not mandatory, 
then the decoder cannot rely on it, so it still needs to implement a VAD
- A decoder VAD does not need to be specified in an exact way, so 
implementers can choose different implementations depending on that 
information they need.
- You cannot "game" a decode-size VAD.

> Are you thinking about some sort of adaptive thresholding that requires
> knowing all streams' volume levels?

Well, knowing the relative amplitudes of each stream can allow you to take 
more intelligent decisions, e.g. when you have to choose the "most active 
speaker". That's something you can't really get from an encoder VAD.

> Anyway, VAD can run on both encode and decode sides at the same time.

That would just mean nobody would bother implementing the encode side.

	Jean-Marc

[codec] #15: Efficiently combine pre-encoded audio codec issue tracker
Re: [codec] #15: Efficiently combine pre-encoded … stephen botzko
Re: [codec] #15: Efficiently combine pre-encoded … Stephan Wenger
Re: [codec] #15: Efficiently combine pre-encoded … codec issue tracker
Re: [codec] #15: Efficiently combine pre-encoded … stephen botzko
Re: [codec] #15: Efficiently combine pre-encoded … Jean-Marc Valin
Re: [codec] #15: Efficiently combine pre-encoded … Cullen Jennings
Re: [codec] #15: Efficiently combine pre-encoded … Brian Rosen
Re: [codec] #15: Efficiently combine pre-encoded … Benjamin M. Schwartz
Re: [codec] #15: Efficiently combine pre-encoded … Jean-Marc Valin
Re: [codec] #15: Efficiently combine pre-encoded … stephen botzko
Re: [codec] #15: Efficiently combine pre-encoded … Benjamin M. Schwartz
Re: [codec] #15: Efficiently combine pre-encoded … Benjamin M. Schwartz
Re: [codec] #15: Efficiently combine pre-encoded … Jean-Marc Valin
Re: [codec] #15: Efficiently combine pre-encoded … Benjamin M. Schwartz
Re: [codec] #15: Efficiently combine pre-encoded … Roman Shpount
Re: [codec] #15: Efficiently combine pre-encoded … codec issue tracker
Re: [codec] #15: Efficiently combine pre-encoded … Roman Shpount
Re: [codec] #15: Efficiently combine pre-encoded … codec issue tracker
Re: [codec] #15: Efficiently combine pre-encoded … Christian Hoene
Re: [codec] #15: Efficiently combine pre-encoded … codec issue tracker