Re: [codec] #3: 2.2. Conferencing: Support of binaural audio?

Gregory Maxwell <gmaxwell@juniper.net> Tue, 23 March 2010 19:01 UTC

From: Gregory Maxwell <gmaxwell@juniper.net>
To: Christian Hoene <hoene@uni-tuebingen.de>, 'Slava Borilin' <Borilin@spiritdsp.com>, "codec@ietf.org" <codec@ietf.org>
Date: Tue, 23 Mar 2010 11:59:24 -0700
Thread-Topic: [codec] #3: 2.2. Conferencing: Support of binaural audio?
Thread-Index: AcrKrvuIACd3YdARQgWZAA5s7Y3ufgAAmG9wAAA/4OAAAbpg/g==
Message-ID: <BCB3F026FAC4C145A4A3330806FEFDA93A5C458BE7@EMBX01-HQ.jnpr.net>
References: <062.a837f2ff7647f7cb184f0c86b7e65747@tools.ietf.org> <5A3D7E7076F5DF42990A8C164308F810547717@mail-srv.spiritcorp.com>, <003001cacab3$4e9ded90$ebd9c8b0$@de>
In-Reply-To: <003001cacab3$4e9ded90$ebd9c8b0$@de>
Accept-Language: en-US
Content-Language: en-US
acceptlanguage: en-US
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
Subject: Re: [codec] #3: 2.2. Conferencing: Support of binaural audio?
Precedence: list

Christian Hoene [hoene@uni-tuebingen.de] wrote:
> (a) does not have any impact on the requirements but in case of (b) codec requirements are the support of stereo speech transmission and support for efficient mixing.

I agree that (b) requires support for stereo.

Since you're coming from mono you're going to do the panning/auralization to virtually position participants at different locations. Physiologically/acoustically correct positioning is likely going to defeat any mixing short-cuts that a codec provides. E.g. with CELT (and presumably other codecs) you can use the independent frame mechanisms to eliminate or minimize conferencing server computation when hard switching between sources, but you can't do this if you're converting a mono stream to positioned stereo. So I don't think that it requires any more than support for stereo.

I suppose we could ask that codecs support a mode where a mono stream is sent with a positioning wrapper, instead of using stereo over the link... But for that case, I think that could be provided equally or better by another protocol or wrapper also running on RTP in a codec independent manner.

Of course, minimizing the codec computational burden is helpful for scaling conferencing systems.

Can anyone share some typical conference scaling numbers which they consider interesting? It's been my view that all comers are fast enough that conferencing on commercial scale isn't much of an issue: That, yes, it might take some serious processing grunt to handle 10,000 users on a conferencing system, but if you're running at that kind of scale you could afford the required hardware. I've not see any numbers to suggest that this isn't the case.

[codec] #3: 2.2. Conferencing: Support of binaura… codec issue tracker
Re: [codec] #3: 2.2. Conferencing: Support of bin… Slava Borilin
Re: [codec] #3: 2.2. Conferencing: Support of bin… Christian Hoene
Re: [codec] #3: 2.2. Conferencing: Support of bin… Michael Knappe
Re: [codec] #3: 2.2. Conferencing: Support of bin… Marc Petit-Huguenin
Re: [codec] #3: 2.2. Conferencing: Support of bin… Gregory Maxwell
Re: [codec] #3: 2.2. Conferencing: Support of bin… Slava Borilin
Re: [codec] #3: 2.2. Conferencing: Support of bin… Stefan Sayer
Re: [codec] #3: 2.2. Conferencing: Support of bin… codec issue tracker
Re: [codec] requirements #3 (new): 2.2. Conferenc… codec issue tracker
Re: [codec] #3: 2.2. Conferencing: Support of bin… codec issue tracker