Re: [codec] WG Review: Internet Wideband Audio Codec (codec)

Daniel Petrie <dpetrie@sipez.com> Thu, 14 January 2010 01:16 UTC

Return-Path: <dpetrie@sipez.com>
X-Original-To: codec@core3.amsl.com
Delivered-To: codec@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 743AA3A6896 for <codec@core3.amsl.com>; Wed, 13 Jan 2010 17:16:03 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.599
X-Spam-Level:
X-Spam-Status: No, score=-2.599 tagged_above=-999 required=5 tests=[BAYES_00=-2.599]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id FbJ+3hGS0vzK for <codec@core3.amsl.com>; Wed, 13 Jan 2010 17:16:01 -0800 (PST)
Received: from web37105.mail.mud.yahoo.com (web37105.mail.mud.yahoo.com [209.191.85.107]) by core3.amsl.com (Postfix) with SMTP id 935983A682E for <codec@ietf.org>; Wed, 13 Jan 2010 17:16:01 -0800 (PST)
Received: (qmail 76420 invoked by uid 60001); 14 Jan 2010 01:15:57 -0000
Message-ID: <923058.72035.qm@web37105.mail.mud.yahoo.com>
X-YMail-OSG: aHMiswoVM1k6pwFz4AiBs9Wi6xBMvMZHMs62tkXhDv9Q3_Y2v00mQrC2GNdiUOmeyLgXh0mi0HFf3poNEeGzgwFU.hatCh39LkRy.ZVkhs0daq9R37qNfV4oruq2R7c9tbWkUDaV6VXnfD7McgGZ9tCQUeMr_0xcD3sJsuUGNuGju7x2YSKgNEmWFHQLgPz7O3EDxX08esox.n0KKfLsnWoJl.mxeuN4bVWHEym0kJw2F21Fm.aJFgNCYT08kxazQ9C7r6SzPYLnlf4cUsL4n__yJjHtHFJ5b0IUE4CHd72xjubkLeMu5wYZfjortd7V.DkCYjOaqp4naXOShQW6HAgya_44p5UYFIHsdFI-
Received: from [24.61.83.127] by web37105.mail.mud.yahoo.com via HTTP; Wed, 13 Jan 2010 17:15:56 PST
X-RocketYMMF: dgpetrie
X-Mailer: YahooMailClassic/9.0.20 YahooMailWebService/0.8.100.260964
Date: Wed, 13 Jan 2010 17:15:56 -0800
From: Daniel Petrie <dpetrie@sipez.com>
To: 'Alexander Chemeris' <Alexander.Chemeris@sipez.com>, 'Christian Hoene' <hoene@uni-tuebingen.de>, Roni Even <ron.even.tlv@gmail.com>
In-Reply-To: <4b4e3315.0f345e0a.1940.79ac@mx.google.com>
MIME-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
Cc: codec@ietf.org
Subject: Re: [codec] WG Review: Internet Wideband Audio Codec (codec)
X-BeenThere: codec@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
Reply-To: dpetrie@sipez.com
List-Id: "Should the IETF standardize wideband Internet codec\(s\)? " <codec.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/codec>, <mailto:codec-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/codec>
List-Post: <mailto:codec@ietf.org>
List-Help: <mailto:codec-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/codec>, <mailto:codec-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 14 Jan 2010 01:16:03 -0000

Hi Roni:
I agree that transporting this information is code independent information.  However calculating these as part of the encoding may be possible significant performance savings depending upon the codec algorithm.

Cheers,
Dan

--- On Wed, 1/13/10, Roni Even <ron.even.tlv@gmail.com> wrote:

> From: Roni Even <ron.even.tlv@gmail.com>
> Subject: RE: [codec] WG Review: Internet Wideband Audio Codec (codec)
> To: dpetrie@sipez.com, "'Alexander Chemeris'" <Alexander.Chemeris@sipez.com>, "'Christian Hoene'" <hoene@uni-tuebingen.de>
> Cc: codec@ietf.org
> Date: Wednesday, January 13, 2010, 3:53 PM
> Daniel,
> These are not a codec specific information and there is
> work on it in AVT.
> See
> http://tools.ietf.org/id/draft-lennox-avt-rtp-audio-level-exthdr-01.txt
> and
> there is also
> http://tools.ietf.org/id/draft-ivov-avt-slic-02.txt
> which provides
> information to the receiver.
> 
> Roni Even
> 
> > -----Original Message-----
> > From: codec-bounces@ietf.org
> [mailto:codec-bounces@ietf.org]
> On Behalf
> > Of Daniel Petrie
> > Sent: Wednesday, January 13, 2010 7:54 PM
> > To: 'Alexander Chemeris'; Christian Hoene
> > Cc: codec@ietf.org
> > Subject: Re: [codec] WG Review: Internet Wideband
> Audio Codec (codec)
> > 
> > While we are discussing conferencing, I would like to
> add my wish list
> > of requirements:
> > 
> > I would want the following information included in the
> RTP packet so
> > that I can do speaker selection on the streams without
> having to have
> > the expense of decoding them and run the algorithms to
> calculate these
> > values.  This is useful in all 3 of Christian's
> conferencing scenarios.
> > 
> > 1) Voice activity indication (minimally binary,
> ideally magnitude)
> > 2) gain control either already applied or a multiplier
> to apply to
> > achieve a consistent gain.
> > 3) Energy level
> > 
> > --- On Wed, 1/13/10, Christian Hoene <hoene@uni-tuebingen.de>
> wrote:
> > 
> > > From: Christian Hoene <hoene@uni-tuebingen.de>
> > > Subject: Re: [codec] WG Review: Internet Wideband
> Audio Codec (codec)
> > > To: "'Alexander Chemeris'" <Alexander.Chemeris@sipez.com>
> > > Cc: codec@ietf.org
> > > Date: Wednesday, January 13, 2010, 5:55 AM
> > > > This codec should be also useful
> > > in
> > > > conference
> > > > scenarios, where you have to decode it at
> conference
> > > server and encode
> > > > mixed signal again. And if I haven't missed
> something,
> > > this use-case
> > > > was
> > > > explicitly listed in codec requirements.
> > >
> > > The conferencing scenario is not yet well
> described in the
> > > current codec requirements document. Different
> kinds of
> > > systems are possible:
> > >
> > > 1) A central conferencing server that serves all
> clients.
> > > - a) which does a efficient single changing
> mixing - still
> > > a challenge if you have multiple operational
> modes in a
> > > codec.
> > > - b) which does a more intelligent mixing using
> (at least)
> > > two channels to support spatial hearing - this
> increases the
> > > quality of conference call significant. Thus,
> stereo modes
> > > must be supported if the codec shall support
> state of the
> > > art conferencing technologies.
> > > 2) A mesh-like N-to-N transmission conference
> call. Then
> > > you might benefit from layered coding because you
> have at a
> > > client you have to encode only once. But then
> again, how to
> > > support multiple coding modes?
> > > 3) One client acts as conference server (similar
> to 1)
> > >
> > > In total, the support of conferencing calls will
> require:
> > > - Layered coding
> > > - Efficient mixing of streams
> > > - Support of stereo
> > > - Good (self-)transcoding
> > > Is this all achievable?
> > >
> > > Christian
> > >
> > >
> > > _______________________________________________
> > > codec mailing list
> > > codec@ietf.org
> > > https://www.ietf.org/mailman/listinfo/codec
> > >
> > _______________________________________________
> > codec mailing list
> > codec@ietf.org
> > https://www.ietf.org/mailman/listinfo/codec
> 
>