Re: [codec] draft test and processing plan for the IETF Codec

Anisse Taleb <anisse.taleb@huawei.com> Tue, 19 April 2011 09:58 UTC

Return-Path: <anisse.taleb@huawei.com>
X-Original-To: codec@ietfc.amsl.com
Delivered-To: codec@ietfc.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfc.amsl.com (Postfix) with ESMTP id E2DF5E0680 for <codec@ietfc.amsl.com>; Tue, 19 Apr 2011 02:58:35 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -6.445
X-Spam-Level:
X-Spam-Status: No, score=-6.445 tagged_above=-999 required=5 tests=[AWL=0.154, BAYES_00=-2.599, RCVD_IN_DNSWL_MED=-4]
Received: from mail.ietf.org ([208.66.40.236]) by localhost (ietfc.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id WCM2zTnoFSkS for <codec@ietfc.amsl.com>; Tue, 19 Apr 2011 02:58:34 -0700 (PDT)
Received: from lhrga02-in.huawei.com (lhrga02-in.huawei.com [195.33.106.143]) by ietfc.amsl.com (Postfix) with ESMTP id 7F889E0665 for <codec@ietf.org>; Tue, 19 Apr 2011 02:58:34 -0700 (PDT)
Received: from huawei.com (lhrga02-in [172.18.7.45]) by lhrga02-in.huawei.com (iPlanet Messaging Server 5.2 HotFix 2.14 (built Aug 8 2006)) with ESMTP id <0LJW00B4L91LR2@lhrga02-in.huawei.com> for codec@ietf.org; Tue, 19 Apr 2011 10:58:33 +0100 (BST)
Received: from LHREML202-EDG.china.huawei.com ([172.18.7.118]) by lhrga02-in.huawei.com (iPlanet Messaging Server 5.2 HotFix 2.14 (built Aug 8 2006)) with ESMTPS id <0LJW0038291KLT@lhrga02-in.huawei.com> for codec@ietf.org; Tue, 19 Apr 2011 10:58:32 +0100 (BST)
Received: from LHREML401-HUB.china.huawei.com (10.201.5.30) by LHREML202-EDG.china.huawei.com (172.18.7.189) with Microsoft SMTP Server (TLS) id 14.1.270.1; Tue, 19 Apr 2011 10:58:27 +0100
Received: from LHREML503-MBX.china.huawei.com ([fe80::f93f:958b:5b06:4f36]) by LHREML401-HUB.china.huawei.com ([::1]) with mapi id 14.01.0270.001; Tue, 19 Apr 2011 10:58:32 +0100
Date: Tue, 19 Apr 2011 09:58:31 +0000
From: Anisse Taleb <anisse.taleb@huawei.com>
In-reply-to: <364969800.304668.1303205984931.JavaMail.root@lu2-zimbra>
X-Originating-IP: [10.220.139.57]
To: Koen Vos <koen.vos@skype.net>
Message-id: <F5AD4C2E5FBF304ABAE7394E9979AF7C26BC910B@LHREML503-MBX.china.huawei.com>
MIME-version: 1.0
Content-type: text/plain; charset="utf-8"
Content-language: en-US
Content-transfer-encoding: 7bit
Accept-Language: en-GB, en-US
Thread-topic: [codec] draft test and processing plan for the IETF Codec
Thread-index: AQHL+/O4JQgDHmbkU0q3uka00W6W65RgTfQAgACNAoCAAFjtAIAASHuAgALk1RCAAIGPAIAAFHPA
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-cr-hashedpuzzle: ACGp Af9v BbWM GMxf JrfX Lzoi MkJ7 NwKc Qe00 TxFy UKiG VhPA W8r+ cvdS eZ3a fhW/; 3; YwBvAGQAZQBjAEAAaQBlAHQAZgAuAG8AcgBnADsAYwBvAHYAZQByAGQAYQBsAGUAQABzAHkAbQBwAGEAdABpAGMAbwAuAGMAYQA7AGsAbwBlAG4ALgB2AG8AcwBAAHMAawB5AHAAZQAuAG4AZQB0AA==; Sosha1_v1; 7; {42BC4B8A-D584-45AB-9C06-F3DA90D2E6A8}; YQBuAGkAcwBzAGUALgB0AGEAbABlAGIAQABoAHUAYQB3AGUAaQAuAGMAbwBtAA==; Tue, 19 Apr 2011 09:58:24 GMT; UgBFADoAIABbAGMAbwBkAGUAYwBdACAAZAByAGEAZgB0ACAAdABlAHMAdAAgAGEAbgBkACAAcAByAG8AYwBlAHMAcwBpAG4AZwAgAHAAbABhAG4AIABmAG8AcgAgAHQAaABlACAASQBFAFQARgAgAEMAbwBkAGUAYwA=
x-cr-puzzleid: {42BC4B8A-D584-45AB-9C06-F3DA90D2E6A8}
References: <F5AD4C2E5FBF304ABAE7394E9979AF7C26BC8C62@LHREML503-MBX.china.huawei.com> <364969800.304668.1303205984931.JavaMail.root@lu2-zimbra>
Cc: "codec@ietf.org" <codec@ietf.org>
Subject: Re: [codec] draft test and processing plan for the IETF Codec
X-BeenThere: codec@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: Codec WG <codec.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/codec>, <mailto:codec-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/codec>
List-Post: <mailto:codec@ietf.org>
List-Help: <mailto:codec-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/codec>, <mailto:codec-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 19 Apr 2011 09:58:36 -0000

Dear Koen,
Thanks for the explanation. When it comes to the second part of my question, i.e. interaction with other effects, such as channel quality, call drops, bandwidth, I was curious about the statistical methodology you used. Any info on that?

Kind regards,
/Anisse 

> -----Original Message-----
> From: Koen Vos [mailto:koen.vos@skype.net]
> Sent: Tuesday, April 19, 2011 11:40 AM
> To: Anisse Taleb
> Cc: codec@ietf.org; Paul Coverdale
> Subject: Re: [codec] draft test and processing plan for the IETF Codec
> 
> Hi Anisse,
> 
> The reason that SILK-SWB has a much smaller confidence interval is simple:
> we had many more data points for that case (about a million, versus just a
> couple thousand for each of the other ones).  This difference in numbers
> was caused by the fact that in the randomized testing we only allowed a
> small fraction of calls to switch to anything else than the default.
> 
> best,
> koen.
> 
> 
> ----- Original Message -----
> From: "Anisse Taleb" <anisse.taleb@huawei.com>
> To: "Koen Vos" <koen.vos@skype.net>, "Paul Coverdale"
> <coverdale@sympatico.ca>
> Cc: codec@ietf.org
> Sent: Monday, April 18, 2011 6:03:09 PM
> Subject: RE: [codec] draft test and processing plan for the IETF Codec
> 
> Dear Koen,
> 
> Regarding point 3. This is quite interesting results, just for my
> understanding, I was wondering why the confidence intervals for SILK-SWB
> were small in comparison with the other alternatives? My understanding of
> this experiment is that the audio bandwidth is not the only factor
> affecting quality and call time. How did you isolate the other effect?
> 
> Do you have any more information about the experimental setup and
> statistical analysis conducted to derive these results.
> 
> Kind regards,
> /Anisse
> 
> 
> > -----Original Message-----
> > From: Koen Vos [mailto:koen.vos@skype.net]
> > Sent: Sunday, April 17, 2011 7:44 AM
> > To: Paul Coverdale
> > Cc: codec@ietf.org; Anisse Taleb
> > Subject: Re: [codec] draft test and processing plan for the IETF Codec
> >
> > Hi Paul,
> >
> > > The filtering described in the test plan [..] is there to establish
> > > a common bandwidth (and equalization characteristic in some cases)
> > > for the audio chain (be it NB, WB, SWB) so that subjects can focus
> > > on comparing the distortion introduced by each of the codecs in the
> > > test, without confounding it with bandwidth effects.
> >
> > I believe it would be a mistake to test with band-limited signals, for
> > these reasons:
> >
> > 1. Band-limited test signals are atypical of real-world usage.  People
> > in this WG have always emphasized that we should test with realistic
> > scenarios (like network traces for packet loss), and the proposal goes
> > against that philosophy.
> >
> > 2. Band limiting the input hurts a codec's performance.  In the Google
> > test for instance, Opus-WB@20 kbps outperformed the LP7 anchor --
> > surely that wouldn't happen if Opus ran on an LP7 signal.  That makes
> > the proposed testing procedure less relevant for deciding whether this
> > codec will be of value on the Internet.
> >
> > 3. Audio bandwidth matters to end users.  Real-life experiments show
> > that codecs with more bandwidth boost user ratings and call durations.
> > (E.g. see slides 2, 3 of http://www.ietf.org/proceedings/77/slides/codec-
> > 3.pdf)
> > So if a codec scores higher "just" because it encodes more bandwidth,
> > that's still a real benefit to users.  And the testing procedure
> > proposed already reduces the impact of differing bandwidths, by using
> > MOS scores without pairwise comparisons.
> >
> > 4. Testing with band-limited signals risks perpetuating crippled codec
> > design.  In order to do well in the tests, a codec designer would be
> > "wise" to downsample the input or otherwise optimize towards the
> > artificial test signals.  This actually lowers the performance for
> > real-world signals, and usually adds complexity.  And as long as
> > people design codecs with a band-limited response, they'll argue to
> > test with one as well.  Let's break this circle.
> >
> > I also found it interesting how the chosen bandwidths magically match
> > those of ITU standards, while potentially hurting Opus.  For instance,
> > Opus-SWB has only 12 kHz bandwidth, but would still be tested with a
> > 14 kHz signal.
> >
> > best,
> > koen.
> >
> >
> > ----- Original Message -----
> > From: "Paul Coverdale" <coverdale@sympatico.ca>
> > To: "Koen Vos" <koen.vos@skype.net>
> > Cc: codec@ietf.org, "Anisse Taleb" <anisse.taleb@huawei.com>
> > Sent: Saturday, April 16, 2011 6:25:04 PM
> > Subject: RE: [codec] draft test and processing plan for the IETF Codec
> >
> > Hi Koen and Jean-Marc,
> >
> > The filtering described in the test plan is not meant to be for anti-
> > aliassing, it is there to establish a common bandwidth (and equalization
> > characteristic in some cases) for the audio chain (be it NB, WB, SWB) so
> > that subjects can focus on comparing the distortion introduced by each of
> > the codecs in the test, without confounding it with bandwidth effects.
> >
> > Regards,
> >
> > ...Paul
> >
> > >-----Original Message-----
> > >From: Koen Vos [mailto:koen.vos@skype.net]
> > >Sent: Saturday, April 16, 2011 4:07 PM
> > >To: Paul Coverdale
> > >Cc: codec@ietf.org; Anisse Taleb
> > >Subject: Re: [codec] draft test and processing plan for the IETF Codec
> > >
> > >Paul Coverdale wrote:
> > >> You mean that VoIP applications have no filtering at all, not even
> > >> anti-aliassing?
> > >
> > >The bandpass filter in the test plan runs on the downsampled signal,
> > >so it's not an anti-aliasing filter.
> > >
> > >Also, the plan's bandpass for narrowband goes all the way up to Nyquist
> > >(4000 Hz), whereas for wideband it goes only to 7000 Hz.  So if the
> > >bandpass filters were to somehow deal with aliasing, they are not being
> > >used consistently.
> > >
> > >I presume the resamplers in the plan use proper anti-aliasing filters
> > >representative of those in VoIP applications (and described in
> > >Jean-Marc's post).
> > >
> > >best,
> > >koen.
> > >
> > >
> > >----- Original Message -----
> > >From: "Paul Coverdale" <coverdale@sympatico.ca>
> > >To: "Koen Vos" <koen.vos@skype.net>, "Anisse Taleb"
> > ><anisse.taleb@huawei.com>
> > >Cc: codec@ietf.org
> > >Sent: Saturday, April 16, 2011 4:42:06 AM
> > >Subject: RE: [codec] draft test and processing plan for the IETF Codec
> > >
> > >Hi Koen,
> > >
> > >You mean that VoIP applications have no filtering at all, not even
> > >anti-aliassing?
> > >
> > >...Paul
> > >
> > >>-----Original Message-----
> > >>From: codec-bounces@ietf.org [mailto:codec-bounces@ietf.org] On Behalf
> > >>Of Koen Vos
> > >>Sent: Saturday, April 16, 2011 1:04 AM
> > >>To: Anisse Taleb
> > >>Cc: codec@ietf.org
> > >>Subject: Re: [codec] draft test and processing plan for the IETF Codec
> > >>
> > >>Hi Anisse,
> > >>
> > >>I noticed your plan tests with band-limited signals: Narrowband signals
> > >>are
> > >>filtered from 300-4000 Hz, Wideband from 50-7000 Hz, Superwideband from
> > >>50-14000 Hz.
> > >>
> > >>However, VoIP applications have no such band-pass filters (which
> > >degrade
> > >>quality and add complexity).  So results will be more informative to
> > >the
> > >>WG
> > >>and potential adopters of the codec if the testing avoids band-pass
> > >>filtering as well.  We want test conditions to mimic the real world as
> > >>closely as possible.
> > >>
> > >>Instead of band-pass filtering, tests on speech could use a simple
> > >high-
> > >>pass
> > >>filter with a cutoff around 50 Hz, as many VoIP applications do indeed
> > >>have
> > >>such a filter.
> > >>
> > >>best,
> > >>koen.
> > >>
> > >>
> > >
> >