Re: [codec] draft test and processing plan for the IETF Codec

Anisse Taleb <anisse.taleb@huawei.com> Wed, 20 April 2011 12:57 UTC

Return-Path: <anisse.taleb@huawei.com>
X-Original-To: codec@ietfc.amsl.com
Delivered-To: codec@ietfc.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfc.amsl.com (Postfix) with ESMTP id 4E57FE00BE for <codec@ietfc.amsl.com>; Wed, 20 Apr 2011 05:57:25 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -6.46
X-Spam-Level:
X-Spam-Status: No, score=-6.46 tagged_above=-999 required=5 tests=[AWL=0.139, BAYES_00=-2.599, RCVD_IN_DNSWL_MED=-4]
Received: from mail.ietf.org ([208.66.40.236]) by localhost (ietfc.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 7hjABxAo+AZB for <codec@ietfc.amsl.com>; Wed, 20 Apr 2011 05:57:24 -0700 (PDT)
Received: from lhrga04-in.huawei.com (lhrga04-in.huawei.com [195.33.106.149]) by ietfc.amsl.com (Postfix) with ESMTP id C5891E0665 for <codec@ietf.org>; Wed, 20 Apr 2011 05:57:23 -0700 (PDT)
Received: from huawei.com (localhost [127.0.0.1]) by lhrga04-in.huawei.com (iPlanet Messaging Server 5.2 HotFix 2.14 (built Aug 8 2006)) with ESMTP id <0LJY00E30BZEPQ@lhrga04-in.huawei.com> for codec@ietf.org; Wed, 20 Apr 2011 13:57:15 +0100 (BST)
Received: from LHREML201-EDG.china.huawei.com ([172.18.7.118]) by lhrga04-in.huawei.com (iPlanet Messaging Server 5.2 HotFix 2.14 (built Aug 8 2006)) with ESMTPS id <0LJY00D4WBZEHS@lhrga04-in.huawei.com> for codec@ietf.org; Wed, 20 Apr 2011 13:57:14 +0100 (BST)
Received: from LHREML401-HUB.china.huawei.com (10.201.5.30) by LHREML201-EDG.china.huawei.com (172.18.7.188) with Microsoft SMTP Server (TLS) id 14.1.270.1; Wed, 20 Apr 2011 13:57:06 +0100
Received: from LHREML503-MBX.china.huawei.com ([fe80::f93f:958b:5b06:4f36]) by LHREML401-HUB.china.huawei.com ([::1]) with mapi id 14.01.0270.001; Wed, 20 Apr 2011 13:57:13 +0100
Date: Wed, 20 Apr 2011 12:57:12 +0000
From: Anisse Taleb <anisse.taleb@huawei.com>
In-reply-to: <686052936.342214.1303247959195.JavaMail.root@lu2-zimbra>
X-Originating-IP: [10.220.139.57]
To: Koen Vos <koen.vos@skype.net>
Message-id: <F5AD4C2E5FBF304ABAE7394E9979AF7C26BC9A34@LHREML503-MBX.china.huawei.com>
MIME-version: 1.0
Content-type: text/plain; charset="utf-8"
Content-language: en-US
Content-transfer-encoding: base64
Accept-Language: en-GB, en-US
Thread-topic: [codec] draft test and processing plan for the IETF Codec
Thread-index: AQHL+/O4JQgDHmbkU0q3uka00W6W65RgTfQAgACNAoCAAFjtAIAASHuAgALk1RCAAIGPAIAAFHPAgACvA4CAAPn3EA==
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-cr-hashedpuzzle: CYgh D2t6 FgDF HNrS K56a M3Or M9mP QUfO RnGj TESF VoZ+ XLf/ XwrH YGog a+hN cokS; 3; YwBvAGQAZQBjAEAAaQBlAHQAZgAuAG8AcgBnADsAYwBvAHYAZQByAGQAYQBsAGUAQABzAHkAbQBwAGEAdABpAGMAbwAuAGMAYQA7AGsAbwBlAG4ALgB2AG8AcwBAAHMAawB5AHAAZQAuAG4AZQB0AA==; Sosha1_v1; 7; {37E95A72-2A05-4B5F-BA54-47F860290EC8}; YQBuAGkAcwBzAGUALgB0AGEAbABlAGIAQABoAHUAYQB3AGUAaQAuAGMAbwBtAA==; Wed, 20 Apr 2011 12:57:05 GMT; UgBFADoAIABbAGMAbwBkAGUAYwBdACAAZAByAGEAZgB0ACAAdABlAHMAdAAgAGEAbgBkACAAcAByAG8AYwBlAHMAcwBpAG4AZwAgAHAAbABhAG4AIABmAG8AcgAgAHQAaABlACAASQBFAFQARgAgAEMAbwBkAGUAYwA=
x-cr-puzzleid: {37E95A72-2A05-4B5F-BA54-47F860290EC8}
References: <F5AD4C2E5FBF304ABAE7394E9979AF7C26BC910B@LHREML503-MBX.china.huawei.com> <686052936.342214.1303247959195.JavaMail.root@lu2-zimbra>
Cc: "codec@ietf.org" <codec@ietf.org>
Subject: Re: [codec] draft test and processing plan for the IETF Codec
X-BeenThere: codec@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: Codec WG <codec.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/codec>, <mailto:codec-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/codec>
List-Post: <mailto:codec@ietf.org>
List-Help: <mailto:codec-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/codec>, <mailto:codec-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 20 Apr 2011 12:57:25 -0000

Hi Koen,

The reason for me asking is that the ACD numbers (regardless of which codec is used ) you report seem to be quite high compared to what is reported in (*). It could be, as you mention, that by filtering the bad channel conditions the overall average goes up. Are your results published somewhere else? With the detailed experimental setup and filtering procedure? 

(*) “An Experimental Study of the Skype Peer-to-Peer VoIP System,” by S. Guha, N. Daswani and R. Jain. 

Kind regards,
/Anisse

> -----Original Message-----
> From: Koen Vos [mailto:koen.vos@skype.net]
> Sent: Tuesday, April 19, 2011 11:19 PM
> To: Anisse Taleb
> Cc: codec@ietf.org; Paul Coverdale
> Subject: Re: [codec] draft test and processing plan for the IETF Codec
> 
> Hi Anisse,
> 
> Yes for sure, we carefully filtered out anything that could interfere with
> the test.  So we kept only good network channels and so on.  You know,
> regular rigorous testing, but without the intentional biases :P.
> 
> The most amazing thing was: coinciding with the randomized testing we saw
> an increase in bug reports about audio quality..  People complaining about
> muffled audio etc.
> 
> best,
> koen.
> 
> 
> ----- Original Message -----
> From: "Anisse Taleb" <anisse.taleb@huawei.com>
> To: "Koen Vos" <koen.vos@skype.net>
> Cc: codec@ietf.org, "Paul Coverdale" <coverdale@sympatico.ca>
> Sent: Tuesday, April 19, 2011 11:58:31 AM
> Subject: RE: [codec] draft test and processing plan for the IETF Codec
> 
> Dear Koen,
> Thanks for the explanation. When it comes to the second part of my
> question, i.e. interaction with other effects, such as channel quality,
> call drops, bandwidth, I was curious about the statistical methodology you
> used. Any info on that?
> 
> Kind regards,
> /Anisse
> 
> > -----Original Message-----
> > From: Koen Vos [mailto:koen.vos@skype.net]
> > Sent: Tuesday, April 19, 2011 11:40 AM
> > To: Anisse Taleb
> > Cc: codec@ietf.org; Paul Coverdale
> > Subject: Re: [codec] draft test and processing plan for the IETF Codec
> >
> > Hi Anisse,
> >
> > The reason that SILK-SWB has a much smaller confidence interval is
> simple:
> > we had many more data points for that case (about a million, versus just
> a
> > couple thousand for each of the other ones).  This difference in numbers
> > was caused by the fact that in the randomized testing we only allowed a
> > small fraction of calls to switch to anything else than the default.
> >
> > best,
> > koen.
> >
> >
> > ----- Original Message -----
> > From: "Anisse Taleb" <anisse.taleb@huawei.com>
> > To: "Koen Vos" <koen.vos@skype.net>, "Paul Coverdale"
> > <coverdale@sympatico.ca>
> > Cc: codec@ietf.org
> > Sent: Monday, April 18, 2011 6:03:09 PM
> > Subject: RE: [codec] draft test and processing plan for the IETF Codec
> >
> > Dear Koen,
> >
> > Regarding point 3. This is quite interesting results, just for my
> > understanding, I was wondering why the confidence intervals for SILK-SWB
> > were small in comparison with the other alternatives? My understanding of
> > this experiment is that the audio bandwidth is not the only factor
> > affecting quality and call time. How did you isolate the other effect?
> >
> > Do you have any more information about the experimental setup and
> > statistical analysis conducted to derive these results.
> >
> > Kind regards,
> > /Anisse
> >
> >
> > > -----Original Message-----
> > > From: Koen Vos [mailto:koen.vos@skype.net]
> > > Sent: Sunday, April 17, 2011 7:44 AM
> > > To: Paul Coverdale
> > > Cc: codec@ietf.org; Anisse Taleb
> > > Subject: Re: [codec] draft test and processing plan for the IETF Codec
> > >
> > > Hi Paul,
> > >
> > > > The filtering described in the test plan [..] is there to establish
> > > > a common bandwidth (and equalization characteristic in some cases)
> > > > for the audio chain (be it NB, WB, SWB) so that subjects can focus
> > > > on comparing the distortion introduced by each of the codecs in the
> > > > test, without confounding it with bandwidth effects.
> > >
> > > I believe it would be a mistake to test with band-limited signals, for
> > > these reasons:
> > >
> > > 1. Band-limited test signals are atypical of real-world usage.  People
> > > in this WG have always emphasized that we should test with realistic
> > > scenarios (like network traces for packet loss), and the proposal goes
> > > against that philosophy.
> > >
> > > 2. Band limiting the input hurts a codec's performance.  In the Google
> > > test for instance, Opus-WB@20 kbps outperformed the LP7 anchor --
> > > surely that wouldn't happen if Opus ran on an LP7 signal.  That makes
> > > the proposed testing procedure less relevant for deciding whether this
> > > codec will be of value on the Internet.
> > >
> > > 3. Audio bandwidth matters to end users.  Real-life experiments show
> > > that codecs with more bandwidth boost user ratings and call durations.
> > > (E.g. see slides 2, 3 of
> http://www.ietf.org/proceedings/77/slides/codec-
> > > 3.pdf)
> > > So if a codec scores higher "just" because it encodes more bandwidth,
> > > that's still a real benefit to users.  And the testing procedure
> > > proposed already reduces the impact of differing bandwidths, by using
> > > MOS scores without pairwise comparisons.
> > >
> > > 4. Testing with band-limited signals risks perpetuating crippled codec
> > > design.  In order to do well in the tests, a codec designer would be
> > > "wise" to downsample the input or otherwise optimize towards the
> > > artificial test signals.  This actually lowers the performance for
> > > real-world signals, and usually adds complexity.  And as long as
> > > people design codecs with a band-limited response, they'll argue to
> > > test with one as well.  Let's break this circle.
> > >
> > > I also found it interesting how the chosen bandwidths magically match
> > > those of ITU standards, while potentially hurting Opus.  For instance,
> > > Opus-SWB has only 12 kHz bandwidth, but would still be tested with a
> > > 14 kHz signal.
> > >
> > > best,
> > > koen.
> > >
> > >
> > > ----- Original Message -----
> > > From: "Paul Coverdale" <coverdale@sympatico.ca>
> > > To: "Koen Vos" <koen.vos@skype.net>
> > > Cc: codec@ietf.org, "Anisse Taleb" <anisse.taleb@huawei.com>
> > > Sent: Saturday, April 16, 2011 6:25:04 PM
> > > Subject: RE: [codec] draft test and processing plan for the IETF Codec
> > >
> > > Hi Koen and Jean-Marc,
> > >
> > > The filtering described in the test plan is not meant to be for anti-
> > > aliassing, it is there to establish a common bandwidth (and
> equalization
> > > characteristic in some cases) for the audio chain (be it NB, WB, SWB)
> so
> > > that subjects can focus on comparing the distortion introduced by each
> of
> > > the codecs in the test, without confounding it with bandwidth effects.
> > >
> > > Regards,
> > >
> > > ...Paul
> > >
> > > >-----Original Message-----
> > > >From: Koen Vos [mailto:koen.vos@skype.net]
> > > >Sent: Saturday, April 16, 2011 4:07 PM
> > > >To: Paul Coverdale
> > > >Cc: codec@ietf.org; Anisse Taleb
> > > >Subject: Re: [codec] draft test and processing plan for the IETF Codec
> > > >
> > > >Paul Coverdale wrote:
> > > >> You mean that VoIP applications have no filtering at all, not even
> > > >> anti-aliassing?
> > > >
> > > >The bandpass filter in the test plan runs on the downsampled signal,
> > > >so it's not an anti-aliasing filter.
> > > >
> > > >Also, the plan's bandpass for narrowband goes all the way up to
> Nyquist
> > > >(4000 Hz), whereas for wideband it goes only to 7000 Hz.  So if the
> > > >bandpass filters were to somehow deal with aliasing, they are not
> being
> > > >used consistently.
> > > >
> > > >I presume the resamplers in the plan use proper anti-aliasing filters
> > > >representative of those in VoIP applications (and described in
> > > >Jean-Marc's post).
> > > >
> > > >best,
> > > >koen.
> > > >
> > > >
> > > >----- Original Message -----
> > > >From: "Paul Coverdale" <coverdale@sympatico.ca>
> > > >To: "Koen Vos" <koen.vos@skype.net>, "Anisse Taleb"
> > > ><anisse.taleb@huawei.com>
> > > >Cc: codec@ietf.org
> > > >Sent: Saturday, April 16, 2011 4:42:06 AM
> > > >Subject: RE: [codec] draft test and processing plan for the IETF Codec
> > > >
> > > >Hi Koen,
> > > >
> > > >You mean that VoIP applications have no filtering at all, not even
> > > >anti-aliassing?
> > > >
> > > >...Paul
> > > >
> > > >>-----Original Message-----
> > > >>From: codec-bounces@ietf.org [mailto:codec-bounces@ietf.org] On
> Behalf
> > > >>Of Koen Vos
> > > >>Sent: Saturday, April 16, 2011 1:04 AM
> > > >>To: Anisse Taleb
> > > >>Cc: codec@ietf.org
> > > >>Subject: Re: [codec] draft test and processing plan for the IETF
> Codec
> > > >>
> > > >>Hi Anisse,
> > > >>
> > > >>I noticed your plan tests with band-limited signals: Narrowband
> signals
> > > >>are
> > > >>filtered from 300-4000 Hz, Wideband from 50-7000 Hz, Superwideband
> from
> > > >>50-14000 Hz.
> > > >>
> > > >>However, VoIP applications have no such band-pass filters (which
> > > >degrade
> > > >>quality and add complexity).  So results will be more informative to
> > > >the
> > > >>WG
> > > >>and potential adopters of the codec if the testing avoids band-pass
> > > >>filtering as well.  We want test conditions to mimic the real world
> as
> > > >>closely as possible.
> > > >>
> > > >>Instead of band-pass filtering, tests on speech could use a simple
> > > >high-
> > > >>pass
> > > >>filter with a cutoff around 50 Hz, as many VoIP applications do
> indeed
> > > >>have
> > > >>such a filter.
> > > >>
> > > >>best,
> > > >>koen.
> > > >>
> > > >>
> > > >
> > >