Re: [codec] McGill university speech database

Anisse Taleb <anisse.taleb@huawei.com> Fri, 08 October 2010 15:21 UTC

Return-Path: <anisse.taleb@huawei.com>
X-Original-To: codec@core3.amsl.com
Delivered-To: codec@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 1E15C3A68A7 for <codec@core3.amsl.com>; Fri, 8 Oct 2010 08:21:34 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: 1.364
X-Spam-Level: *
X-Spam-Status: No, score=1.364 tagged_above=-999 required=5 tests=[BAYES_20=-0.74, FH_RELAY_NODNS=1.451, HELO_MISMATCH_COM=0.553, RDNS_NONE=0.1]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id XYDrZZcRzSeL for <codec@core3.amsl.com>; Fri, 8 Oct 2010 08:21:33 -0700 (PDT)
Received: from szxga02-in.huawei.com (unknown [119.145.14.65]) by core3.amsl.com (Postfix) with ESMTP id 13E8F3A68D4 for <codec@ietf.org>; Fri, 8 Oct 2010 08:21:33 -0700 (PDT)
Received: from huawei.com (szxga02-in [172.24.2.6]) by szxga02-in.huawei.com (iPlanet Messaging Server 5.2 HotFix 2.14 (built Aug 8 2006)) with ESMTP id <0L9Z00LWW9CVDX@szxga02-in.huawei.com> for codec@ietf.org; Fri, 08 Oct 2010 23:22:07 +0800 (CST)
Received: from szxeml201-edg.china.huawei.com ([172.24.2.119]) by szxga02-in.huawei.com (iPlanet Messaging Server 5.2 HotFix 2.14 (built Aug 8 2006)) with ESMTP id <0L9Z00EW39CVW8@szxga02-in.huawei.com> for codec@ietf.org; Fri, 08 Oct 2010 23:22:07 +0800 (CST)
Received: from SZXEML402-HUB.china.huawei.com (10.82.67.32) by szxeml201-edg.china.huawei.com (172.24.2.39) with Microsoft SMTP Server (TLS) id 14.1.218.12; Fri, 08 Oct 2010 23:22:06 +0800
Received: from szxeml502-mbx.china.huawei.com ([fe80::1c80:305a:245:1afe]) by szxeml402-hub.china.huawei.com ([fe80::952d:2bf3:cb35:cafa%21]) with mapi id 14.01.0218.012; Fri, 08 Oct 2010 23:22:06 +0800
Date: Fri, 08 Oct 2010 15:22:06 +0000
From: Anisse Taleb <anisse.taleb@huawei.com>
In-reply-to: <C8C903D9.1DAA4%mknappe@juniper.net>
X-Originating-IP: [10.200.70.166]
To: Michael Knappe <mknappe@juniper.net>, "codec@ietf.org" <codec@ietf.org>
Message-id: <31A06A2BB2D2AB4F9EF23A695455C007943DCB@szxeml502-mbx.china.huawei.com>
MIME-version: 1.0
Content-type: text/plain; charset="us-ascii"
Content-language: en-US
Content-transfer-encoding: 7bit
Accept-Language: en-GB, zh-CN, en-US
Thread-topic: [codec] McGill university speech database
Thread-index: ActgIQL1rV2vEHjq5UuWF93xCyy/iQG1c4xQ
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
References: <C8C903D9.1DAA4%mknappe@juniper.net>
Subject: Re: [codec] McGill university speech database
X-BeenThere: codec@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: Codec WG <codec.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/codec>, <mailto:codec-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/codec>
List-Post: <mailto:codec@ietf.org>
List-Help: <mailto:codec-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/codec>, <mailto:codec-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 08 Oct 2010 15:21:34 -0000

Dear Michael,

Just a few comments.

The TSP database is indeed quite extensive, however, if I am not mistaken, it contains English language only (?).

Regarding the balancing of items, one of the main use cases of this codec are conversational applications, as such, the testing should include as much speech items as possible in different conditions, including noisy, reverberant rooms, error conditions/packet losses and more. Furthermore, It is well known that codecs performance may depend on the language of the items used in testing and it is not unheard-of to have a codec pass a quality performance requirement in a certain language and utterly fails in some others. The need for language diversity in testing is even more important given the intended wide distribution of the codec.

Kind regards,
/Anisse

-----Original Message-----
From: codec-bounces@ietf.org [mailto:codec-bounces@ietf.org] On Behalf Of Michael Knappe
Sent: Wednesday, September 29, 2010 11:55 PM
To: codec@ietf.org
Subject: [codec] McGill university speech database

Just received permission from Dr. Peter Kabal at McGill University to use their 1400 utterance, 24 different talker (12 male, 12 female) Harvard sentence speech database for our codec testing efforts in the IETF codec WG. Includes the original 48 kHz files. My preference is just to put the McGill link to the 539 MB CD ISO image file up on the codec wiki, if that sounds ok with everyone I will get final permission to post the link from Dr. Kabal and get them up on the wiki asap.

Next step is to work on getting rights to representative music content. A capella vocals (e.g. Tom's Diner) , orchestral crescendo's, castanets, solo violin, jazz trumpet/ensembles, rock/electronica etc would all be good to include for testing, please reply with any suggestions.

Thanks to Jean-Marc for the pointer to Dr. Kabal and the speech database!

Cheers,

Mike
_______________________________________________
codec mailing list
codec@ietf.org
https://www.ietf.org/mailman/listinfo/codec