Re: [codec] #15: Efficiently combine pre-encoded audio

"Benjamin M. Schwartz" <bmschwar@fas.harvard.edu> Wed, 12 May 2010 17:02 UTC

Return-Path: <bmschwar@fas.harvard.edu>
X-Original-To: codec@core3.amsl.com
Delivered-To: codec@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 10DB13A6D08 for <codec@core3.amsl.com>; Wed, 12 May 2010 10:02:33 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -5.283
X-Spam-Level:
X-Spam-Status: No, score=-5.283 tagged_above=-999 required=5 tests=[AWL=0.697, BAYES_00=-2.599, RCVD_IN_DNSWL_MED=-4, RCVD_IN_SORBS_WEB=0.619]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id mvx5ADbo87UN for <codec@core3.amsl.com>; Wed, 12 May 2010 10:02:32 -0700 (PDT)
Received: from us12.unix.fas.harvard.edu (us12.unix.fas.harvard.edu [140.247.35.203]) by core3.amsl.com (Postfix) with ESMTP id 4F95728C1EA for <codec@ietf.org>; Wed, 12 May 2010 09:44:08 -0700 (PDT)
Received: from us12.unix.fas.harvard.edu (localhost.localdomain [127.0.0.1]) by us12.unix.fas.harvard.edu (Postfix) with ESMTP id EA164665269; Wed, 12 May 2010 12:43:57 -0400 (EDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed; d=fas.harvard.edu; h= message-id:date:from:reply-to:mime-version:to:cc:subject :references:in-reply-to:content-type:content-transfer-encoding; s=mail; bh=E0dGisNq4u7rUj7TacP0PO02WugxTAd00Sik9Xx50jo=; b=OBVj eDCW8bdFfRhuSk+Pmg2ZVg3Ki5EkkV5H4WgN4hLYx2Fvrk20htZDc3QMlzdBc27J KrffOeUdfj3VTvdbJOB6qOjXfBQnLY5BF94cKPwpj9Tnfd0UBthjM62/XQjLPUNN jXxYBhSgbCsQJVHNhFmZro+tD/uSxUgE0m3xJlc=
DomainKey-Signature: a=rsa-sha1; c=simple; d=fas.harvard.edu; h= message-id:date:from:reply-to:mime-version:to:cc:subject :references:in-reply-to:content-type:content-transfer-encoding; q=dns; s=mail; b=WEaJs2ksL7U89GoY1wX7qwXldVXqVO7IzYlrZ4ki+OCWIs +BWVY61lqkeIyUyQQOqod3hUp/gnVVZoPAApSQumoaA1D0JPDCjh32h9wUc6LSYG eCJTHEetFvdFQ17Z7AKab0PxrKWmfqL0T9m1KcluwgDo9iuHniJMRIxcSkNkM=
Received: from [172.23.141.103] (bwhmaincampuspat25.partners.org [170.223.207.25]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) (Authenticated sender: bmschwar@fas) by us12.unix.fas.harvard.edu (Postfix) with ESMTPSA id E1DDE664FAB; Wed, 12 May 2010 12:43:57 -0400 (EDT)
Message-ID: <4BEADACD.4080609@fas.harvard.edu>
Date: Wed, 12 May 2010 12:43:57 -0400
From: "Benjamin M. Schwartz" <bmschwar@fas.harvard.edu>
User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.1.9) Gecko/20100423 Thunderbird/3.0.4
MIME-Version: 1.0
To: Jean-Marc Valin <jean-marc.valin@octasic.com>
References: <062.bc75a3b3c4a980df34535f87c9484935@tools.ietf.org> <071.30b67e93d22f0bfedf46b5035d133441@tools.ietf.org> <1F68067D-33B9-4F0C-B31B-B3A56A72DBA4@cisco.com> <4BEAC888.50109@fas.harvard.edu> <4BEACCD7.8080401@octasic.com> <4BEACEBF.7080403@fas.harvard.edu> <4BEAD147.8080307@octasic.com> <4BEAD5C1.4000802@fas.harvard.edu> <4BEAD963.4010300@octasic.com>
In-Reply-To: <4BEAD963.4010300@octasic.com>
Content-Type: text/plain; charset="ISO-8859-1"; format="flowed"
Content-Transfer-Encoding: 7bit
Cc: codec@ietf.org
Subject: Re: [codec] #15: Efficiently combine pre-encoded audio
X-BeenThere: codec@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
Reply-To: bens@alum.mit.edu
List-Id: Codec WG <codec.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/codec>, <mailto:codec-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/codec>
List-Post: <mailto:codec@ietf.org>
List-Help: <mailto:codec-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/codec>, <mailto:codec-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 12 May 2010 17:02:33 -0000

On 05/12/2010 12:37 PM, Jean-Marc Valin wrote:
> Benjamin M. Schwartz wrote:
>> I think I failed to communicate that by VAD I mean _not sending packets_
>> during inactivity. For the packets that are sent, the overhead should
>> average much less than 1 bit per frame.
>
> What you're describing is called DTX (discontinuous transmission).

Oops. Right.  What I'm trying to say is that DTX, based on encoder-side 
VAD, also greatly reduces the (average) computational burden on a 
conference mixer.  Of course, if everyone's really talking at once then 
VAD can't help.

--Ben