Re: [codec] #8: Sample rates? 44.1kHz?

"Christian Hoene" <hoene@uni-tuebingen.de> Fri, 16 April 2010 07:44 UTC

Return-Path: <hoene@uni-tuebingen.de>
X-Original-To: codec@core3.amsl.com
Delivered-To: codec@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id A60193A68E9 for <codec@core3.amsl.com>; Fri, 16 Apr 2010 00:44:30 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.818
X-Spam-Level:
X-Spam-Status: No, score=-4.818 tagged_above=-999 required=5 tests=[AWL=-0.428, BAYES_20=-0.74, HELO_EQ_DE=0.35, RCVD_IN_DNSWL_MED=-4]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id udlOqdkPPyvl for <codec@core3.amsl.com>; Fri, 16 Apr 2010 00:44:29 -0700 (PDT)
Received: from mx05.uni-tuebingen.de (mx05.uni-tuebingen.de [134.2.3.4]) by core3.amsl.com (Postfix) with ESMTP id 6849F3A685E for <codec@ietf.org>; Fri, 16 Apr 2010 00:44:22 -0700 (PDT)
Received: from hoeneT60 (u-173-c009.cs.uni-tuebingen.de [134.2.173.9]) (authenticated bits=0) by mx05.uni-tuebingen.de (8.13.6/8.13.6) with ESMTP id o3G7i8K8027693 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=NO); Fri, 16 Apr 2010 09:44:09 +0200
From: Christian Hoene <hoene@uni-tuebingen.de>
To: 'Koen Vos' <koen.vos@skype.net>
References: <062.89d7aa91c79b145b798b83610e45ce71@tools.ietf.org> <071.0bc6655c98ff0335ad26ee705d9f5ce9@tools.ietf.org> <002a01cadac8$68dbf380$3a93da80$@de> <002301cadd1f$8b936da0$a2ba48e0$@de> <20100415233531.265740nulcub9elv@mail.skype.net>
In-Reply-To: <20100415233531.265740nulcub9elv@mail.skype.net>
Date: Fri, 16 Apr 2010 09:44:08 +0200
Message-ID: <000601cadd38$9b8210e0$d28632a0$@de>
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
X-Mailer: Microsoft Office Outlook 12.0
Thread-Index: AcrdLwm8QOZN9Q0pT6OEdJcYvFhP0AABR6RQ
Content-Language: de
X-AntiVirus-Spam-Check: clean (checked by Avira MailGate: version: 3.0.0-4; spam filter version: 3.0.0/2.0; host: mx05)
X-AntiVirus: checked by Avira MailGate (version: 3.0.0-4; AVE: 8.2.1.220; VDF: 7.10.6.110; host: mx05); id=12376-JuX2TI
Cc: codec@ietf.org
Subject: Re: [codec] #8: Sample rates? 44.1kHz?
X-BeenThere: codec@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: Codec WG <codec.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/codec>, <mailto:codec-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/codec>
List-Post: <mailto:codec@ietf.org>
List-Help: <mailto:codec-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/codec>, <mailto:codec-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 16 Apr 2010 07:44:30 -0000

Hi Koen,

>> Supporting 44.1 kHz would require fractions of time stamp units
>
>It would be better to base the RTP time stamp on a fixed sampling
>rate, independent of the sampling rates used at the API level and
>internally.  The reason is that the API and internal sampling rates
>may change at any time.  My proposal would be to count RTP time stamps
>at 48 kHz always.

Using 48 kHz as RTP time stamp basis might be difficult for a 32 kHz internal sampling rate because you then might need to count 1,5
units of time. However, this might not be a problem if the frame size is a multiple of 3 or if we define a proper, bijective
rounding mechanisms for RTP time stamps.

In case of 44.1 kHz mapping to 48 kHz, a similar round function must be defined. 

Actually, some time ago I had a similar discussion regarding the support of multiple sampling rates in the draft-hoene-sbc.  The AVT
guys suggested me to either use the least common multiple of the sampling rates as RTP time stamp rate (which would be 96 kHz if
44.1 is not support), or to use different RTP payload types for different sampling rates.

A rounding function for RTP time stamps has not yet been defined in any RTP payload RFC. Thus, AVT guys might complain if we
introduce a rounding function. But then again, if it is well defined it should not be a technical problem...

Best,
 Christian




>
>best,
>koen.
>
>
>
>> Hi all,
>>
>>> It might be useful to state that it is not recommended to use 44100
>>> Hz, because the conversion to
>>> 16kHz and 32kHz is computational more demanding than 48kHz and
>>> requires more power/time.
>>
>> Currently, proposed codec run at sampling rates of
>>
>> BroadVoice: 8, 16 kHz
>> SILK: 8, 12, 16, 24 kHz
>> CELT: 32 kHz to 96 kHz.
>>
>> I do have a question. In order to make the RTP handling and the
>> resampling simpler, it might be useful to skip the 44.1kHz
>> compression mode. If only 8, 12, 16, 24, 32, and 48 kHz sampling
>> rates are used, then the RTP timestamp can be easily counted in 96
>> kHz units. Supporting 44.1 kHz would require fractions of time stamp
>> units that are difficult to handle with. Also, the resampling take
>> more computational resource if it has to be done at high quality.
>>
>> Thus, my question: Is 44.1 kHz really much-needed or can we use 32
>> or 48 kHz instead?
>>
>> Christian
>>
>>
>> _______________________________________________
>> codec mailing list
>> codec@ietf.org
>> https://www.ietf.org/mailman/listinfo/codec
>>