Re: [codec] Comments on draft-ietf-codec-ambisonics-01

"Timothy B. Terriberry" <tterribe@xiph.org> Mon, 13 March 2017 23:07 UTC

Return-Path: <tterribe@xiph.org>
X-Original-To: codec@ietfa.amsl.com
Delivered-To: codec@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 409BF129BEB for <codec@ietfa.amsl.com>; Mon, 13 Mar 2017 16:07:06 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -6.9
X-Spam-Level:
X-Spam-Status: No, score=-6.9 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_HI=-5] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id X1hlWWOIHZyU for <codec@ietfa.amsl.com>; Mon, 13 Mar 2017 16:07:04 -0700 (PDT)
Received: from smtp.mozilla.org (mx2.scl3.mozilla.com [63.245.214.156]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 8794D129BCE for <codec@ietf.org>; Mon, 13 Mar 2017 16:06:55 -0700 (PDT)
Received: from localhost (localhost6.localdomain [127.0.0.1]) by mx2.mail.scl3.mozilla.com (Postfix) with ESMTP id 60C84C1792; Mon, 13 Mar 2017 23:06:55 +0000 (UTC)
X-Virus-Scanned: amavisd-new at mozilla.org
Received: from smtp.mozilla.org ([127.0.0.1]) by localhost (mx2.mail.scl3.mozilla.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id YEi3ODegR-xn; Mon, 13 Mar 2017 23:06:55 +0000 (UTC)
Received: from [10.252.26.5] (corp.mtv2.mozilla.com [63.245.221.32]) (Authenticated sender: tterriberry@mozilla.com) by mx2.mail.scl3.mozilla.com (Postfix) with ESMTPSA id 3951CC1764; Mon, 13 Mar 2017 23:06:55 +0000 (UTC)
Message-ID: <58C7260F.6030506@xiph.org>
Date: Mon, 13 Mar 2017 16:06:55 -0700
From: "Timothy B. Terriberry" <tterribe@xiph.org>
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:29.0) Gecko/20100101 SeaMonkey/2.26
MIME-Version: 1.0
To: Jean-Marc Valin <jmvalin@mozilla.com>, Jan Skoglund <jks@google.com>, Jean-Marc Valin <jmvalin@jmvalin.ca>, Drew Allen <bitllama@google.com>, Mark Harris <mark.hsj@gmail.com>, "codec@ietf.org" <codec@ietf.org>
References: <2f534e1b-b1af-266a-50ef-36f1739d878b@jmvalin.ca> <CAMdZqKGzdndiwpdXsYcHS7+r8Ega5LcQmAvcjiuHTHJgtTUwDg@mail.gmail.com> <CA+KMCSXhS2m4Dkous=4RkOibYWuoi+V_zBrhi1+anm-c+syQ1Q@mail.gmail.com> <CAMdZqKFDtD684HMkoO9bXi-c+g+8R+ay9kPdWSQOtHFDbC3ZLA@mail.gmail.com> <CABQ9DcuD+Et6+rBG-rCnWX-Dk-9STZMeYs-6fQWTk1kyjigRhw@mail.gmail.com> <52f5a570-e9f4-ea49-515e-498f0ed4f1bb@mozilla.com> <CABQ9Dcu0JVuAFvThSOgiBzxa+QOD4-1zpLzX6i-RKG7SRJnkNg@mail.gmail.com> <CABQ9Dct0d4id7wnzyu4sQHU=HZFVjCOXHCTO_F5RHcfE7HdH1Q@mail.gmail.com> <17622007-e5ce-0a08-67df-98c30a51e5a8@mozilla.com> <CA+KMCSVPHoav7QzdnvV5_TB2wFidkML0Z2+kp4VpJCU16N1+5Q@mail.gmail.com> <ebae7987-a8c2-befc-0d95-9f2b131916a6@jmvalin.ca> <CA+KMCSVdAG=q_LDhFg7OcKqBbN+6Dbaz-1Bjv6tRji9+d_PktA@mail.gmail.com> <2480dd55-eaef-9376-c88a-d764d777dfad@mozilla.com>
In-Reply-To: <2480dd55-eaef-9376-c88a-d764d777dfad@mozilla.com>
Content-Type: text/plain; charset="ISO-8859-1"; format="flowed"
Content-Transfer-Encoding: 7bit
Archived-At: <https://mailarchive.ietf.org/arch/msg/codec/hI_Dg54AdHiHeai2awfoUroQ3XU>
Subject: Re: [codec] Comments on draft-ietf-codec-ambisonics-01
X-BeenThere: codec@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: Codec WG <codec.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/codec>, <mailto:codec-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/codec/>
List-Post: <mailto:codec@ietf.org>
List-Help: <mailto:codec-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/codec>, <mailto:codec-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 13 Mar 2017 23:07:06 -0000

Jean-Marc Valin wrote:
> For family 3, if you already have a Cx(N+M) weight matrix, then the
> mapping table becomes completely redundant.

I think the issue boils down to whether it's worth it to have a table of 
size C to allow a matrix of size Kx(N+M) instead of Cx(N+M) in the 
headers where D is some number smaller than C (probably D=N+M). If more 
than 80% of your matrix is filled with zeros (100 out of 121 rows for 
10th order with only horizontal channels), that could be seen as a 
little wasteful.

That said, we very intentionally chose the direction of the channel 
mapping table for RFC 7845: one value per output channel that says which 
decoded channel to use. That makes it impossible to send multiple 
decoded channels (mixed channels after multiplying by the weight matrix 
in this draft) to the same speaker (ACN number in this draft). As Mark 
Harris points out, this is a failure case the current draft doesn't 
specify how to deal with. By doing the mapping in the same direction as 
RFC 7845, the worst failure case is that someone specifies a channel 
index larger than what's actually available, but that's easily 
detectable (i.e., you can tell without having to look at any other 
values in the mapping table). You also don't need to handle the diegetic 
channels separately. There is a special case for silent channels, but as 
a benefit it become easy (O(1)) to tell if a channel isn't used.

Having the mapping work in this direction would also be more consistent 
with Channel Mapping Family 2.