Re: [video-codec] Strategy for an RF video codec

John Koleszar <jkoleszar@gmail.com> Wed, 16 January 2013 16:42 UTC

MIME-Version: 1.0
In-Reply-To: <C5E08FE080ACFD4DAE31E4BDBF944EB1133804E9@xmb-aln-x02.cisco.com>
References: <C5E08FE080ACFD4DAE31E4BDBF944EB11337D2B5@xmb-aln-x02.cisco.com> <CABcZeBNSrKGGr_nG1hTu07d-mGpKZGsp6uNu=w=Sm2YWs45YMg@mail.gmail.com> <C5E08FE080ACFD4DAE31E4BDBF944EB1133804E9@xmb-aln-x02.cisco.com>
Date: Wed, 16 Jan 2013 08:42:20 -0800
Message-ID: <CAPzd0H4C5Gdh5daRzaJXr8X7YBGOP5PWL195jmPJ1xGY8-04Fg@mail.gmail.com>
From: John Koleszar <jkoleszar@gmail.com>
To: "Cullen Jennings (fluffy)" <fluffy@cisco.com>
Content-Type: multipart/alternative; boundary="20cf303b40c3e299ad04d36a8f74"
Cc: Eric Rescorla <ekr@rtfm.com>, "video-codec@ietf.org" <video-codec@ietf.org>
Subject: Re: [video-codec] Strategy for an RF video codec
Precedence: list

On Wed, Jan 16, 2013 at 7:36 AM, Cullen Jennings (fluffy)<fluffy@cisco.com>
 wrote:

>
> On Jan 15, 2013, at 8:03 AM, Eric Rescorla <ekr@rtfm.com> wrote:
>
> > Cullen,
> >
> > This is a really interesting idea.
> >
> > Can you give me some sense of how you envisioned the negotiation
> happening?
> > Traditionally, IETF uses one of three types of negotiation:
> >
> > - One side offers some list of non-categorized features and the other
> side picks
> > a subset. ("extensions")
> > - We have a bunch of feature categories and one side offers a list of
> which
> > ones if supports in each category. The other side picks one from each
> column
> > ("chinese menu}")
> > - We have a fixed number of feature combinations and one side offers
> > a list of the combinations it supports and the other side picks one
> ("suites")
> > - We have a list of features and one side lists the combinations it would
> > accept ("profiles" (?))
> >
> > What did you have in mind here? I ask because it impacts the relationship
> > between the features and the required level of engineering
> > work.
>
> I had not really put too much thought into which style would work best but
> I was imaging "extensions" for interactive and "suites" for stored video.
>
> For interactive, the SDP would have the list of features supported by the
> receiver. The side encoding the video would look at that, and encode
> appropriately. The actual information of what the encoding was would be in
> the bitstream so any device could decode just from looking at the bitstream.
>

This seems like a testing nightmare to me, in addition to the implicit
costs of carrying around fallback code and silicon that never gets used.

Also, with this proposed granularity of features, even if some given
feature level is interoperable, it doesn't mean that the resulting codec as
a whole satisfies all the other requirements. If the minimal feature is
uncompressed video, that doesn't make it a practical choice for the
applications we're trying to cover. If used, features should degrade
quality, not baseline functionality (which includes some lower bound on
quality).

>
> For non interactive media - something more like vimeo, I would imagine
> that we would define a small number of suites defined in IETF specs and the
> video is encoded to one or more of the suites and the receiver either can
> deal with the suite or it can't. More or less how interactive video works
> today. It does allow new suites to be defined with existing features which
> would allow for fairly rapid roll out of new suites if a problem was found
> with existing suite set.
>
>
I fail to see the distinction between codecs and suites -- couldn't each
suite be considered a codec in and of itself?

I think the underlying issue you're addressing here is how do you iterate
on codecs and how quickly can you do so, in response to new research or in
your case as an IPR risk mitigation strategy. I think there are some
straightforward approches to doing this in software, where you can just
download the codec definition at runtime and run it in a sandbox, but doing
so in hardware is much harder. You can take an approach where you define
algorithmic blocks and then the definition of how those blocks connect can
be described in the bitstream as is done with MPEG's RVC effort, but as far
as I know it hasn't been shown to be practical yet.

In other words, even if we only do one codec now, do we do anything
differently because we know there'll be another sometime soon after? I'd
argue that it's outside the scope of a codec definition effort, but
anything that can get the cycle time on codecs reduced so it can be
measured in months, not years, is a worthy goal in my opinion, so maybe
it's worth trying. It's a much harder problem than defining a new codec,
certainly.

[video-codec] Strategy for an RF video codec Cullen Jennings (fluffy)
Re: [video-codec] Strategy for an RF video codec Eric Rescorla
Re: [video-codec] Strategy for an RF video codec Monty Montgomery
Re: [video-codec] Strategy for an RF video codec Cullen Jennings (fluffy)
Re: [video-codec] Strategy for an RF video codec John Koleszar
Re: [video-codec] Strategy for an RF video codec Timothy B. Terriberry
Re: [video-codec] Strategy for an RF video codec John Koleszar