Re: [video-codec] Roman Danyliw's Discuss on draft-ietf-netvc-requirements-09: (with DISCUSS and COMMENT)

Filippov Alexey <Alexey.Filippov@huawei.com> Thu, 13 June 2019 17:15 UTC

Return-Path: <Alexey.Filippov@huawei.com>
X-Original-To: video-codec@ietfa.amsl.com
Delivered-To: video-codec@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 8CF24120120; Thu, 13 Jun 2019 10:15:03 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.2
X-Spam-Level:
X-Spam-Status: No, score=-4.2 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_MED=-2.3, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id WHW58FpU-Cyx; Thu, 13 Jun 2019 10:15:00 -0700 (PDT)
Received: from huawei.com (lhrrgout.huawei.com [185.176.76.210]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 8DA8F12011C; Thu, 13 Jun 2019 10:15:00 -0700 (PDT)
Received: from lhreml704-cah.china.huawei.com (unknown [172.18.7.106]) by Forcepoint Email with ESMTP id 2219132B076A8A28C085; Thu, 13 Jun 2019 18:14:58 +0100 (IST)
Received: from fraeml703-chm.china.huawei.com (10.206.15.52) by lhreml704-cah.china.huawei.com (10.201.108.45) with Microsoft SMTP Server (TLS) id 14.3.408.0; Thu, 13 Jun 2019 18:14:57 +0100
Received: from fraeml705-chm.china.huawei.com (10.206.15.54) by fraeml703-chm.china.huawei.com (10.206.15.52) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_CBC_SHA256_P256) id 15.1.1713.5; Thu, 13 Jun 2019 19:14:57 +0200
Received: from fraeml705-chm.china.huawei.com ([10.206.112.183]) by fraeml705-chm.china.huawei.com ([10.206.112.183]) with mapi id 15.01.1713.004; Thu, 13 Jun 2019 19:14:57 +0200
From: Filippov Alexey <Alexey.Filippov@huawei.com>
To: Roman Danyliw <rdd@cert.org>, Adam Roach <adam@nostrum.com>, The IESG <iesg@ietf.org>
CC: "draft-ietf-netvc-requirements@ietf.org" <draft-ietf-netvc-requirements@ietf.org>, Mo Zanaty <mzanaty@cisco.com>, "netvc-chairs@ietf.org" <netvc-chairs@ietf.org>, "video-codec@ietf.org" <video-codec@ietf.org>
Thread-Topic: Roman Danyliw's Discuss on draft-ietf-netvc-requirements-09: (with DISCUSS and COMMENT)
Thread-Index: AQHVIL2iTyqJ65ndE0u1t8hrSi8/oaaZ1RMw
Date: Thu, 13 Jun 2019 17:14:56 +0000
Message-ID: <8820dcb9272f4de7b3ebb7cba68052f4@huawei.com>
References: <156030268519.5895.7315446863069831893.idtracker@ietfa.amsl.com>
In-Reply-To: <156030268519.5895.7315446863069831893.idtracker@ietfa.amsl.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-originating-ip: [10.198.51.251]
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
MIME-Version: 1.0
X-CFilter-Loop: Reflected
Archived-At: <https://mailarchive.ietf.org/arch/msg/video-codec/r2gvjj0YZ8C1QCX_KBun6ewzZPM>
Subject: Re: [video-codec] Roman Danyliw's Discuss on draft-ietf-netvc-requirements-09: (with DISCUSS and COMMENT)
X-BeenThere: video-codec@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Video codec BoF discussion list <video-codec.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/video-codec>, <mailto:video-codec-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/video-codec/>
List-Post: <mailto:video-codec@ietf.org>
List-Help: <mailto:video-codec-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/video-codec>, <mailto:video-codec-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 13 Jun 2019 17:15:04 -0000

Dear Roman,

Thank you a lot for your comments and raising the important questions.

> How should a set of QPs be specified?
As you probably know, a set of available QP values can be codec specific. Usually, a QP set is selected to cover medium bit-rate range that is considered to be the most complex for compressing. We proposed to set up the assessment process not only for this range but also for low and high QP ranges to cover a wider range of applications. According to my understanding, concrete QP values should be specified for candidate codecs in draft-ietf-netvc-testing.

> How should the quality values be calculated?
It is explained in Section 4.1, namely: “To assess the quality of output (decoded) sequences, two indexes, PSNR [3] and MS-SSIM [3,11] are separately computed. In the case of the YCbCr color format, PSNR should be calculated for each color plane whereas MS-SSIM is calculated for luma channel only. In the case of the RGB color format, both metrics are computed for R, G and B channels. Thus, for each sequence, 30 RD-points for PSNR (i.e. three RD-curves, one for each channel) and 10 RD-points for MS-SSIM (i.e. one RD-curve, for luma channel only) should be calculated in the case of YCbCr. If content is encoded as RGB, 60 RD-points (30 for PSNR and 30 for MS-SSIM) should be calculated, i.e. three RD-curves (one for each channel) are computed for PSNR as well as three RD-curves (one for each channel) for MS-SSIM.” In references [3, 11], these 2 quality assessment metrics and the ways of how to calculate them are described in detail.

>-- What does the text “(in a separate document on Internet video codec testing)” mean?
> Per “A list of video sequences that should be used for testing as well as the 10 QP values for the reference codec are defined in a separate document ", what document is that?  Is it draft-ietf-netvc-testing?
According to my understanding, it is draft-ietf-netvc-testing.

>What does “codec implementation (for both an encoder and a decoder) should cover the worst case of computational complexity, memory bandwidth, and physical memory size” mean?
I’d like to thank Adam for his comment. I also think that his formulation ("...should take into consideration the worst-case...") is clearer. 
A more detailed explanation on this Section can be found in my response to the secdir reviewer: https://mailarchive.ietf.org/arch/msg/secdir/QYx-hl07aec5XhgqkldqgWFG3Is 

>Please add additional language that codec should be written in a defensive style as they will be processing untrusted input.
Good point. Thank you. I’ll add it.

--
Best regards,
Alexey Filippov

-----Original Message-----
From: Roman Danyliw via Datatracker [mailto:noreply@ietf.org] 
Sent: Wednesday, June 12, 2019 4:25 AM
To: The IESG <iesg@ietf.org>
Cc: draft-ietf-netvc-requirements@ietf.org; Mo Zanaty <mzanaty@cisco.com>; netvc-chairs@ietf.org; mzanaty@cisco.com; video-codec@ietf.org
Subject: Roman Danyliw's Discuss on draft-ietf-netvc-requirements-09: (with DISCUSS and COMMENT)

Roman Danyliw has entered the following ballot position for
draft-ietf-netvc-requirements-09: Discuss

When responding, please keep the subject line intact and reply to all email addresses included in the To and CC lines. (Feel free to cut this introductory paragraph, however.)


Please refer to https://www.ietf.org/iesg/statement/discuss-criteria.html
for more information about IESG DISCUSS and COMMENT positions.


The document, along with other ballot positions, can be found here:
https://datatracker.ietf.org/doc/draft-ietf-netvc-requirements/



----------------------------------------------------------------------
DISCUSS:
----------------------------------------------------------------------

(1) I worry that the level of detail in the in the Compression Performance Evaluation (Section 4.1) is insufficient for implementation.  Specifically:

(a) Per “Initially, for the codec selected as a reference one (e.g., HEVC or VP9), a set of 10 QP quantization parameter) values should be specified (in a separate document on Internet video codec testing) and corresponding quality values should be calculated.”

-- How should a set of QPs be specified?

--How should the quality values be calculated?

-- What does the text “(in a separate document on Internet video codec testing)” mean?

(b) Per “A list of video sequences that should be used for testing as well as the 10 QP values for the reference codec are defined in a separate document ", what document is that?  Is it draft-ietf-netvc-testing?

(2) Per the Security Considerations Section (Section 5)

-- What does “codec implementation (for both an encoder and a decoder) should cover the worst case of computational complexity, memory bandwidth, and physical memory size” mean?

-- Please add additional language that codec should be written in a defensive style as they will be processing untrusted input.


----------------------------------------------------------------------
COMMENT:
----------------------------------------------------------------------

(1) There is something odd about the document formatting – the title and the first author last name in the footer is wrapped in “< … >”

(2) It would be helpful to forward reference that acronyms are explained in Appendix A.

(3) This draft uses the words should and must to prescribe action.  Why wasn’t
RFC2119 cited to explain these words?

(4) Section 2.0.  A reference to explain “YCbCr 4:2:0” would be helpful

(5) Section 2.1. Per “high encoder complexity” and “decoding complexity”, I initiate read that as a qualitative measure.  However, the text says “up to 10x and more” so that implies some quantitative measure.  What is that?

(6) Section 2.1.  Expand QP values on first use

(7) Section 2.x.  The language around content doesn’t appear to be consist. 
For example:

-- Section 2.1, Internet Video Streaming says “movies, TV-series and shows, and animation.”

-- Section 2.2, IPTV says “television content”

-- Section 2.5, Screen casting says “business presentations …, animation (cartoons), gaming content, data visualization, …, virtual desktop infrastructure (VDI), screen/desktop sharing and collaboration, supervisory control and data acquisition (SCADA) display, automotive/navigation display, cloud gaming, factory automation display, wireless display, display wall, digital operating room (DiOR), etc.

What the difference between Section 2.1’s animation and Section 2.5’s cartoons?

What’s the different between Section 2.1’s “movies, TV series …” and Section 2.2’s “television content”?

(8) Section 2.5.  The sentence “Currently, …” is very challenging to parse as it includes inline “i.e.,” and “etc”.

(9) Section 2.5.  Per “powerpoint, word documents”, these are specific Microsoft products.  I recommend using more generic names.

(10) Section 4.  I found it confusing that an evaluation methodology was in a requirements document.  I would have expected it in the draft-ietf-netvc-testing

(11) Section 4.1.  VP9 needs a reference.

(12) Section 6.  I don’t think this entire section is necessary.

(13) Editorial Nits:
-- Section 3.  Style nit.  s/chapter/section/

-- Section 4.1. Typo.  s/computged/computed