Re: [AVTCORE] Comments on draft-ietf-avtcore-rtp-vvc-05.txt

Stephan Wenger <stewe@stewe.org> Thu, 05 November 2020 18:50 UTC

Return-Path: <stewe@stewe.org>
X-Original-To: avt@ietfa.amsl.com
Delivered-To: avt@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 50A5E3A1960 for <avt@ietfa.amsl.com>; Thu, 5 Nov 2020 10:50:05 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.9
X-Spam-Level:
X-Spam-Status: No, score=-1.9 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=0.001, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=steweorg.onmicrosoft.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id cYXBUo64-F-E for <avt@ietfa.amsl.com>; Thu, 5 Nov 2020 10:50:02 -0800 (PST)
Received: from NAM10-DM6-obe.outbound.protection.outlook.com (mail-dm6nam10on2101.outbound.protection.outlook.com [40.107.93.101]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 46DD33A1954 for <avt@ietf.org>; Thu, 5 Nov 2020 10:50:01 -0800 (PST)
ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=C8nSy8gzeY0AkmywIpeGDAPtMFfQ/x60To4XwpCVc+dgPIu2z+2PmqlH6+fnFzXbUJYuEJYJAeKHhrFVpM1nJOgl0b2Ryw96G4dmjbKr5VJRKjebWsf1cSNxCxSjD49JBxMKVI33z/h2zJy+CJoV9A2Gty/6XztSkYWAtahoOVraoKjzHGSHKGYZLX+G7R9AWSv3olrHhdusLMwQajDSmGVPABjlpVGg4E4YtKCIdBmEt1/UU1cjn4EEfqFPeN4Iwijzsor5lL2ZvIDSeTu7ba58ifcb42F4xDJ2UzfsteQ/irUTtDvzdZN3j+9fiuwjqEZnHRfMEiW/jQDCPhnhxg==
ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=LvXnYm5fpigBD9gE4gdp1DT8u8vGH51YbQthP4WV0+M=; b=P0qmXv/9rdGlft4UTAWp8wY7DID0L5RX6vfXfHi1tPsqJdztPlfQjaVsR58pfmdm1HyhOdHrV2gyqW32PIpYkz/i2e4hHGbC2kWyq1Q4s1u4k69YoftfCVfUmz+hMIRx1sEAtJQpIFXTzlJv//3+UQbiVYK7cNSqT0y6sCxsEZkzW11lx3R68OnWW05lw+JilJplNMxLFmP1LukG7JKMSY7cVrDXvHP5I1lwFZTIeo7iWK6lPN2gz9tLw8pLVI9Dpue4Rl4x7G2AfVubySol/LPZMAWEqnLvz0buG5DmgRrYwYjV4CycSt82oqaf8tbZkdS+bugOY8tTY6dsMEQGPA==
ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=stewe.org; dmarc=pass action=none header.from=stewe.org; dkim=pass header.d=stewe.org; arc=none
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=steweorg.onmicrosoft.com; s=selector2-steweorg-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=LvXnYm5fpigBD9gE4gdp1DT8u8vGH51YbQthP4WV0+M=; b=rum50qegRkwb1Yx3xAabs9DdU8y84mNfOEs5kg/TDB60tjodTO5vWAESmUH5e1NzF/WEFwjl8opWzljASJudfgggpyxScaki7h7FzotPRyjiilkW/sOb6CDdP255HRKIWoxeNEyHKuLCfdhGchsd5eiiVwiKbyT8KzgK2UleZ7g=
Received: from DM6PR17MB3036.namprd17.prod.outlook.com (2603:10b6:5:12e::14) by DM6PR17MB3020.namprd17.prod.outlook.com (2603:10b6:5:131::29) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.3541.21; Thu, 5 Nov 2020 18:49:59 +0000
Received: from DM6PR17MB3036.namprd17.prod.outlook.com ([fe80::c47c:6a25:195c:3a31]) by DM6PR17MB3036.namprd17.prod.outlook.com ([fe80::c47c:6a25:195c:3a31%6]) with mapi id 15.20.3499.032; Thu, 5 Nov 2020 18:49:59 +0000
From: Stephan Wenger <stewe@stewe.org>
To: Martin Pettersson M <martin.m.pettersson=40ericsson.com@dmarc.ietf.org>, "shuaiizhao(Shuai Zhao)" <shuaiizhao@tencent.com>, "avt@ietf.org" <avt@ietf.org>
Thread-Topic: [AVTCORE] Comments on draft-ietf-avtcore-rtp-vvc-05.txt
Thread-Index: Adax+3XG1UYuenjlQSmSlfNmJ+iHbQBZfGAA
Date: Thu, 05 Nov 2020 18:49:58 +0000
Message-ID: <3584D9AA-D447-4F5C-9302-AD07629B838D@stewe.org>
References: <HE1PR0702MB36425058B8AEE97940A0C736CA110@HE1PR0702MB3642.eurprd07.prod.outlook.com>
In-Reply-To: <HE1PR0702MB36425058B8AEE97940A0C736CA110@HE1PR0702MB3642.eurprd07.prod.outlook.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
user-agent: Microsoft-MacOutlook/16.42.20101102
authentication-results: dmarc.ietf.org; dkim=none (message not signed) header.d=none;dmarc.ietf.org; dmarc=none action=none header.from=stewe.org;
x-originating-ip: [67.161.32.220]
x-ms-publictraffictype: Email
x-ms-office365-filtering-correlation-id: 8f78c57f-6ff0-40fe-4f1a-08d881bb989c
x-ms-traffictypediagnostic: DM6PR17MB3020:
x-microsoft-antispam-prvs: <DM6PR17MB3020993F8ABFC43C85B34B9AAEEE0@DM6PR17MB3020.namprd17.prod.outlook.com>
x-ms-oob-tlc-oobclassifiers: OLM:10000;
x-ms-exchange-senderadcheck: 1
x-microsoft-antispam: BCL:0;
x-microsoft-antispam-message-info: DM1j7OlGe5i/D7PQYH0hMSb6ycVwBKwSZZycbDrG8U5JZ1RtSoU+VAO6w+1vBrktj78+Xvna2TKK5kzE/64TAal0pFF9Urz5qh3sqEm1FrITCZFYDquIFTpaeRQtBnPjcIOcV4iJ9FSH0hxEyzzZFFFa5HI+tqusVA8YsKiZDWJpd/qMtU0VElN0Y0D+STz+xmv0lMfswziDB5bp0tvuedstdypWRZQvhtg4Py3XKLnhqsWmi+bpFir/bsCRoS/hcpLcdhiELBUjtPvsvQbHEhIfaqfu8bSH1EZ2Lz53alHtTMF5uuyW/+dSN0TGLk6H3DkTJa4wbOdukfXn7+sxBSgm7xKf2igkVW2zkfmej1Tnlqvnwlvv7Kn8ZlJ+cR2RpiUUjfOCSVRKtcVl4jjSqg==
x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:DM6PR17MB3036.namprd17.prod.outlook.com; PTR:; CAT:NONE; SFS:(376002)(136003)(39830400003)(396003)(346002)(366004)(9326002)(6506007)(33656002)(5660300002)(966005)(8936002)(86362001)(53546011)(91956017)(71200400001)(76116006)(6486002)(64756008)(66556008)(66446008)(36756003)(66946007)(478600001)(2906002)(66476007)(186003)(66574015)(83380400001)(6512007)(110136005)(26005)(21615005)(2616005)(8676002)(166002)(316002); DIR:OUT; SFP:1102;
x-ms-exchange-antispam-messagedata: o6OvvzcUkq6rsrOETht/TCJ0YVeNvf2Mlp0Ac6naqUlgeP355JWCoSu7zPMLftSVwYrQTP6KRB851tcRj6OFb7KhhwmbIqZag5/vYNK8RFJjOFnqO7W0d8hZGYFl2zAXhsdcTqEY7Eo5eUNAlVw64f9Vu2fn7TnGh5jAgRTlbdnKPM4i1GSA7tCS2be9t25jrZR0hqdnCLrHGxYpHjEa/qTX+aZ4t9lMRB1XqxhdQHdcPLRkX3jMDfD0qAftCvCBKZg4HmbVQgqd6ByETLRwKjIIK+beU+d9xynMbqmO/Ob0EjHOppGBDZ9aYsQy5SDduNSuQ0+yt/lEAjlgAKjg9qVfQtFsoQOFSJrPYQj/q4fe+IiBRO9Uh+UnEmlC95j/igs3EWlNwm06kQbyBfxHA61nfDl39hriMWTTNPj9sVxoEPCpMBXBHC2WJcrhjIEq5SQJtIo3SCiTFTIIxi9aYhqTGhoUztY5H4XowJATf68aYeFI2kWiWf+08UIkHKxA6EcCsuNJM6chxDHTknPVftj8jZfb0XqzmIi3EDhgvzqmmutlG2hyZCVYOHUGWmWIj2zIgBuwy2qQQZeJdGSWaX8DZ8OwXdd/0ZpJBbZUc3u3Y+AT6GTibWOYNYqrykXB3tDHn7zuf5ZtbrG9IJBe9w==
x-ms-exchange-transport-forked: True
Content-Type: multipart/alternative; boundary="_000_3584D9AAD4474F5C9302AD07629B838Dsteweorg_"
MIME-Version: 1.0
X-OriginatorOrg: stewe.org
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-AuthSource: DM6PR17MB3036.namprd17.prod.outlook.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 8f78c57f-6ff0-40fe-4f1a-08d881bb989c
X-MS-Exchange-CrossTenant-originalarrivaltime: 05 Nov 2020 18:49:59.0464 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 865fc51c-5fae-4322-98ef-0121a85df0b6
X-MS-Exchange-CrossTenant-mailboxtype: HOSTED
X-MS-Exchange-CrossTenant-userprincipalname: 9sbRlVfAxr2C6icKsxIfixshIUs8iX0XOs/lxP+2/Lbw3Hwaq4QoCcSLzOYSSxRN
X-MS-Exchange-Transport-CrossTenantHeadersStamped: DM6PR17MB3020
Archived-At: <https://mailarchive.ietf.org/arch/msg/avt/HfS_e8WkvARQ0oRgUNt3f3xl4_k>
Subject: Re: [AVTCORE] Comments on draft-ietf-avtcore-rtp-vvc-05.txt
X-BeenThere: avt@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Audio/Video Transport Core Maintenance <avt.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/avt>, <mailto:avt-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/avt/>
List-Post: <mailto:avt@ietf.org>
List-Help: <mailto:avt-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/avt>, <mailto:avt-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 05 Nov 2020 18:50:05 -0000

Hi Martin,

Thanks for those suggested changes, which I think would consistently implement the option to react to a “Full Intra Request” (FIR) with a gradual decoder refresh (GDR) series of pictures.

As to whether we should allow GDR as a reaction to a FIR: I’m a bit torn here.  Arguments can be made either way, please see below.  Would others in the WG please weigh in?

On one hand, the argument (somewhat rephrased here) that VVC’s GDR pictures are a fully specified replacement for traditional “all intra” pictures is a good one.  I concur that this is somewhat new in VVC, compared to older video coding standards.  Pretty much all of those could do some form of GDR (even good old H.261 and MPEG-2), but things were clumsy, results were not guaranteed, or one had to rely on SEI messages and similar exotics for implementation.  In the environments where FIR matters—video conferencing mostly—no one ever used GDR in any context except in those ca. 1990 H.261-based systems which didn’t implement full intra pictures at all, and relied on intra macroblock walk-around during the initial communication setup.

On the other hand, there’s a reason why FIR until now was consistently interpreted as a requirement of sending a single “all intra” picture (whatever that translates to in the various video coding standards and technologies).  That reason was related to the architecture of the MCUs that were around when RFC 5104 was written, back in the 2005-2008 timeframe.  What people requested then was that the internal architecture of an MCU should stay as independent of the codec in use as possible.  For FIR, that means that means: if an MCU sends out a FIR to a sending endpoint, it expects exactly one intra picture at the earliest opportunity that can be used to sync in added decoders of unknown state.  That logic would now have to change to receive either a single IDR picture or a series of pictures that make up a GDR.  A transcoding MCU would have to go further and include the decoding of those multiple pictures with all the tricky (though now fully specified!) stuff that goes on in VVC, before transcoding.

Stephan


From: avt <avt-bounces@ietf.org> on behalf of Martin Pettersson M <martin.m.pettersson=40ericsson.com@dmarc.ietf.org>
Date: Tuesday, November 3, 2020 at 08:38
To: "shuaiizhao(Shuai Zhao)" <shuaiizhao@tencent.com>, "avt@ietf.org" <avt@ietf.org>
Subject: [AVTCORE] Comments on draft-ietf-avtcore-rtp-vvc-05.txt

Hi,

Thanks for the good progress on the VVC RTP payload format. Below are some suggested modifications for your consideration:


  1.  In section 3.2, add “GDR                       Gradual Decoding Refresh”



  1.  In section 8.4, change “Upon reception of a FIR, a sender must send an IDR picture.” to “Upon reception of a FIR, a sender must send an IDR, a CRA or a GDR picture.”

Motivation:
One of the versatile features in VVC is its support for low-latency coding where the GDR picture is a key component to achieve low latency. Compared to AVC and HEVC where GDR is signaled in an SEI message with optional support by the decoder, the GDR picture in VVC is a normative part of the specification and the decoder must be able to tune in at a GDR picture. Therefore it makes sense to allow a sender to respond with a GDR picture upon receiving a FIR. Note also that a gradual decoding refresh point is mentioned as a possible Decoder Refresh Point in response to the FIR command in https://tools.ietf.org/html/rfc5104.

Sending a CRA picture as a response to FIR would be fine as well in my opinion. I don’t see the reason to exclude that.



  1.  In section 9.1, change “The I bit MUST be 1 when the NAL unit type is 7-9 (inclusive), otherwise it MUST be 0.” to “The I bit MUST be 1 when the NAL unit type is 7-10 (inclusive), otherwise it MUST be 0.”



In section 9.2, change “The I bit MUST be 1 when the NAL unit type is 7-9 (inclusive), otherwise it MUST be 0.” to “The I bit MUST be 1 when the NAL unit type is 7-10 (inclusive), otherwise it MUST be 0.”

Motivation:
NAL unit type 10 is GDR_NUT.

In https://tools.ietf.org/id/draft-ietf-avtext-framemarking-09.html the I bit is specified as:
I: Independent Frame (1 bit) - MUST be 1 for frames that can be decoded independent of temporally prior frames, e.g. intra-frame, VPX keyframe, H.264 IDR [RFC6184], H.265 IDR/CRA/BLA/RAP [RFC7798]; otherwise MUST be 0.

The GDR picture is typically not fully refreshed in one frame, but it does not need prior temporal pictures to start the decoding process, i.e. a bitstream that starts with a GDR picture in VVC is a valid bitstream.

Best regards,
Martin Pettersson


From: avt <avt-bounces@ietf.org> On Behalf Of shuaiizhao(Shuai Zhao)
Sent: den 2 november 2020 23:11
To: avt@ietf.org
Subject: [AVTCORE] FW: I-D Action: draft-ietf-avtcore-rtp-vvc-05.txt(Internet mail)

In this revision, Yago’s proposal for SDP parameters were implemented in section 7.2.1.

Editor’s notes were added for things we will provide clearfication in next revision.  So do review and critisize lightly. ☺

Best
SZ

From: avt <avt-bounces@ietf.org<mailto:avt-bounces@ietf.org>> on behalf of "internet-drafts@ietf.org<mailto:internet-drafts@ietf.org>" <internet-drafts@ietf.org<mailto:internet-drafts@ietf.org>>
Reply-To: "avt@ietf.org<mailto:avt@ietf.org>" <avt@ietf.org<mailto:avt@ietf.org>>
Date: Monday, November 2, 2020 at 14:07
To: "i-d-announce@ietf.org<mailto:i-d-announce@ietf.org>" <i-d-announce@ietf.org<mailto:i-d-announce@ietf.org>>
Cc: "avt@ietf.org<mailto:avt@ietf.org>" <avt@ietf.org<mailto:avt@ietf.org>>
Subject: [AVTCORE] I-D Action: draft-ietf-avtcore-rtp-vvc-05.txt(Internet mail)


A New Internet-Draft is available from the on-line Internet-Drafts directories.
This draft is a work item of the Audio/Video Transport Core Maintenance WG of the IETF.

        Title           : RTP Payload Format for Versatile Video Coding (VVC)
        Authors         : Shuai Zhao
                          Stephan Wenger
                          Yago Sanchez
                          Ye-Kui Wang
                Filename        : draft-ietf-avtcore-rtp-vvc-05.txt
                Pages           : 61
                Date            : 2020-11-02

Abstract:
   This memo describes an RTP payload format for the video coding
   standard ITU-T Recommendation H.266 and ISO/IEC International
   Standard 23090-3, both also known as Versatile Video Coding (VVC) and
   developed by the Joint Video Experts Team (JVET).  The RTP payload
   format allows for packetization of one or more Network Abstraction
   Layer (NAL) units in each RTP packet payload as well as fragmentation
   of a NAL unit into multiple RTP packets.  The payload format has wide
   applicability in videoconferencing, Internet video streaming, and
   high-bitrate entertainment-quality video, among other applications.


The IETF datatracker status page for this draft is:
https://datatracker.ietf.org/doc/draft-ietf-avtcore-rtp-vvc/

There are also htmlized versions available at:
https://tools.ietf.org/html/draft-ietf-avtcore-rtp-vvc-05
https://datatracker.ietf.org/doc/html/draft-ietf-avtcore-rtp-vvc-05

A diff from the previous version is available at:
https://www.ietf.org/rfcdiff?url2=draft-ietf-avtcore-rtp-vvc-05


Please note that it may take a couple of minutes from the time of submission
until the htmlized version and diff are available at tools.ietf.org.

Internet-Drafts are also available by anonymous FTP at:
ftp://ftp.ietf.org/internet-drafts/


_______________________________________________
Audio/Video Transport Core Maintenance
avt@ietf.org<mailto:avt@ietf.org>
https://www.ietf.org/mailman/listinfo/avt