Re: [Wish] WG Last Call for draft-ietf-wish-whip

Sergio Garcia Murillo <sergio.garcia.murillo@gmail.com> Tue, 28 February 2023 13:48 UTC

Return-Path: <sergio.garcia.murillo@gmail.com>
X-Original-To: wish@ietfa.amsl.com
Delivered-To: wish@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id C6786C14F5E0 for <wish@ietfa.amsl.com>; Tue, 28 Feb 2023 05:48:44 -0800 (PST)
X-Quarantine-ID: <U7dYZj7EXyUK>
X-Virus-Scanned: amavisd-new at amsl.com
X-Amavis-Alert: BANNED, message contains application/octet-stream,.dat,favicon.ico
X-Spam-Flag: NO
X-Spam-Score: -2.084
X-Spam-Level:
X-Spam-Status: No, score=-2.084 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_REMOTE_IMAGE=0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id U7dYZj7EXyUK for <wish@ietfa.amsl.com>; Tue, 28 Feb 2023 05:48:40 -0800 (PST)
Received: from mail-ed1-x52c.google.com (mail-ed1-x52c.google.com [IPv6:2a00:1450:4864:20::52c]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 5E52DC14CE25 for <wish@ietf.org>; Tue, 28 Feb 2023 05:48:40 -0800 (PST)
Received: by mail-ed1-x52c.google.com with SMTP id ee7so40312118edb.2 for <wish@ietf.org>; Tue, 28 Feb 2023 05:48:40 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=LKjRbE3XKSOi0jlzrjEfN2D4uuemhIbbIu9qQJKAPiU=; b=lLG4pencI5cs9XIwGI02ipZWw+YmSPvrFfg2WtGA0wpNeluh1vgimi/+R9Y8qnH7rX IvccGMhhY0WEkSuTuZEHPXXU09J8pcMUEFuavB4BCTq3RuTLntyK3cpiFCc/EEYGpXnr nAAwqk4c9Usk6+u89sn9HjXSIg0i0615TVeIVloz/FWLlh+J+RBWEUk8wUcFFtT8soD1 gxkYpTiA1YTvhGUnGfVkm1EHzo1L3znM3s43FLRFhZsLKV0MaResLGibyJqiU+ja46HC iqY2Wnf9d0Omawf03U0XA57S4QgsTtusdQAz9/rVjiS/HuW9P3VkHrKoP9LRrsXF9QPK JsHQ==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=LKjRbE3XKSOi0jlzrjEfN2D4uuemhIbbIu9qQJKAPiU=; b=tuUQhBww5354nugtQhoTSNjW3x4R7UZ0qV+aBCsFzwg+5zVaIRLMBRYClj1n/MEjwU +sDT68Cn69Xz+VO1buo0PhcgmBg4hcN5AtNxHyxZk6/MQ8Sac8k3UNXiBSAIbR/Y+WbZ cbZClwuyB6U15ahusS2CVE+F74yJSX8fF2D5bdTM7Yp3q9NXKqaNZaZtih9CW/0FyLA4 iMzYyrvGpYaFjuJoQmwyw/y/M09qUNC4fKp2eMAdaqAF3BnQBjDUgZcyfCoEKnW6vMa1 wuoZ0BoiRzfYH9tC6z6v3/es/mvXpOWVhQGbuY4GvqemeUisNUfRY6SpabFzBPzk9jl/ T7XQ==
X-Gm-Message-State: AO0yUKV6XWImuBIais25fqhib5QbsD8DCf5gbjaMSDI0fM+GBH+BkAgT ukjGeDAP7ZEUkq8rxNfRgIPut8DO1ABVN4LbMY8gt0W7aVFrgAs+
X-Google-Smtp-Source: AK7set9VeQlPNhK1gyu7ui0+ijUdpHY/GrmHOIX1DbnZFCpxKNsvfru+k4BOriZqtoRGJAU+kDypYjR+AUXoZMrN1eI=
X-Received: by 2002:a17:906:d9ca:b0:8af:b63:b4ba with SMTP id qk10-20020a170906d9ca00b008af0b63b4bamr1232344ejb.3.1677592118721; Tue, 28 Feb 2023 05:48:38 -0800 (PST)
MIME-Version: 1.0
References: <8D708B68-2947-40B8-A7C4-5B9697B1C58D@gmail.com>
In-Reply-To: <8D708B68-2947-40B8-A7C4-5B9697B1C58D@gmail.com>
From: Sergio Garcia Murillo <sergio.garcia.murillo@gmail.com>
Date: Tue, 28 Feb 2023 14:48:27 +0100
Message-ID: <CA+ag07afe3oYQW=UzCruQtqntNhLuQsAi9XmfUNEbRCWBNPEaw@mail.gmail.com>
To: Bernard Aboba <bernard.aboba@gmail.com>
Cc: wish@ietf.org
Content-Type: multipart/related; boundary="000000000000509a5c05f5c2da78"
Archived-At: <https://mailarchive.ietf.org/arch/msg/wish/VFEBxqsMTm4_6FzFMgbku7C5hLA>
Subject: Re: [Wish] WG Last Call for draft-ietf-wish-whip
X-BeenThere: wish@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: WebRTC Ingest Signaling over HTTPS <wish.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/wish>, <mailto:wish-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/wish/>
List-Post: <mailto:wish@ietf.org>
List-Help: <mailto:wish-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/wish>, <mailto:wish-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 28 Feb 2023 13:48:44 -0000

Hi Bernard,

Thank you very much for your feedback, comments below inline.

On Fri, Feb 24, 2023 at 10:56 PM Bernard Aboba <bernard.aboba@gmail.com>
wrote:

> Here is my (belated) review.
>
> Section 1
>
> “ While WebRTC has been very successful in a wide range of scenarios,
>
>    its adoption in the broadcasting/streaming industry is lagging
>
>    behind.”
>
> [BA] I recently saw a survey indicating that WHIP is now the second most
> implemented ingestion protocol, second only to RTMP.  So while this
> sentence may have been correct at one time, it seems out of date now.  Can
> we delete this sentence?
>
> Also, overall Section 1 seems like it could be shortened considerably by
> highlighting the major points. Here is my suggestion:
>
>
I am ok with changing the introduction, will review your text and propose a
PR with your changes.


> Section 2
>
>
> I do not see a definition of “track”.  I think this is important to clarify.
>
>
 I already had a PR related to that adding a reference to RFC8830.

https://github.com/wish-wg/webrtc-http-ingest-protocol/pull/95

Do you think it would be enough?



> Section 4.2
>
> “  While this version of the specification only supports a single audio
>
>    and video track, in order to ensure forward compatibility, if the
>    number of audio and or video tracks or number streams is not
>    supported by the WHIP Endpoint, it MUST reject the HTTP POST request
>    with a "406 Not Acceptable" error response.”
>
>
> [BA] Support for stereo and surround-sound is becoming increasingly popular.  Can you clarify whether this is supported in this version of the specification?  I assume it can be (e.g. it is possible to have multiple channels per track) but the lack of a definition for “track” creates some uncertainty.
>
>
>
Yes, this is definitely possible, see me comment about using track as
defined in RFC8830. Given that stereo and surround sound are SDP
negotiated, do you think that we need to include any further clarification
apart of the track one?


> 4.6.  Simulcast and scalable video coding
>
>    Both Simulcast [RFC8853] and Scalable Video Coding (SVC), including
>    K-SVC (also known as "S modes", in which multiple encodings are sent
>    on the same SSRC), MAY be supported by both the Media Servers and
>    WHIP clients through negotiation in the SDP offer/answer.
>
> [BA] K-SVC and “S” modes are different.  K-SVC denotes the KEY and KEY_SHIFT modes, whereas “S” modes denote the encapsulation of multiple encodings within a single SSRC, as is supported in VP9 and AV1.   Diagrams of the various modes are included in the WebRTC-SVC specification: https://w3c.github.io/webrtc-svc/
>
>
> Also, SVC is *not* negotiated within Offer/Answer in WebRTC.  It is something that the encoder can just turn on.  At least for temporal modes (L1T2, L1T3) ingester support can often be taken for granted.  While the VP9 and AV1 specifications require a compliant decoder to be able to decode any mode than an encoder can encode, in practice there are VP9 and AV1 hardware decoders that cannot decode spatial scalability because they do not support spatial references (e.g. a P-frame at a higher resolution than the P or I frame that it references).
>
>
> This nasty little “wrinkle” required us to define the “spatialScalability” attribute in Media Capabilities, to allow applications to discover whether a (hardware) decoder supports spatial scalability or not:
>
> Media Capabilities <https://www.w3.org/TR/media-capabilities/#dom-videoconfiguration-spatialscalability>
> w3.org <https://www.w3.org/TR/media-capabilities/#dom-videoconfiguration-spatialscalability>
> [image: favicon.ico] <https://www.w3.org/TR/media-capabilities/#dom-videoconfiguration-spatialscalability> <https://www.w3.org/TR/media-capabilities/#dom-videoconfiguration-spatialscalability>
>
> Not sure if you want to get into this in the specification.  Issues are most likely to be encountered where hardware-based transcoders are used (but in that scenario, spatial scalability probably wouldn’t be relevant anyway),
>
>
>
 I agree with you, should we remove the reference to SVC completely and
only leave simulcast support in that section?

Best regards
Sergio