Re: [AVTCORE] Magnus Westerlund's Discuss on draft-ietf-payload-rtp-ttml-03: (with DISCUSS and COMMENT)

James Sandford <james.sandford@bbc.co.uk> Wed, 16 October 2019 08:52 UTC

Return-Path: <james.sandford@bbc.co.uk>
X-Original-To: avt@ietfa.amsl.com
Delivered-To: avt@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 955F712004F; Wed, 16 Oct 2019 01:52:54 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.201
X-Spam-Level:
X-Spam-Status: No, score=-4.201 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_MED=-2.3, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id gVhKk-a8gHeQ; Wed, 16 Oct 2019 01:52:48 -0700 (PDT)
Received: from mailout1.cwwtf.bbc.co.uk (mailout1.cwwtf.bbc.co.uk [132.185.160.180]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 6FB49120882; Wed, 16 Oct 2019 01:52:45 -0700 (PDT)
Received: from BGB01XI1006.national.core.bbc.co.uk ([10.184.50.56]) by mailout1.cwwtf.bbc.co.uk (8.15.2/8.15.2) with ESMTP id x9G8qQnP001724; Wed, 16 Oct 2019 09:52:26 +0100 (BST)
Received: from BGB01XI1004.national.core.bbc.co.uk (10.184.50.54) by BGB01XI1006.national.core.bbc.co.uk (10.184.50.56) with Microsoft SMTP Server (TLS) id 14.3.408.0; Wed, 16 Oct 2019 09:52:25 +0100
Received: from BGB01XUD1001.national.core.bbc.co.uk ([10.184.52.80]) by BGB01XI1004.national.core.bbc.co.uk ([10.184.50.54]) with mapi id 14.03.0408.000; Wed, 16 Oct 2019 09:52:25 +0100
From: James Sandford <james.sandford@bbc.co.uk>
To: Magnus Westerlund <magnus.westerlund@ericsson.com>, The IESG <iesg@ietf.org>
CC: "draft-ietf-payload-rtp-ttml@ietf.org" <draft-ietf-payload-rtp-ttml@ietf.org>, Roni Even <roni.even@huawei.com>, "avtcore-chairs@ietf.org" <avtcore-chairs@ietf.org>, "avt@ietf.org" <avt@ietf.org>
Thread-Topic: Magnus Westerlund's Discuss on draft-ietf-payload-rtp-ttml-03: (with DISCUSS and COMMENT)
Thread-Index: AQHVg2afvBBZi0nYEE6tAWxVrjQIBqdc9f6U
Date: Wed, 16 Oct 2019 08:52:24 +0000
Message-ID: <734752AF0E88364D983373FE5CEFED5770DE3D18@bgb01xud1001>
References: <157115048176.18158.4077040057321391690.idtracker@ietfa.amsl.com>
In-Reply-To: <157115048176.18158.4077040057321391690.idtracker@ietfa.amsl.com>
Accept-Language: en-GB, en-US
Content-Language: en-GB
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-originating-ip: [132.185.132.13]
x-exclaimer-md-config: c91d45b2-6e10-4209-9543-d9970fac71b7
X-TM-AS-Product-Ver: SMEX-12.5.0.1300-8.2.1013-24052.007
X-TM-AS-Result: No-15.029300-8.000000-10
X-TMASE-MatchedRID: /77LoUQXvQ8DJrf2+hNOhY+YSzwl92XTgb9qWjtzZTfkMnUVL5d0E51K lx+BMy/CNrm8RN3LwcKrBusPs2ySug/hBq4zXMEir3X9gdfSLWUfXzVgO0hVqiJ8zskw0dbrMiV 1iNEHsLnSJzmc3rxUZVXB+cbUcBe3Ua2wwar6R/nXRKunW8upJyFq4bKNOR/1q8z7POX8FJMC8r Q71d/Yfu+wSELyt95al2S2b6+Hj3gFv3lB8CU6yFVN8laWo90MfrTt+hmA5bKYfLu5qIysvmpHK tkQBynKDfLFCF+SnFmdwgj4663pKshmZqvNSIL5y7TSWcbz49Yax3hQAS8V1VXzges/EJ9sUi92 jluIqXjenQuJPjV51NOnHyaPQakp7Hz728uiamAQ9/tMNQ4aiqI0K26z6c862By6TaLG/FYYdUd bjZ0BXHa9KDzVotlqqhVinVPTWKYKICokG5MKNZMSBMTQNiSAeJ1OirYwzAPaFXyMyYYsNFxFYZ qEedYUxCgk8Xh/vW65D9cIqkcbwaIzmWemVB4ssyw+ZJnFumQZKp0SZ4P+dXHV7c2GNf0CdD4iB X0ZrOi1l782J6QRwZUQp3D2MOCnxc4xaTZpubSV9hHQSQP5WhlgDfyCPcHE0JvbqYYPaMtEfxdU +sPp0kpjaufAdrq2fyYDewMOrQDkwjHXXC/4I5BlLa6MK1y4
X-TM-AS-User-Approved-Sender: Yes
X-TM-AS-User-Blocked-Sender: No
X-TMASE-Result: 10--15.029300-8.000000
X-TMASE-Version: SMEX-12.5.0.1300-8.2.1013-24052.007
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
Archived-At: <https://mailarchive.ietf.org/arch/msg/avt/pEzo8mr5pxpDbzBnw-dfp048zGU>
Subject: Re: [AVTCORE] Magnus Westerlund's Discuss on draft-ietf-payload-rtp-ttml-03: (with DISCUSS and COMMENT)
X-BeenThere: avt@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Audio/Video Transport Core Maintenance <avt.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/avt>, <mailto:avt-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/avt/>
List-Post: <mailto:avt@ietf.org>
List-Help: <mailto:avt-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/avt>, <mailto:avt-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 16 Oct 2019 08:52:55 -0000

Hello,
Sorry for a slight non-response and I am conscious of the urgency of a reply. But I wanted to assure you that I am preparing a response but I feel some of the points are quite nuanced. I should have it to you later today but I may reply to Adam first as I feel some of his suggestions address your concerns. 

Regards,
James



----------------------------------------------------------------------
DISCUSS:
----------------------------------------------------------------------

James and WG,

I do have a couple of issues I want to have your feedback on if they should be
corrected or not before proceeding to publication. Note they are for discussion
and in cases where things have been discussed and there is consensus please
reference that so that I can take that into consideration when we resolve these.

1. Section 4.1:
        Timestamp:
        The RTP Timestamp encodes the time of the text in the packet.

        As timed text is a media that has duration, from a start time to an end
        time, and the RTP timestmap is a single time tick in the chose clock
        resolution the above text is not clear. I would think the start time of
        the document would be the most useful to include?

        I think the text in 4.2.1.2 combined with the above attempts to imply
        that the RTP timestamp will be the 0 reference for the time-expression?

        I think this needs a bit more clarification. Not having detailed
        studied TTML2/1 I might be missing important details. But some more
        information how the document timebase:media time line connects to the
        RTP timestamp appears necessary.

2. A Discuss Discuss: As Timed Text is directly associated with one or more
video and audio streams and requires synchronization with these other media
streams to function correct. This leads to two questions.

        First of all is application/ttml+xml actually the right top-level media
        type? If using SDP that forces one unless one have BUNDLE to use a
        different RTP session. Many media types having this type of properties
        of being associated with some other media types have registered media
        types in all relevant top-level media types.

        Secondly, this payload format may need some references to mechanisms in
        RTP and signalling that has the purpose of associating media streams? I
        also assume that we have the interesting cases with localization that
        different languages have different time lines for the text and how long
        it shows as there are different tranditions in different countries and
        languages for how one makes subtitles.

        This may also point to the need for discussing the pick one out of n
        mechanism that a manifest may need.

3. Section 7.1:

        It may be appropriate to use the same Synchronization
   Source and Clock Rate as the related media.

        Using the same SSRC as another media stream in the same RTP Session is
        no-no. If you meant to use multiple RTP sessions and associate them
        using the same SSRC in diffiernt, yes it works but is not recommended.
        This points to the need for a clearer discussion of how to achieve
        linkage and the reasons for why same RTP timestamp may be useful or not.

4. Fragmentation:
        I think the fragmentation of an TTML document across multiple RTP
        payloads are a bit insufficiently described. I have the impression that
        it is hard to do something more clever than to fill each RTP payload to
        MTU limtiation, and send them out insequence. However, I think a firm
        requirement to apply RTP sequence number for a single document in
        consecutive numbers. Also the re-assebly process appear to have to
        parts for detecting what belongs together, same timestamp and last
        packet of document should have marker bit set. As a receiver can loose
        the last packet in the previous document, still know that it has
        received everything for the following document. However, if the losses
        are multiple, inspection of the re-assemblied document will be
        necessary to determine if the correct beginning is present. I have the
        impression that a proper section discussing these matter of
        fragmentation and re-assembly are necessary for good interoperability
        and function.


----------------------------------------------------------------------
COMMENT:
----------------------------------------------------------------------

A. Section 6.
        To my understanding the TTML document is basically not possible to
        encode better. A poor generator can create unnecessary verbose XML
        which could be shorter, but there are no possibility here to trade-off
        media quality for lower bit-rate. I think that should be made more
        explicit in Section 6.

B. Section 7.
        Wouldn't using 90kHz be the better default? 1kHz is the minimal from
        RTCP report that will work decently. However, if the timed text is
        primarily going to be synchronized with video 90k do ensure that
        (sub-)frame precise timing is possible to express. I don't see any need
        raster line specific for time text so the SMPTE 27 MHz clock is not
        needed. And using non default for subtitling radio etc appears fine.

C. Repair operations and relation to documents. Based on basic properties of
TTML documents, I do think the repair operations should be highly targeting
single documents as there is likely seconds between documents, while the
fragments of a document will be sent in a rather short interval. That
recommendation would be good to include.