[MMUSIC] Open source SIP/WebRTC real-time text integration

Lorenzo Miniero <lorenzo@meetecho.com> Wed, 18 December 2019 11:05 UTC

Date: Wed, 18 Dec 2019 12:04:51 +0100
From: Lorenzo Miniero <lorenzo@meetecho.com>
To: mmusic@ietf.org
Message-ID: <20191218120451.7dd7093b@lminiero>
Organization: Meetecho
MIME-Version: 1.0
Content-Type: text/plain; charset="US-ASCII"
Content-Transfer-Encoding: 7bit
Archived-At: <https://mailarchive.ietf.org/arch/msg/mmusic/IgYUYcAZKeNLoKIJR0lU9k8J2RA>
Subject: [MMUSIC] Open source SIP/WebRTC real-time text integration
Precedence: list

Hi all,

as I anticipated in another post, I've recently started working with
real-time text: mostly for fun (I've always been intrigued by the
protocol), but also because I wanted to get it working with WebRTC,
especially in light of its importance for the upcoming NG emergency
services. Considering Christer's recent efforts on standardizing how to
use RTT on WebRTC, I thought I'd share this here, so that it can
hopefully provide a testing framework: it is not complete, and doesn't
adhere 100% to the specification yet (more on that in a minute), but I
think it might be helpful as a running code reference anyway.

Specifically, I've integrated support for real-time text in our open
source Janus WebRTC server. Janus is a modular component, and one of
the plugins implements a SIP gateway: this is where I implemented
support for T.140 and red. The way it works is "straightforward":

	1. Any time we get an incoming SIP call on the SIP side with an
	m=text m-line, we translate that to an m=application, so that
	we can negotiate data channels on the WebRTC side.

	2. At the same time, if the WebRTC peer negotiates an
	m=application line, we turn that into an m=text section to
	negotiate real-time text on the SIP side.

	3. Then, if we receive T.140 packets from SIP, we relay them
	via data channels using binary data (we use an Uint8Array in
	the web application); if we receive RED packets, we parse it in
	order to get the T.140 blocks we need, and relay those via data
	channels (since the draft explains redundancy is not needed,
	thanks to SCTP).

	4. When we get T.140 blocks via data channels (which again the
	web application sends as an Uint8Array), we either send them as
	they are in RTP packets (if RED was not negotiated), or we
	create a RED RTP packet with redundant info on previously sent
	packets (if RED was negotiated instead).

This is the effort in a nutshell, and from my simple tests it seems to
be working as expected so far. You can find a more comprehensive
description of the whole effort in this blog post:

https://www.meetecho.com/blog/realtime-text-sip-and-webrtc/

The Janus branch supporting real-time text, instead, is here:

https://github.com/meetecho/janus-gateway/pull/1898

As explained in the blog post, the effort is not complete, and there
are some things that are either missing, or need to be improved, namely:

	1. We're not using the dcmap and dcsa attributes on the WebRTC
	side yet: the reason is that browsers don't support them, at
	the moment, and so putting them there may result in the SDP
	being rejected and the session broken. This means that,
	currently, we use a default label for exchanging T.140 blocks
	with the server.

	2. While we support RED, we're not using the redundant info for
	packets we receive yet, and we're not properly handling packet
	loss or out of order packets yet either. This is something we
	plan to work on in the future, as I was more interested in
	getting the specification in general to work first.

	3. On the client side in WebRTC, we're not doing any buffering,
	meaning we send every character as soon as we type it, which is
	clearly suboptimal. Again, something we plan to fix later on,
	either in the web application (buffer there), or on the server
	side (buffer incoming T.140 blocks there, before crafting RTP
	packets).

I hope this will be considered useful, and I'm looking forward to keep
on working on this as the specifications moves forward. If you have any
questions or doubts, please don't hesitate to ask; besides, I'll be in
Vancouver for the next IETF, so in case you want to talk about it in
person there, see a demo, or make some interoperability tests, I'll be
glad to do that as well.

Thanks,
Lorenzo

-- 
I'm getting older but, unlike whisky, I'm not getting any better
https://twitter.com/elminiero

[MMUSIC] Open source SIP/WebRTC real-time text in… Lorenzo Miniero
Re: [MMUSIC] Open source SIP/WebRTC real-time tex… Christer Holmberg
Re: [MMUSIC] Open source SIP/WebRTC real-time tex… Lorenzo Miniero
Re: [MMUSIC] Open source SIP/WebRTC real-time tex… Christer Holmberg