Re: [rtcweb] WebRTC and Real-time Translation

Bernard Aboba <bernard.aboba@gmail.com> Thu, 27 September 2018 04:58 UTC

Return-Path: <bernard.aboba@gmail.com>
X-Original-To: rtcweb@ietfa.amsl.com
Delivered-To: rtcweb@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 1392B130DF0 for <rtcweb@ietfa.amsl.com>; Wed, 26 Sep 2018 21:58:24 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.998
X-Spam-Level:
X-Spam-Status: No, score=-1.998 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id YCnWkzNk2RTV for <rtcweb@ietfa.amsl.com>; Wed, 26 Sep 2018 21:58:21 -0700 (PDT)
Received: from mail-vs1-xe31.google.com (mail-vs1-xe31.google.com [IPv6:2607:f8b0:4864:20::e31]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 4941A130DEE for <rtcweb@ietf.org>; Wed, 26 Sep 2018 21:58:21 -0700 (PDT)
Received: by mail-vs1-xe31.google.com with SMTP id y11-v6so759232vso.5 for <rtcweb@ietf.org>; Wed, 26 Sep 2018 21:58:21 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=dKKC7A7TCt7q29RycOs4e0YDfQW455WCcFtQV1LOZBc=; b=V/urth0Q+WGx2oC6SQzIdnwA/1Te063zbfH5hfBLuzuzWoFDvlIHVDXR0eAxlBnpoP 1fBO9DlR15ORhcZ3+okN1gsPYlJDQuO3XvuDcoxfsGoIg/xVGqiV7HKW6e4tzt7igIeC 3FxlbytydJBPfeBju6GPc0yKEQkV/dt6/cJUeBaWvifDbI6DLiD0q/z2P+FV7ULh80eq qBg9wNhFHg652WWOZ4dl0Pfl8nfXsj9KJyiT8aZluzJ17XkB38MNwU5acq2zXGCOZuUV OSpJOxsfR0+9XyYe5v2Vm+vW1Ellr9vQTP1nJjcLa9hSmJdSCgFYLmAmAQ0X/xpspPTT xEmA==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=dKKC7A7TCt7q29RycOs4e0YDfQW455WCcFtQV1LOZBc=; b=ICwNnMsg6gY/J8jpRnBVDr4HSzwNrnEfAqtuwi3ZxiuooUyHOVKdYnWIERDIZ+Vxz8 OL2PL+O/CNUz+y4ZONVIj2DkveLH7ykHz2mCO3NrpOw7JyCwNLh0cMCxoMMqurDmmhDI k2tUybkIMNolFySFCVEKIZoKSqq37BFnTrMYFb+HO0wVPmxgYD3PtQSE0DfQzmHXImJB /bICC4SuVuHZ/Ss7VKf0jWY2PdwDE1gsZSbmBwld7lCv5U/1xpLDYFFYM2EdQZqnIYzE fwEr5ZX/fvElAXD0GMEqlaKWamS5YtpXHwynrRjtflyMfuK00cSn39yqxELehSLlUoEG 9WEw==
X-Gm-Message-State: ABuFfoh5DErdjHPTjVOjxzUv+byiSGKKgVE76uO3LiQrcWOAijFs2W3w KpTymWbmnClUtugybJgDBQETdIEY6CfTmw7IMRE=
X-Google-Smtp-Source: ACcGV63m7IEcaXxvTRjZu71jOrdr4EVm2G2Jv8ljXD3Gb9K1XprDZcHf7Cn/OYDQBF+ro6T6NQMSFzFFVPN6Y0Uzd1w=
X-Received: by 2002:a67:d388:: with SMTP id b8-v6mr2879415vsj.144.1538024299883; Wed, 26 Sep 2018 21:58:19 -0700 (PDT)
MIME-Version: 1.0
References: <CY4PR0101MB309521AF4EF436C0D1503741C5150@CY4PR0101MB3095.prod.exchangelabs.com>
In-Reply-To: <CY4PR0101MB309521AF4EF436C0D1503741C5150@CY4PR0101MB3095.prod.exchangelabs.com>
From: Bernard Aboba <bernard.aboba@gmail.com>
Date: Wed, 26 Sep 2018 21:58:08 -0700
Message-ID: <CAOW+2dvkgpWp6h+MY1YY4jDG3=KG-WPes-A1WXW6yuxRG6f9vg@mail.gmail.com>
To: adamsobieski@hotmail.com
Cc: RTCWeb IETF <rtcweb@ietf.org>
Content-Type: multipart/alternative; boundary="0000000000000cf2c10576d3301e"
Archived-At: <https://mailarchive.ietf.org/arch/msg/rtcweb/AahIzK8zb2UPf9bTAjQcKhogqKI>
Subject: Re: [rtcweb] WebRTC and Real-time Translation
X-BeenThere: rtcweb@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Real-Time Communication in WEB-browsers working group list <rtcweb.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/rtcweb>, <mailto:rtcweb-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/rtcweb/>
List-Post: <mailto:rtcweb@ietf.org>
List-Help: <mailto:rtcweb-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/rtcweb>, <mailto:rtcweb-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 27 Sep 2018 04:58:24 -0000

One of the key questions for "Next Version Use Cases" is what
WebRTC-deficiencies are preventing these use cases from being
satisfactorily implemented today.

For example, speech transcription cloud services have been implemented over
Websockets, where a snippet of speech is uploaded, and a transcription is
provided in reply.  The latency is satisfactory for some uses cases.
Improvements can perhaps be made by sending an audio stream and receiving a
transcription via the data channel, but this is also within the
capabilities of the existing RTCWEB protocols and WebRTC-PC API.

What seems to differentiate *next version* scenarios are situations where
the processing is best done on the device, in order to lower latency or
enhance privacy.  On-device processing brings in discussion of
workers/worklets, access to raw audio/video, etc.  However, so far I'm not
aware of on-device implementations of transcription or translation.

On Wed, Sep 26, 2018 at 6:10 PM Adam Sobieski <adamsobieski@hotmail.com>
wrote:

> IETF RTCWEB Working Group,
>
>
>
> Greetings. I opened an issue on *WebRTC and Real-time Translation* at the
> GitHub repository for WebRTC version next use cases (
> https://github.com/w3c/webrtc-nv-use-cases/issues/2).
>
>
> Introduction
>
> Real-time translation is both an interesting and important use case for a
> next version of WebRTC.
> Speech Recognition, Translation and Speech Synthesis
>
> Approaches to real-time speech-to-speech machine translation include those
> which interconnect speech recognition, translation and speech synthesis
> components and services. In that regard, we can consider client-side,
> on-prem, server-side, third-party and cloud-based components and services.
> In that regard, we can also consider both free and priced components and
> services.
>
> We can envision *post-text* speech technology and machine translation
> components and services. Speech recognition need not output to text; we can
> consider speech-to-SSML. Machine translation need not input from nor output
> to text; we can consider SSML-to-SSML machine translation. Components and
> services may provide various options with respect to their input and output
> data formats.
> Connecting Components and Services by Constructing Graphs
>
> We can consider APIs which facilitate the construction of graphs which
> represent the flow of data between components and services. As these graphs
> are constructed, users could be apprised of relevant notifications,
> requests for permissions and options for payments. As these constructed
> graphs are activated, a number of protocols could be utilized to
> interconnect the components and services which, together, provide users
> with real-time translation.
> Hyperlinks
>
> WebRTC Translator Demo <https://www.youtube.com/watch?v=Tv8ilBOKS2o>
> Real Time Translation in WebRTC
> <https://www.youtube.com/watch?v=EPBWR_GNY9U>
>
>
>
>
>
> Best regards,
>
> Adam Sobieski
>
> http://www.phoster.com/contents/
>
>
> _______________________________________________
> rtcweb mailing list
> rtcweb@ietf.org
> https://www.ietf.org/mailman/listinfo/rtcweb
>