Re: [vmeet] Remote Attendance: Simultaneous Translation Software

John C Klensin <john-ietf@jck.com> Sat, 09 April 2016 20:18 UTC

Return-Path: <john-ietf@jck.com>
X-Original-To: vmeet@ietfa.amsl.com
Delivered-To: vmeet@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 3788712D121; Sat, 9 Apr 2016 13:18:40 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -0.996
X-Spam-Level:
X-Spam-Status: No, score=-0.996 tagged_above=-999 required=5 tests=[RP_MATCHES_RCVD=-0.996] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 8t0qzyi5hRYL; Sat, 9 Apr 2016 13:18:39 -0700 (PDT)
Received: from bsa2.jck.com (ns.jck.com [70.88.254.51]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 1F58312D154; Sat, 9 Apr 2016 13:18:39 -0700 (PDT)
Received: from [198.252.137.10] (helo=JcK-HP8200.jck.com) by bsa2.jck.com with esmtp (Exim 4.82 (FreeBSD)) (envelope-from <john-ietf@jck.com>) id 1aozL2-000Cpi-Jr; Sat, 09 Apr 2016 16:18:36 -0400
Date: Sat, 09 Apr 2016 16:18:31 -0400
From: John C Klensin <john-ietf@jck.com>
To: "Alvaro Retana (aretana)" <aretana@cisco.com>, Dhruv Dhody <dhruv.ietf@gmail.com>, Carlos Marcelo Martinez Cagnazzo <carlosm3011@gmail.com>
Message-ID: <22A6138C57ECB33E623C4541@JcK-HP8200.jck.com>
In-Reply-To: <D32E91C8.11DC91%aretana@cisco.com>
References: <418091485.521561.1460202293180.JavaMail.yahoo.ref@mail.yahoo.com> <418091485.521561.1460202293180.JavaMail.yahoo@mail.yahoo.com> <1460206804058-54dee730-e85626c7-5247adba@gmail.com> <CAB75xn7SOz4O3CozMu7+fH1cwRQQryNPeWTxB-PkWENwJ==k1g@mail.gmail.com> <D32E91C8.11DC91%aretana@cisco.com>
X-Mailer: Mulberry/4.0.8 (Win32)
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Content-Disposition: inline
X-SA-Exim-Connect-IP: 198.252.137.10
X-SA-Exim-Mail-From: john-ietf@jck.com
X-SA-Exim-Scanned: No (on bsa2.jck.com); SAEximRunCond expanded to false
Archived-At: <http://mailarchive.ietf.org/arch/msg/vmeet/X9jXfPGcY0gLBWykzGZSJBD88bE>
Cc: mentoring-coordinators@ietf.org, vmeet@ietf.org
Subject: Re: [vmeet] Remote Attendance: Simultaneous Translation Software
X-BeenThere: vmeet@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: IETF remote participation meeting services discussion <vmeet.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/vmeet>, <mailto:vmeet-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/vmeet/>
List-Post: <mailto:vmeet@ietf.org>
List-Help: <mailto:vmeet-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/vmeet>, <mailto:vmeet-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sat, 09 Apr 2016 20:18:40 -0000


--On Saturday, April 09, 2016 14:03 +0000 "Alvaro Retana
(aretana)" <aretana@cisco.com> wrote:

> If it is regarding interpretation, I think the first step
> would be - good speech to text (english), with a possible
> input to etherpad like tool, that could be corrected by minute
> taker.
> 
> I really like Dhruv's suggestion.  The result would help even
> the native English speakers and wouldn't be limited to remote
> participants.

Folks, fwiw, I've run IETF sessions through good-quality speech
to text programs.  The results have not been very helpful even
when the systems have been trained with dictionaries built by
scanning most of the RFCs I've written and a selection of
others.  When run with generic voices (rather than training to
my peculiar voice and speech patterns), the systems basically go
nuts and become useless when confronted with heavily-accented
English or "unusual" sentence rhythms.   Even those
unsatisfactory results aren't real-time either -- the systems
are reasonable for recording speech and delivering text with
some considerable, and cumulative, lag.

One could improve on that with more powerful software,
dictionaries, and computers, but, for example, I don't think
IASA or AMS have a supply of free supercomputer cycles.   The
ICANN transcription efforts, at least when I was last paying a
lot of attention, were different because they involved highly
skilled human transcribers and, at least as important, the
fraction of the ICANN discussions that involve highly technical
material tends to be quite small.

I'd be happy to have written transcripts too and I'm a native
English speaker.  However, please, let's try to keep this real
and, if we have expensive ideas, to be sure we understand where
the resources might come from and what would have to be pushed
to lower priority.

    john