Lecture translator speech translation framework for simultaneous lecture translation

Markus Müller, Thai Son Nguyen, Jan Niehues, Eunah Cho, Bastian Krüger, Thanh Le Ha, Kevin Kilgour, Matthias Sperber, Mohammed Mediani, Sebastian St Üker, Alex Waibel

Research output: Contribution to conferencePaperpeer-review

17 Citations (Scopus)

Abstract

Foreign students at German universities often have difficulties following lectures as they are often held in German. Since human interpreters are too expensive for universities we are addressing this problem via speech translation technology deployed in KIT's lecture halls. Our simultaneous lecture translation system automatically translates lectures from German to English in real-time. Other supported language directions are English to Spanish, English to French, English to German and German to French. Automatic simultaneous translation is more than just the concatenation of automatic speech recognition and machine translation technology, as the input is an unsegmented, practically infinite stream of spontaneous speech. The lack of segmentation and the spontaneous nature of the speech makes it especially difficult to recognize and translate it with sufficient quality. In addition to quality, speed and latency are of the utmost importance in order for the system to enable students to follow lectures. In this paper we present our system that performs the task of simultaneous speech translation of university lectures by performing speech translation on a stream of audio in real-time and with low latency. The system features several techniques beyond the basic speech translation task, that make it fit for real-world use. Examples of these features are a continuous stream speech recognition without any prior segmentation of the input audio, punctuation prediction, run-on decoding and run-on translation with continuously updating displays in order to keep the latency as low as possible.

Original languageEnglish
Pages82-86
Number of pages5
Publication statusPublished - 2016
Externally publishedYes
Event2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2016 - San Diego, United States
Duration: Jun 12 2016Jun 17 2016

Conference

Conference2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2016
Country/TerritoryUnited States
CitySan Diego
Period6/12/166/17/16

ASJC Scopus subject areas

  • Computer Science Applications
  • Artificial Intelligence
  • Language and Linguistics
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Lecture translator speech translation framework for simultaneous lecture translation'. Together they form a unique fingerprint.

Cite this