The KIT English-French Translation systems for IWSLT 2011

Mohammed Mediani, Eunah Cho, Jan Niehues, Teresa Herrmann, Alex Waibel

Research output: Contribution to conferencePaperpeer-review

17 Citations (Scopus)

Abstract

This paper presents the KIT system participating in the English→French TALK Translation tasks in the framework of the IWSLT 2011 machine translation evaluation. Our system is a phrase-based translation system using POS-based reordering extended with many additional features. First of all, a special preprocessing is devoted to the Giga corpus in order to minimize the effect of the great amount of noise it contains. In addition, the system gives more importance to the in-domain data by adapting the translation and the language models as well as by using a word-cluster language model. Furthermore, the system is extended by a bilingual language model and a discriminative word lexicon. The automatic speech transcription input usually has no or wrong punctuation marks, therefore these marks were especially removed from the source training data for the SLT system training.

Original languageEnglish
Pages73-78
Number of pages6
Publication statusPublished - 2011
Externally publishedYes
Event8th International Workshop on Spoken Language Translation, IWSLT 2011 - San Francisco, United States
Duration: Dec 8 2011Dec 9 2011

Conference

Conference8th International Workshop on Spoken Language Translation, IWSLT 2011
Country/TerritoryUnited States
CitySan Francisco
Period12/8/1112/9/11

ASJC Scopus subject areas

  • Language and Linguistics
  • Human-Computer Interaction
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'The KIT English-French Translation systems for IWSLT 2011'. Together they form a unique fingerprint.

Cite this