Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Matúš Žilinec

Lost in Interpreting: Speech Translation from Source or Interpreter?

Jun 17, 2021

Dominik Macháček, Matúš Žilinec, Ondřej Bojar

Figure 1 for Lost in Interpreting: Speech Translation from Source or Interpreter?

Figure 2 for Lost in Interpreting: Speech Translation from Source or Interpreter?

Figure 3 for Lost in Interpreting: Speech Translation from Source or Interpreter?

Figure 4 for Lost in Interpreting: Speech Translation from Source or Interpreter?

Abstract:Interpreters facilitate multi-lingual meetings but the affordable set of languages is often smaller than what is needed. Automatic simultaneous speech translation can extend the set of provided languages. We investigate if such an automatic system should rather follow the original speaker, or an interpreter to achieve better translation quality at the cost of increased delay. To answer the question, we release Europarl Simultaneous Interpreting Corpus (ESIC), 10 hours of recordings and transcripts of European Parliament speeches in English, with simultaneous interpreting into Czech and German. We evaluate quality and latency of speaker-based and interpreter-based spoken translation systems from English to Czech. We study the differences in implicit simplification and summarization of the human interpreter compared to a machine translation system trained to shorten the output to some extent. Finally, we perform human evaluation to measure information loss of each of these approaches.

* to be published at INTERSPEECH 2021

Via

Access Paper or Ask Questions

Backtranslation Feedback Improves User Confidence in MT, Not Quality

Apr 12, 2021

Vilém Zouhar, Michal Novák, Matúš Žilinec, Ondřej Bojar, Mateo Obregón, Robin L. Hill, Frédéric Blain, Marina Fomicheva, Lucia Specia, Lisa Yankovskaya

Figure 1 for Backtranslation Feedback Improves User Confidence in MT, Not Quality

Figure 2 for Backtranslation Feedback Improves User Confidence in MT, Not Quality

Figure 3 for Backtranslation Feedback Improves User Confidence in MT, Not Quality

Figure 4 for Backtranslation Feedback Improves User Confidence in MT, Not Quality

Abstract:Translating text into a language unknown to the text's author, dubbed outbound translation, is a modern need for which the user experience has significant room for improvement, beyond the basic machine translation facility. We demonstrate this by showing three ways in which user confidence in the outbound translation, as well as its overall final quality, can be affected: backward translation, quality estimation (with alignment) and source paraphrasing. In this paper, we describe an experiment on outbound translation from English to Czech and Estonian. We examine the effects of each proposed feedback module and further focus on how the quality of machine translation systems influence these findings and the user perception of success. We show that backward translation feedback has a mixed effect on the whole process: it increases user confidence in the produced translation, but not the objective quality.

* 9 pages (excluding references); to appear at NAACL-HWT 2021

Via

Access Paper or Ask Questions

ELITR Non-Native Speech Translation at IWSLT 2020

Jun 05, 2020

Dominik Macháček, Jonáš Kratochvíl, Sangeet Sagar, Matúš Žilinec, Ondřej Bojar, Thai-Son Nguyen, Felix Schneider, Philip Williams, Yuekun Yao

Figure 1 for ELITR Non-Native Speech Translation at IWSLT 2020

Figure 2 for ELITR Non-Native Speech Translation at IWSLT 2020

Figure 3 for ELITR Non-Native Speech Translation at IWSLT 2020

Abstract:This paper is an ELITR system submission for the non-native speech translation task at IWSLT 2020. We describe systems for offline ASR, real-time ASR, and our cascaded approach to offline SLT and real-time SLT. We select our primary candidates from a pool of pre-existing systems, develop a new end-to-end general ASR system, and a hybrid ASR trained on non-native speech. The provided small validation set prevents us from carrying out a complex validation, but we submit all the unselected candidates for contrastive evaluation on the test set.

* IWSLT 2020

Via

Access Paper or Ask Questions