Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Julian Mäder

Assessing Evaluation Metrics for Speech-to-Speech Translation

Oct 26, 2021

Elizabeth Salesky, Julian Mäder, Severin Klinger

Figure 1 for Assessing Evaluation Metrics for Speech-to-Speech Translation

Figure 2 for Assessing Evaluation Metrics for Speech-to-Speech Translation

Figure 3 for Assessing Evaluation Metrics for Speech-to-Speech Translation

Figure 4 for Assessing Evaluation Metrics for Speech-to-Speech Translation

Abstract:Speech-to-speech translation combines machine translation with speech synthesis, introducing evaluation challenges not present in either task alone. How to automatically evaluate speech-to-speech translation is an open question which has not previously been explored. Translating to speech rather than to text is often motivated by unwritten languages or languages without standardized orthographies. However, we show that the previously used automatic metric for this task is best equipped for standardized high-resource languages only. In this work, we first evaluate current metrics for speech-to-speech translation, and second assess how translation to dialectal variants rather than to standardized languages impacts various evaluation methods.

* ASRU 2021

Via

Access Paper or Ask Questions

SwissDial: Parallel Multidialectal Corpus of Spoken Swiss German

Mar 21, 2021

Pelin Dogan-Schönberger, Julian Mäder, Thomas Hofmann

Figure 1 for SwissDial: Parallel Multidialectal Corpus of Spoken Swiss German

Figure 2 for SwissDial: Parallel Multidialectal Corpus of Spoken Swiss German

Figure 3 for SwissDial: Parallel Multidialectal Corpus of Spoken Swiss German

Figure 4 for SwissDial: Parallel Multidialectal Corpus of Spoken Swiss German

Abstract:Swiss German is a dialect continuum whose natively acquired dialects significantly differ from the formal variety of the language. These dialects are mostly used for verbal communication and do not have standard orthography. This has led to a lack of annotated datasets, rendering the use of many NLP methods infeasible. In this paper, we introduce the first annotated parallel corpus of spoken Swiss German across 8 major dialects, plus a Standard German reference. Our goal has been to create and to make available a basic dataset for employing data-driven NLP applications in Swiss German. We present our data collection procedure in detail and validate the quality of our corpus by conducting experiments with the recent neural models for speech synthesis.

Via

Access Paper or Ask Questions