Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Learning Multilingual Expressive Speech Representation for Prosody Prediction without Parallel Data

Jun 29, 2023

Jarod Duret, Titouan Parcollet, Yannick Estève

Figure 1 for Learning Multilingual Expressive Speech Representation for Prosody Prediction without Parallel Data

Figure 2 for Learning Multilingual Expressive Speech Representation for Prosody Prediction without Parallel Data

Figure 3 for Learning Multilingual Expressive Speech Representation for Prosody Prediction without Parallel Data

Figure 4 for Learning Multilingual Expressive Speech Representation for Prosody Prediction without Parallel Data

Share this with someone who'll enjoy it:

Abstract:We propose a method for speech-to-speech emotionpreserving translation that operates at the level of discrete speech units. Our approach relies on the use of multilingual emotion embedding that can capture affective information in a language-independent manner. We show that this embedding can be used to predict the pitch and duration of speech units in a target language, allowing us to resynthesize the source speech signal with the same emotional content. We evaluate our approach to English and French speech signals and show that it outperforms a baseline method that does not use emotional information, including when the emotion embedding is extracted from a different language. Even if this preliminary study does not address directly the machine translation issue, our results demonstrate the effectiveness of our approach for cross-lingual emotion preservation in the context of speech resynthesis.

* Speech Synthesis Workshop (SSW), Aug 2023, Grenoble, France

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Learning Multilingual Expressive Speech Representation for Prosody Prediction without Parallel Data

Paper and Code