Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Improving Cross-lingual Speech Synthesis with Triplet Training Scheme

Feb 22, 2022

Jianhao Ye, Hongbin Zhou, Zhiba Su, Wendi He, Kaimeng Ren, Lin Li, Heng Lu

Figure 1 for Improving Cross-lingual Speech Synthesis with Triplet Training Scheme

Figure 2 for Improving Cross-lingual Speech Synthesis with Triplet Training Scheme

Figure 3 for Improving Cross-lingual Speech Synthesis with Triplet Training Scheme

Share this with someone who'll enjoy it:

Abstract:Recent advances in cross-lingual text-to-speech (TTS) made it possible to synthesize speech in a language foreign to a monolingual speaker. However, there is still a large gap between the pronunciation of generated cross-lingual speech and that of native speakers in terms of naturalness and intelligibility. In this paper, a triplet training scheme is proposed to enhance the cross-lingual pronunciation by allowing previously unseen content and speaker combinations to be seen during training. Proposed method introduces an extra fine-tune stage with triplet loss during training, which efficiently draws the pronunciation of the synthesized foreign speech closer to those from the native anchor speaker, while preserving the non-native speaker's timbre. Experiments are conducted based on a state-of-the-art baseline cross-lingual TTS system and its enhanced variants. All the objective and subjective evaluations show the proposed method brings significant improvement in both intelligibility and naturalness of the synthesized cross-lingual speech.

View paper on

Share this with someone who'll enjoy it:

Title:Improving Cross-lingual Speech Synthesis with Triplet Training Scheme

Paper and Code