Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A New GAN-based End-to-End TTS Training Algorithm

Apr 09, 2019

Haohan Guo, Frank K. Soong, Lei He, Lei Xie

Figure 1 for A New GAN-based End-to-End TTS Training Algorithm

Figure 2 for A New GAN-based End-to-End TTS Training Algorithm

Figure 3 for A New GAN-based End-to-End TTS Training Algorithm

Figure 4 for A New GAN-based End-to-End TTS Training Algorithm

Share this with someone who'll enjoy it:

Abstract:End-to-end, autoregressive model-based TTS has shown significant performance improvements over the conventional one. However, the autoregressive module training is affected by the exposure bias, or the mismatch between the different distributions of real and predicted data. While real data is available in training, but in testing, only predicted data is available to feed the autoregressive module. By introducing both real and generated data sequences in training, we can alleviate the effects of the exposure bias. We propose to use Generative Adversarial Network (GAN) along with the key idea of Professor Forcing in training. A discriminator in GAN is jointly trained to equalize the difference between real and predicted data. In AB subjective listening test, the results show that the new approach is preferred over the standard transfer learning with a CMOS improvement of 0.1. Sentence level intelligibility tests show significant improvement in a pathological test set. The GAN-trained new model is also more stable than the baseline to produce better alignments for the Tacotron output.

* Submitted to Interspeech 2019, Graz, Austria

View paper on

Share this with someone who'll enjoy it:

Title:A New GAN-based End-to-End TTS Training Algorithm

Paper and Code