Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks

Oct 30, 2018

Lauri Juvela, Bajibabu Bollepalli, Junichi Yamagishi, Paavo Alku

Figure 1 for Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks

Figure 2 for Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks

Figure 3 for Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks

Figure 4 for Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks

Share this with someone who'll enjoy it:

Abstract:The state-of-the-art in text-to-speech synthesis has recently improved considerably due to novel neural waveform generation methods, such as WaveNet. However, these methods suffer from their slow sequential inference process, while their parallel versions are difficult to train and even more expensive computationally. Meanwhile, generative adversarial networks (GANs) have achieved impressive results in image generation and are making their way into audio applications; parallel inference is among their lucrative properties. By adopting recent advances in GAN training techniques, this investigation studies waveform generation for TTS in two domains (speech signal and glottal excitation). Listening test results show that while direct waveform generation with GAN is still far behind WaveNet, a GAN-based glottal excitation model can achieve quality and voice similarity on par with a WaveNet vocoder.

* Submitted to ICASSP 2019

View paper on

Share this with someone who'll enjoy it:

Title:Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks

Paper and Code