Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram

Oct 25, 2019

Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim

Figure 1 for Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram

Figure 2 for Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram

Figure 3 for Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram

Figure 4 for Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram

Share this with someone who'll enjoy it:

Abstract:We propose Parallel WaveGAN, a distillation-free, fast, and small-footprint waveform generation method using a generative adversarial network. In the proposed method, a non-autoregressive WaveNet is trained by jointly optimizing multi-resolution spectrogram and adversarial loss functions, which can effectively capture the time-frequency distribution of the realistic speech waveform. As our method does not require density distillation used in the conventional teacher-student framework, the entire model can be easily trained even with a small number of parameters. In particular, the proposed Parallel WaveGAN has only 1.44 M parameters and can generate 24 kHz speech waveform 28.68 times faster than real-time on a single GPU environment. Perceptual listening test results verify that our proposed method achieves 4.16 mean opinion score within a Transformer-based text-to-speech framework, which is comparative to the best distillation-based Parallel WaveNet system.

* submitted to ICASSP 2020

View paper on

Share this with someone who'll enjoy it:

Title:Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram

Paper and Code