Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:HumanGAN: generative adversarial network with human-based discriminator and its evaluation in speech perception modeling

Sep 25, 2019

Kazuki Fujii, Yuki Saito, Shinnosuke Takamichi, Yukino Baba, Hiroshi Saruwatari

Figure 1 for HumanGAN: generative adversarial network with human-based discriminator and its evaluation in speech perception modeling

Figure 2 for HumanGAN: generative adversarial network with human-based discriminator and its evaluation in speech perception modeling

Figure 3 for HumanGAN: generative adversarial network with human-based discriminator and its evaluation in speech perception modeling

Figure 4 for HumanGAN: generative adversarial network with human-based discriminator and its evaluation in speech perception modeling

Share this with someone who'll enjoy it:

Abstract:We propose the HumanGAN, a generative adversarial network (GAN) incorporating human perception as a discriminator. A basic GAN trains a generator to represent a real-data distribution by fooling the discriminator that distinguishes real and generated data. Therefore, the basic GAN cannot represent the outside of a real-data distribution. In the case of speech perception, humans can recognize not only human voices but also processed (i.e., a non-existent human) voices as human voice. Such a human-acceptable distribution is typically wider than a real-data one and cannot be modeled by the basic GAN. To model the human-acceptable distribution, we formulate a backpropagation-based generator training algorithm by regarding human perception as a black-boxed discriminator. The training efficiently iterates generator training by using a computer and discrimination by crowdsourcing. We evaluate our HumanGAN in speech naturalness modeling and demonstrate that it can represent a human-acceptable distribution that is wider than a real-data distribution.

* Submitted to IEEE ICASSP 2020

View paper on

Share this with someone who'll enjoy it:

Title:HumanGAN: generative adversarial network with human-based discriminator and its evaluation in speech perception modeling

Paper and Code