Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Unsupervised Any-to-Many Audiovisual Synthesis via Exemplar Autoencoders

Jan 13, 2020

Kangle Deng, Aayush Bansal, Deva Ramanan

Figure 1 for Unsupervised Any-to-Many Audiovisual Synthesis via Exemplar Autoencoders

Figure 2 for Unsupervised Any-to-Many Audiovisual Synthesis via Exemplar Autoencoders

Figure 3 for Unsupervised Any-to-Many Audiovisual Synthesis via Exemplar Autoencoders

Figure 4 for Unsupervised Any-to-Many Audiovisual Synthesis via Exemplar Autoencoders

Share this with someone who'll enjoy it:

Abstract:We present an unsupervised approach that enables us to convert the speech input of any one individual to an output set of potentially-infinitely many speakers. One can stand in front of a mic and be able to make their favorite celebrity say the same words. Our approach builds on simple autoencoders that project out-of-sample data to the distribution of the training set (motivated by PCA/linear autoencoders). We use an exemplar autoencoder to learn the voice and specific style (emotions and ambiance) of a target speaker. In contrast to existing methods, the proposed approach can be easily extended to an arbitrarily large number of speakers in a very little time using only two-three minutes of audio data from a speaker. We also exhibit the usefulness of our approach for generating video from audio signals and vice-versa. We suggest the reader to check out our project webpage for various synthesized examples: https://dunbar12138.github.io/projectpage/Audiovisual/

* (1) Project summary is available at: https://www.youtube.com/watch?v=7BO0-Q3TLfI ; (2) Code is available at https://github.com/dunbar12138/Audiovisual-Synthesis ; and (3) A beta web-demo is available at: https://scs00197.sp.cs.cmu.edu/

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Unsupervised Any-to-Many Audiovisual Synthesis via Exemplar Autoencoders

Paper and Code