Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Voice Conversion with Conditional SampleRNN

Aug 24, 2018

Cong Zhou, Michael Horgan, Vivek Kumar, Cristina Vasco, Dan Darcy

Figure 1 for Voice Conversion with Conditional SampleRNN

Figure 2 for Voice Conversion with Conditional SampleRNN

Figure 3 for Voice Conversion with Conditional SampleRNN

Figure 4 for Voice Conversion with Conditional SampleRNN

Share this with someone who'll enjoy it:

Abstract:Here we present a novel approach to conditioning the SampleRNN generative model for voice conversion (VC). Conventional methods for VC modify the perceived speaker identity by converting between source and target acoustic features. Our approach focuses on preserving voice content and depends on the generative network to learn voice style. We first train a multi-speaker SampleRNN model conditioned on linguistic features, pitch contour, and speaker identity using a multi-speaker speech corpus. Voice-converted speech is generated using linguistic features and pitch contour extracted from the source speaker, and the target speaker identity. We demonstrate that our system is capable of many-to-many voice conversion without requiring parallel data, enabling broad applications. Subjective evaluation demonstrates that our approach outperforms conventional VC methods.

* Accepted at Interspeech 2018, Hyderabad, India. This version matches the final version submitted to the conference

View paper on

Share this with someone who'll enjoy it:

Title:Voice Conversion with Conditional SampleRNN

Paper and Code