Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:DreamVoice: Text-Guided Voice Conversion

Jun 24, 2024

Jiarui Hai, Karan Thakkar, Helin Wang, Zengyi Qin, Mounya Elhilali

Figure 1 for DreamVoice: Text-Guided Voice Conversion

Figure 2 for DreamVoice: Text-Guided Voice Conversion

Figure 3 for DreamVoice: Text-Guided Voice Conversion

Share this with someone who'll enjoy it:

Abstract:Generative voice technologies are rapidly evolving, offering opportunities for more personalized and inclusive experiences. Traditional one-shot voice conversion (VC) requires a target recording during inference, limiting ease of usage in generating desired voice timbres. Text-guided generation offers an intuitive solution to convert voices to desired "DreamVoices" according to the users' needs. Our paper presents two major contributions to VC technology: (1) DreamVoiceDB, a robust dataset of voice timbre annotations for 900 speakers from VCTK and LibriTTS. (2) Two text-guided VC methods: DreamVC, an end-to-end diffusion-based text-guided VC model; and DreamVG, a versatile text-to-voice generation plugin that can be combined with any one-shot VC models. The experimental results demonstrate that our proposed methods trained on the DreamVoiceDB dataset generate voice timbres accurately aligned with the text prompt and achieve high-quality VC.

* Accepted at INTERSPEECH 2024

View paper on

Share this with someone who'll enjoy it:

Title:DreamVoice: Text-Guided Voice Conversion

Paper and Code