Picture for Zongyang Du

Zongyang Du

Towards Naturalistic Voice Conversion: NaturalVoices Dataset with an Automatic Processing Pipeline

Add code
Jun 06, 2024
Viaarxiv icon

Exploring speech style spaces with language models: Emotional TTS without emotion labels

Add code
May 18, 2024
Figure 1 for Exploring speech style spaces with language models: Emotional TTS without emotion labels
Figure 2 for Exploring speech style spaces with language models: Emotional TTS without emotion labels
Figure 3 for Exploring speech style spaces with language models: Emotional TTS without emotion labels
Figure 4 for Exploring speech style spaces with language models: Emotional TTS without emotion labels
Viaarxiv icon

Converting Anyone's Voice: End-to-End Expressive Voice Conversion with a Conditional Diffusion Model

Add code
May 02, 2024
Figure 1 for Converting Anyone's Voice: End-to-End Expressive Voice Conversion with a Conditional Diffusion Model
Figure 2 for Converting Anyone's Voice: End-to-End Expressive Voice Conversion with a Conditional Diffusion Model
Figure 3 for Converting Anyone's Voice: End-to-End Expressive Voice Conversion with a Conditional Diffusion Model
Figure 4 for Converting Anyone's Voice: End-to-End Expressive Voice Conversion with a Conditional Diffusion Model
Viaarxiv icon

Revealing Emotional Clusters in Speaker Embeddings: A Contrastive Learning Strategy for Speech Emotion Recognition

Add code
Jan 19, 2024
Viaarxiv icon

Identity Conversion for Emotional Speakers: A Study for Disentanglement of Emotion Style and Speaker Identity

Add code
Oct 20, 2021
Figure 1 for Identity Conversion for Emotional Speakers: A Study for Disentanglement of Emotion Style and Speaker Identity
Figure 2 for Identity Conversion for Emotional Speakers: A Study for Disentanglement of Emotion Style and Speaker Identity
Figure 3 for Identity Conversion for Emotional Speakers: A Study for Disentanglement of Emotion Style and Speaker Identity
Figure 4 for Identity Conversion for Emotional Speakers: A Study for Disentanglement of Emotion Style and Speaker Identity
Viaarxiv icon

Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer

Add code
Jul 08, 2021
Figure 1 for Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer
Figure 2 for Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer
Figure 3 for Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer
Figure 4 for Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer
Viaarxiv icon

Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN

Add code
Aug 12, 2020
Figure 1 for Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN
Figure 2 for Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN
Figure 3 for Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN
Figure 4 for Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN
Viaarxiv icon