Picture for Zhenhua Ling

Zhenhua Ling

DiffStyleTTS: Diffusion-based Hierarchical Prosody Modeling for Text-to-Speech with Diverse and Controllable Styles

Add code
Dec 04, 2024
Viaarxiv icon

Refining Self-Supervised Learnt Speech Representation using Brain Activations

Add code
Jun 12, 2024
Viaarxiv icon

Adversarial speech for voice privacy protection from Personalized Speech generation

Add code
Jan 22, 2024
Viaarxiv icon

Pre-training Language Model as a Multi-perspective Course Learner

Add code
May 06, 2023
Viaarxiv icon

Decoupled Pronunciation and Prosody Modeling in Meta-Learning-Based Multilingual Speech Synthesis

Add code
Sep 14, 2022
Figure 1 for Decoupled Pronunciation and Prosody Modeling in Meta-Learning-Based Multilingual Speech Synthesis
Figure 2 for Decoupled Pronunciation and Prosody Modeling in Meta-Learning-Based Multilingual Speech Synthesis
Figure 3 for Decoupled Pronunciation and Prosody Modeling in Meta-Learning-Based Multilingual Speech Synthesis
Figure 4 for Decoupled Pronunciation and Prosody Modeling in Meta-Learning-Based Multilingual Speech Synthesis
Viaarxiv icon

Speaker Adaption with Intuitive Prosodic Features for Statistical Parametric Speech Synthesis

Add code
Mar 02, 2022
Figure 1 for Speaker Adaption with Intuitive Prosodic Features for Statistical Parametric Speech Synthesis
Figure 2 for Speaker Adaption with Intuitive Prosodic Features for Statistical Parametric Speech Synthesis
Figure 3 for Speaker Adaption with Intuitive Prosodic Features for Statistical Parametric Speech Synthesis
Figure 4 for Speaker Adaption with Intuitive Prosodic Features for Statistical Parametric Speech Synthesis
Viaarxiv icon

Cognitive Diagnosis with Explicit Student Vector Estimation and Unsupervised Question Matrix Learning

Add code
Mar 01, 2022
Figure 1 for Cognitive Diagnosis with Explicit Student Vector Estimation and Unsupervised Question Matrix Learning
Figure 2 for Cognitive Diagnosis with Explicit Student Vector Estimation and Unsupervised Question Matrix Learning
Figure 3 for Cognitive Diagnosis with Explicit Student Vector Estimation and Unsupervised Question Matrix Learning
Figure 4 for Cognitive Diagnosis with Explicit Student Vector Estimation and Unsupervised Question Matrix Learning
Viaarxiv icon

Using multiple reference audios and style embedding constraints for speech synthesis

Add code
Oct 09, 2021
Figure 1 for Using multiple reference audios and style embedding constraints for speech synthesis
Figure 2 for Using multiple reference audios and style embedding constraints for speech synthesis
Figure 3 for Using multiple reference audios and style embedding constraints for speech synthesis
Figure 4 for Using multiple reference audios and style embedding constraints for speech synthesis
Viaarxiv icon

SemEval-2021 Task 4: Reading Comprehension of Abstract Meaning

Add code
Jun 01, 2021
Figure 1 for SemEval-2021 Task 4: Reading Comprehension of Abstract Meaning
Figure 2 for SemEval-2021 Task 4: Reading Comprehension of Abstract Meaning
Figure 3 for SemEval-2021 Task 4: Reading Comprehension of Abstract Meaning
Figure 4 for SemEval-2021 Task 4: Reading Comprehension of Abstract Meaning
Viaarxiv icon

A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment

Add code
Sep 04, 2018
Figure 1 for A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment
Figure 2 for A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment
Figure 3 for A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment
Figure 4 for A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment
Viaarxiv icon