Picture for Wangjin Zhou

Wangjin Zhou

Disentangling Age and Identity with a Mutual Information Minimization Approach for Cross-Age Speaker Verification

Add code
Sep 24, 2024
Figure 1 for Disentangling Age and Identity with a Mutual Information Minimization Approach for Cross-Age Speaker Verification
Figure 2 for Disentangling Age and Identity with a Mutual Information Minimization Approach for Cross-Age Speaker Verification
Figure 3 for Disentangling Age and Identity with a Mutual Information Minimization Approach for Cross-Age Speaker Verification
Figure 4 for Disentangling Age and Identity with a Mutual Information Minimization Approach for Cross-Age Speaker Verification
Viaarxiv icon

Zero-Shot Sing Voice Conversion: built upon clustering-based phoneme representations

Add code
Sep 12, 2024
Figure 1 for Zero-Shot Sing Voice Conversion: built upon clustering-based phoneme representations
Figure 2 for Zero-Shot Sing Voice Conversion: built upon clustering-based phoneme representations
Figure 3 for Zero-Shot Sing Voice Conversion: built upon clustering-based phoneme representations
Figure 4 for Zero-Shot Sing Voice Conversion: built upon clustering-based phoneme representations
Viaarxiv icon

LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation

Add code
Jun 12, 2024
Viaarxiv icon

MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction

Add code
Jan 25, 2024
Viaarxiv icon

LE-SSL-MOS: Self-Supervised Learning MOS Prediction with Listener Enhancement

Add code
Nov 17, 2023
Viaarxiv icon

Fusion of Self-supervised Learned Models for MOS Prediction

Add code
Apr 11, 2022
Figure 1 for Fusion of Self-supervised Learned Models for MOS Prediction
Figure 2 for Fusion of Self-supervised Learned Models for MOS Prediction
Figure 3 for Fusion of Self-supervised Learned Models for MOS Prediction
Figure 4 for Fusion of Self-supervised Learned Models for MOS Prediction
Viaarxiv icon