Picture for Wendi He

Wendi He

Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models

Add code
Sep 18, 2024
Viaarxiv icon

Vec-Tok-VC+: Residual-enhanced Robust Zero-shot Voice Conversion with Progressive Constraints in a Dual-mode Training Strategy

Add code
Jun 14, 2024
Viaarxiv icon

Vec-Tok Speech: speech vectorization and tokenization for neural speech generation

Add code
Oct 12, 2023
Viaarxiv icon

Improving Cross-lingual Speech Synthesis with Triplet Training Scheme

Add code
Feb 22, 2022
Figure 1 for Improving Cross-lingual Speech Synthesis with Triplet Training Scheme
Figure 2 for Improving Cross-lingual Speech Synthesis with Triplet Training Scheme
Figure 3 for Improving Cross-lingual Speech Synthesis with Triplet Training Scheme
Viaarxiv icon