Picture for Haowei Lou

Haowei Lou

Aligner-Guided Training Paradigm: Advancing Text-to-Speech Models with Aligner Guided Duration

Add code
Dec 11, 2024
Viaarxiv icon

LatentSpeech: Latent Diffusion for Text-To-Speech Generation

Add code
Dec 11, 2024
Viaarxiv icon

StyleSpeech: Parameter-efficient Fine Tuning for Pre-trained Controllable Text-to-Speech

Add code
Aug 27, 2024
Viaarxiv icon