Picture for Dong Won Kim

Dong Won Kim

DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer

Add code
Jun 17, 2024
Viaarxiv icon