Picture for Roy Fejgin

Roy Fejgin

Single-stage TTS with Masked Audio Token Modeling and Semantic Knowledge Distillation

Add code
Sep 17, 2024
Viaarxiv icon