Picture for Ashishkumar Gudmalwar

Ashishkumar Gudmalwar

DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing

Add code
Jun 13, 2024
Viaarxiv icon

VECL-TTS: Voice identity and Emotional style controllable Cross-Lingual Text-to-Speech

Add code
Jun 12, 2024
Viaarxiv icon