Picture for Chunghsin Yeh

Chunghsin Yeh

Single-stage TTS with Masked Audio Token Modeling and Semantic Knowledge Distillation

Add code
Sep 17, 2024
Viaarxiv icon

Masked Generative Video-to-Audio Transformers with Enhanced Synchronicity

Add code
Jul 15, 2024
Viaarxiv icon

Sequential Contrastive Audio-Visual Learning

Add code
Jul 08, 2024
Viaarxiv icon

Full-band General Audio Synthesis with Score-based Diffusion

Add code
Oct 26, 2022
Viaarxiv icon