Picture for Chaeyoung Jung

Chaeyoung Jung

VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis

Add code
Dec 26, 2024
Viaarxiv icon

FlowAVSE: Efficient Audio-Visual Speech Enhancement with Conditional Flow Matching

Add code
Jun 13, 2024
Viaarxiv icon

Seeing Through the Conversation: Audio-Visual Speech Separation based on Diffusion Model

Add code
Oct 30, 2023
Viaarxiv icon

TalkNCE: Improving Active Speaker Detection with Talk-Aware Contrastive Learning

Add code
Sep 21, 2023
Viaarxiv icon