Picture for Youngjoon Jang

Youngjoon Jang

Let Me Finish My Sentence: Video Temporal Grounding with Holistic Text Understanding

Add code
Oct 17, 2024
Figure 1 for Let Me Finish My Sentence: Video Temporal Grounding with Holistic Text Understanding
Figure 2 for Let Me Finish My Sentence: Video Temporal Grounding with Holistic Text Understanding
Figure 3 for Let Me Finish My Sentence: Video Temporal Grounding with Holistic Text Understanding
Figure 4 for Let Me Finish My Sentence: Video Temporal Grounding with Holistic Text Understanding
Viaarxiv icon

Faces that Speak: Jointly Synthesising Talking Face and Speech from Text

Add code
May 16, 2024
Viaarxiv icon

FreGrad: Lightweight and Fast Frequency-aware Diffusion Vocoder

Add code
Jan 18, 2024
Viaarxiv icon

Seeing Through the Conversation: Audio-Visual Speech Separation based on Diffusion Model

Add code
Oct 30, 2023
Viaarxiv icon

SlowFast Network for Continuous Sign Language Recognition

Add code
Sep 21, 2023
Viaarxiv icon

TalkNCE: Improving Active Speaker Detection with Talk-Aware Contrastive Learning

Add code
Sep 21, 2023
Viaarxiv icon

That's What I Said: Fully-Controllable Talking Face Generation

Add code
Apr 06, 2023
Viaarxiv icon

Self-Sufficient Framework for Continuous Sign Language Recognition

Add code
Mar 21, 2023
Viaarxiv icon

Metric Learning for User-defined Keyword Spotting

Add code
Nov 01, 2022
Viaarxiv icon

Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language Recognition

Add code
Nov 01, 2022
Figure 1 for Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language Recognition
Figure 2 for Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language Recognition
Figure 3 for Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language Recognition
Figure 4 for Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language Recognition
Viaarxiv icon