Picture for Jaesung Huh

Jaesung Huh

Character-aware audio-visual subtitling in context

Add code
Oct 14, 2024
Viaarxiv icon

The VoxCeleb Speaker Recognition Challenge: A Retrospective

Add code
Aug 27, 2024
Viaarxiv icon

TIM: A Time Interval Machine for Audio-Visual Action Recognition

Add code
Apr 09, 2024
Viaarxiv icon

Look, Listen and Recognise: Character-Aware Audio-Visual Subtitling

Add code
Jan 22, 2024
Figure 1 for Look, Listen and Recognise: Character-Aware Audio-Visual Subtitling
Figure 2 for Look, Listen and Recognise: Character-Aware Audio-Visual Subtitling
Figure 3 for Look, Listen and Recognise: Character-Aware Audio-Visual Subtitling
Figure 4 for Look, Listen and Recognise: Character-Aware Audio-Visual Subtitling
Viaarxiv icon

OxfordVGG Submission to the EGO4D AV Transcription Challenge

Add code
Jul 18, 2023
Viaarxiv icon

VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge

Add code
Mar 06, 2023
Viaarxiv icon

WhisperX: Time-Accurate Speech Transcription of Long-Form Audio

Add code
Mar 01, 2023
Viaarxiv icon

Epic-Sounds: A Large-scale Dataset of Actions That Sound

Add code
Feb 01, 2023
Viaarxiv icon

In search of strong embedding extractors for speaker diarisation

Add code
Oct 26, 2022
Viaarxiv icon

VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge

Add code
Jan 12, 2022
Figure 1 for VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge
Figure 2 for VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge
Figure 3 for VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge
Figure 4 for VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge
Viaarxiv icon