Picture for Jaesung Huh

Jaesung Huh

Character-aware audio-visual subtitling in context

Add code
Oct 14, 2024
Viaarxiv icon

The VoxCeleb Speaker Recognition Challenge: A Retrospective

Add code
Aug 27, 2024
Figure 1 for The VoxCeleb Speaker Recognition Challenge: A Retrospective
Figure 2 for The VoxCeleb Speaker Recognition Challenge: A Retrospective
Figure 3 for The VoxCeleb Speaker Recognition Challenge: A Retrospective
Figure 4 for The VoxCeleb Speaker Recognition Challenge: A Retrospective
Viaarxiv icon

TIM: A Time Interval Machine for Audio-Visual Action Recognition

Add code
Apr 09, 2024
Figure 1 for TIM: A Time Interval Machine for Audio-Visual Action Recognition
Figure 2 for TIM: A Time Interval Machine for Audio-Visual Action Recognition
Figure 3 for TIM: A Time Interval Machine for Audio-Visual Action Recognition
Figure 4 for TIM: A Time Interval Machine for Audio-Visual Action Recognition
Viaarxiv icon

Look, Listen and Recognise: Character-Aware Audio-Visual Subtitling

Add code
Jan 22, 2024
Figure 1 for Look, Listen and Recognise: Character-Aware Audio-Visual Subtitling
Figure 2 for Look, Listen and Recognise: Character-Aware Audio-Visual Subtitling
Figure 3 for Look, Listen and Recognise: Character-Aware Audio-Visual Subtitling
Figure 4 for Look, Listen and Recognise: Character-Aware Audio-Visual Subtitling
Viaarxiv icon

OxfordVGG Submission to the EGO4D AV Transcription Challenge

Add code
Jul 18, 2023
Viaarxiv icon

VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge

Add code
Mar 06, 2023
Viaarxiv icon

WhisperX: Time-Accurate Speech Transcription of Long-Form Audio

Add code
Mar 01, 2023
Viaarxiv icon

Epic-Sounds: A Large-scale Dataset of Actions That Sound

Add code
Feb 01, 2023
Figure 1 for Epic-Sounds: A Large-scale Dataset of Actions That Sound
Figure 2 for Epic-Sounds: A Large-scale Dataset of Actions That Sound
Figure 3 for Epic-Sounds: A Large-scale Dataset of Actions That Sound
Figure 4 for Epic-Sounds: A Large-scale Dataset of Actions That Sound
Viaarxiv icon

In search of strong embedding extractors for speaker diarisation

Add code
Oct 26, 2022
Figure 1 for In search of strong embedding extractors for speaker diarisation
Figure 2 for In search of strong embedding extractors for speaker diarisation
Figure 3 for In search of strong embedding extractors for speaker diarisation
Viaarxiv icon

VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge

Add code
Jan 12, 2022
Figure 1 for VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge
Figure 2 for VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge
Figure 3 for VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge
Figure 4 for VoxSRC 2021: The Third VoxCeleb Speaker Recognition Challenge
Viaarxiv icon