Picture for Shota Nakada

Shota Nakada

DETECLAP: Enhancing Audio-Visual Representation Learning with Object Information

Add code
Sep 18, 2024
Viaarxiv icon

Lighthouse: A User-Friendly Library for Reproducible Video Moment Retrieval and Highlight Detection

Add code
Aug 06, 2024
Viaarxiv icon

On the Audio Hallucinations in Large Audio-Video Language Models

Add code
Jan 18, 2024
Viaarxiv icon

Large-scale Vision-Language Models Learn Super Images for Efficient and High-Performance Partially Relevant Video Retrieval

Add code
Dec 01, 2023
Viaarxiv icon