Picture for Taichi Nishimura

Taichi Nishimura

EgoOops: A Dataset for Mistake Action Detection from Egocentric Videos with Procedural Texts

Add code
Oct 07, 2024
Figure 1 for EgoOops: A Dataset for Mistake Action Detection from Egocentric Videos with Procedural Texts
Figure 2 for EgoOops: A Dataset for Mistake Action Detection from Egocentric Videos with Procedural Texts
Figure 3 for EgoOops: A Dataset for Mistake Action Detection from Egocentric Videos with Procedural Texts
Figure 4 for EgoOops: A Dataset for Mistake Action Detection from Egocentric Videos with Procedural Texts
Viaarxiv icon

DETECLAP: Enhancing Audio-Visual Representation Learning with Object Information

Add code
Sep 18, 2024
Viaarxiv icon

Lighthouse: A User-Friendly Library for Reproducible Video Moment Retrieval and Highlight Detection

Add code
Aug 06, 2024
Viaarxiv icon

BioVL-QR: Egocentric Biochemical Video-and-Language Dataset Using Micro QR Codes

Add code
Apr 04, 2024
Viaarxiv icon

Text-driven Affordance Learning from Egocentric Vision

Add code
Apr 03, 2024
Viaarxiv icon

Automatic Construction of a Large-Scale Corpus for Geoparsing Using Wikipedia Hyperlinks

Add code
Mar 25, 2024
Viaarxiv icon

On the Audio Hallucinations in Large Audio-Video Language Models

Add code
Jan 18, 2024
Viaarxiv icon

Large-scale Vision-Language Models Learn Super Images for Efficient and High-Performance Partially Relevant Video Retrieval

Add code
Dec 01, 2023
Viaarxiv icon

Exo2EgoDVC: Dense Video Captioning of Egocentric Procedural Activities Using Web Instructional Videos

Add code
Nov 29, 2023
Viaarxiv icon

Recipe Generation from Unsegmented Cooking Videos

Add code
Sep 21, 2022
Figure 1 for Recipe Generation from Unsegmented Cooking Videos
Figure 2 for Recipe Generation from Unsegmented Cooking Videos
Figure 3 for Recipe Generation from Unsegmented Cooking Videos
Figure 4 for Recipe Generation from Unsegmented Cooking Videos
Viaarxiv icon