Picture for Atsushi Hashimoto

Atsushi Hashimoto

Visuo-Tactile Zero-Shot Object Recognition with Vision-Language Model

Add code
Sep 14, 2024
Viaarxiv icon

COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language Benchmark

Add code
Aug 05, 2024
Viaarxiv icon

AdaCoder: Adaptive Prompt Compression for Programmatic Visual Question Answering

Add code
Jul 28, 2024
Viaarxiv icon

Exo2EgoDVC: Dense Video Captioning of Egocentric Procedural Activities Using Web Instructional Videos

Add code
Nov 29, 2023
Viaarxiv icon

Vision-Language Interpreter for Robot Task Planning

Add code
Nov 02, 2023
Viaarxiv icon

WeaveNet for Approximating Two-sided Matching Problems

Add code
Oct 19, 2023
Viaarxiv icon

A Critical Look at the Current Usage of Foundation Model for Dense Recognition Task

Add code
Aug 01, 2023
Viaarxiv icon

Noisy Universal Domain Adaptation via Divergence Optimization for Visual Recognition

Add code
Apr 20, 2023
Viaarxiv icon

Recipe Generation from Unsegmented Cooking Videos

Add code
Sep 21, 2022
Figure 1 for Recipe Generation from Unsegmented Cooking Videos
Figure 2 for Recipe Generation from Unsegmented Cooking Videos
Figure 3 for Recipe Generation from Unsegmented Cooking Videos
Figure 4 for Recipe Generation from Unsegmented Cooking Videos
Viaarxiv icon

Visual Recipe Flow: A Dataset for Learning Visual State Changes of Objects with Recipe Flows

Add code
Sep 13, 2022
Figure 1 for Visual Recipe Flow: A Dataset for Learning Visual State Changes of Objects with Recipe Flows
Figure 2 for Visual Recipe Flow: A Dataset for Learning Visual State Changes of Objects with Recipe Flows
Figure 3 for Visual Recipe Flow: A Dataset for Learning Visual State Changes of Objects with Recipe Flows
Figure 4 for Visual Recipe Flow: A Dataset for Learning Visual State Changes of Objects with Recipe Flows
Viaarxiv icon