Picture for Ryo Hachiuma

Ryo Hachiuma

Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks

Add code
Jan 14, 2025
Viaarxiv icon

VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models

Add code
Dec 02, 2024
Viaarxiv icon

RealTraj: Towards Real-World Pedestrian Trajectory Forecasting

Add code
Nov 26, 2024
Figure 1 for RealTraj: Towards Real-World Pedestrian Trajectory Forecasting
Figure 2 for RealTraj: Towards Real-World Pedestrian Trajectory Forecasting
Figure 3 for RealTraj: Towards Real-World Pedestrian Trajectory Forecasting
Figure 4 for RealTraj: Towards Real-World Pedestrian Trajectory Forecasting
Viaarxiv icon

SANER: Annotation-free Societal Attribute Neutralizer for Debiasing CLIP

Add code
Aug 19, 2024
Viaarxiv icon

CrowdMAC: Masked Crowd Density Completion for Robust Crowd Density Forecasting

Add code
Jul 20, 2024
Viaarxiv icon

From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment

Add code
Jun 20, 2024
Viaarxiv icon

Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition

Add code
May 31, 2024
Viaarxiv icon

EMAG: Ego-motion Aware and Generalizable 2D Hand Forecasting from Egocentric Videos

Add code
May 30, 2024
Viaarxiv icon

Weakly Semi-supervised Tool Detection in Minimally Invasive Surgery Videos

Add code
Jan 08, 2024
Viaarxiv icon

Surgical tool classification and localization: results and methods from the MICCAI 2022 SurgToolLoc challenge

Add code
May 11, 2023
Viaarxiv icon