Picture for Minseo Yoon

Minseo Yoon

ST-VLM: Kinematic Instruction Tuning for Spatio-Temporal Reasoning in Vision-Language Models

Add code
Mar 26, 2025
Viaarxiv icon