Picture for Yuzhuo Tian

Yuzhuo Tian

SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories

Add code
Mar 11, 2025
Viaarxiv icon