Picture for Yutao Fan

Yutao Fan

Visual-O1: Understanding Ambiguous Instructions via Multi-modal Multi-turn Chain-of-thoughts Reasoning

Add code
Oct 04, 2024
Viaarxiv icon