Picture for Weizhen Wang

Weizhen Wang

Embodied Scene Understanding for Vision Language Models via MetaVQA

Add code
Jan 15, 2025
Viaarxiv icon