Picture for Yanjia Huang

Yanjia Huang

Can Large Vision Language Models Read Maps Like a Human?

Add code
Mar 18, 2025
Viaarxiv icon

PANDORA: Diffusion Policy Learning for Dexterous Robotic Piano Playing

Add code
Mar 17, 2025
Viaarxiv icon

Zero-shot Object Navigation with Vision-Language Models Reasoning

Add code
Oct 24, 2024
Figure 1 for Zero-shot Object Navigation with Vision-Language Models Reasoning
Figure 2 for Zero-shot Object Navigation with Vision-Language Models Reasoning
Figure 3 for Zero-shot Object Navigation with Vision-Language Models Reasoning
Figure 4 for Zero-shot Object Navigation with Vision-Language Models Reasoning
Viaarxiv icon