Picture for Dhruv Shah

Dhruv Shah

Vision Language Models are In-Context Value Learners

Add code
Nov 07, 2024
Viaarxiv icon

STEER: Flexible Robotic Manipulation via Dense Language Grounding

Add code
Nov 05, 2024
Viaarxiv icon

Traversability-Aware Legged Navigation by Learning from Real-World Visual Data

Add code
Oct 14, 2024
Viaarxiv icon

LeLaN: Learning A Language-Conditioned Navigation Policy from In-the-Wild Videos

Add code
Oct 04, 2024
Viaarxiv icon

Gen2Act: Human Video Generation in Novel Scenarios enables Generalizable Robot Manipulation

Add code
Sep 24, 2024
Viaarxiv icon

A System and Benchmark for LLM-based Q&A on Heterogeneous Data

Add code
Sep 10, 2024
Viaarxiv icon

Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs

Add code
Jul 10, 2024
Figure 1 for Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs
Figure 2 for Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs
Figure 3 for Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs
Figure 4 for Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs
Viaarxiv icon

SELFI: Autonomous Self-Improvement with Reinforcement Learning for Social Navigation

Add code
Mar 01, 2024
Viaarxiv icon

Pushing the Limits of Cross-Embodiment Learning for Manipulation and Navigation

Add code
Feb 29, 2024
Viaarxiv icon

GOAT: GO to Any Thing

Add code
Nov 10, 2023
Viaarxiv icon