Picture for Dhruv Shah

Dhruv Shah

Gemini Robotics: Bringing AI into the Physical World

Add code
Mar 25, 2025
Viaarxiv icon

A Taxonomy for Evaluating Generalist Robot Policies

Add code
Mar 03, 2025
Viaarxiv icon

Robot Data Curation with Mutual Information Estimators

Add code
Feb 12, 2025
Viaarxiv icon

Vision Language Models are In-Context Value Learners

Add code
Nov 07, 2024
Figure 1 for Vision Language Models are In-Context Value Learners
Figure 2 for Vision Language Models are In-Context Value Learners
Figure 3 for Vision Language Models are In-Context Value Learners
Figure 4 for Vision Language Models are In-Context Value Learners
Viaarxiv icon

STEER: Flexible Robotic Manipulation via Dense Language Grounding

Add code
Nov 05, 2024
Figure 1 for STEER: Flexible Robotic Manipulation via Dense Language Grounding
Figure 2 for STEER: Flexible Robotic Manipulation via Dense Language Grounding
Figure 3 for STEER: Flexible Robotic Manipulation via Dense Language Grounding
Figure 4 for STEER: Flexible Robotic Manipulation via Dense Language Grounding
Viaarxiv icon

Traversability-Aware Legged Navigation by Learning from Real-World Visual Data

Add code
Oct 14, 2024
Viaarxiv icon

LeLaN: Learning A Language-Conditioned Navigation Policy from In-the-Wild Videos

Add code
Oct 04, 2024
Viaarxiv icon

Gen2Act: Human Video Generation in Novel Scenarios enables Generalizable Robot Manipulation

Add code
Sep 24, 2024
Figure 1 for Gen2Act: Human Video Generation in Novel Scenarios enables Generalizable Robot Manipulation
Figure 2 for Gen2Act: Human Video Generation in Novel Scenarios enables Generalizable Robot Manipulation
Figure 3 for Gen2Act: Human Video Generation in Novel Scenarios enables Generalizable Robot Manipulation
Figure 4 for Gen2Act: Human Video Generation in Novel Scenarios enables Generalizable Robot Manipulation
Viaarxiv icon

A System and Benchmark for LLM-based Q&A on Heterogeneous Data

Add code
Sep 10, 2024
Viaarxiv icon

Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs

Add code
Jul 10, 2024
Figure 1 for Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs
Figure 2 for Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs
Figure 3 for Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs
Figure 4 for Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs
Viaarxiv icon