Picture for Zhuo Xu

Zhuo Xu

Vision Language Models are In-Context Value Learners

Add code
Nov 07, 2024
Viaarxiv icon

Imagined Potential Games: A Framework for Simulating, Learning and Evaluating Interactive Behaviors

Add code
Nov 06, 2024
Viaarxiv icon

Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs

Add code
Jul 10, 2024
Figure 1 for Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs
Figure 2 for Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs
Figure 3 for Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs
Figure 4 for Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs
Viaarxiv icon

Ghost imaging-based Non-contact Heart Rate Detection

Add code
Jun 04, 2024
Viaarxiv icon

HC-GAE: The Hierarchical Cluster-based Graph Auto-Encoder for Graph Representation Learning

Add code
May 23, 2024
Viaarxiv icon

SSHPool: The Separated Subgraph-based Hierarchical Pooling

Add code
Mar 24, 2024
Viaarxiv icon

MATRIX: Multi-Agent Trajectory Generation with Diverse Contexts

Add code
Mar 09, 2024
Viaarxiv icon

PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs

Add code
Feb 12, 2024
Figure 1 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 2 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 3 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 4 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Viaarxiv icon

Generative Expressive Robot Behaviors using Large Language Models

Add code
Jan 30, 2024
Viaarxiv icon

AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents

Add code
Jan 23, 2024
Viaarxiv icon