Picture for Xueyan Zou

Xueyan Zou

NaVILA: Legged Robot Vision-Language-Action Model for Navigation

Add code
Dec 05, 2024
Viaarxiv icon

WildLMa: Long Horizon Loco-Manipulation in the Wild

Add code
Nov 22, 2024
Figure 1 for WildLMa: Long Horizon Loco-Manipulation in the Wild
Figure 2 for WildLMa: Long Horizon Loco-Manipulation in the Wild
Figure 3 for WildLMa: Long Horizon Loco-Manipulation in the Wild
Figure 4 for WildLMa: Long Horizon Loco-Manipulation in the Wild
Viaarxiv icon

GraspSplats: Efficient Manipulation with 3D Feature Splatting

Add code
Sep 03, 2024
Viaarxiv icon

PLUM: Preference Learning Plus Test Cases Yields Better Code Language Models

Add code
Jun 11, 2024
Viaarxiv icon

Interfacing Foundation Models' Embeddings

Add code
Dec 12, 2023
Viaarxiv icon

LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models

Add code
Dec 05, 2023
Viaarxiv icon

Visual In-Context Prompting

Add code
Nov 22, 2023
Viaarxiv icon

LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents

Add code
Nov 09, 2023
Viaarxiv icon

Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V

Add code
Oct 17, 2023
Viaarxiv icon

Semantic-SAM: Segment and Recognize Anything at Any Granularity

Add code
Jul 10, 2023
Viaarxiv icon