Picture for Haoang Li

Haoang Li

Rethinking the Practicality of Vision-language-action Model: A Comprehensive Benchmark and An Improved Baseline

Add code
Feb 26, 2026
Viaarxiv icon

Dream-SLAM: Dreaming the Unseen for Active SLAM in Dynamic Environments

Add code
Feb 25, 2026
Viaarxiv icon

FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment

Add code
Feb 19, 2026
Viaarxiv icon

Advances in Global Solvers for 3D Vision

Add code
Feb 16, 2026
Viaarxiv icon

Neural Predictor-Corrector: Solving Homotopy Problems with Reinforcement Learning

Add code
Feb 03, 2026
Viaarxiv icon

The RoboSense Challenge: Sense Anything, Navigate Anywhere, Adapt Across Platforms

Add code
Jan 08, 2026
Viaarxiv icon

Embodied Robot Manipulation in the Era of Foundation Models: Planning and Learning Perspectives

Add code
Dec 28, 2025
Viaarxiv icon

FlowVLA: Thinking in Motion with a Visual Chain of Thought

Add code
Aug 25, 2025
Viaarxiv icon

ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver

Add code
Aug 14, 2025
Viaarxiv icon

SkyVLN: Vision-and-Language Navigation and NMPC Control for UAVs in Urban Environments

Add code
Jul 09, 2025
Figure 1 for SkyVLN: Vision-and-Language Navigation and NMPC Control for UAVs in Urban Environments
Figure 2 for SkyVLN: Vision-and-Language Navigation and NMPC Control for UAVs in Urban Environments
Figure 3 for SkyVLN: Vision-and-Language Navigation and NMPC Control for UAVs in Urban Environments
Figure 4 for SkyVLN: Vision-and-Language Navigation and NMPC Control for UAVs in Urban Environments
Viaarxiv icon