Picture for Shuanghao Bai

Shuanghao Bai

Latent Reasoning VLA: Latent Thinking and Prediction for Vision-Language-Action Models

Add code
Feb 01, 2026
Viaarxiv icon

RoboMirror: Understand Before You Imitate for Video to Humanoid Locomotion

Add code
Dec 30, 2025
Viaarxiv icon

Embodied Robot Manipulation in the Era of Foundation Models: Planning and Learning Perspectives

Add code
Dec 28, 2025
Viaarxiv icon

Long-VLA: Unleashing Long-Horizon Capability of Vision Language Action Model for Robot Manipulation

Add code
Aug 28, 2025
Viaarxiv icon

Dual-Path Stable Soft Prompt Generation for Domain Generalization

Add code
May 24, 2025
Viaarxiv icon

OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation

Add code
May 06, 2025
Figure 1 for OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation
Figure 2 for OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation
Figure 3 for OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation
Figure 4 for OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation
Viaarxiv icon

VLAS: Vision-Language-Action Model With Speech Instructions For Customized Robot Manipulation

Add code
Feb 19, 2025
Viaarxiv icon

PromptTA: Prompt-driven Text Adapter for Source-free Domain Generalization

Add code
Sep 21, 2024
Viaarxiv icon

Jacobian Regularizer-based Neural Granger Causality

Add code
May 14, 2024
Figure 1 for Jacobian Regularizer-based Neural Granger Causality
Figure 2 for Jacobian Regularizer-based Neural Granger Causality
Figure 3 for Jacobian Regularizer-based Neural Granger Causality
Figure 4 for Jacobian Regularizer-based Neural Granger Causality
Viaarxiv icon

Revisiting the Adversarial Robustness of Vision Language Models: a Multimodal Perspective

Add code
Apr 30, 2024
Viaarxiv icon