Picture for Cheng Chi

Cheng Chi

Reshaping Action Error Distributions for Reliable Vision-Language-Action Models

Add code
Feb 04, 2026
Viaarxiv icon

Latent Reasoning VLA: Latent Thinking and Prediction for Vision-Language-Action Models

Add code
Feb 01, 2026
Viaarxiv icon

RoboBrain 2.5: Depth in Sight, Time in Mind

Add code
Jan 20, 2026
Viaarxiv icon

Action-Sketcher: From Reasoning to Action via Visual Sketches for Long-Horizon Robotic Manipulation

Add code
Jan 04, 2026
Viaarxiv icon

RoboMirror: Understand Before You Imitate for Video to Humanoid Locomotion

Add code
Dec 30, 2025
Viaarxiv icon

Robo-Dopamine: General Process Reward Modeling for High-Precision Robotic Manipulation

Add code
Dec 29, 2025
Viaarxiv icon

Do You Have Freestyle? Expressive Humanoid Locomotion via Audio Control

Add code
Dec 29, 2025
Viaarxiv icon

Embodied Robot Manipulation in the Era of Foundation Models: Planning and Learning Perspectives

Add code
Dec 28, 2025
Viaarxiv icon

RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics

Add code
Dec 15, 2025
Viaarxiv icon

PIGEON: VLM-Driven Object Navigation via Points of Interest Selection

Add code
Nov 17, 2025
Figure 1 for PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
Figure 2 for PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
Figure 3 for PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
Figure 4 for PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
Viaarxiv icon