Picture for Pengwei Wang

Pengwei Wang

Reshaping Action Error Distributions for Reliable Vision-Language-Action Models

Add code
Feb 04, 2026
Viaarxiv icon

Latent Reasoning VLA: Latent Thinking and Prediction for Vision-Language-Action Models

Add code
Feb 01, 2026
Viaarxiv icon

RoboBrain 2.5: Depth in Sight, Time in Mind

Add code
Jan 20, 2026
Viaarxiv icon

Action-Sketcher: From Reasoning to Action via Visual Sketches for Long-Horizon Robotic Manipulation

Add code
Jan 04, 2026
Viaarxiv icon

RoboMirror: Understand Before You Imitate for Video to Humanoid Locomotion

Add code
Dec 30, 2025
Viaarxiv icon

Robo-Dopamine: General Process Reward Modeling for High-Precision Robotic Manipulation

Add code
Dec 29, 2025
Viaarxiv icon

Do You Have Freestyle? Expressive Humanoid Locomotion via Audio Control

Add code
Dec 29, 2025
Viaarxiv icon

RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics

Add code
Dec 15, 2025
Viaarxiv icon

PIGEON: VLM-Driven Object Navigation via Points of Interest Selection

Add code
Nov 17, 2025
Figure 1 for PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
Figure 2 for PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
Figure 3 for PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
Figure 4 for PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
Viaarxiv icon

GridPrune: From "Where to Look" to "What to Select" in Visual Token Pruning for MLLMs

Add code
Nov 13, 2025
Viaarxiv icon