Picture for Pengwei Wang

Pengwei Wang

From Language to Locomotion: Retargeting-free Humanoid Control via Motion Latent Guidance

Add code
Oct 16, 2025
Viaarxiv icon

TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics

Add code
Oct 08, 2025
Viaarxiv icon

MathSticks: A Benchmark for Visual Symbolic Compositional Reasoning with Matchstick Puzzles

Add code
Oct 01, 2025
Viaarxiv icon

$NavA^3$: Understanding Any Instruction, Navigating Anywhere, Finding Anything

Add code
Aug 06, 2025
Viaarxiv icon

AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation

Add code
Jul 02, 2025
Viaarxiv icon

RoboBrain 2.0 Technical Report

Add code
Jul 02, 2025
Viaarxiv icon

SafeMap: Robust HD Map Construction from Incomplete Observations

Add code
Jul 01, 2025
Viaarxiv icon

Latent Anomaly Detection: Masked VQ-GAN for Unsupervised Segmentation in Medical CBCT

Add code
Jun 17, 2025
Viaarxiv icon

Video-CoT: A Comprehensive Dataset for Spatiotemporal Understanding of Videos Based on Chain-of-Thought

Add code
Jun 12, 2025
Viaarxiv icon

RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics

Add code
Jun 04, 2025
Viaarxiv icon