Picture for Pengwei Wang

Pengwei Wang

RoboBrain 2.0 Technical Report

Add code
Jul 02, 2025
Viaarxiv icon

AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation

Add code
Jul 02, 2025
Viaarxiv icon

SafeMap: Robust HD Map Construction from Incomplete Observations

Add code
Jul 01, 2025
Viaarxiv icon

Latent Anomaly Detection: Masked VQ-GAN for Unsupervised Segmentation in Medical CBCT

Add code
Jun 17, 2025
Viaarxiv icon

Video-CoT: A Comprehensive Dataset for Spatiotemporal Understanding of Videos Based on Chain-of-Thought

Add code
Jun 12, 2025
Viaarxiv icon

RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics

Add code
Jun 04, 2025
Viaarxiv icon

RoboOS: A Hierarchical Embodied Framework for Cross-Embodiment and Multi-Agent Collaboration

Add code
May 06, 2025
Viaarxiv icon

Token Communication-Driven Multimodal Large Models in Resource-Constrained Multiuser Networks

Add code
May 06, 2025
Viaarxiv icon

Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning

Add code
Mar 27, 2025
Viaarxiv icon

Modeling Variants of Prompts for Vision-Language Models

Add code
Mar 11, 2025
Viaarxiv icon