Picture for Pengxiang Ding

Pengxiang Ding

Unicorn: Text-Only Data Synthesis for Vision Language Model Training

Add code
Mar 28, 2025
Viaarxiv icon

Exploring the Evolution of Physics Cognition in Video Generation: A Survey

Add code
Mar 27, 2025
Viaarxiv icon

MoRE: Unlocking Scalability in Reinforcement Learning for Quadruped Vision-Language-Action Models

Add code
Mar 11, 2025
Viaarxiv icon

Accelerating Vision-Language-Action Model Integrated with Action Chunking via Parallel Decoding

Add code
Mar 04, 2025
Viaarxiv icon

Humanoid-VLA: Towards Universal Humanoid Control with Visual Integration

Add code
Feb 21, 2025
Viaarxiv icon

VLAS: Vision-Language-Action Model With Speech Instructions For Customized Robot Manipulation

Add code
Feb 19, 2025
Viaarxiv icon

Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport

Add code
Feb 18, 2025
Viaarxiv icon

GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation

Add code
Feb 13, 2025
Viaarxiv icon

Rethinking Latent Representations in Behavior Cloning: An Information Bottleneck Approach for Robot Manipulation

Add code
Feb 05, 2025
Viaarxiv icon

QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning

Add code
Dec 23, 2024
Viaarxiv icon