Picture for Peng Li

Peng Li

DJI Innovations Inc

Sparse Forcing: Native Trainable Sparse Attention for Real-time Autoregressive Diffusion Video Generation

Add code
Apr 23, 2026
Viaarxiv icon

SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks

Add code
Apr 10, 2026
Viaarxiv icon

Know3D: Prompting 3D Generation with Knowledge from Vision-Language Models

Add code
Mar 24, 2026
Viaarxiv icon

GigaWorld-Policy: An Efficient Action-Centered World--Action Model

Add code
Mar 18, 2026
Viaarxiv icon

Evaluating Time Awareness and Cross-modal Active Perception of Large Models via 4D Escape Room Task

Add code
Mar 16, 2026
Viaarxiv icon

TacMamba: A Tactile History Compression Adapter Bridging Fast Reflexes and Slow VLA Reasoning

Add code
Mar 02, 2026
Viaarxiv icon

Beyond Words: Evaluating and Bridging Epistemic Divergence in User-Agent Interaction via Theory of Mind

Add code
Feb 14, 2026
Viaarxiv icon

FlexAM: Flexible Appearance-Motion Decomposition for Versatile Video Generation Control

Add code
Feb 13, 2026
Viaarxiv icon

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Add code
Feb 12, 2026
Viaarxiv icon

Reasoning and Tool-use Compete in Agentic RL:From Quantifying Interference to Disentangled Tuning

Add code
Feb 01, 2026
Viaarxiv icon