Picture for Yuming Li

Yuming Li

HPSv3++: Scaling Reward Models Across the Full Spectrum of Diffusion Model Capabilities

Add code
Jun 12, 2026
Viaarxiv icon

Ultra Flash: Scaling Real-Time Streaming Video Generation to High Resolutions

Add code
Jun 08, 2026
Viaarxiv icon

Echo-Memory: A Controlled Study of Memory in Action World Models

Add code
Jun 08, 2026
Viaarxiv icon

OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation

Add code
May 12, 2026
Viaarxiv icon

A Systematic Post-Train Framework for Video Generation

Add code
Apr 28, 2026
Viaarxiv icon

Beyond the Golden Data: Resolving the Motion-Vision Quality Dilemma via Timestep Selective Training

Add code
Mar 26, 2026
Viaarxiv icon

OmniForcing: Unleashing Real-time Joint Audio-Visual Generation

Add code
Mar 12, 2026
Viaarxiv icon

EchoTorrent: Towards Swift, Sustained, and Streaming Multi-Modal Video Generation

Add code
Feb 14, 2026
Viaarxiv icon

AEGPO: Adaptive Entropy-Guided Policy Optimization for Diffusion Models

Add code
Feb 06, 2026
Viaarxiv icon

QVLA: Not All Channels Are Equal in Vision-Language-Action Model's Quantization

Add code
Feb 03, 2026
Viaarxiv icon