Picture for Zijing Hu

Zijing Hu

Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards

Add code
Mar 14, 2025
Viaarxiv icon