Picture for Jiesong Lian

Jiesong Lian

Euphonium: Steering Video Flow Matching via Process Reward Gradient Guided Stochastic Dynamics

Add code
Feb 04, 2026
Viaarxiv icon

SoliReward: Mitigating Susceptibility to Reward Hacking and Annotation Noise in Video Generation Reward Models

Add code
Dec 17, 2025
Viaarxiv icon

Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles

Add code
Jun 03, 2024
Viaarxiv icon