Picture for Yujie Zhou

Yujie Zhou

CapRL++: Unified Reinforcement Learning with Verifiable Rewards for Dense Image and Video Captioning

Add code
Jun 08, 2026
Viaarxiv icon

AdaGRPO: A Capability-Aware Adaptive Enhancement for Flow-based GRPO

Add code
Jun 05, 2026
Viaarxiv icon

Pave-GRPO: Beyond Instantaneous Guidance through Principled Average Velocity Decomposition

Add code
Jun 01, 2026
Viaarxiv icon

SANEmerg: An Emergent Communication Framework for Semantic-aware Agentic AI Networking

Add code
May 07, 2026
Viaarxiv icon

Uni-Classifier: Leveraging Video Diffusion Priors for Universal Guidance Classifier

Add code
Mar 20, 2026
Viaarxiv icon

From Sparse to Dense: Multi-View GRPO for Flow Models via Augmented Condition Space

Add code
Mar 13, 2026
Viaarxiv icon

EndoCoT: Scaling Endogenous Chain-of-Thought Reasoning in Diffusion Models

Add code
Mar 12, 2026
Viaarxiv icon

Unified Personalized Reward Model for Vision Generation

Add code
Feb 02, 2026
Viaarxiv icon

DynaDrag: Dynamic Drag-Style Image Editing by Motion Prediction

Add code
Jan 02, 2026
Viaarxiv icon

$\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models

Add code
Oct 02, 2025
Figure 1 for $\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models
Figure 2 for $\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models
Figure 3 for $\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models
Figure 4 for $\text{G}^2$RPO: Granular GRPO for Precise Reward in Flow Models
Viaarxiv icon