Picture for Xingzhong Xu

Xingzhong Xu

UCPO: Uncertainty-Aware Policy Optimization

Add code
Jan 30, 2026
Viaarxiv icon

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Add code
Mar 11, 2025
Viaarxiv icon