Picture for Yongxuan Lv

Yongxuan Lv

PRPO: Aligning Process Reward with Outcome Reward in Policy Optimization

Add code
Jan 13, 2026
Viaarxiv icon

You Only Click Once: Single Point Weakly Supervised 3D Instance Segmentation for Autonomous Driving

Add code
Feb 28, 2025
Viaarxiv icon