Picture for Zhouyi Qian

Zhouyi Qian

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

Add code
Apr 01, 2025
Viaarxiv icon