Picture for Heeju Kim

Heeju Kim

VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data

Add code
Feb 10, 2025
Viaarxiv icon