Picture for Christian Classen

Christian Classen

VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data

Add code
Feb 10, 2025
Figure 1 for VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data
Figure 2 for VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data
Figure 3 for VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data
Figure 4 for VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data
Viaarxiv icon