Picture for Huimu Yu

Huimu Yu

CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoning

Add code
Oct 03, 2024
Viaarxiv icon