Picture for Jiangming Shu

Jiangming Shu

OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning

Add code
Dec 22, 2024
Viaarxiv icon

o1-Coder: an o1 Replication for Coding

Add code
Nov 29, 2024
Figure 1 for o1-Coder: an o1 Replication for Coding
Figure 2 for o1-Coder: an o1 Replication for Coding
Figure 3 for o1-Coder: an o1 Replication for Coding
Figure 4 for o1-Coder: an o1 Replication for Coding
Viaarxiv icon