Picture for Jinlin Xiao

Jinlin Xiao

OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning

Add code
Dec 22, 2024
Viaarxiv icon

o1-Coder: an o1 Replication for Coding

Add code
Nov 29, 2024
Figure 1 for o1-Coder: an o1 Replication for Coding
Figure 2 for o1-Coder: an o1 Replication for Coding
Figure 3 for o1-Coder: an o1 Replication for Coding
Figure 4 for o1-Coder: an o1 Replication for Coding
Viaarxiv icon

Debiasing Vison-Language Models with Text-Only Training

Add code
Oct 12, 2024
Viaarxiv icon

KG-FPQ: Evaluating Factuality Hallucination in LLMs with Knowledge Graph-based False Premise Questions

Add code
Jul 08, 2024
Figure 1 for KG-FPQ: Evaluating Factuality Hallucination in LLMs with Knowledge Graph-based False Premise Questions
Figure 2 for KG-FPQ: Evaluating Factuality Hallucination in LLMs with Knowledge Graph-based False Premise Questions
Figure 3 for KG-FPQ: Evaluating Factuality Hallucination in LLMs with Knowledge Graph-based False Premise Questions
Figure 4 for KG-FPQ: Evaluating Factuality Hallucination in LLMs with Knowledge Graph-based False Premise Questions
Viaarxiv icon

Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning

Add code
Feb 01, 2024
Viaarxiv icon