Picture for Yafu Li

Yafu Li

A Survey of Reinforcement Learning for Large Reasoning Models

Add code
Sep 10, 2025
Viaarxiv icon

Synthesizing Sheet Music Problems for Evaluation and Reinforcement Learning

Add code
Sep 04, 2025
Viaarxiv icon

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law

Add code
Jul 24, 2025
Viaarxiv icon

Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning

Add code
Jun 04, 2025
Viaarxiv icon

Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models

Add code
May 20, 2025
Viaarxiv icon

Learning to Reason under Off-Policy Guidance

Add code
Apr 22, 2025
Viaarxiv icon

SEE: Continual Fine-tuning with Sequential Ensemble of Experts

Add code
Apr 09, 2025
Viaarxiv icon

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Add code
Mar 27, 2025
Viaarxiv icon

Lost in Literalism: How Supervised Training Shapes Translationese in LLMs

Add code
Mar 06, 2025
Figure 1 for Lost in Literalism: How Supervised Training Shapes Translationese in LLMs
Figure 2 for Lost in Literalism: How Supervised Training Shapes Translationese in LLMs
Figure 3 for Lost in Literalism: How Supervised Training Shapes Translationese in LLMs
Figure 4 for Lost in Literalism: How Supervised Training Shapes Translationese in LLMs
Viaarxiv icon

Multi-LLM Collaborative Search for Complex Problem Solving

Add code
Feb 26, 2025
Viaarxiv icon