Picture for Pengfei Liu

Pengfei Liu

DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments

Add code
Apr 07, 2025
Viaarxiv icon

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Add code
Apr 03, 2025
Viaarxiv icon

ToRL: Scaling Tool-Integrated RL

Add code
Mar 30, 2025
Viaarxiv icon

RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing

Add code
Mar 10, 2025
Viaarxiv icon

LIMR: Less is More for RL Scaling

Add code
Feb 17, 2025
Viaarxiv icon

LIMO: Less is More for Reasoning

Add code
Feb 05, 2025
Viaarxiv icon

Survey and Improvement Strategies for Gene Prioritization with Large Language Models

Add code
Jan 30, 2025
Figure 1 for Survey and Improvement Strategies for Gene Prioritization with Large Language Models
Figure 2 for Survey and Improvement Strategies for Gene Prioritization with Large Language Models
Figure 3 for Survey and Improvement Strategies for Gene Prioritization with Large Language Models
Viaarxiv icon

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Add code
Jan 11, 2025
Viaarxiv icon

DIVE: Diversified Iterative Self-Improvement

Add code
Jan 01, 2025
Viaarxiv icon

Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability

Add code
Dec 24, 2024
Figure 1 for Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
Figure 2 for Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
Figure 3 for Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
Figure 4 for Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
Viaarxiv icon