Picture for Huatong Song

Huatong Song

SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training

Add code
Feb 03, 2026
Viaarxiv icon

SWE-World: Building Software Engineering Agents in Docker-Free Environments

Add code
Feb 03, 2026
Viaarxiv icon

LLM-in-Sandbox Elicits General Agentic Intelligence

Add code
Jan 22, 2026
Viaarxiv icon

R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning

Add code
May 22, 2025
Viaarxiv icon

SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis

Add code
May 22, 2025
Figure 1 for SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
Figure 2 for SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
Figure 3 for SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
Figure 4 for SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis
Viaarxiv icon

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Add code
Mar 07, 2025
Figure 1 for R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
Figure 2 for R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
Figure 3 for R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
Figure 4 for R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
Viaarxiv icon

YuLan-Mini: An Open Data-efficient Language Model

Add code
Dec 24, 2024
Figure 1 for YuLan-Mini: An Open Data-efficient Language Model
Figure 2 for YuLan-Mini: An Open Data-efficient Language Model
Figure 3 for YuLan-Mini: An Open Data-efficient Language Model
Figure 4 for YuLan-Mini: An Open Data-efficient Language Model
Viaarxiv icon

Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems

Add code
Dec 12, 2024
Figure 1 for Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems
Figure 2 for Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems
Figure 3 for Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems
Figure 4 for Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems
Viaarxiv icon