Picture for Jinhao Jiang

Jinhao Jiang

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Add code
Mar 07, 2025
Viaarxiv icon

An Empirical Study on Eliciting and Improving R1-like Reasoning Models

Add code
Mar 06, 2025
Viaarxiv icon

LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation

Add code
Feb 11, 2025
Viaarxiv icon

Holistically Guided Monte Carlo Tree Search for Intricate Information Seeking

Add code
Feb 07, 2025
Figure 1 for Holistically Guided Monte Carlo Tree Search for Intricate Information Seeking
Figure 2 for Holistically Guided Monte Carlo Tree Search for Intricate Information Seeking
Figure 3 for Holistically Guided Monte Carlo Tree Search for Intricate Information Seeking
Figure 4 for Holistically Guided Monte Carlo Tree Search for Intricate Information Seeking
Viaarxiv icon

YuLan-Mini: An Open Data-efficient Language Model

Add code
Dec 24, 2024
Figure 1 for YuLan-Mini: An Open Data-efficient Language Model
Figure 2 for YuLan-Mini: An Open Data-efficient Language Model
Figure 3 for YuLan-Mini: An Open Data-efficient Language Model
Figure 4 for YuLan-Mini: An Open Data-efficient Language Model
Viaarxiv icon

RAG-Star: Enhancing Deliberative Reasoning with Retrieval Augmented Verification and Refinement

Add code
Dec 17, 2024
Figure 1 for RAG-Star: Enhancing Deliberative Reasoning with Retrieval Augmented Verification and Refinement
Figure 2 for RAG-Star: Enhancing Deliberative Reasoning with Retrieval Augmented Verification and Refinement
Figure 3 for RAG-Star: Enhancing Deliberative Reasoning with Retrieval Augmented Verification and Refinement
Figure 4 for RAG-Star: Enhancing Deliberative Reasoning with Retrieval Augmented Verification and Refinement
Viaarxiv icon

Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems

Add code
Dec 12, 2024
Figure 1 for Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems
Figure 2 for Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems
Figure 3 for Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems
Figure 4 for Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems
Viaarxiv icon

Technical Report: Enhancing LLM Reasoning with Reward-guided Tree Search

Add code
Nov 18, 2024
Figure 1 for Technical Report: Enhancing LLM Reasoning with Reward-guided Tree Search
Figure 2 for Technical Report: Enhancing LLM Reasoning with Reward-guided Tree Search
Figure 3 for Technical Report: Enhancing LLM Reasoning with Reward-guided Tree Search
Figure 4 for Technical Report: Enhancing LLM Reasoning with Reward-guided Tree Search
Viaarxiv icon

Towards Effective and Efficient Continual Pre-training of Large Language Models

Add code
Jul 26, 2024
Figure 1 for Towards Effective and Efficient Continual Pre-training of Large Language Models
Figure 2 for Towards Effective and Efficient Continual Pre-training of Large Language Models
Figure 3 for Towards Effective and Efficient Continual Pre-training of Large Language Models
Figure 4 for Towards Effective and Efficient Continual Pre-training of Large Language Models
Viaarxiv icon

Mix-CPT: A Domain Adaptation Framework via Decoupling Knowledge Learning and Format Alignment

Add code
Jul 15, 2024
Viaarxiv icon