Picture for Zhihong Shao

Zhihong Shao

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Add code
Aug 15, 2024
Figure 1 for DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
Figure 2 for DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
Figure 3 for DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
Figure 4 for DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
Viaarxiv icon

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Add code
Jun 17, 2024
Figure 1 for DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Figure 2 for DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Figure 3 for DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Figure 4 for DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Viaarxiv icon

Learning Task Decomposition to Assist Humans in Competitive Programming

Add code
Jun 07, 2024
Figure 1 for Learning Task Decomposition to Assist Humans in Competitive Programming
Figure 2 for Learning Task Decomposition to Assist Humans in Competitive Programming
Figure 3 for Learning Task Decomposition to Assist Humans in Competitive Programming
Figure 4 for Learning Task Decomposition to Assist Humans in Competitive Programming
Viaarxiv icon

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Add code
May 23, 2024
Viaarxiv icon

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Add code
Feb 06, 2024
Figure 1 for DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Figure 2 for DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Figure 3 for DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Figure 4 for DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Viaarxiv icon

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Add code
Jan 05, 2024
Figure 1 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 2 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 3 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 4 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Viaarxiv icon

Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations

Add code
Dec 28, 2023
Figure 1 for Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations
Figure 2 for Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations
Figure 3 for Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations
Figure 4 for Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations
Viaarxiv icon

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving

Add code
Oct 04, 2023
Viaarxiv icon

Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy

Add code
May 24, 2023
Viaarxiv icon

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing

Add code
May 19, 2023
Viaarxiv icon