Picture for Junxiao Song

Junxiao Song

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Add code
Aug 15, 2024
Figure 1 for DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
Figure 2 for DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
Figure 3 for DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
Figure 4 for DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
Viaarxiv icon

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Add code
Jun 17, 2024
Figure 1 for DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Figure 2 for DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Figure 3 for DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Figure 4 for DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Viaarxiv icon

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Add code
Feb 06, 2024
Figure 1 for DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Figure 2 for DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Figure 3 for DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Figure 4 for DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Viaarxiv icon

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Add code
Jan 05, 2024
Figure 1 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 2 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 3 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 4 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Viaarxiv icon

SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II

Add code
Dec 24, 2020
Figure 1 for SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II
Figure 2 for SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II
Figure 3 for SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II
Figure 4 for SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II
Viaarxiv icon

Sparse Generalized Eigenvalue Problem via Smooth Optimization

Add code
Nov 18, 2014
Figure 1 for Sparse Generalized Eigenvalue Problem via Smooth Optimization
Figure 2 for Sparse Generalized Eigenvalue Problem via Smooth Optimization
Figure 3 for Sparse Generalized Eigenvalue Problem via Smooth Optimization
Figure 4 for Sparse Generalized Eigenvalue Problem via Smooth Optimization
Viaarxiv icon