Picture for Zhibin Gou

Zhibin Gou

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Add code
Aug 15, 2024
Figure 1 for DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
Figure 2 for DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
Figure 3 for DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
Figure 4 for DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
Viaarxiv icon

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Add code
Jun 17, 2024
Figure 1 for DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Figure 2 for DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Figure 3 for DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Figure 4 for DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Viaarxiv icon

Rho-1: Not All Tokens Are What You Need

Add code
Apr 11, 2024
Figure 1 for Rho-1: Not All Tokens Are What You Need
Figure 2 for Rho-1: Not All Tokens Are What You Need
Figure 3 for Rho-1: Not All Tokens Are What You Need
Figure 4 for Rho-1: Not All Tokens Are What You Need
Viaarxiv icon

Exploring the Mystery of Influential Data for Mathematical Reasoning

Add code
Apr 01, 2024
Figure 1 for Exploring the Mystery of Influential Data for Mathematical Reasoning
Figure 2 for Exploring the Mystery of Influential Data for Mathematical Reasoning
Figure 3 for Exploring the Mystery of Influential Data for Mathematical Reasoning
Figure 4 for Exploring the Mystery of Influential Data for Mathematical Reasoning
Viaarxiv icon

CriticBench: Benchmarking LLMs for Critique-Correct Reasoning

Add code
Mar 08, 2024
Viaarxiv icon

Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning

Add code
Mar 04, 2024
Viaarxiv icon

SciAgent: Tool-augmented Language Models for Scientific Reasoning

Add code
Feb 21, 2024
Viaarxiv icon

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving

Add code
Oct 04, 2023
Viaarxiv icon

MvP: Multi-view Prompting Improves Aspect Sentiment Tuple Prediction

Add code
May 22, 2023
Viaarxiv icon

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing

Add code
May 19, 2023
Viaarxiv icon