Picture for Chengpeng Li

Chengpeng Li

Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement

Add code
Sep 18, 2024
Viaarxiv icon

Qwen2 Technical Report

Add code
Jul 16, 2024
Figure 1 for Qwen2 Technical Report
Figure 2 for Qwen2 Technical Report
Figure 3 for Qwen2 Technical Report
Figure 4 for Qwen2 Technical Report
Viaarxiv icon

DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

Add code
Jul 04, 2024
Viaarxiv icon

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

Add code
Jun 19, 2024
Viaarxiv icon

Model-enhanced Contrastive Reinforcement Learning for Sequential Recommendation

Add code
Oct 25, 2023
Viaarxiv icon

Query and Response Augmentation Cannot Help Out-of-domain Math Reasoning Generalization

Add code
Oct 09, 2023
Figure 1 for Query and Response Augmentation Cannot Help Out-of-domain Math Reasoning Generalization
Figure 2 for Query and Response Augmentation Cannot Help Out-of-domain Math Reasoning Generalization
Figure 3 for Query and Response Augmentation Cannot Help Out-of-domain Math Reasoning Generalization
Figure 4 for Query and Response Augmentation Cannot Help Out-of-domain Math Reasoning Generalization
Viaarxiv icon

How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition

Add code
Oct 09, 2023
Figure 1 for How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition
Figure 2 for How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition
Figure 3 for How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition
Figure 4 for How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition
Viaarxiv icon

Scaling Relationship on Learning Mathematical Reasoning with Large Language Models

Add code
Aug 03, 2023
Viaarxiv icon