Picture for Qiufeng Yin

Qiufeng Yin

Scaling Data Difficulty: Improving Coding Models via Reinforcement Learning on Fresh and Challenging Problems

Add code
Mar 08, 2026
Viaarxiv icon

MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark

Add code
Dec 19, 2024
Figure 1 for MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark
Figure 2 for MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark
Figure 3 for MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark
Figure 4 for MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark
Viaarxiv icon

RedStone: Curating General, Code, Math, and QA Data for Large Language Models

Add code
Dec 04, 2024
Figure 1 for RedStone: Curating General, Code, Math, and QA Data for Large Language Models
Figure 2 for RedStone: Curating General, Code, Math, and QA Data for Large Language Models
Figure 3 for RedStone: Curating General, Code, Math, and QA Data for Large Language Models
Figure 4 for RedStone: Curating General, Code, Math, and QA Data for Large Language Models
Viaarxiv icon

WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation

Add code
Jan 11, 2024
Viaarxiv icon