Picture for Zhexu Wang

Zhexu Wang

CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models

Add code
Feb 23, 2025
Viaarxiv icon

How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data

Add code
Sep 05, 2024
Viaarxiv icon

CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery

Add code
Jun 12, 2024
Figure 1 for CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Figure 2 for CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Figure 3 for CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Figure 4 for CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery
Viaarxiv icon