Picture for Kele Huang

Kele Huang

Less is more: Not all samples are effective for evaluation

Add code
Dec 22, 2025
Viaarxiv icon

CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation

Add code
Jan 14, 2025
Figure 1 for CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation
Figure 2 for CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation
Figure 3 for CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation
Figure 4 for CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation
Viaarxiv icon