Picture for Chao Peng

Chao Peng

Evaluating LLM-Based 0-to-1 Software Generation in End-to-End CLI Tool Scenarios

Add code
Apr 08, 2026
Viaarxiv icon

Evaluating Repository-level Software Documentation via Question Answering and Feature-Driven Development

Add code
Apr 08, 2026
Viaarxiv icon

Rethinking the Value of Agent-Generated Tests for LLM-Based Software Engineering Agents

Add code
Feb 08, 2026
Viaarxiv icon

Yunjue Agent Tech Report: A Fully Reproducible, Zero-Start In-Situ Self-Evolving Agent System for Open-Ended Tasks

Add code
Jan 26, 2026
Viaarxiv icon

Self-Augmented Mixture-of-Experts for QoS Prediction

Add code
Jan 16, 2026
Viaarxiv icon

Combating Spurious Correlations in Graph Interpretability via Self-Reflection

Add code
Jan 16, 2026
Viaarxiv icon

AdaGReS:Adaptive Greedy Context Selection via Redundancy-Aware Scoring for Token-Budgeted RAG

Add code
Dec 31, 2025
Viaarxiv icon

Benchmarking LLMs for Fine-Grained Code Review with Enriched Context in Practice

Add code
Nov 10, 2025
Figure 1 for Benchmarking LLMs for Fine-Grained Code Review with Enriched Context in Practice
Figure 2 for Benchmarking LLMs for Fine-Grained Code Review with Enriched Context in Practice
Figure 3 for Benchmarking LLMs for Fine-Grained Code Review with Enriched Context in Practice
Figure 4 for Benchmarking LLMs for Fine-Grained Code Review with Enriched Context in Practice
Viaarxiv icon

Scalable Supervising Software Agents with Patch Reasoner

Add code
Oct 26, 2025
Figure 1 for Scalable Supervising Software Agents with Patch Reasoner
Figure 2 for Scalable Supervising Software Agents with Patch Reasoner
Figure 3 for Scalable Supervising Software Agents with Patch Reasoner
Figure 4 for Scalable Supervising Software Agents with Patch Reasoner
Viaarxiv icon

SafeGenBench: A Benchmark Framework for Security Vulnerability Detection in LLM-Generated Code

Add code
Jun 06, 2025
Viaarxiv icon