Picture for Xuanming Zhang

Xuanming Zhang

Seeker: Enhancing Exception Handling in Code with LLM-based Multi-Agent Approach

Add code
Oct 09, 2024
Viaarxiv icon

DECOR: Improving Coherence in L2 English Writing with a Novel Benchmark for Incoherence Detection, Reasoning, and Rewriting

Add code
Jun 28, 2024
Viaarxiv icon

VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation

Add code
Jun 26, 2024
Viaarxiv icon

DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories

Add code
May 30, 2024
Viaarxiv icon

EvoCodeBench: An Evolving Code Generation Benchmark Aligned with Real-World Code Repositories

Add code
Mar 31, 2024
Viaarxiv icon

DevEval: Evaluating Code Generation in Practical Software Projects

Add code
Jan 26, 2024
Viaarxiv icon

ProLex: A Benchmark for Language Proficiency-oriented Lexical Substitution

Add code
Jan 21, 2024
Viaarxiv icon

Carbon Emission Prediction and Clean Industry Transformation Based on Machine Learning: A Case Study of Sichuan Province

Add code
Sep 03, 2023
Viaarxiv icon

A High-fidelity, Machine-learning Enhanced Queueing Network Simulation Model for Hospital Ultrasound Operations

Add code
Apr 12, 2021
Figure 1 for A High-fidelity, Machine-learning Enhanced Queueing Network Simulation Model for Hospital Ultrasound Operations
Figure 2 for A High-fidelity, Machine-learning Enhanced Queueing Network Simulation Model for Hospital Ultrasound Operations
Figure 3 for A High-fidelity, Machine-learning Enhanced Queueing Network Simulation Model for Hospital Ultrasound Operations
Figure 4 for A High-fidelity, Machine-learning Enhanced Queueing Network Simulation Model for Hospital Ultrasound Operations
Viaarxiv icon