Picture for Chen Xing

Chen Xing

ProjectTest: A Project-level LLM Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms

Add code
Feb 11, 2025
Figure 1 for ProjectTest: A Project-level LLM Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms
Figure 2 for ProjectTest: A Project-level LLM Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms
Figure 3 for ProjectTest: A Project-level LLM Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms
Figure 4 for ProjectTest: A Project-level LLM Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms
Viaarxiv icon

ProjectTest: A Project-level Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms

Add code
Feb 10, 2025
Figure 1 for ProjectTest: A Project-level Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms
Figure 2 for ProjectTest: A Project-level Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms
Figure 3 for ProjectTest: A Project-level Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms
Figure 4 for ProjectTest: A Project-level Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms
Viaarxiv icon

MultiChallenge: A Realistic Multi-Turn Conversation Evaluation Benchmark Challenging to Frontier LLMs

Add code
Jan 29, 2025
Viaarxiv icon

ReGenesis: LLMs can Grow into Reasoning Generalists via Self-Improvement

Add code
Oct 03, 2024
Viaarxiv icon

LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing

Add code
Jun 25, 2024
Figure 1 for LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
Figure 2 for LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
Figure 3 for LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
Figure 4 for LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
Viaarxiv icon

FOFO: A Benchmark to Evaluate LLMs' Format-Following Capability

Add code
Feb 28, 2024
Viaarxiv icon

Lemur: Harmonizing Natural Language and Code for Language Agents

Add code
Oct 10, 2023
Viaarxiv icon

XGen-7B Technical Report

Add code
Sep 07, 2023
Viaarxiv icon

Mask-free OVIS: Open-Vocabulary Instance Segmentation without Manual Mask Annotations

Add code
Mar 29, 2023
Viaarxiv icon

GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation

Add code
Mar 17, 2023
Viaarxiv icon