Picture for Wenting Zhao

Wenting Zhao

Multi-Turn Code Generation Through Single-Step Rewards

Add code
Feb 27, 2025
Viaarxiv icon

ProjectTest: A Project-level LLM Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms

Add code
Feb 11, 2025
Figure 1 for ProjectTest: A Project-level LLM Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms
Figure 2 for ProjectTest: A Project-level LLM Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms
Figure 3 for ProjectTest: A Project-level LLM Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms
Figure 4 for ProjectTest: A Project-level LLM Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms
Viaarxiv icon

ProjectTest: A Project-level Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms

Add code
Feb 10, 2025
Figure 1 for ProjectTest: A Project-level Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms
Figure 2 for ProjectTest: A Project-level Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms
Figure 3 for ProjectTest: A Project-level Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms
Figure 4 for ProjectTest: A Project-level Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms
Viaarxiv icon

Commit0: Library Generation from Scratch

Add code
Dec 02, 2024
Viaarxiv icon

Are Triggers Needed for Document-Level Event Extraction?

Add code
Nov 13, 2024
Viaarxiv icon

A Controlled Study on Long Context Extension and Generalization in LLMs

Add code
Sep 18, 2024
Figure 1 for A Controlled Study on Long Context Extension and Generalization in LLMs
Figure 2 for A Controlled Study on Long Context Extension and Generalization in LLMs
Figure 3 for A Controlled Study on Long Context Extension and Generalization in LLMs
Figure 4 for A Controlled Study on Long Context Extension and Generalization in LLMs
Viaarxiv icon

WildVis: Open Source Visualizer for Million-Scale Chat Logs in the Wild

Add code
Sep 05, 2024
Viaarxiv icon

Great Memory, Shallow Reasoning: Limits of $k$NN-LMs

Add code
Aug 21, 2024
Figure 1 for Great Memory, Shallow Reasoning: Limits of $k$NN-LMs
Figure 2 for Great Memory, Shallow Reasoning: Limits of $k$NN-LMs
Figure 3 for Great Memory, Shallow Reasoning: Limits of $k$NN-LMs
Figure 4 for Great Memory, Shallow Reasoning: Limits of $k$NN-LMs
Viaarxiv icon

WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries

Add code
Jul 24, 2024
Figure 1 for WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries
Figure 2 for WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries
Figure 3 for WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries
Figure 4 for WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries
Viaarxiv icon

I Could've Asked That: Reformulating Unanswerable Questions

Add code
Jul 24, 2024
Figure 1 for I Could've Asked That: Reformulating Unanswerable Questions
Figure 2 for I Could've Asked That: Reformulating Unanswerable Questions
Figure 3 for I Could've Asked That: Reformulating Unanswerable Questions
Figure 4 for I Could've Asked That: Reformulating Unanswerable Questions
Viaarxiv icon