Picture for Baishakhi Ray

Baishakhi Ray

CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation

Add code
Jan 14, 2025
Viaarxiv icon

Can LLM Prompting Serve as a Proxy for Static Analysis in Vulnerability Detection

Add code
Dec 16, 2024
Viaarxiv icon

On Mitigating Code LLM Hallucinations with API Documentation

Add code
Jul 13, 2024
Viaarxiv icon

Solving Zebra Puzzles Using Constraint-Guided Multi-Agent Systems

Add code
Jul 04, 2024
Viaarxiv icon

Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies

Add code
Jun 11, 2024
Figure 1 for Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies
Figure 2 for Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies
Figure 3 for Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies
Figure 4 for Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies
Viaarxiv icon

SemCoder: Training Code Language Models with Comprehensive Semantics

Add code
Jun 03, 2024
Viaarxiv icon

Training LLMs to Better Self-Debug and Explain Code

Add code
May 28, 2024
Viaarxiv icon

Automatic Programming: Large Language Models and Beyond

Add code
May 03, 2024
Viaarxiv icon

Vulnerability Detection with Code Language Models: How Far Are We?

Add code
Mar 27, 2024
Figure 1 for Vulnerability Detection with Code Language Models: How Far Are We?
Figure 2 for Vulnerability Detection with Code Language Models: How Far Are We?
Figure 3 for Vulnerability Detection with Code Language Models: How Far Are We?
Figure 4 for Vulnerability Detection with Code Language Models: How Far Are We?
Viaarxiv icon

CYCLE: Learning to Self-Refine the Code Generation

Add code
Mar 27, 2024
Viaarxiv icon