Picture for Maliheh Izadi

Maliheh Izadi

Automated Attention Pattern Discovery at Scale in Large Language Models

Add code
Apr 04, 2026
Viaarxiv icon

Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time

Add code
Apr 01, 2026
Viaarxiv icon

TriCEGAR: A Trace-Driven Abstraction Mechanism for Agentic AI

Add code
Jan 30, 2026
Viaarxiv icon

Model See, Model Do? Exposure-Aware Evaluation of Bug-vs-Fix Preference in Code LLMs

Add code
Jan 15, 2026
Viaarxiv icon

Does In-IDE Calibration of Large Language Models work at Scale?

Add code
Oct 26, 2025
Figure 1 for Does In-IDE Calibration of Large Language Models work at Scale?
Figure 2 for Does In-IDE Calibration of Large Language Models work at Scale?
Figure 3 for Does In-IDE Calibration of Large Language Models work at Scale?
Figure 4 for Does In-IDE Calibration of Large Language Models work at Scale?
Viaarxiv icon

A Qualitative Investigation into LLM-Generated Multilingual Code Comments and Automatic Evaluation Metrics

Add code
May 21, 2025
Viaarxiv icon

Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks

Add code
Apr 02, 2025
Figure 1 for Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks
Figure 2 for Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks
Figure 3 for Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks
Figure 4 for Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks
Viaarxiv icon

Human-AI Experience in Integrated Development Environments: A Systematic Literature Review

Add code
Mar 08, 2025
Viaarxiv icon

Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol

Add code
Mar 07, 2025
Figure 1 for Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol
Figure 2 for Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol
Figure 3 for Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol
Figure 4 for Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol
Viaarxiv icon

The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models

Add code
Jan 16, 2025
Figure 1 for The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models
Figure 2 for The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models
Figure 3 for The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models
Figure 4 for The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models
Viaarxiv icon