Picture for Amin Nikanjam

Amin Nikanjam

Jack

Prism: Dynamic and Flexible Benchmarking of LLMs Code Generation with Monte Carlo Tree Search

Add code
Apr 10, 2025
Viaarxiv icon

An Efficient Model Maintenance Approach for MLOps

Add code
Dec 05, 2024
Viaarxiv icon

Fault Localization in Deep Learning-based Software: A System-level Approach

Add code
Nov 12, 2024
Figure 1 for Fault Localization in Deep Learning-based Software: A System-level Approach
Figure 2 for Fault Localization in Deep Learning-based Software: A System-level Approach
Figure 3 for Fault Localization in Deep Learning-based Software: A System-level Approach
Figure 4 for Fault Localization in Deep Learning-based Software: A System-level Approach
Viaarxiv icon

Toward Debugging Deep Reinforcement Learning Programs with RLExplorer

Add code
Oct 06, 2024
Figure 1 for Toward Debugging Deep Reinforcement Learning Programs with RLExplorer
Figure 2 for Toward Debugging Deep Reinforcement Learning Programs with RLExplorer
Figure 3 for Toward Debugging Deep Reinforcement Learning Programs with RLExplorer
Figure 4 for Toward Debugging Deep Reinforcement Learning Programs with RLExplorer
Viaarxiv icon

Assessing Programming Task Difficulty for Efficient Evaluation of Large Language Models

Add code
Jul 30, 2024
Viaarxiv icon

DeepCodeProbe: Towards Understanding What Models Trained on Code Learn

Add code
Jul 11, 2024
Figure 1 for DeepCodeProbe: Towards Understanding What Models Trained on Code Learn
Figure 2 for DeepCodeProbe: Towards Understanding What Models Trained on Code Learn
Figure 3 for DeepCodeProbe: Towards Understanding What Models Trained on Code Learn
Figure 4 for DeepCodeProbe: Towards Understanding What Models Trained on Code Learn
Viaarxiv icon

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Add code
Apr 18, 2024
Figure 1 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 2 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 3 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 4 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Viaarxiv icon

Bugs in Large Language Models Generated Code: An Empirical Study

Add code
Mar 18, 2024
Figure 1 for Bugs in Large Language Models Generated Code: An Empirical Study
Figure 2 for Bugs in Large Language Models Generated Code: An Empirical Study
Figure 3 for Bugs in Large Language Models Generated Code: An Empirical Study
Figure 4 for Bugs in Large Language Models Generated Code: An Empirical Study
Viaarxiv icon

Trained Without My Consent: Detecting Code Inclusion In Language Models Trained on Code

Add code
Feb 14, 2024
Viaarxiv icon

Data Cleaning and Machine Learning: A Systematic Literature Review

Add code
Oct 03, 2023
Viaarxiv icon