Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Aditya Mittal

Rubric Is All You Need: Enhancing LLM-based Code Evaluation With Question-Specific Rubrics

Mar 31, 2025

Aditya Pathak, Rachit Gandhi, Vaibhav Uttam, Devansh, Yashwanth Nakka, Aaryan Raj Jindal, Pratyush Ghosh, Arnav Ramamoorthy, Shreyash Verma, Aditya Mittal(+4 more)

Abstract:Since the disruption in LLM technology brought about by the release of GPT-3 and ChatGPT, LLMs have shown remarkable promise in programming-related tasks. While code generation remains a popular field of research, code evaluation using LLMs remains a problem with no conclusive solution. In this paper, we focus on LLM-based code evaluation and attempt to fill in the existing gaps. We propose multi-agentic novel approaches using question-specific rubrics tailored to the problem statement, arguing that these perform better for logical assessment than the existing approaches that use question-agnostic rubrics. To address the lack of suitable evaluation datasets, we introduce two datasets: a Data Structures and Algorithms dataset containing 150 student submissions from a popular Data Structures and Algorithms practice website, and an Object Oriented Programming dataset comprising 80 student submissions from undergraduate computer science courses. In addition to using standard metrics (Spearman Correlation, Cohen's Kappa), we additionally propose a new metric called as Leniency, which quantifies evaluation strictness relative to expert assessment. Our comprehensive analysis demonstrates that question-specific rubrics significantly enhance logical assessment of code in educational settings, providing better feedback aligned with instructional goals beyond mere syntactic correctness.

* Under Review

Via

Access Paper or Ask Questions

TowerDebias: A Novel Debiasing Method based on the Tower Property

Nov 13, 2024

Norman Matloff, Aditya Mittal

Figure 1 for TowerDebias: A Novel Debiasing Method based on the Tower Property

Figure 2 for TowerDebias: A Novel Debiasing Method based on the Tower Property

Figure 3 for TowerDebias: A Novel Debiasing Method based on the Tower Property

Figure 4 for TowerDebias: A Novel Debiasing Method based on the Tower Property

Abstract:Decision-making processes have increasingly come to rely on sophisticated machine learning tools, raising concerns about the fairness of their predictions with respect to any sensitive groups. The widespread use of commercial black-box machine learning models necessitates careful consideration of their legal and ethical implications on consumers. In situations where users have access to these "black-box" models, a key question emerges: how can we mitigate or eliminate the influence of sensitive attributes, such as race or gender? We propose towerDebias (tDB), a novel approach designed to reduce the influence of sensitive variables in predictions made by black-box models. Using the Tower Property from probability theory, tDB aims to improve prediction fairness during the post-processing stage in a manner amenable to the Fairness-Utility Tradeoff. This method is highly flexible, requiring no prior knowledge of the original model's internal structure, and can be extended to a range of different applications. We provide a formal improvement theorem for tDB and demonstrate its effectiveness in both regression and classification tasks, underscoring its impact on the fairness-utility tradeoff.

* To be submitted to a journal soon

Via

Access Paper or Ask Questions

dsld: A Socially Relevant Tool for Teaching Statistics

Nov 06, 2024

Taha Abdullah, Arjun Ashok, Brandon Estrada, Norman Matloff, Aditya Mittal

Figure 1 for dsld: A Socially Relevant Tool for Teaching Statistics

Figure 2 for dsld: A Socially Relevant Tool for Teaching Statistics

Figure 3 for dsld: A Socially Relevant Tool for Teaching Statistics

Figure 4 for dsld: A Socially Relevant Tool for Teaching Statistics

Abstract:The growing power of data science can play a crucial role in addressing social discrimination, necessitating nuanced understanding and effective mitigation strategies of potential biases. Data Science Looks At Discrimination (dsld) is an R and Python package designed to provide users with a comprehensive toolkit of statistical and graphical methods for assessing possible discrimination related to protected groups, such as race, gender, and age. Our software offers techniques for discrimination analysis by identifying and mitigating confounding variables, along with methods for reducing bias in predictive models. In educational settings, dsld offers instructors powerful tools to teach important statistical principles through motivating real world examples of discrimination analysis. The inclusion of an 80-page Quarto book further supports users, from statistics educators to legal professionals, in effectively applying these analytical tools to real world scenarios.

* To be submitted to the Journal of Statistics and Data Science Education

Via

Access Paper or Ask Questions