Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sadra Sabouri

ELI-Why: Evaluating the Pedagogical Utility of Language Model Explanations

Jun 17, 2025

Brihi Joshi, Keyu He, Sahana Ramnath, Sadra Sabouri, Kaitlyn Zhou, Souti Chattopadhyay, Swabha Swayamdipta, Xiang Ren

Abstract:Language models today are widely used in education, yet their ability to tailor responses for learners with varied informational needs and knowledge backgrounds remains under-explored. To this end, we introduce ELI-Why, a benchmark of 13.4K "Why" questions to evaluate the pedagogical capabilities of language models. We then conduct two extensive human studies to assess the utility of language model-generated explanatory answers (explanations) on our benchmark, tailored to three distinct educational grades: elementary, high-school and graduate school. In our first study, human raters assume the role of an "educator" to assess model explanations' fit to different educational grades. We find that GPT-4-generated explanations match their intended educational background only 50% of the time, compared to 79% for lay human-curated explanations. In our second study, human raters assume the role of a learner to assess if an explanation fits their own informational needs. Across all educational backgrounds, users deemed GPT-4-generated explanations 20% less suited on average to their informational needs, when compared to explanations curated by lay people. Additionally, automated evaluation metrics reveal that explanations generated across different language model families for different informational needs remain indistinguishable in their grade-level, limiting their pedagogical effectiveness.

* Findings of ACL 2025

Via

Access Paper or Ask Questions

PyMilo: A Python Library for ML I/O

Dec 31, 2024

AmirHosein Rostami, Sepand Haghighi, Sadra Sabouri, Alireza Zolanvari

Abstract:PyMilo is an open-source Python package that addresses the limitations of existing Machine Learning (ML) model storage formats by providing a transparent, reliable, and safe method for exporting and deploying trained models. Current formats, such as pickle and other binary formats, have significant problems, such as reliability, safety, and transparency issues. In contrast, PyMilo serializes ML models in a transparent non-executable format, enabling straightforward and safe model exchange, while also facilitating the deserialization and deployment of exported models in production environments. This package aims to provide a seamless, end-to-end solution for the exportation and importation of pre-trained ML models, which simplifies the model development and deployment pipeline.

* 7 pages, 5 figures, 2 tables, 3 code blocks

Via

Access Paper or Ask Questions

A Semidefinite Relaxation Approach for Fair Graph Clustering

Oct 19, 2024

Sina Baharlouei, Sadra Sabouri

Figure 1 for A Semidefinite Relaxation Approach for Fair Graph Clustering

Figure 2 for A Semidefinite Relaxation Approach for Fair Graph Clustering

Figure 3 for A Semidefinite Relaxation Approach for Fair Graph Clustering

Figure 4 for A Semidefinite Relaxation Approach for Fair Graph Clustering

Abstract:Fair graph clustering is crucial for ensuring equitable representation and treatment of diverse communities in network analysis. Traditional methods often ignore disparities among social, economic, and demographic groups, perpetuating biased outcomes and reinforcing inequalities. This study introduces fair graph clustering within the framework of the disparate impact doctrine, treating it as a joint optimization problem integrating clustering quality and fairness constraints. Given the NP-hard nature of this problem, we employ a semidefinite relaxation approach to approximate the underlying optimization problem. For up to medium-sized graphs, we utilize a singular value decomposition-based algorithm, while for larger graphs, we propose a novel algorithm based on the alternative direction method of multipliers. Unlike existing methods, our formulation allows for tuning the trade-off between clustering quality and fairness. Experimental results on graphs generated from the standard stochastic block model demonstrate the superiority of our approach in achieving an optimal accuracy-fairness trade-off compared to state-of-the-art methods.

Via

Access Paper or Ask Questions

naab: A ready-to-use plug-and-play corpus for Farsi

Aug 29, 2022

Sadra Sabouri, Elnaz Rahmati, Soroush Gooran, Hossein Sameti

Figure 1 for naab: A ready-to-use plug-and-play corpus for Farsi

Figure 2 for naab: A ready-to-use plug-and-play corpus for Farsi

Figure 3 for naab: A ready-to-use plug-and-play corpus for Farsi

Figure 4 for naab: A ready-to-use plug-and-play corpus for Farsi

Abstract:Huge corpora of textual data are always known to be a crucial need for training deep models such as transformer-based ones. This issue is emerging more in lower resource languages - like Farsi. We propose naab, the biggest cleaned and ready-to-use open-source textual corpus in Farsi. It contains about 130GB of data, 250 million paragraphs, and 15 billion words. The project name is derived from the Farsi word NAAB K which means pure and high grade. We also provide the raw version of the corpus called naab-raw and an easy-to-use preprocessor that can be employed by those who wanted to make a customized corpus.

* 6 pages, 2 figures

Via

Access Paper or Ask Questions