Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mélina Verger

A Fair Post-Processing Method based on the MADD Metric for Predictive Student Models

Jul 07, 2024

Mélina Verger, Chunyang Fan, Sébastien Lallé, François Bouchet, Vanda Luengo

Figure 1 for A Fair Post-Processing Method based on the MADD Metric for Predictive Student Models

Figure 2 for A Fair Post-Processing Method based on the MADD Metric for Predictive Student Models

Figure 3 for A Fair Post-Processing Method based on the MADD Metric for Predictive Student Models

Figure 4 for A Fair Post-Processing Method based on the MADD Metric for Predictive Student Models

Abstract:Predictive student models are increasingly used in learning environments. However, due to the rising social impact of their usage, it is now all the more important for these models to be both sufficiently accurate and fair in their predictions. To evaluate algorithmic fairness, a new metric has been developed in education, namely the Model Absolute Density Distance (MADD). This metric enables us to measure how different a predictive model behaves regarding two groups of students, in order to quantify its algorithmic unfairness. In this paper, we thus develop a post-processing method based on this metric, that aims at improving the fairness while preserving the accuracy of relevant predictive models' results. We experiment with our approach on the task of predicting student success in an online course, using both simulated and real-world educational data, and obtain successful results. Our source code and data are in open access at https://github.com/melinaverger/MADD .

* 1st International Tutorial and Workshop on Responsible Knowledge Discovery in Education (RKDE 2023) at ECML PKDD 2023, September 2023, Turino, Italy

Via

Access Paper or Ask Questions

Evaluating Algorithmic Bias in Models for Predicting Academic Performance of Filipino Students

May 16, 2024

Valdemar Švábenský, Mélina Verger, Maria Mercedes T. Rodrigo, Clarence James G. Monterozo, Ryan S. Baker, Miguel Zenon Nicanor Lerias Saavedra, Sébastien Lallé, Atsushi Shimada

Figure 1 for Evaluating Algorithmic Bias in Models for Predicting Academic Performance of Filipino Students

Figure 2 for Evaluating Algorithmic Bias in Models for Predicting Academic Performance of Filipino Students

Figure 3 for Evaluating Algorithmic Bias in Models for Predicting Academic Performance of Filipino Students

Abstract:Algorithmic bias is a major issue in machine learning models in educational contexts. However, it has not yet been studied thoroughly in Asian learning contexts, and only limited work has considered algorithmic bias based on regional (sub-national) background. As a step towards addressing this gap, this paper examines the population of 5,986 students at a large university in the Philippines, investigating algorithmic bias based on students' regional background. The university used the Canvas learning management system (LMS) in its online courses across a broad range of domains. Over the period of three semesters, we collected 48.7 million log records of the students' activity in Canvas. We used these logs to train binary classification models that predict student grades from the LMS activity. The best-performing model reached AUC of 0.75 and weighted F1-score of 0.79. Subsequently, we examined the data for bias based on students' region. Evaluation using three metrics: AUC, weighted F1-score, and MADD showed consistent results across all demographic groups. Thus, no unfairness was observed against a particular student group in the grade predictions.

* Published in proceedings of the 17th Educational Data Mining Conference (EDM 2024)

Via

Access Paper or Ask Questions

Is Your Model "MADD"? A Novel Metric to Evaluate Algorithmic Fairness for Predictive Student Models

May 24, 2023

Mélina Verger, Sébastien Lallé, François Bouchet, Vanda Luengo

Figure 1 for Is Your Model "MADD"? A Novel Metric to Evaluate Algorithmic Fairness for Predictive Student Models

Figure 2 for Is Your Model "MADD"? A Novel Metric to Evaluate Algorithmic Fairness for Predictive Student Models

Figure 3 for Is Your Model "MADD"? A Novel Metric to Evaluate Algorithmic Fairness for Predictive Student Models

Figure 4 for Is Your Model "MADD"? A Novel Metric to Evaluate Algorithmic Fairness for Predictive Student Models

Abstract:Predictive student models are increasingly used in learning environments due to their ability to enhance educational outcomes and support stakeholders in making informed decisions. However, predictive models can be biased and produce unfair outcomes, leading to potential discrimination against some students and possible harmful long-term implications. This has prompted research on fairness metrics meant to capture and quantify such biases. Nonetheless, so far, existing fairness metrics used in education are predictive performance-oriented, focusing on assessing biased outcomes across groups of students, without considering the behaviors of the models nor the severity of the biases in the outcomes. Therefore, we propose a novel metric, the Model Absolute Density Distance (MADD), to analyze models' discriminatory behaviors independently from their predictive performance. We also provide a complementary visualization-based analysis to enable fine-grained human assessment of how the models discriminate between groups of students. We evaluate our approach on the common task of predicting student success in online courses, using several common predictive classification models on an open educational dataset. We also compare our metric to the only predictive performance-oriented fairness metric developed in education, ABROCA. Results on this dataset show that: (1) fair predictive performance does not guarantee fair models' behaviors and thus fair outcomes, (2) there is no direct relationship between data bias and predictive performance bias nor discriminatory behaviors bias, and (3) trained on the same data, models exhibit different discriminatory behaviors, according to different sensitive features too. We thus recommend using the MADD on models that show satisfying predictive performance, to gain a finer-grained understanding on how they behave and to refine models selection and their usage.

* 12 pages, conference

Via

Access Paper or Ask Questions

Predicting students' performance in online courses using multiple data sources

Sep 07, 2021

Mélina Verger, Hugo Jair Escalante

Figure 1 for Predicting students' performance in online courses using multiple data sources

Figure 2 for Predicting students' performance in online courses using multiple data sources

Figure 3 for Predicting students' performance in online courses using multiple data sources

Figure 4 for Predicting students' performance in online courses using multiple data sources

Abstract:Data-driven decision making is serving and transforming education. We approached the problem of predicting students' performance by using multiple data sources which came from online courses, including one we created. Experimental results show preliminary conclusions towards which data are to be considered for the task.

Via

Access Paper or Ask Questions