Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jerzy Stefanowski

Explaining Concept Drift through the Evolution of Group Counterfactuals

Sep 11, 2025

Ignacy Stępka, Jerzy Stefanowski

Abstract:Machine learning models in dynamic environments often suffer from concept drift, where changes in the data distribution degrade performance. While detecting this drift is a well-studied topic, explaining how and why the model's decision-making logic changes still remains a significant challenge. In this paper, we introduce a novel methodology to explain concept drift by analyzing the temporal evolution of group-based counterfactual explanations (GCEs). Our approach tracks shifts in the GCEs' cluster centroids and their associated counterfactual action vectors before and after a drift. These evolving GCEs act as an interpretable proxy, revealing structural changes in the model's decision boundary and its underlying rationale. We operationalize this analysis within a three-layer framework that synergistically combines insights from the data layer (distributional shifts), the model layer (prediction disagreement), and our proposed explanation layer. We show that such holistic view allows for a more comprehensive diagnosis of drift, making it possible to distinguish between different root causes, such as a spatial data shift versus a re-labeling of concepts.

* TempXAI Workshop @ ECML PKDD 2025

Via

Access Paper or Ask Questions

This part looks alike this: identifying important parts of explained instances and prototypes

May 08, 2025

Jacek Karolczak, Jerzy Stefanowski

Figure 1 for This part looks alike this: identifying important parts of explained instances and prototypes

Figure 2 for This part looks alike this: identifying important parts of explained instances and prototypes

Figure 3 for This part looks alike this: identifying important parts of explained instances and prototypes

Figure 4 for This part looks alike this: identifying important parts of explained instances and prototypes

Abstract:Although prototype-based explanations provide a human-understandable way of representing model predictions they often fail to direct user attention to the most relevant features. We propose a novel approach to identify the most informative features within prototypes, termed alike parts. Using feature importance scores derived from an agnostic explanation method, it emphasizes the most relevant overlapping features between an instance and its nearest prototype. Furthermore, the feature importance score is incorporated into the objective function of the prototype selection algorithms to promote global prototypes diversity. Through experiments on six benchmark datasets, we demonstrate that the proposed approach improves user comprehension while maintaining or even increasing predictive accuracy.

Via

Access Paper or Ask Questions

DetoxAI: a Python Toolkit for Debiasing Deep Learning Models in Computer Vision

May 02, 2025

Ignacy Stępka, Lukasz Sztukiewicz, Michał Wiliński, Jerzy Stefanowski

Abstract:While machine learning fairness has made significant progress in recent years, most existing solutions focus on tabular data and are poorly suited for vision-based classification tasks, which rely heavily on deep learning. To bridge this gap, we introduce DetoxAI, an open-source Python library for improving fairness in deep learning vision classifiers through post-hoc debiasing. DetoxAI implements state-of-the-art debiasing algorithms, fairness metrics, and visualization tools. It supports debiasing via interventions in internal representations and includes attribution-based visualization tools and quantitative algorithmic fairness metrics to show how bias is mitigated. This paper presents the motivation, design, and use cases of DetoxAI, demonstrating its tangible value to engineers and researchers.

Via

Access Paper or Ask Questions

Properties of fairness measures in the context of varying class imbalance and protected group ratios

Nov 13, 2024

Dariusz Brzezinski, Julia Stachowiak, Jerzy Stefanowski, Izabela Szczech, Robert Susmaga, Sofya Aksenyuk, Uladzimir Ivashka, Oleksandr Yasinskyi

Figure 1 for Properties of fairness measures in the context of varying class imbalance and protected group ratios

Figure 2 for Properties of fairness measures in the context of varying class imbalance and protected group ratios

Figure 3 for Properties of fairness measures in the context of varying class imbalance and protected group ratios

Figure 4 for Properties of fairness measures in the context of varying class imbalance and protected group ratios

Abstract:Society is increasingly relying on predictive models in fields like criminal justice, credit risk management, or hiring. To prevent such automated systems from discriminating against people belonging to certain groups, fairness measures have become a crucial component in socially relevant applications of machine learning. However, existing fairness measures have been designed to assess the bias between predictions for protected groups without considering the imbalance in the classes of the target variable. Current research on the potential effect of class imbalance on fairness focuses on practical applications rather than dataset-independent measure properties. In this paper, we study the general properties of fairness measures for changing class and protected group proportions. For this purpose, we analyze the probability mass functions of six of the most popular group fairness measures. We also measure how the probability of achieving perfect fairness changes for varying class imbalance ratios. Moreover, we relate the dataset-independent properties of fairness measures described in this paper to classifier fairness in real-life tasks. Our results show that measures such as Equal Opportunity and Positive Predictive Parity are more sensitive to changes in class imbalance than Accuracy Equality. These findings can help guide researchers and practitioners in choosing the most appropriate fairness measures for their classification problems.

Via

Access Paper or Ask Questions

Improving Online Bagging for Complex Imbalanced Data Stream

Oct 04, 2024

Bartosz Przybyl, Jerzy Stefanowski

Abstract:Learning classifiers from imbalanced and concept drifting data streams is still a challenge. Most of the current proposals focus on taking into account changes in the global imbalance ratio only and ignore the local difficulty factors, such as the minority class decomposition into sub-concepts and the presence of unsafe types of examples (borderline or rare ones). As the above factors present in the stream may deteriorate the performance of popular online classifiers, we propose extensions of resampling online bagging, namely Neighbourhood Undersampling or Oversampling Online Bagging to take better account of the presence of unsafe minority examples. The performed computational experiments with synthetic complex imbalanced data streams have shown their advantage over earlier variants of online bagging resampling ensembles.

* 16 pages, 4 figures

Via

Access Paper or Ask Questions

Counterfactual Explanations with Probabilistic Guarantees on their Robustness to Model Change

Aug 09, 2024

Ignacy Stępka, Mateusz Lango, Jerzy Stefanowski

Figure 1 for Counterfactual Explanations with Probabilistic Guarantees on their Robustness to Model Change

Figure 2 for Counterfactual Explanations with Probabilistic Guarantees on their Robustness to Model Change

Figure 3 for Counterfactual Explanations with Probabilistic Guarantees on their Robustness to Model Change

Figure 4 for Counterfactual Explanations with Probabilistic Guarantees on their Robustness to Model Change

Abstract:Counterfactual explanations (CFEs) guide users on how to adjust inputs to machine learning models to achieve desired outputs. While existing research primarily addresses static scenarios, real-world applications often involve data or model changes, potentially invalidating previously generated CFEs and rendering user-induced input changes ineffective. Current methods addressing this issue often support only specific models or change types, require extensive hyperparameter tuning, or fail to provide probabilistic guarantees on CFE robustness to model changes. This paper proposes a novel approach for generating CFEs that provides probabilistic guarantees for any model and change type, while offering interpretable and easy-to-select hyperparameters. We establish a theoretical framework for probabilistically defining robustness to model change and demonstrate how our BetaRCE method directly stems from it. BetaRCE is a post-hoc method applied alongside a chosen base CFE generation method to enhance the quality of the explanation beyond robustness. It facilitates a transition from the base explanation to a more robust one with user-adjusted probability bounds. Through experimental comparisons with baselines, we show that BetaRCE yields robust, most plausible, and closest to baseline counterfactual explanations.

Via

Access Paper or Ask Questions

A-PETE: Adaptive Prototype Explanations of Tree Ensembles

May 31, 2024

Jacek Karolczak, Jerzy Stefanowski

Figure 1 for A-PETE: Adaptive Prototype Explanations of Tree Ensembles

Figure 2 for A-PETE: Adaptive Prototype Explanations of Tree Ensembles

Abstract:The need for interpreting machine learning models is addressed through prototype explanations within the context of tree ensembles. An algorithm named Adaptive Prototype Explanations of Tree Ensembles (A-PETE) is proposed to automatise the selection of prototypes for these classifiers. Its unique characteristics is using a specialised distance measure and a modified k-medoid approach. Experiments demonstrated its competitive predictive accuracy with respect to earlier explanation algorithms. It also provides a a sufficient number of prototypes for the purpose of interpreting the random forest classifier.

Via

Access Paper or Ask Questions

Unifying Perspectives: Plausible Counterfactual Explanations on Global, Group-wise, and Local Levels

May 27, 2024

Patryk Wielopolski, Oleksii Furman, Jerzy Stefanowski, Maciej Zięba

Abstract:Growing regulatory and societal pressures demand increased transparency in AI, particularly in understanding the decisions made by complex machine learning models. Counterfactual Explanations (CFs) have emerged as a promising technique within Explainable AI (xAI), offering insights into individual model predictions. However, to understand the systemic biases and disparate impacts of AI models, it is crucial to move beyond local CFs and embrace global explanations, which offer a~holistic view across diverse scenarios and populations. Unfortunately, generating Global Counterfactual Explanations (GCEs) faces challenges in computational complexity, defining the scope of "global," and ensuring the explanations are both globally representative and locally plausible. We introduce a novel unified approach for generating Local, Group-wise, and Global Counterfactual Explanations for differentiable classification models via gradient-based optimization to address these challenges. This framework aims to bridge the gap between individual and systemic insights, enabling a deeper understanding of model decisions and their potential impact on diverse populations. Our approach further innovates by incorporating a probabilistic plausibility criterion, enhancing actionability and trustworthiness. By offering a cohesive solution to the optimization and plausibility challenges in GCEs, our work significantly advances the interpretability and accountability of AI models, marking a step forward in the pursuit of transparent AI.

Via

Access Paper or Ask Questions

Probabilistically Plausible Counterfactual Explanations with Normalizing Flows

May 27, 2024

Patryk Wielopolski, Oleksii Furman, Jerzy Stefanowski, Maciej Zięba

Figure 1 for Probabilistically Plausible Counterfactual Explanations with Normalizing Flows

Figure 2 for Probabilistically Plausible Counterfactual Explanations with Normalizing Flows

Figure 3 for Probabilistically Plausible Counterfactual Explanations with Normalizing Flows

Figure 4 for Probabilistically Plausible Counterfactual Explanations with Normalizing Flows

Abstract:We present PPCEF, a novel method for generating probabilistically plausible counterfactual explanations (CFs). PPCEF advances beyond existing methods by combining a probabilistic formulation that leverages the data distribution with the optimization of plausibility within a unified framework. Compared to reference approaches, our method enforces plausibility by directly optimizing the explicit density function without assuming a particular family of parametrized distributions. This ensures CFs are not only valid (i.e., achieve class change) but also align with the underlying data's probability density. For that purpose, our approach leverages normalizing flows as powerful density estimators to capture the complex high-dimensional data distribution. Furthermore, we introduce a novel loss that balances the trade-off between achieving class change and maintaining closeness to the original instance while also incorporating a probabilistic plausibility term. PPCEF's unconstrained formulation allows for efficient gradient-based optimization with batch processing, leading to orders of magnitude faster computation compared to prior methods. Moreover, the unconstrained formulation of PPCEF allows for the seamless integration of future constraints tailored to specific counterfactual properties. Finally, extensive evaluations demonstrate PPCEF's superiority in generating high-quality, probabilistically plausible counterfactual explanations in high-dimensional tabular settings. This makes PPCEF a powerful tool for not only interpreting complex machine learning models but also for improving fairness, accountability, and trust in AI systems.

Via

Access Paper or Ask Questions

Multi-criteria approach for selecting an explanation from the set of counterfactuals produced by an ensemble of explainers

Mar 20, 2024

Ignacy Stępka, Mateusz Lango, Jerzy Stefanowski

Abstract:Counterfactuals are widely used to explain ML model predictions by providing alternative scenarios for obtaining the more desired predictions. They can be generated by a variety of methods that optimize different, sometimes conflicting, quality measures and produce quite different solutions. However, choosing the most appropriate explanation method and one of the generated counterfactuals is not an easy task. Instead of forcing the user to test many different explanation methods and analysing conflicting solutions, in this paper, we propose to use a multi-stage ensemble approach that will select single counterfactual based on the multiple-criteria analysis. It offers a compromise solution that scores well on several popular quality measures. This approach exploits the dominance relation and the ideal point decision aid method, which selects one counterfactual from the Pareto front. The conducted experiments demonstrated that the proposed approach generates fully actionable counterfactuals with attractive compromise values of the considered quality measures.

* 17 pages, 2 figures

Via

Access Paper or Ask Questions