Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Stefan Zürn

Change Detection for Local Explainability in Evolving Data Streams

Sep 06, 2022

Johannes Haug, Alexander Braun, Stefan Zürn, Gjergji Kasneci

Figure 1 for Change Detection for Local Explainability in Evolving Data Streams

Figure 2 for Change Detection for Local Explainability in Evolving Data Streams

Figure 3 for Change Detection for Local Explainability in Evolving Data Streams

Figure 4 for Change Detection for Local Explainability in Evolving Data Streams

Abstract:As complex machine learning models are increasingly used in sensitive applications like banking, trading or credit scoring, there is a growing demand for reliable explanation mechanisms. Local feature attribution methods have become a popular technique for post-hoc and model-agnostic explanations. However, attribution methods typically assume a stationary environment in which the predictive model has been trained and remains stable. As a result, it is often unclear how local attributions behave in realistic, constantly evolving settings such as streaming and online applications. In this paper, we discuss the impact of temporal change on local feature attributions. In particular, we show that local attributions can become obsolete each time the predictive model is updated or concept drift alters the data generating distribution. Consequently, local feature attributions in data streams provide high explanatory power only when combined with a mechanism that allows us to detect and respond to local changes over time. To this end, we present CDLEEDS, a flexible and model-agnostic framework for detecting local change and concept drift. CDLEEDS serves as an intuitive extension of attribution-based explanation techniques to identify outdated local attributions and enable more targeted recalculations. In experiments, we also show that the proposed framework can reliably detect both local and global concept drift. Accordingly, our work contributes to a more meaningful and robust explainability in online machine learning.

* To be published in the proceedings of the 31st ACM International Conference on Information and Knowledge Management (CIKM 2022)

Via

Access Paper or Ask Questions

On Baselines for Local Feature Attributions

Jan 04, 2021

Johannes Haug, Stefan Zürn, Peter El-Jiz, Gjergji Kasneci

Figure 1 for On Baselines for Local Feature Attributions

Figure 2 for On Baselines for Local Feature Attributions

Figure 3 for On Baselines for Local Feature Attributions

Figure 4 for On Baselines for Local Feature Attributions

Abstract:High-performing predictive models, such as neural nets, usually operate as black boxes, which raises serious concerns about their interpretability. Local feature attribution methods help to explain black box models and are therefore a powerful tool for assessing the reliability and fairness of predictions. To this end, most attribution models compare the importance of input features with a reference value, often called baseline. Recent studies show that the baseline can heavily impact the quality of feature attributions. Yet, we frequently find simplistic baselines, such as the zero vector, in practice. In this paper, we show empirically that baselines can significantly alter the discriminative power of feature attributions. We conduct our analysis on tabular data sets, thus complementing recent works on image data. Besides, we propose a new taxonomy of baseline methods. Our experimental study illustrates the sensitivity of popular attribution models to the baseline, thus laying the foundation for a more in-depth discussion on sensible baseline methods for tabular data.

* Accepted at the AAAI-21 Workshop on Explainable Agency in AI

Via

Access Paper or Ask Questions