Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Christina Karle

Relevant Irrelevance: Generating Alterfactual Explanations for Image Classifiers

May 08, 2024

Silvan Mertes, Tobias Huber, Christina Karle, Katharina Weitz, Ruben Schlagowski, Cristina Conati, Elisabeth André

Figure 1 for Relevant Irrelevance: Generating Alterfactual Explanations for Image Classifiers

Figure 2 for Relevant Irrelevance: Generating Alterfactual Explanations for Image Classifiers

Figure 3 for Relevant Irrelevance: Generating Alterfactual Explanations for Image Classifiers

Figure 4 for Relevant Irrelevance: Generating Alterfactual Explanations for Image Classifiers

Abstract:In this paper, we demonstrate the feasibility of alterfactual explanations for black box image classifiers. Traditional explanation mechanisms from the field of Counterfactual Thinking are a widely-used paradigm for Explainable Artificial Intelligence (XAI), as they follow a natural way of reasoning that humans are familiar with. However, most common approaches from this field are based on communicating information about features or characteristics that are especially important for an AI's decision. However, to fully understand a decision, not only knowledge about relevant features is needed, but the awareness of irrelevant information also highly contributes to the creation of a user's mental model of an AI system. To this end, a novel approach for explaining AI systems called alterfactual explanations was recently proposed on a conceptual level. It is based on showing an alternative reality where irrelevant features of an AI's input are altered. By doing so, the user directly sees which input data characteristics can change arbitrarily without influencing the AI's decision. In this paper, we show for the first time that it is possible to apply this idea to black box models based on neural networks. To this end, we present a GAN-based approach to generate these alterfactual explanations for binary image classifiers. Further, we present a user study that gives interesting insights on how alterfactual explanations can complement counterfactual explanations.

* Accepted at IJCAI 2024. arXiv admin note: text overlap with arXiv:2207.09374

Via

Access Paper or Ask Questions

Alterfactual Explanations -- The Relevance of Irrelevance for Explaining AI Systems

Jul 19, 2022

Silvan Mertes, Christina Karle, Tobias Huber, Katharina Weitz, Ruben Schlagowski, Elisabeth André

Figure 1 for Alterfactual Explanations -- The Relevance of Irrelevance for Explaining AI Systems

Figure 2 for Alterfactual Explanations -- The Relevance of Irrelevance for Explaining AI Systems

Figure 3 for Alterfactual Explanations -- The Relevance of Irrelevance for Explaining AI Systems

Abstract:Explanation mechanisms from the field of Counterfactual Thinking are a widely-used paradigm for Explainable Artificial Intelligence (XAI), as they follow a natural way of reasoning that humans are familiar with. However, all common approaches from this field are based on communicating information about features or characteristics that are especially important for an AI's decision. We argue that in order to fully understand a decision, not only knowledge about relevant features is needed, but that the awareness of irrelevant information also highly contributes to the creation of a user's mental model of an AI system. Therefore, we introduce a new way of explaining AI systems. Our approach, which we call Alterfactual Explanations, is based on showing an alternative reality where irrelevant features of an AI's input are altered. By doing so, the user directly sees which characteristics of the input data can change arbitrarily without influencing the AI's decision. We evaluate our approach in an extensive user study, revealing that it is able to significantly contribute to the participants' understanding of an AI. We show that alterfactual explanations are suited to convey an understanding of different aspects of the AI's reasoning than established counterfactual explanation methods.

* Accepted at IJCAI 2022 Workshop on XAI

Via

Access Paper or Ask Questions