Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mohammad Nokhbeh Zaeem

Cause and Effect: Concept-based Explanation of Neural Networks

May 14, 2021

Mohammad Nokhbeh Zaeem, Majid Komeili

Figure 1 for Cause and Effect: Concept-based Explanation of Neural Networks

Figure 2 for Cause and Effect: Concept-based Explanation of Neural Networks

Figure 3 for Cause and Effect: Concept-based Explanation of Neural Networks

Figure 4 for Cause and Effect: Concept-based Explanation of Neural Networks

Abstract:In many scenarios, human decisions are explained based on some high-level concepts. In this work, we take a step in the interpretability of neural networks by examining their internal representation or neuron's activations against concepts. A concept is characterized by a set of samples that have specific features in common. We propose a framework to check the existence of a causal relationship between a concept (or its negation) and task classes. While the previous methods focus on the importance of a concept to a task class, we go further and introduce four measures to quantitatively determine the order of causality. Through experiments, we demonstrate the effectiveness of the proposed method in explaining the relationship between a concept and the predictive behaviour of a neural network.

* 14 pages, 17 figures

Via

Access Paper or Ask Questions