Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Causality for Inherently Explainable Transformers: CAT-XPLAIN

Jun 29, 2022

Subash Khanal, Benjamin Brodie, Xin Xing, Ai-Ling Lin, Nathan Jacobs

Figure 1 for Causality for Inherently Explainable Transformers: CAT-XPLAIN

Figure 2 for Causality for Inherently Explainable Transformers: CAT-XPLAIN

Figure 3 for Causality for Inherently Explainable Transformers: CAT-XPLAIN

Figure 4 for Causality for Inherently Explainable Transformers: CAT-XPLAIN

Share this with someone who'll enjoy it:

Abstract:There have been several post-hoc explanation approaches developed to explain pre-trained black-box neural networks. However, there is still a gap in research efforts toward designing neural networks that are inherently explainable. In this paper, we utilize a recently proposed instance-wise post-hoc causal explanation method to make an existing transformer architecture inherently explainable. Once trained, our model provides an explanation in the form of top-$k$ regions in the input space of the given instance contributing to its decision. We evaluate our method on binary classification tasks using three image datasets: MNIST, FMNIST, and CIFAR. Our results demonstrate that compared to the causality-based post-hoc explainer model, our inherently explainable model achieves better explainability results while eliminating the need of training a separate explainer model. Our code is available at https://github.com/mvrl/CAT-XPLAIN.

* Accepted for spotlight presentation at the Explainable Artificial Intelligence for Computer Vision Workshop at CVPR 2022

View paper on

Share this with someone who'll enjoy it:

Title:Causality for Inherently Explainable Transformers: CAT-XPLAIN

Paper and Code