Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Gradient-free Post-hoc Explainability Using Distillation Aided Learnable Approach

Sep 17, 2024

Debarpan Bhattacharya, Amir H. Poorjam, Deepak Mittal, Sriram Ganapathy

Figure 1 for Gradient-free Post-hoc Explainability Using Distillation Aided Learnable Approach

Figure 2 for Gradient-free Post-hoc Explainability Using Distillation Aided Learnable Approach

Figure 3 for Gradient-free Post-hoc Explainability Using Distillation Aided Learnable Approach

Figure 4 for Gradient-free Post-hoc Explainability Using Distillation Aided Learnable Approach

Share this with someone who'll enjoy it:

Abstract:The recent advancements in artificial intelligence (AI), with the release of several large models having only query access, make a strong case for explainability of deep models in a post-hoc gradient free manner. In this paper, we propose a framework, named distillation aided explainability (DAX), that attempts to generate a saliency-based explanation in a model agnostic gradient free application. The DAX approach poses the problem of explanation in a learnable setting with a mask generation network and a distillation network. The mask generation network learns to generate the multiplier mask that finds the salient regions of the input, while the student distillation network aims to approximate the local behavior of the black-box model. We propose a joint optimization of the two networks in the DAX framework using the locally perturbed input samples, with the targets derived from input-output access to the black-box model. We extensively evaluate DAX across different modalities (image and audio), in a classification setting, using a diverse set of evaluations (intersection over union with ground truth, deletion based and subjective human evaluation based measures) and benchmark it with respect to $9$ different methods. In these evaluations, the DAX significantly outperforms the existing approaches on all modalities and evaluation metrics.

* 12 pages, 10 figures, Accepted in IEEE Journal of Selected Topics in Signal Processing (JSTSP), 2024

View paper on

Share this with someone who'll enjoy it:

Title:Gradient-free Post-hoc Explainability Using Distillation Aided Learnable Approach

Paper and Code