Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Bartosz Jura

Interpretability Benchmark for Evaluating Spatial Misalignment of Prototypical Parts Explanations

Aug 16, 2023

Mikołaj Sacha, Bartosz Jura, Dawid Rymarczyk, Łukasz Struski, Jacek Tabor, Bartosz Zieliński

Figure 1 for Interpretability Benchmark for Evaluating Spatial Misalignment of Prototypical Parts Explanations

Figure 2 for Interpretability Benchmark for Evaluating Spatial Misalignment of Prototypical Parts Explanations

Figure 3 for Interpretability Benchmark for Evaluating Spatial Misalignment of Prototypical Parts Explanations

Figure 4 for Interpretability Benchmark for Evaluating Spatial Misalignment of Prototypical Parts Explanations

Abstract:Prototypical parts-based networks are becoming increasingly popular due to their faithful self-explanations. However, their similarity maps are calculated in the penultimate network layer. Therefore, the receptive field of the prototype activation region often depends on parts of the image outside this region, which can lead to misleading interpretations. We name this undesired behavior a spatial explanation misalignment and introduce an interpretability benchmark with a set of dedicated metrics for quantifying this phenomenon. In addition, we propose a method for misalignment compensation and apply it to existing state-of-the-art models. We show the expressiveness of our benchmark and the effectiveness of the proposed compensation methodology through extensive empirical studies.

* Under review. Code will be release upon acceptance

Via

Access Paper or Ask Questions