Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jeongyeol Choe

Why are Saliency Maps Noisy? Cause of and Solution to Noisy Saliency Maps

Feb 20, 2019

Beomsu Kim, Junghoon Seo, SeungHyun Jeon, Jamyoung Koo, Jeongyeol Choe, Taegyun Jeon

Figure 1 for Why are Saliency Maps Noisy? Cause of and Solution to Noisy Saliency Maps

Figure 2 for Why are Saliency Maps Noisy? Cause of and Solution to Noisy Saliency Maps

Figure 3 for Why are Saliency Maps Noisy? Cause of and Solution to Noisy Saliency Maps

Figure 4 for Why are Saliency Maps Noisy? Cause of and Solution to Noisy Saliency Maps

Abstract:Saliency Map, the gradient of the score function with respect to the input, is the most basic technique for interpreting deep neural network decisions. However, saliency maps are often visually noisy. Although several hypotheses were proposed to account for this phenomenon, there are few works that provide rigorous analyses of noisy saliency maps. In this paper, we identify that noise occurs in saliency maps when irrelevant features pass through ReLU activation functions. Then we propose Rectified Gradient, a method that solves this problem through layer-wise thresholding during backpropagation. Experiments with neural networks trained on CIFAR-10 and ImageNet showed effectiveness of our method and its superiority to other attribution methods.

Via

Access Paper or Ask Questions

Noise-adding Methods of Saliency Map as Series of Higher Order Partial Derivative

Jun 08, 2018

Junghoon Seo, Jeongyeol Choe, Jamyoung Koo, Seunghyeon Jeon, Beomsu Kim, Taegyun Jeon

Figure 1 for Noise-adding Methods of Saliency Map as Series of Higher Order Partial Derivative

Figure 2 for Noise-adding Methods of Saliency Map as Series of Higher Order Partial Derivative

Abstract:SmoothGrad and VarGrad are techniques that enhance the empirical quality of standard saliency maps by adding noise to input. However, there were few works that provide a rigorous theoretical interpretation of those methods. We analytically formalize the result of these noise-adding methods. As a result, we observe two interesting results from the existing noise-adding methods. First, SmoothGrad does not make the gradient of the score function smooth. Second, VarGrad is independent of the gradient of the score function. We believe that our findings provide a clue to reveal the relationship between local explanation methods of deep neural networks and higher-order partial derivatives of the score function.

* presented at 2018 ICML Workshop on Human Interpretability in Machine Learning (WHI 2018), Stockholm, Sweden

Via

Access Paper or Ask Questions