Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Hierarchical interpretations for neural network predictions

Jun 14, 2018

Chandan Singh, W. James Murdoch, Bin Yu

Figure 1 for Hierarchical interpretations for neural network predictions

Figure 2 for Hierarchical interpretations for neural network predictions

Figure 3 for Hierarchical interpretations for neural network predictions

Figure 4 for Hierarchical interpretations for neural network predictions

Share this with someone who'll enjoy it:

Abstract:Deep neural networks (DNNs) have achieved impressive predictive performance due to their ability to learn complex, non-linear relationships between variables. However, the inability to effectively visualize these relationships has led to DNNs being characterized as black boxes and consequently limited their applications. To ameliorate this problem, we introduce the use of hierarchical interpretations to explain DNN predictions through our proposed method, agglomerative contextual decomposition (ACD). Given a prediction from a trained DNN, ACD produces a hierarchical clustering of the input features, along with the contribution of each cluster to the final prediction. This hierarchy is optimized to identify clusters of features that the DNN learned are predictive. Using examples from Stanford Sentiment Treebank and ImageNet, we show that ACD is effective at diagnosing incorrect predictions and identifying dataset bias. Through human experiments, we demonstrate that ACD enables users both to identify the more accurate of two DNNs and to better trust a DNN's outputs. We also find that ACD's hierarchy is largely robust to adversarial perturbations, implying that it captures fundamental aspects of the input and ignores spurious noise.

* main text: 8 pages, references: 2 pages, supplement: 13 pages

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Hierarchical interpretations for neural network predictions

Paper and Code