Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Do Feature Attribution Methods Correctly Attribute Features?

Apr 27, 2021

Yilun Zhou, Serena Booth, Marco Tulio Ribeiro, Julie Shah

Figure 1 for Do Feature Attribution Methods Correctly Attribute Features?

Figure 2 for Do Feature Attribution Methods Correctly Attribute Features?

Figure 3 for Do Feature Attribution Methods Correctly Attribute Features?

Figure 4 for Do Feature Attribution Methods Correctly Attribute Features?

Share this with someone who'll enjoy it:

Abstract:Feature attribution methods are exceedingly popular in interpretable machine learning. They aim to compute the attribution of each input feature to represent its importance, but there is no consensus on the definition of "attribution", leading to many competing methods with little systematic evaluation. The lack of attribution ground truth further complicates evaluation, which has to rely on proxy metrics. To address this, we propose a dataset modification procedure such that models trained on the new dataset have ground truth attribution available. We evaluate three methods: saliency maps, rationales, and attention. We identify their deficiencies and add a new perspective to the growing body of evidence questioning their correctness and reliability in the wild. Our evaluation approach is model-agnostic and can be used to assess future feature attribution method proposals as well. Code is available at https://github.com/YilunZhou/feature-attribution-evaluation.

* 21 pages, 20 figures

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Do Feature Attribution Methods Correctly Attribute Features?

Paper and Code