To what extent do human explanations of model behavior align with actual model behavior?

Add code
Dec 24, 2020
Figure 1 for To what extent do human explanations of model behavior align with actual model behavior?
Figure 2 for To what extent do human explanations of model behavior align with actual model behavior?
Figure 3 for To what extent do human explanations of model behavior align with actual model behavior?
Figure 4 for To what extent do human explanations of model behavior align with actual model behavior?

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: