Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language?

Add code
Oct 08, 2020

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: