Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hiwot Belay Tadesse

Directly Optimizing Explanations for Desired Properties

Oct 31, 2024

Hiwot Belay Tadesse, Alihan Hüyük, Weiwei Pan, Finale Doshi-Velez

Figure 1 for Directly Optimizing Explanations for Desired Properties

Figure 2 for Directly Optimizing Explanations for Desired Properties

Figure 3 for Directly Optimizing Explanations for Desired Properties

Figure 4 for Directly Optimizing Explanations for Desired Properties

Abstract:When explaining black-box machine learning models, it's often important for explanations to have certain desirable properties. Most existing methods `encourage' desirable properties in their construction of explanations. In this work, we demonstrate that these forms of encouragement do not consistently create explanations with the properties that are supposedly being targeted. Moreover, they do not allow for any control over which properties are prioritized when different properties are at odds with each other. We propose to directly optimize explanations for desired properties. Our direct approach not only produces explanations with optimal properties more consistently but also empowers users to control trade-offs between different properties, allowing them to create explanations with exactly what is needed for a particular task.

Via

Access Paper or Ask Questions