Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Eura Nofshin

A Sim2Real Approach for Identifying Task-Relevant Properties in Interpretable Machine Learning

May 31, 2024

Eura Nofshin, Esther Brown, Brian Lim, Weiwei Pan, Finale Doshi-Velez

Figure 1 for A Sim2Real Approach for Identifying Task-Relevant Properties in Interpretable Machine Learning

Figure 2 for A Sim2Real Approach for Identifying Task-Relevant Properties in Interpretable Machine Learning

Figure 3 for A Sim2Real Approach for Identifying Task-Relevant Properties in Interpretable Machine Learning

Figure 4 for A Sim2Real Approach for Identifying Task-Relevant Properties in Interpretable Machine Learning

Abstract:Existing user studies suggest that different tasks may require explanations with different properties. However, user studies are expensive. In this paper, we introduce a generalizable, cost-effective method for identifying task-relevant explanation properties in silico, which can guide the design of more expensive user studies. We use our approach to identify relevant proxies for three example tasks and validate our simulation with real user studies.

Via

Access Paper or Ask Questions

Reinforcement Learning Interventions on Boundedly Rational Human Agents in Frictionful Tasks

Jan 26, 2024

Eura Nofshin, Siddharth Swaroop, Weiwei Pan, Susan Murphy, Finale Doshi-Velez

Abstract:Many important behavior changes are frictionful; they require individuals to expend effort over a long period with little immediate gratification. Here, an artificial intelligence (AI) agent can provide personalized interventions to help individuals stick to their goals. In these settings, the AI agent must personalize rapidly (before the individual disengages) and interpretably, to help us understand the behavioral interventions. In this paper, we introduce Behavior Model Reinforcement Learning (BMRL), a framework in which an AI agent intervenes on the parameters of a Markov Decision Process (MDP) belonging to a boundedly rational human agent. Our formulation of the human decision-maker as a planning agent allows us to attribute undesirable human policies (ones that do not lead to the goal) to their maladapted MDP parameters, such as an extremely low discount factor. Furthermore, we propose a class of tractable human models that captures fundamental behaviors in frictionful tasks. Introducing a notion of MDP equivalence specific to BMRL, we theoretically and empirically show that AI planning with our human models can lead to helpful policies on a wide range of more complex, ground-truth humans.

* In AAMAS 2024

Via

Access Paper or Ask Questions