Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Using Machine Teaching to Investigate Human Assumptions when Teaching Reinforcement Learners

Sep 05, 2020

Yun-Shiuan Chuang, Xuezhou Zhang, Yuzhe Ma, Mark K. Ho, Joseph L. Austerweil, Xiaojin Zhu

Figure 1 for Using Machine Teaching to Investigate Human Assumptions when Teaching Reinforcement Learners

Figure 2 for Using Machine Teaching to Investigate Human Assumptions when Teaching Reinforcement Learners

Figure 3 for Using Machine Teaching to Investigate Human Assumptions when Teaching Reinforcement Learners

Figure 4 for Using Machine Teaching to Investigate Human Assumptions when Teaching Reinforcement Learners

Share this with someone who'll enjoy it:

Abstract:Successful teaching requires an assumption of how the learner learns - how the learner uses experiences from the world to update their internal states. We investigate what expectations people have about a learner when they teach them in an online manner using rewards and punishment. We focus on a common reinforcement learning method, Q-learning, and examine what assumptions people have using a behavioral experiment. To do so, we first establish a normative standard, by formulating the problem as a machine teaching optimization problem. To solve the machine teaching optimization problem, we use a deep learning approximation method which simulates learners in the environment and learns to predict how feedback affects the learner's internal states. What do people assume about a learner's learning and discount rates when they teach them an idealized exploration-exploitation task? In a behavioral experiment, we find that people can teach the task to Q-learners in a relatively efficient and effective manner when the learner uses a small value for its discounting rate and a large value for its learning rate. However, they still are suboptimal. We also find that providing people with real-time updates of how possible feedback would affect the Q-learner's internal states weakly helps them teach. Our results reveal how people teach using evaluative feedback and provide guidance for how engineers should design machine agents in a manner that is intuitive for people.

* 21 pages, 4 figures

View paper on

Share this with someone who'll enjoy it:

Title:Using Machine Teaching to Investigate Human Assumptions when Teaching Reinforcement Learners

Paper and Code