Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Active Measure Reinforcement Learning for Observation Cost Minimization

May 26, 2020

Colin Bellinger, Rory Coles, Mark Crowley, Isaac Tamblyn

Figure 1 for Active Measure Reinforcement Learning for Observation Cost Minimization

Figure 2 for Active Measure Reinforcement Learning for Observation Cost Minimization

Figure 3 for Active Measure Reinforcement Learning for Observation Cost Minimization

Figure 4 for Active Measure Reinforcement Learning for Observation Cost Minimization

Share this with someone who'll enjoy it:

Abstract:Standard reinforcement learning (RL) algorithms assume that the observation of the next state comes instantaneously and at no cost. In a wide variety of sequential decision making tasks ranging from medical treatment to scientific discovery, however, multiple classes of state observations are possible, each of which has an associated cost. We propose the active measure RL framework (Amrl) as an initial solution to this problem where the agent learns to maximize the costed return, which we define as the discounted sum of rewards minus the sum of observation costs. Our empirical evaluation demonstrates that Amrl-Q agents are able to learn a policy and state estimator in parallel during online training. During training the agent naturally shifts from its reliance on costly measurements of the environment to its state estimator in order to increase its reward. It does this without harm to the learned policy. Our results show that the Amrl-Q agent learns at a rate similar to standard Q-learning and Dyna-Q. Critically, by utilizing an active strategy, Amrl-Q achieves a higher costed return.

* Under review at NeurIPS 2020

View paper on

Share this with someone who'll enjoy it:

Title:Active Measure Reinforcement Learning for Observation Cost Minimization

Paper and Code