Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning

Nov 28, 2022

Leo Ardon, Alberto Pozanco, Daniel Borrajo, Sumitra Ganesh

Figure 1 for Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning

Figure 2 for Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning

Figure 3 for Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning

Figure 4 for Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning

Share this with someone who'll enjoy it:

Abstract:Reinforcement Learning (RL) algorithms are known to scale poorly to environments with many available actions, requiring numerous samples to learn an optimal policy. The traditional approach of considering the same fixed action space in every possible state implies that the agent must understand, while also learning to maximize its reward, to ignore irrelevant actions such as $\textit{inapplicable actions}$ (i.e. actions that have no effect on the environment when performed in a given state). Knowing this information can help reduce the sample complexity of RL algorithms by masking the inapplicable actions from the policy distribution to only explore actions relevant to finding an optimal policy. This is typically done in an ad-hoc manner with hand-crafted domain logic added to the RL algorithm. In this paper, we propose a more systematic approach to introduce this knowledge into the algorithm. We (i) standardize the way knowledge can be manually specified to the agent; and (ii) present a new framework to autonomously learn these state-dependent action constraints jointly with the policy. We show experimentally that learning inapplicable actions greatly improves the sample efficiency of the algorithm by providing a reliable signal to mask out irrelevant actions. Moreover, we demonstrate that thanks to the transferability of the knowledge acquired, it can be reused in other tasks to make the learning process more efficient.

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning

Paper and Code