Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Which Experiences Are Influential for RL Agents? Efficiently Estimating The Influence of Experiences

May 23, 2024

Takuya Hiraoka, Guanquan Wang, Takashi Onishi, Yoshimasa Tsuruoka

Figure 1 for Which Experiences Are Influential for RL Agents? Efficiently Estimating The Influence of Experiences

Figure 2 for Which Experiences Are Influential for RL Agents? Efficiently Estimating The Influence of Experiences

Figure 3 for Which Experiences Are Influential for RL Agents? Efficiently Estimating The Influence of Experiences

Figure 4 for Which Experiences Are Influential for RL Agents? Efficiently Estimating The Influence of Experiences

Share this with someone who'll enjoy it:

Abstract:In reinforcement learning (RL) with experience replay, experiences stored in a replay buffer influence the RL agent's performance. Information about the influence of these experiences is valuable for various purposes, such as identifying experiences that negatively influence poorly performing RL agents. One method for estimating the influence of experiences is the leave-one-out (LOO) method. However, this method is usually computationally prohibitive. In this paper, we present Policy Iteration with Turn-over Dropout (PIToD), which efficiently estimates the influence of experiences. We evaluate how accurately PIToD estimates the influence of experiences and its efficiency compared to LOO. We then apply PIToD to amend poorly performing RL agents, i.e., we use PIToD to estimate negatively influential experiences for the RL agents and to delete the influence of these experiences. We show that RL agents' performance is significantly improved via amendments with PIToD.

* Source code: https://github.com/TakuyaHiraoka/Which-Experiences-Are-Influential-for-RL-Agents

View paper on

Share this with someone who'll enjoy it:

Title:Which Experiences Are Influential for RL Agents? Efficiently Estimating The Influence of Experiences

Paper and Code