Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning

May 24, 2024

Shuai Zhang, Heshan Devaka Fernando, Miao Liu, Keerthiram Murugesan, Songtao Lu, Pin-Yu Chen, Tianyi Chen, Meng Wang

Figure 1 for SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning

Figure 2 for SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning

Figure 3 for SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning

Figure 4 for SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning

Share this with someone who'll enjoy it:

Abstract:This paper studies the transfer reinforcement learning (RL) problem where multiple RL problems have different reward functions but share the same underlying transition dynamics. In this setting, the Q-function of each RL problem (task) can be decomposed into a successor feature (SF) and a reward mapping: the former characterizes the transition dynamics, and the latter characterizes the task-specific reward function. This Q-function decomposition, coupled with a policy improvement operator known as generalized policy improvement (GPI), reduces the sample complexity of finding the optimal Q-function, and thus the SF \& GPI framework exhibits promising empirical performance compared to traditional RL methods like Q-learning. However, its theoretical foundations remain largely unestablished, especially when learning the successor features using deep neural networks (SF-DQN). This paper studies the provable knowledge transfer using SFs-DQN in transfer RL problems. We establish the first convergence analysis with provable generalization guarantees for SF-DQN with GPI. The theory reveals that SF-DQN with GPI outperforms conventional RL approaches, such as deep Q-network, in terms of both faster convergence rate and better generalization. Numerical experiments on real and synthetic RL tasks support the superior performance of SF-DQN \& GPI, aligning with our theoretical findings.

* arXiv admin note: text overlap with arXiv:2310.16173

View paper on

Share this with someone who'll enjoy it:

Title:SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning

Paper and Code