Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Learning Task-relevant Representations for Generalization via Characteristic Functions of Reward Sequence Distributions

May 20, 2022

Rui Yang, Jie Wang, Zijie Geng, Mingxuan Ye, Shuiwang Ji, Bin Li, Feng Wu

Figure 1 for Learning Task-relevant Representations for Generalization via Characteristic Functions of Reward Sequence Distributions

Figure 2 for Learning Task-relevant Representations for Generalization via Characteristic Functions of Reward Sequence Distributions

Figure 3 for Learning Task-relevant Representations for Generalization via Characteristic Functions of Reward Sequence Distributions

Figure 4 for Learning Task-relevant Representations for Generalization via Characteristic Functions of Reward Sequence Distributions

Share this with someone who'll enjoy it:

Abstract:Generalization across different environments with the same tasks is critical for successful applications of visual reinforcement learning (RL) in real scenarios. However, visual distractions -- which are common in real scenes -- from high-dimensional observations can be hurtful to the learned representations in visual RL, thus degrading the performance of generalization. To tackle this problem, we propose a novel approach, namely Characteristic Reward Sequence Prediction (CRESP), to extract the task-relevant information by learning reward sequence distributions (RSDs), as the reward signals are task-relevant in RL and invariant to visual distractions. Specifically, to effectively capture the task-relevant information via RSDs, CRESP introduces an auxiliary task -- that is, predicting the characteristic functions of RSDs -- to learn task-relevant representations, because we can well approximate the high-dimensional distributions by leveraging the corresponding characteristic functions. Experiments demonstrate that CRESP significantly improves the performance of generalization on unseen environments, outperforming several state-of-the-arts on DeepMind Control tasks with different visual distractions.

* Accepted to KDD'22

View paper on

Share this with someone who'll enjoy it:

Title:Learning Task-relevant Representations for Generalization via Characteristic Functions of Reward Sequence Distributions

Paper and Code