Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Learning Dense Reward with Temporal Variant Self-Supervision

May 26, 2022

Yuning Wu, Jieliang Luo, Hui Li

Figure 1 for Learning Dense Reward with Temporal Variant Self-Supervision

Figure 2 for Learning Dense Reward with Temporal Variant Self-Supervision

Figure 3 for Learning Dense Reward with Temporal Variant Self-Supervision

Figure 4 for Learning Dense Reward with Temporal Variant Self-Supervision

Share this with someone who'll enjoy it:

Abstract:Rewards play an essential role in reinforcement learning. In contrast to rule-based game environments with well-defined reward functions, complex real-world robotic applications, such as contact-rich manipulation, lack explicit and informative descriptions that can directly be used as a reward. Previous effort has shown that it is possible to algorithmically extract dense rewards directly from multimodal observations. In this paper, we aim to extend this effort by proposing a more efficient and robust way of sampling and learning. In particular, our sampling approach utilizes temporal variance to simulate the fluctuating state and action distribution of a manipulation task. We then proposed a network architecture for self-supervised learning to better incorporate temporal information in latent representations. We tested our approach in two experimental setups, namely joint-assembly and door-opening. Preliminary results show that our approach is effective and efficient in learning dense rewards, and the learned rewards lead to faster convergence than baselines.

* 4 pages, 6 figures, accepted to ICRA 2022 RL for Contact-Rich Manipulation Workshop

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Learning Dense Reward with Temporal Variant Self-Supervision

Paper and Code