Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Optimal Transport for Offline Imitation Learning

Mar 24, 2023

Yicheng Luo, Zhengyao Jiang, Samuel Cohen, Edward Grefenstette, Marc Peter Deisenroth

Figure 1 for Optimal Transport for Offline Imitation Learning

Figure 2 for Optimal Transport for Offline Imitation Learning

Figure 3 for Optimal Transport for Offline Imitation Learning

Figure 4 for Optimal Transport for Offline Imitation Learning

Share this with someone who'll enjoy it:

Abstract:With the advent of large datasets, offline reinforcement learning (RL) is a promising framework for learning good decision-making policies without the need to interact with the real environment. However, offline RL requires the dataset to be reward-annotated, which presents practical challenges when reward engineering is difficult or when obtaining reward annotations is labor-intensive. In this paper, we introduce Optimal Transport Reward labeling (OTR), an algorithm that assigns rewards to offline trajectories, with a few high-quality demonstrations. OTR's key idea is to use optimal transport to compute an optimal alignment between an unlabeled trajectory in the dataset and an expert demonstration to obtain a similarity measure that can be interpreted as a reward, which can then be used by an offline RL algorithm to learn the policy. OTR is easy to implement and computationally efficient. On D4RL benchmarks, we show that OTR with a single demonstration can consistently match the performance of offline RL with ground-truth rewards.

* Published in ICLR 2023

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Optimal Transport for Offline Imitation Learning

Paper and Code