Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Invariance in Policy Optimisation and Partial Identifiability in Reward Learning

Mar 14, 2022

Joar Skalse, Matthew Farrugia-Roberts, Stuart Russell, Alessandro Abate, Adam Gleave

Figure 1 for Invariance in Policy Optimisation and Partial Identifiability in Reward Learning

Figure 2 for Invariance in Policy Optimisation and Partial Identifiability in Reward Learning

Share this with someone who'll enjoy it:

Abstract:It's challenging to design reward functions for complex, real-world tasks. Reward learning lets one instead infer reward functions from data. However, multiple reward functions often fit the data equally well, even in the infinite-data limit. Prior work often considers reward functions to be uniquely recoverable, by imposing additional assumptions on data sources. By contrast, we formally characterise the partial identifiability of popular data sources, including demonstrations and trajectory preferences, under multiple common sets of assumptions. We analyse the impact of this partial identifiability on downstream tasks such as policy optimisation, including under changes in environment dynamics. We unify our results in a framework for comparing data sources and downstream tasks by their invariances, with implications for the design and selection of data sources for reward learning.

* 8 pages main paper, 24 pages total, 1 figure

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Invariance in Policy Optimisation and Partial Identifiability in Reward Learning

Paper and Code