Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Identifying Selections for Unsupervised Subtask Discovery

Oct 28, 2024

Yiwen Qiu, Yujia Zheng, Kun Zhang

Figure 1 for Identifying Selections for Unsupervised Subtask Discovery

Figure 2 for Identifying Selections for Unsupervised Subtask Discovery

Figure 3 for Identifying Selections for Unsupervised Subtask Discovery

Figure 4 for Identifying Selections for Unsupervised Subtask Discovery

Share this with someone who'll enjoy it:

Abstract:When solving long-horizon tasks, it is intriguing to decompose the high-level task into subtasks. Decomposing experiences into reusable subtasks can improve data efficiency, accelerate policy generalization, and in general provide promising solutions to multi-task reinforcement learning and imitation learning problems. However, the concept of subtasks is not sufficiently understood and modeled yet, and existing works often overlook the true structure of the data generation process: subtasks are the results of a $\textit{selection}$ mechanism on actions, rather than possible underlying confounders or intermediates. Specifically, we provide a theory to identify, and experiments to verify the existence of selection variables in such data. These selections serve as subgoals that indicate subtasks and guide policy. In light of this idea, we develop a sequential non-negative matrix factorization (seq- NMF) method to learn these subgoals and extract meaningful behavior patterns as subtasks. Our empirical results on a challenging Kitchen environment demonstrate that the learned subtasks effectively enhance the generalization to new tasks in multi-task imitation learning scenarios. The codes are provided at https://anonymous.4open.science/r/Identifying\_Selections\_for\_Unsupervised\_Subtask\_Discovery/README.md.

* NeurIPS 2024

View paper on

Share this with someone who'll enjoy it:

Title:Identifying Selections for Unsupervised Subtask Discovery

Paper and Code