Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:KeyIn: Discovering Subgoal Structure with Keyframe-based Video Prediction

Apr 11, 2019

Karl Pertsch, Oleh Rybkin, Jingyun Yang, Kosta Derpanis, Joseph Lim, Kostas Daniilidis, Andrew Jaegle

Figure 1 for KeyIn: Discovering Subgoal Structure with Keyframe-based Video Prediction

Figure 2 for KeyIn: Discovering Subgoal Structure with Keyframe-based Video Prediction

Figure 3 for KeyIn: Discovering Subgoal Structure with Keyframe-based Video Prediction

Figure 4 for KeyIn: Discovering Subgoal Structure with Keyframe-based Video Prediction

Share this with someone who'll enjoy it:

Abstract:Real-world image sequences can often be naturally decomposed into a small number of frames depicting interesting, highly stochastic moments (its $\textit{keyframes}$) and the low-variance frames in between them. In image sequences depicting trajectories to a goal, keyframes can be seen as capturing the $\textit{subgoals}$ of the sequence as they depict the high-variance moments of interest that ultimately led to the goal. In this paper, we introduce a video prediction model that discovers the keyframe structure of image sequences in an unsupervised fashion. We do so using a hierarchical Keyframe-Intermediate model (KeyIn) that stochastically predicts keyframes and their offsets in time and then uses these predictions to deterministically predict the intermediate frames. We propose a differentiable formulation of this problem that allows us to train the full hierarchical model using a sequence reconstruction loss. We show that our model is able to find meaningful keyframe structure in a simulated dataset of robotic demonstrations and that these keyframes can serve as subgoals for planning. Our model outperforms other hierarchical prediction approaches for planning on a simulated pushing task.

* 8 pages + 5 pages of references and appendices

View paper on

Share this with someone who'll enjoy it:

Title:KeyIn: Discovering Subgoal Structure with Keyframe-based Video Prediction

Paper and Code