Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator

Sep 15, 2022

Younggyo Seo, Kimin Lee, Fangchen Liu, Stephen James, Pieter Abbeel

Figure 1 for HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator

Figure 2 for HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator

Figure 3 for HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator

Figure 4 for HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator

Share this with someone who'll enjoy it:

Abstract:Video prediction is an important yet challenging problem; burdened with the tasks of generating future frames and learning environment dynamics. Recently, autoregressive latent video models have proved to be a powerful video prediction tool, by separating the video prediction into two sub-problems: pre-training an image generator model, followed by learning an autoregressive prediction model in the latent space of the image generator. However, successfully generating high-fidelity and high-resolution videos has yet to be seen. In this work, we investigate how to train an autoregressive latent video prediction model capable of predicting high-fidelity future frames with minimal modification to existing models, and produce high-resolution (256x256) videos. Specifically, we scale up prior models by employing a high-fidelity image generator (VQ-GAN) with a causal transformer model, and introduce additional techniques of top-k sampling and data augmentation to further improve video prediction quality. Despite the simplicity, the proposed method achieves competitive performance to state-of-the-art approaches on standard video prediction benchmarks with fewer parameters, and enables high-resolution video prediction on complex and large-scale datasets. Videos are available at https://sites.google.com/view/harp-videos/home.

* Extended draft of the paper accepted to ICIP 2022 conference

View paper on

Share this with someone who'll enjoy it:

Title:HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator

Paper and Code