Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:TGANv2: Efficient Training of Large Models for Video Generation with Multiple Subsampling Layers

Nov 22, 2018

Masaki Saito, Shunta Saito

Figure 1 for TGANv2: Efficient Training of Large Models for Video Generation with Multiple Subsampling Layers

Figure 2 for TGANv2: Efficient Training of Large Models for Video Generation with Multiple Subsampling Layers

Figure 3 for TGANv2: Efficient Training of Large Models for Video Generation with Multiple Subsampling Layers

Figure 4 for TGANv2: Efficient Training of Large Models for Video Generation with Multiple Subsampling Layers

Share this with someone who'll enjoy it:

Abstract:In this paper, we propose a novel method to efficiently train a Generative Adversarial Network (GAN) on high dimensional samples. The key idea is to introduce a differentiable subsampling layer which appropriately reduces the dimensionality of intermediate feature maps in the generator during training. In general, generators require large memory and computational costs in the latter stages of the network as the feature maps become larger, though the latter stages have relatively fewer parameters than the earlier stages. It makes training large models for video generation difficult due to the limited computational resource. We solve this problem by introducing a method that gradually reduces the dimensionality of feature maps in the generator with multiple subsampling layers. We also propose a network (Temporal GAN v2) with such layers and perform video generation experiments. As a consequence, our model trained on the UCF101 dataset at $192 \times 192$ pixels achieves an Inception Score (IS) of 24.34, which shows a significant improvement over the previous state-of-the-art score of 14.56.

* The code will be released soon

View paper on

Share this with someone who'll enjoy it:

Title:TGANv2: Efficient Training of Large Models for Video Generation with Multiple Subsampling Layers

Paper and Code