Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dingding Wang

Frosting Weights for Better Continual Training

Jan 07, 2020

Xiaofeng Zhu, Feng Liu, Goce Trajcevski, Dingding Wang

Figure 1 for Frosting Weights for Better Continual Training

Figure 2 for Frosting Weights for Better Continual Training

Figure 3 for Frosting Weights for Better Continual Training

Figure 4 for Frosting Weights for Better Continual Training

Abstract:Training a neural network model can be a lifelong learning process and is a computationally intensive one. A severe adverse effect that may occur in deep neural network models is that they can suffer from catastrophic forgetting during retraining on new data. To avoid such disruptions in the continuous learning, one appealing property is the additive nature of ensemble models. In this paper, we propose two generic ensemble approaches, gradient boosting and meta-learning, to solve the catastrophic forgetting problem in tuning pre-trained neural network models.

Via

Access Paper or Ask Questions

On Computation and Generalization of GANs with Spectrum Control

Dec 28, 2018

Haoming Jiang, Zhehui Chen, Minshuo Chen, Feng Liu, Dingding Wang, Tuo Zhao

Figure 1 for On Computation and Generalization of GANs with Spectrum Control

Figure 2 for On Computation and Generalization of GANs with Spectrum Control

Figure 3 for On Computation and Generalization of GANs with Spectrum Control

Figure 4 for On Computation and Generalization of GANs with Spectrum Control

Abstract:Generative Adversarial Networks (GANs), though powerful, is hard to train. Several recent works (brock2016neural,miyato2018spectral) suggest that controlling the spectra of weight matrices in the discriminator can significantly improve the training of GANs. Motivated by their discovery, we propose a new framework for training GANs, which allows more flexible spectrum control (e.g., making the weight matrices of the discriminator have slow singular value decays). Specifically, we propose a new reparameterization approach for the weight matrices of the discriminator in GANs, which allows us to directly manipulate the spectra of the weight matrices through various regularizers and constraints, without intensively computing singular value decompositions. Theoretically, we further show that the spectrum control improves the generalization ability of GANs. Our experiments on CIFAR-10, STL-10, and ImageNet datasets confirm that compared to other methods, our proposed method is capable of generating images with competitive quality by utilizing spectral normalization and encouraging the slow singular value decay.

* Seventh International Conference on Learning Representations, ICLR 2019

Via

Access Paper or Ask Questions