Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:PatchUp: A Regularization Technique for Convolutional Neural Networks

Jun 14, 2020

Mojtaba Faramarzi, Mohammad Amini, Akilesh Badrinaaraayanan, Vikas Verma, Sarath Chandar

Figure 1 for PatchUp: A Regularization Technique for Convolutional Neural Networks

Figure 2 for PatchUp: A Regularization Technique for Convolutional Neural Networks

Figure 3 for PatchUp: A Regularization Technique for Convolutional Neural Networks

Figure 4 for PatchUp: A Regularization Technique for Convolutional Neural Networks

Share this with someone who'll enjoy it:

Abstract:Large capacity deep learning models are often prone to a high generalization gap when trained with a limited amount of labeled training data. A recent class of methods to address this problem uses various ways to construct a new training sample by mixing a pair (or more) of training samples. We propose PatchUp, a hidden state block-level regularization technique for Convolutional Neural Networks (CNNs), that is applied on selected contiguous blocks of feature maps from a random pair of samples. Our approach improves the robustness of CNN models against the manifold intrusion problem that may occur in other state-of-the-art mixing approaches like Mixup and CutMix. Moreover, since we are mixing the contiguous block of features in the hidden space, which has more dimensions than the input space, we obtain more diverse samples for training towards different dimensions. Our experiments on CIFAR-10, CIFAR-100, and SVHN datasets with PreactResnet18, PreactResnet34, and WideResnet-28-10 models show that PatchUp improves upon, or equals, the performance of current state-of-the-art regularizers for CNNs. We also show that PatchUp can provide better generalization to affine transformations of samples and is more robust against adversarial attacks.

View paper on

Share this with someone who'll enjoy it:

Title:PatchUp: A Regularization Technique for Convolutional Neural Networks

Paper and Code