Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Wonho Choo

Co-Mixup: Saliency Guided Joint Mixup with Supermodular Diversity

Feb 05, 2021

Jang-Hyun Kim, Wonho Choo, Hosan Jeong, Hyun Oh Song

Figure 1 for Co-Mixup: Saliency Guided Joint Mixup with Supermodular Diversity

Figure 2 for Co-Mixup: Saliency Guided Joint Mixup with Supermodular Diversity

Figure 3 for Co-Mixup: Saliency Guided Joint Mixup with Supermodular Diversity

Figure 4 for Co-Mixup: Saliency Guided Joint Mixup with Supermodular Diversity

Abstract:While deep neural networks show great performance on fitting to the training distribution, improving the networks' generalization performance to the test distribution and robustness to the sensitivity to input perturbations still remain as a challenge. Although a number of mixup based augmentation strategies have been proposed to partially address them, it remains unclear as to how to best utilize the supervisory signal within each input data for mixup from the optimization perspective. We propose a new perspective on batch mixup and formulate the optimal construction of a batch of mixup data maximizing the data saliency measure of each individual mixup data and encouraging the supermodular diversity among the constructed mixup data. This leads to a novel discrete optimization problem minimizing the difference between submodular functions. We also propose an efficient modular approximation based iterative submodular minimization algorithm for efficient mixup computation per each minibatch suitable for minibatch based neural network training. Our experiments show the proposed method achieves the state of the art generalization, calibration, and weakly supervised localization results compared to other mixup methods. The source code is available at https://github.com/snu-mllab/Co-Mixup.

* Published at ICLR 2021 (Oral)

Via

Access Paper or Ask Questions

Puzzle Mix: Exploiting Saliency and Local Statistics for Optimal Mixup

Sep 15, 2020

Jang-Hyun Kim, Wonho Choo, Hyun Oh Song

Figure 1 for Puzzle Mix: Exploiting Saliency and Local Statistics for Optimal Mixup

Figure 2 for Puzzle Mix: Exploiting Saliency and Local Statistics for Optimal Mixup

Figure 3 for Puzzle Mix: Exploiting Saliency and Local Statistics for Optimal Mixup

Figure 4 for Puzzle Mix: Exploiting Saliency and Local Statistics for Optimal Mixup

Abstract:While deep neural networks achieve great performance on fitting the training distribution, the learned networks are prone to overfitting and are susceptible to adversarial attacks. In this regard, a number of mixup based augmentation methods have been recently proposed. However, these approaches mainly focus on creating previously unseen virtual examples and can sometimes provide misleading supervisory signal to the network. To this end, we propose Puzzle Mix, a mixup method for explicitly utilizing the saliency information and the underlying statistics of the natural examples. This leads to an interesting optimization problem alternating between the multi-label objective for optimal mixing mask and saliency discounted optimal transport objective. Our experiments show Puzzle Mix achieves the state of the art generalization and the adversarial robustness results compared to other mixup methods on CIFAR-100, Tiny-ImageNet, and ImageNet datasets. The source code is available at https://github.com/snu-mllab/PuzzleMix.

* Published at ICML 2020

Via

Access Paper or Ask Questions