Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Improving the Generalization of Adversarial Training with Domain Adaptation

Oct 24, 2018

Chuanbiao Song, Kun He, Liwei Wang, John E. Hopcroft

Figure 1 for Improving the Generalization of Adversarial Training with Domain Adaptation

Figure 2 for Improving the Generalization of Adversarial Training with Domain Adaptation

Figure 3 for Improving the Generalization of Adversarial Training with Domain Adaptation

Figure 4 for Improving the Generalization of Adversarial Training with Domain Adaptation

Share this with someone who'll enjoy it:

Abstract:By injecting adversarial examples into training data, the adversarial training method is promising for improving the robustness of deep learning models. However, most existing adversarial training approaches are based on a specific type of adversarial attack. It may not provide sufficiently representative samples from the adversarial domain, leading to a weak generalization ability on adversarial examples from other attacks. To scale to large datasets, perturbations on inputs to generate adversarial examples are usually crafted using fast single-step attacks. This work is mainly focused on the adversarial training with the single-step yet efficient FGSM adversary. In this scenario, it is difficult to train a model with great generalization due to the lack of representative adversarial samples, aka the samples are unable to accurately reflect the adversarial domain. To alleviate this problem, we propose a novel Adversarial Training with Domain Adaptation (ATDA) method. Our intuition is regarding adversarial training on FGSM adversary as a domain adaption task with limited number of target domain samples. The main idea is to learn a representation that is semantically meaningful and domain invariant on the clean domain as well as the adversarial domain. Empirical evaluations on Fashion-MNIST, SVHN, CIFAR-10 and CIFAR-100 demonstrate that ATDA can greatly improve the generalization of adversarial training and outperforms state-of-the-art methods on standard benchmark datasets.

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Improving the Generalization of Adversarial Training with Domain Adaptation

Paper and Code