Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chubiao Song

Robust Local Features for Improving the Generalization of Adversarial Training

Sep 23, 2019

Chubiao Song, Kun He, Jiadong Lin, Liwei Wang, John E. Hopcroft

Figure 1 for Robust Local Features for Improving the Generalization of Adversarial Training

Figure 2 for Robust Local Features for Improving the Generalization of Adversarial Training

Figure 3 for Robust Local Features for Improving the Generalization of Adversarial Training

Figure 4 for Robust Local Features for Improving the Generalization of Adversarial Training

Abstract:Adversarial training has been demonstrated as one of the most effective methods for training robust models so as to defend against adversarial examples. However, adversarial training often lacks adversarially robust generalization on unseen data. Recent works show that adversarially trained models may be more biased towards global structure features. Instead, in this work, we would like to investigate the relationship between the generalization of adversarial training and the robust local features, as the local features generalize well for unseen shape variation. To learn the robust local features, we develop a Random Block Shuffle (RBS) transformation to break up the global structure features on normal adversarial examples. We continue to propose a new approach called Robust Local Features for Adversarial Training (RLFAT), which first learns the robust local features by adversarial training on the RBS-transformed adversarial examples, and then transfers the robust local features into the training of normal adversarial examples. Finally, we implement RLFAT in two currently state-of-the-art adversarial training frameworks. Extensive experiments on STL-10, CIFAR-10, CIFAR-100 datasets show that RLFAT improves the adversarially robust generalization as well as the standard generalization of adversarial training. Additionally, we demonstrate that our method captures more local features of the object, aligning better with human perception.

Via

Access Paper or Ask Questions