Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Adversarial Feature Genome: a Data Driven Adversarial Examples Recognition Method

Dec 25, 2018

Li Chen, Hailun Ding, Qi Li, Jiawei Zhu, Haozhe Huang, Yifan Chang, Haifeng Li

Figure 1 for Adversarial Feature Genome: a Data Driven Adversarial Examples Recognition Method

Figure 2 for Adversarial Feature Genome: a Data Driven Adversarial Examples Recognition Method

Figure 3 for Adversarial Feature Genome: a Data Driven Adversarial Examples Recognition Method

Figure 4 for Adversarial Feature Genome: a Data Driven Adversarial Examples Recognition Method

Share this with someone who'll enjoy it:

Abstract:Convolutional neural networks (CNNs) are easily spoofed by adversarial examples which lead to wrong classification result. Most of the one-way defense methods focus only on how to improve the robustness of a CNN or to identify adversarial examples. They are incapable of identifying and correctly classifying adversarial examples simultaneously due to the lack of an effective way to quantitatively represent changes in the characteristics of the sample within the network. We find that adversarial examples and original ones have diverse representation in the feature space. Moreover, this difference grows as layers go deeper, which we call Adversarial Feature Separability (AFS). Inspired by AFS, we propose an Adversarial Feature Genome (AFG) based adversarial examples defense framework which can detect adversarial examples and correctly classify them into original category simultaneously. First, we extract the representations of adversarial examples and original ones with labels by the group visualization method. Then, we encode the representations into the feature database AFG. Finally, we model adversarial examples recognition as a multi-label classification or prediction problem by training a CNN for recognizing adversarial examples and original examples on the AFG. Experiments show that the proposed framework can not only effectively identify the adversarial examples in the defense process, but also correctly classify adversarial examples with mean accuracy up to 63\%. Our framework potentially gives a new perspective, i.e. data-driven way, to adversarial examples defense. We believe that adversarial examples defense research may benefit from a large scale AFG database which is similar to ImageNet. The database and source code can be visited at https://github.com/lehaifeng/Adversarial_Feature_Genome.

* 12 pages, 13 figures

View paper on

Share this with someone who'll enjoy it:

Title:Adversarial Feature Genome: a Data Driven Adversarial Examples Recognition Method

Paper and Code