Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shixin Tian

Purifying Adversarial Perturbation with Adversarially Trained Auto-encoders

May 26, 2019

Hebi Li, Qi Xiao, Shixin Tian, Jin Tian

Figure 1 for Purifying Adversarial Perturbation with Adversarially Trained Auto-encoders

Figure 2 for Purifying Adversarial Perturbation with Adversarially Trained Auto-encoders

Figure 3 for Purifying Adversarial Perturbation with Adversarially Trained Auto-encoders

Figure 4 for Purifying Adversarial Perturbation with Adversarially Trained Auto-encoders

Abstract:Machine learning models are vulnerable to adversarial examples. Iterative adversarial training has shown promising results against strong white-box attacks. However, adversarial training is very expensive, and every time a model needs to be protected, such expensive training scheme needs to be performed. In this paper, we propose to apply iterative adversarial training scheme to an external auto-encoder, which once trained can be used to protect other models directly. We empirically show that our model outperforms other purifying-based methods against white-box attacks, and transfers well to directly protect other base models with different architectures.

Via

Access Paper or Ask Questions