Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Low Curvature Activations Reduce Overfitting in Adversarial Training

Feb 15, 2021

Vasu Singla, Sahil Singla, David Jacobs, Soheil Feizi

Figure 1 for Low Curvature Activations Reduce Overfitting in Adversarial Training

Figure 2 for Low Curvature Activations Reduce Overfitting in Adversarial Training

Figure 3 for Low Curvature Activations Reduce Overfitting in Adversarial Training

Figure 4 for Low Curvature Activations Reduce Overfitting in Adversarial Training

Share this with someone who'll enjoy it:

Abstract:Adversarial training is one of the most effective defenses against adversarial attacks. Previous works suggest that overfitting is a dominant phenomenon in adversarial training leading to a large generalization gap between test and train accuracy in neural networks. In this work, we show that the observed generalization gap is closely related to the choice of the activation function. In particular, we show that using activation functions with low (exact or approximate) curvature values has a regularization effect that significantly reduces both the standard and robust generalization gaps in adversarial training. We observe this effect for both differentiable/smooth activations such as Swish as well as non-differentiable/non-smooth activations such as LeakyReLU. In the latter case, the approximate curvature of the activation is low. Finally, we show that for activation functions with low curvature, the double descent phenomenon for adversarially trained models does not occur.

View paper on

Share this with someone who'll enjoy it:

Title:Low Curvature Activations Reduce Overfitting in Adversarial Training

Paper and Code