Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Bryn Marie Reinstadler

STRATA: Building Robustness with a Simple Method for Generating Black-box Adversarial Attacks for Models of Code

Sep 28, 2020

Jacob M. Springer, Bryn Marie Reinstadler, Una-May O'Reilly

Figure 1 for STRATA: Building Robustness with a Simple Method for Generating Black-box Adversarial Attacks for Models of Code

Figure 2 for STRATA: Building Robustness with a Simple Method for Generating Black-box Adversarial Attacks for Models of Code

Figure 3 for STRATA: Building Robustness with a Simple Method for Generating Black-box Adversarial Attacks for Models of Code

Figure 4 for STRATA: Building Robustness with a Simple Method for Generating Black-box Adversarial Attacks for Models of Code

Abstract:Adversarial examples are imperceptible perturbations in the input to a neural model that result in misclassification. Generating adversarial examples for source code poses an additional challenge compared to the domains of images and natural language, because source code perturbations must adhere to strict semantic guidelines so the resulting programs retain the functional meaning of the code. We propose a simple and efficient black-box method for generating state-of-the-art adversarial examples on models of code. Our method generates untargeted and targeted attacks, and empirically outperforms competing gradient-based methods with less information and less computational effort. We also use adversarial training to construct a model robust to these attacks; our attack reduces the F1 score of code2seq by 42%. Adversarial training brings the F1 score on adversarial examples up to 99% of baseline.

* 13 pages, 3 figures, 10 tables

Via

Access Paper or Ask Questions