Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Robustness Verification for Attention Networks using Mixed Integer Programming

Feb 08, 2022

Hsuan-Cheng Liao, Chih-Hong Cheng, Maximilian Kneissl, Alois Knoll

Figure 1 for Robustness Verification for Attention Networks using Mixed Integer Programming

Figure 2 for Robustness Verification for Attention Networks using Mixed Integer Programming

Figure 3 for Robustness Verification for Attention Networks using Mixed Integer Programming

Figure 4 for Robustness Verification for Attention Networks using Mixed Integer Programming

Share this with someone who'll enjoy it:

Abstract:Attention networks such as transformers have been shown powerful in many applications ranging from natural language processing to object recognition. This paper further considers their robustness properties from both theoretical and empirical perspectives. Theoretically, we formulate a variant of attention networks containing linearized layer normalization and sparsemax activation, and reduce its robustness verification to a Mixed Integer Programming problem. Apart from a na\"ive encoding, we derive tight intervals from admissible perturbation regions and examine several heuristics to speed up the verification process. More specifically, we find a novel bounding technique for sparsemax activation, which is also applicable to softmax activation in general neural networks. Empirically, we evaluate our proposed techniques with a case study on lane departure warning and demonstrate a performance gain of approximately an order of magnitude. Furthermore, although attention networks typically deliver higher accuracy than general neural networks, contrasting its robustness against a similar-sized multi-layer perceptron surprisingly shows that they are not necessarily more robust.

* Submitted to IROS 2022

View paper on

Share this with someone who'll enjoy it:

Title:Robustness Verification for Attention Networks using Mixed Integer Programming

Paper and Code