Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nelson Manohar-Alers

Using Anomaly Feature Vectors for Detecting, Classifying and Warning of Outlier Adversarial Examples

Jul 01, 2021

Nelson Manohar-Alers, Ryan Feng, Sahib Singh, Jiguo Song, Atul Prakash

Figure 1 for Using Anomaly Feature Vectors for Detecting, Classifying and Warning of Outlier Adversarial Examples

Figure 2 for Using Anomaly Feature Vectors for Detecting, Classifying and Warning of Outlier Adversarial Examples

Figure 3 for Using Anomaly Feature Vectors for Detecting, Classifying and Warning of Outlier Adversarial Examples

Figure 4 for Using Anomaly Feature Vectors for Detecting, Classifying and Warning of Outlier Adversarial Examples

Abstract:We present DeClaW, a system for detecting, classifying, and warning of adversarial inputs presented to a classification neural network. In contrast to current state-of-the-art methods that, given an input, detect whether an input is clean or adversarial, we aim to also identify the types of adversarial attack (e.g., PGD, Carlini-Wagner or clean). To achieve this, we extract statistical profiles, which we term as anomaly feature vectors, from a set of latent features. Preliminary findings suggest that AFVs can help distinguish among several types of adversarial attacks (e.g., PGD versus Carlini-Wagner) with close to 93% accuracy on the CIFAR-10 dataset. The results open the door to using AFV-based methods for exploring not only adversarial attack detection but also classification of the attack type and then design of attack-specific mitigation strategies.

* ICML 2021 workshop on A Blessing in Disguise: The Prospects and Perils of Adversarial Machine Learning

Via

Access Paper or Ask Questions