Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Srikanth Krishnamurthy

Blackbox Attacks via Surrogate Ensemble Search

Aug 07, 2022

Zikui Cai, Chengyu Song, Srikanth Krishnamurthy, Amit Roy-Chowdhury, M. Salman Asif

Figure 1 for Blackbox Attacks via Surrogate Ensemble Search

Figure 2 for Blackbox Attacks via Surrogate Ensemble Search

Figure 3 for Blackbox Attacks via Surrogate Ensemble Search

Figure 4 for Blackbox Attacks via Surrogate Ensemble Search

Abstract:Blackbox adversarial attacks can be categorized into transfer- and query-based attacks. Transfer methods do not require any feedback from the victim model, but provide lower success rates compared to query-based methods. Query attacks often require a large number of queries for success. To achieve the best of both approaches, recent efforts have tried to combine them, but still require hundreds of queries to achieve high success rates (especially for targeted attacks). In this paper, we propose a novel method for blackbox attacks via surrogate ensemble search (BASES) that can generate highly successful blackbox attacks using an extremely small number of queries. We first define a perturbation machine that generates a perturbed image by minimizing a weighted loss function over a fixed set of surrogate models. To generate an attack for a given victim model, we search over the weights in the loss function using queries generated by the perturbation machine. Since the dimension of the search space is small (same as the number of surrogate models), the search requires a small number of queries. We demonstrate that our proposed method achieves better success rate with at least 30x fewer queries compared to state-of-the-art methods on different image classifiers trained with ImageNet (including VGG-19, DenseNet-121, and ResNext-50). In particular, our method requires as few as 3 queries per image (on average) to achieve more than a 90% success rate for targeted attacks and 1-2 queries per image for over a 99% success rate for non-targeted attacks. Our method is also effective on Google Cloud Vision API and achieved a 91% non-targeted attack success rate with 2.9 queries per image. We also show that the perturbations generated by our proposed method are highly transferable and can be adopted for hard-label blackbox attacks.

Via

Access Paper or Ask Questions

Adversarial Attacks on Black Box Video Classifiers: Leveraging the Power of Geometric Transformations

Oct 05, 2021

Shasha Li, Abhishek Aich, Shitong Zhu, M. Salman Asif, Chengyu Song, Amit K. Roy-Chowdhury, Srikanth Krishnamurthy

Figure 1 for Adversarial Attacks on Black Box Video Classifiers: Leveraging the Power of Geometric Transformations

Figure 2 for Adversarial Attacks on Black Box Video Classifiers: Leveraging the Power of Geometric Transformations

Figure 3 for Adversarial Attacks on Black Box Video Classifiers: Leveraging the Power of Geometric Transformations

Figure 4 for Adversarial Attacks on Black Box Video Classifiers: Leveraging the Power of Geometric Transformations

Abstract:When compared to the image classification models, black-box adversarial attacks against video classification models have been largely understudied. This could be possible because, with video, the temporal dimension poses significant additional challenges in gradient estimation. Query-efficient black-box attacks rely on effectively estimated gradients towards maximizing the probability of misclassifying the target video. In this work, we demonstrate that such effective gradients can be searched for by parameterizing the temporal structure of the search space with geometric transformations. Specifically, we design a novel iterative algorithm Geometric TRAnsformed Perturbations (GEO-TRAP), for attacking video classification models. GEO-TRAP employs standard geometric transformation operations to reduce the search space for effective gradients into searching for a small group of parameters that define these operations. This group of parameters describes the geometric progression of gradients, resulting in a reduced and structured search space. Our algorithm inherently leads to successful perturbations with surprisingly few queries. For example, adversarial examples generated from GEO-TRAP have better attack success rates with ~73.55% fewer queries compared to the state-of-the-art method for video adversarial attacks on the widely used Jester dataset. Overall, our algorithm exposes vulnerabilities of diverse video classification models and achieves new state-of-the-art results under black-box settings on two large datasets.

* Accepted at NeurIPS 2021; First two authors contributed equally; includes Supplementary Material

Via

Access Paper or Ask Questions

Connecting the Dots: Detecting Adversarial Perturbations Using Context Inconsistency

Jul 24, 2020

Shasha Li, Shitong Zhu, Sudipta Paul, Amit Roy-Chowdhury, Chengyu Song, Srikanth Krishnamurthy, Ananthram Swami, Kevin S Chan

Figure 1 for Connecting the Dots: Detecting Adversarial Perturbations Using Context Inconsistency

Figure 2 for Connecting the Dots: Detecting Adversarial Perturbations Using Context Inconsistency

Figure 3 for Connecting the Dots: Detecting Adversarial Perturbations Using Context Inconsistency

Figure 4 for Connecting the Dots: Detecting Adversarial Perturbations Using Context Inconsistency

Abstract:There has been a recent surge in research on adversarial perturbations that defeat Deep Neural Networks (DNNs) in machine vision; most of these perturbation-based attacks target object classifiers. Inspired by the observation that humans are able to recognize objects that appear out of place in a scene or along with other unlikely objects, we augment the DNN with a system that learns context consistency rules during training and checks for the violations of the same during testing. Our approach builds a set of auto-encoders, one for each object class, appropriately trained so as to output a discrepancy between the input and output if an added adversarial perturbation violates context consistency rules. Experiments on PASCAL VOC and MS COCO show that our method effectively detects various adversarial attacks and achieves high ROC-AUC (over 0.95 in most cases); this corresponds to over 20% improvement over a state-of-the-art context-agnostic method.

* The paper is accepted by ECCV 2020

Via

Access Paper or Ask Questions