Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Glenn Dawson

Contributor-Aware Defenses Against Adversarial Backdoor Attacks

May 28, 2022

Glenn Dawson, Muhammad Umer, Robi Polikar

Figure 1 for Contributor-Aware Defenses Against Adversarial Backdoor Attacks

Figure 2 for Contributor-Aware Defenses Against Adversarial Backdoor Attacks

Figure 3 for Contributor-Aware Defenses Against Adversarial Backdoor Attacks

Figure 4 for Contributor-Aware Defenses Against Adversarial Backdoor Attacks

Abstract:Deep neural networks for image classification are well-known to be vulnerable to adversarial attacks. One such attack that has garnered recent attention is the adversarial backdoor attack, which has demonstrated the capability to perform targeted misclassification of specific examples. In particular, backdoor attacks attempt to force a model to learn spurious relations between backdoor trigger patterns and false labels. In response to this threat, numerous defensive measures have been proposed; however, defenses against backdoor attacks focus on backdoor pattern detection, which may be unreliable against novel or unexpected types of backdoor pattern designs. We introduce a novel re-contextualization of the adversarial setting, where the presence of an adversary implicitly admits the existence of multiple database contributors. Then, under the mild assumption of contributor awareness, it becomes possible to exploit this knowledge to defend against backdoor attacks by destroying the false label associations. We propose a contributor-aware universal defensive framework for learning in the presence of multiple, potentially adversarial data sources that utilizes semi-supervised ensembles and learning from crowds to filter the false labels produced by adversarial triggers. Importantly, this defensive strategy is agnostic to backdoor pattern design, as it functions without needing -- or even attempting -- to perform either adversary identification or backdoor pattern detection during either training or inference. Our empirical studies demonstrate the robustness of the proposed framework against adversarial backdoor attacks from multiple simultaneous adversaries.

Via

Access Paper or Ask Questions

Rethinking Noisy Label Models: Labeler-Dependent Noise with Adversarial Awareness

Jun 05, 2021

Glenn Dawson, Robi Polikar

Figure 1 for Rethinking Noisy Label Models: Labeler-Dependent Noise with Adversarial Awareness

Figure 2 for Rethinking Noisy Label Models: Labeler-Dependent Noise with Adversarial Awareness

Figure 3 for Rethinking Noisy Label Models: Labeler-Dependent Noise with Adversarial Awareness

Figure 4 for Rethinking Noisy Label Models: Labeler-Dependent Noise with Adversarial Awareness

Abstract:Most studies on learning from noisy labels rely on unrealistic models of i.i.d. label noise, such as class-conditional transition matrices. More recent work on instance-dependent noise models are more realistic, but assume a single generative process for label noise across the entire dataset. We propose a more principled model of label noise that generalizes instance-dependent noise to multiple labelers, based on the observation that modern datasets are typically annotated using distributed crowdsourcing methods. Under our labeler-dependent model, label noise manifests itself under two modalities: natural error of good-faith labelers, and adversarial labels provided by malicious actors. We present two adversarial attack vectors that more accurately reflect the label noise that may be encountered in real-world settings, and demonstrate that under our multimodal noisy labels model, state-of-the-art approaches for learning from noisy labels are defeated by adversarial label attacks. Finally, we propose a multi-stage, labeler-aware, model-agnostic framework that reliably filters noisy labels by leveraging knowledge about which data partitions were labeled by which labeler, and show that our proposed framework remains robust even in the presence of extreme adversarial label noise.

* 9 pages, 3 figures, 3 algorithms. Currently under blind review at NeurIPS 2021

Via

Access Paper or Ask Questions

OpinionRank: Extracting Ground Truth Labels from Unreliable Expert Opinions with Graph-Based Spectral Ranking

Feb 11, 2021

Glenn Dawson, Robi Polikar

Figure 1 for OpinionRank: Extracting Ground Truth Labels from Unreliable Expert Opinions with Graph-Based Spectral Ranking

Figure 2 for OpinionRank: Extracting Ground Truth Labels from Unreliable Expert Opinions with Graph-Based Spectral Ranking

Figure 3 for OpinionRank: Extracting Ground Truth Labels from Unreliable Expert Opinions with Graph-Based Spectral Ranking

Figure 4 for OpinionRank: Extracting Ground Truth Labels from Unreliable Expert Opinions with Graph-Based Spectral Ranking

Abstract:As larger and more comprehensive datasets become standard in contemporary machine learning, it becomes increasingly more difficult to obtain reliable, trustworthy label information with which to train sophisticated models. To address this problem, crowdsourcing has emerged as a popular, inexpensive, and efficient data mining solution for performing distributed label collection. However, crowdsourced annotations are inherently untrustworthy, as the labels are provided by anonymous volunteers who may have varying, unreliable expertise. Worse yet, some participants on commonly used platforms such as Amazon Mechanical Turk may be adversarial, and provide intentionally incorrect label information without the end user's knowledge. We discuss three conventional models of the label generation process, describing their parameterizations and the model-based approaches used to solve them. We then propose OpinionRank, a model-free, interpretable, graph-based spectral algorithm for integrating crowdsourced annotations into reliable labels for performing supervised or semi-supervised learning. Our experiments show that OpinionRank performs favorably when compared against more highly parameterized algorithms. We also show that OpinionRank is scalable to very large datasets and numbers of label sources, and requires considerably less computational resources than previous approaches.

* 8 pages, 5 figures, submitted to IJCNN 2021

Via

Access Paper or Ask Questions

Targeted Forgetting and False Memory Formation in Continual Learners through Adversarial Backdoor Attacks

Feb 17, 2020

Muhammad Umer, Glenn Dawson, Robi Polikar

Figure 1 for Targeted Forgetting and False Memory Formation in Continual Learners through Adversarial Backdoor Attacks

Figure 2 for Targeted Forgetting and False Memory Formation in Continual Learners through Adversarial Backdoor Attacks

Figure 3 for Targeted Forgetting and False Memory Formation in Continual Learners through Adversarial Backdoor Attacks

Figure 4 for Targeted Forgetting and False Memory Formation in Continual Learners through Adversarial Backdoor Attacks

Abstract:Artificial neural networks are well-known to be susceptible to catastrophic forgetting when continually learning from sequences of tasks. Various continual (or "incremental") learning approaches have been proposed to avoid catastrophic forgetting, but they are typically adversary agnostic, i.e., they do not consider the possibility of a malicious attack. In this effort, we explore the vulnerability of Elastic Weight Consolidation (EWC), a popular continual learning algorithm for avoiding catastrophic forgetting. We show that an intelligent adversary can bypass the EWC's defenses, and instead cause gradual and deliberate forgetting by introducing small amounts of misinformation to the model during training. We demonstrate such an adversary's ability to assume control of the model via injection of "backdoor" attack samples on both permuted and split benchmark variants of the MNIST dataset. Importantly, once the model has learned the adversarial misinformation, the adversary can then control the amount of forgetting of any task. Equivalently, the malicious actor can create a "false memory" about any task by inserting carefully-designed backdoor samples to any fraction of the test instances of that task. Perhaps most damaging, we show this vulnerability to be very acute; neural network memory can be easily compromised with the addition of backdoor samples into as little as 1% of the training data of even a single task.

Via

Access Paper or Ask Questions

Attack Strength vs. Detectability Dilemma in Adversarial Machine Learning

Feb 20, 2018

Christopher Frederickson, Michael Moore, Glenn Dawson, Robi Polikar

Figure 1 for Attack Strength vs. Detectability Dilemma in Adversarial Machine Learning

Figure 2 for Attack Strength vs. Detectability Dilemma in Adversarial Machine Learning

Figure 3 for Attack Strength vs. Detectability Dilemma in Adversarial Machine Learning

Figure 4 for Attack Strength vs. Detectability Dilemma in Adversarial Machine Learning

Abstract:As the prevalence and everyday use of machine learning algorithms, along with our reliance on these algorithms grow dramatically, so do the efforts to attack and undermine these algorithms with malicious intent, resulting in a growing interest in adversarial machine learning. A number of approaches have been developed that can render a machine learning algorithm ineffective through poisoning or other types of attacks. Most attack algorithms typically use sophisticated optimization approaches, whose objective function is designed to cause maximum damage with respect to accuracy and performance of the algorithm with respect to some task. In this effort, we show that while such an objective function is indeed brutally effective in causing maximum damage on an embedded feature selection task, it often results in an attack mechanism that can be easily detected with an embarrassingly simple novelty or outlier detection algorithm. We then propose an equally simple yet elegant solution by adding a regularization term to the attacker's objective function that penalizes outlying attack points.

* 8 pages, 9 figures, submitted to IJCNN 2018

Via

Access Paper or Ask Questions