Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Peter Bajcsy

National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign

Interactive Simulations of Backdoors in Neural Networks

May 21, 2024

Peter Bajcsy, Maxime Bros

Figure 1 for Interactive Simulations of Backdoors in Neural Networks

Figure 2 for Interactive Simulations of Backdoors in Neural Networks

Figure 3 for Interactive Simulations of Backdoors in Neural Networks

Figure 4 for Interactive Simulations of Backdoors in Neural Networks

Abstract:This work addresses the problem of planting and defending cryptographic-based backdoors in artificial intelligence (AI) models. The motivation comes from our lack of understanding and the implications of using cryptographic techniques for planting undetectable backdoors under theoretical assumptions in the large AI model systems deployed in practice. Our approach is based on designing a web-based simulation playground that enables planting, activating, and defending cryptographic backdoors in neural networks (NN). Simulations of planting and activating backdoors are enabled for two scenarios: in the extension of NN model architecture to support digital signature verification and in the modified architectural block for non-linear operators. Simulations of backdoor defense against backdoors are available based on proximity analysis and provide a playground for a game of planting and defending against backdoors. The simulations are available at https://pages.nist.gov/nn-calculator

* 13 pages, 7 figures, 1 Table

Via

Access Paper or Ask Questions

AI Model Utilization Measurements For Finding Class Encoding Patterns

Dec 12, 2022

Peter Bajcsy, Antonio Cardone, Chenyi Ling, Philippe Dessauw, Michael Majurski, Tim Blattner, Derek Juba, Walid Keyrouz

Figure 1 for AI Model Utilization Measurements For Finding Class Encoding Patterns

Figure 2 for AI Model Utilization Measurements For Finding Class Encoding Patterns

Figure 3 for AI Model Utilization Measurements For Finding Class Encoding Patterns

Figure 4 for AI Model Utilization Measurements For Finding Class Encoding Patterns

Abstract:This work addresses the problems of (a) designing utilization measurements of trained artificial intelligence (AI) models and (b) explaining how training data are encoded in AI models based on those measurements. The problems are motivated by the lack of explainability of AI models in security and safety critical applications, such as the use of AI models for classification of traffic signs in self-driving cars. We approach the problems by introducing theoretical underpinnings of AI model utilization measurement and understanding patterns in utilization-based class encodings of traffic signs at the level of computation graphs (AI models), subgraphs, and graph nodes. Conceptually, utilization is defined at each graph node (computation unit) of an AI model based on the number and distribution of unique outputs in the space of all possible outputs (tensor-states). In this work, utilization measurements are extracted from AI models, which include poisoned and clean AI models. In contrast to clean AI models, the poisoned AI models were trained with traffic sign images containing systematic, physically realizable, traffic sign modifications (i.e., triggers) to change a correct class label to another label in a presence of such a trigger. We analyze class encodings of such clean and poisoned AI models, and conclude with implications for trojan injection and detection.

* 45 pages, 29 figures, 7 tables

Via

Access Paper or Ask Questions

Baseline Pruning-Based Approach to Trojan Detection in Neural Networks

Feb 09, 2021

Peter Bajcsy, Michael Majurski

Figure 1 for Baseline Pruning-Based Approach to Trojan Detection in Neural Networks

Figure 2 for Baseline Pruning-Based Approach to Trojan Detection in Neural Networks

Figure 3 for Baseline Pruning-Based Approach to Trojan Detection in Neural Networks

Figure 4 for Baseline Pruning-Based Approach to Trojan Detection in Neural Networks

Abstract:This paper addresses the problem of detecting trojans in neural networks (NNs) by analyzing systematically pruned NN models. Our pruning-based approach consists of three main steps. First, detect any deviations from the reference look-up tables of model file sizes and model graphs. Next, measure the accuracy of a set of systematically pruned NN models following multiple pruning schemas. Finally, classify a NN model as clean or poisoned by applying a mapping between accuracy measurements and NN model labels. This work outlines a theoretical and experimental framework for finding the optimal mapping over a large search space of pruning parameters. Based on our experiments using Round 1 and Round 2 TrojAI Challenge datasets, the approach achieves average classification accuracy of 69.73 % and 82.41% respectively with an average processing time of less than 60 s per model. For both datasets random guessing would produce 50% classification accuracy. Reference model graphs and source code are available from GitHub.

* The funding for all authors was provided by IARPA: IARPA-20001-D2020-2007180011

Via

Access Paper or Ask Questions

Neural Network Calculator for Designing Trojan Detectors

Jun 05, 2020

Peter Bajcsy, Nicholas J. Schaub, Michael Majurski

Figure 1 for Neural Network Calculator for Designing Trojan Detectors

Figure 2 for Neural Network Calculator for Designing Trojan Detectors

Figure 3 for Neural Network Calculator for Designing Trojan Detectors

Figure 4 for Neural Network Calculator for Designing Trojan Detectors

Abstract:This work presents a web-based interactive neural network (NN) calculator and a NN inefficiency measurement that has been investigated for the purpose of detecting trojans embedded in NN models. This NN Calculator is designed on top of TensorFlow Playground with in-memory storage of data and NN coefficients. Its been extended with additional analytical, visualization, and output operations performed on training datasets and NN architectures. The analytical capabilities include a novel measurement of NN inefficiency using modified Kullback-Liebler (KL) divergence applied to histograms of NN model states, as well as a quantification of the sensitivity to variables related to data and NNs. Both NN Calculator and KL divergence are used to devise a trojan detector approach for a variety of trojan embeddings. Experimental results document desirable properties of the KL divergence measurement with respect to NN architectures and dataset perturbations, as well as inferences about embedded trojans.

* 12 pages of main text plus 7 pages of appendices

Via

Access Paper or Ask Questions

Embedding Data within Knowledge Spaces

Feb 04, 2009

James D. Myers, Joe Futrelle, Jeff Gaynor, Joel Plutchak, Peter Bajcsy, Jason Kastner, Kailash Kotwani, Jong Sung Lee, Luigi Marini, Rob Kooper(+4 more)

Figure 1 for Embedding Data within Knowledge Spaces

Abstract:The promise of e-Science will only be realized when data is discoverable, accessible, and comprehensible within distributed teams, across disciplines, and over the long-term--without reliance on out-of-band (non-digital) means. We have developed the open-source Tupelo semantic content management framework and are employing it to manage a wide range of e-Science entities (including data, documents, workflows, people, and projects) and a broad range of metadata (including provenance, social networks, geospatial relationships, temporal relations, and domain descriptions). Tupelo couples the use of global identifiers and resource description framework (RDF) statements with an aggregatable content repository model to provide a unified space for securely managing distributed heterogeneous content and relationships.

* 10 pages with 1 figure. Corrected incorrect transliteration in abstract

Via

Access Paper or Ask Questions