Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jiyong Jang

URET: Universal Robustness Evaluation Toolkit (for Evasion)

Aug 03, 2023

Kevin Eykholt, Taesung Lee, Douglas Schales, Jiyong Jang, Ian Molloy, Masha Zorin

Figure 1 for URET: Universal Robustness Evaluation Toolkit (for Evasion)

Figure 2 for URET: Universal Robustness Evaluation Toolkit (for Evasion)

Figure 3 for URET: Universal Robustness Evaluation Toolkit (for Evasion)

Figure 4 for URET: Universal Robustness Evaluation Toolkit (for Evasion)

Abstract:Machine learning models are known to be vulnerable to adversarial evasion attacks as illustrated by image classification models. Thoroughly understanding such attacks is critical in order to ensure the safety and robustness of critical AI tasks. However, most evasion attacks are difficult to deploy against a majority of AI systems because they have focused on image domain with only few constraints. An image is composed of homogeneous, numerical, continuous, and independent features, unlike many other input types to AI systems used in practice. Furthermore, some input types include additional semantic and functional constraints that must be observed to generate realistic adversarial inputs. In this work, we propose a new framework to enable the generation of adversarial inputs irrespective of the input type and task domain. Given an input and a set of pre-defined input transformations, our framework discovers a sequence of transformations that result in a semantically correct and functional adversarial input. We demonstrate the generality of our approach on several diverse machine learning tasks with various input representations. We also show the importance of generating adversarial examples as they enable the deployment of mitigation techniques.

* Accepted at USENIX '23

Via

Access Paper or Ask Questions

Evidential Cyber Threat Hunting

Apr 21, 2021

Frederico Araujo, Dhilung Kirat, Xiaokui Shu, Teryl Taylor, Jiyong Jang

Figure 1 for Evidential Cyber Threat Hunting

Figure 2 for Evidential Cyber Threat Hunting

Figure 3 for Evidential Cyber Threat Hunting

Figure 4 for Evidential Cyber Threat Hunting

Abstract:A formal cyber reasoning framework for automating the threat hunting process is described. The new cyber reasoning methodology introduces an operational semantics that operates over three subspaces -- knowledge, hypothesis, and action -- to enable human-machine co-creation of threat hypotheses and protective recommendations. An implementation of this framework shows that the approach is practical and can be used to generalize evidence-based multi-criteria threat investigations.

* In Proceedings of the 2021 SIAM AI/ML for Cybersecurity Workshop (AI4CS)
* 5 pages, SDM AI4CS 2021

Via

Access Paper or Ask Questions

Adaptive Verifiable Training Using Pairwise Class Similarity

Dec 14, 2020

Shiqi Wang, Kevin Eykholt, Taesung Lee, Jiyong Jang, Ian Molloy

Figure 1 for Adaptive Verifiable Training Using Pairwise Class Similarity

Figure 2 for Adaptive Verifiable Training Using Pairwise Class Similarity

Figure 3 for Adaptive Verifiable Training Using Pairwise Class Similarity

Figure 4 for Adaptive Verifiable Training Using Pairwise Class Similarity

Abstract:Verifiable training has shown success in creating neural networks that are provably robust to a given amount of noise. However, despite only enforcing a single robustness criterion, its performance scales poorly with dataset complexity. On CIFAR10, a non-robust LeNet model has a 21.63% error rate, while a model created using verifiable training and a L-infinity robustness criterion of 8/255, has an error rate of 57.10%. Upon examination, we find that when labeling visually similar classes, the model's error rate is as high as 61.65%. We attribute the loss in performance to inter-class similarity. Similar classes (i.e., close in the feature space) increase the difficulty of learning a robust model. While it's desirable to train a robust model for a large robustness region, pairwise class similarities limit the potential gains. Also, consideration must be made regarding the relative cost of mistaking similar classes. In security or safety critical tasks, similar classes are likely to belong to the same group, and thus are equally sensitive. In this work, we propose a new approach that utilizes inter-class similarity to improve the performance of verifiable training and create robust models with respect to multiple adversarial criteria. First, we use agglomerate clustering to group similar classes and assign robustness criteria based on the similarity between clusters. Next, we propose two methods to apply our approach: (1) Inter-Group Robustness Prioritization, which uses a custom loss term to create a single model with multiple robustness guarantees and (2) neural decision trees, which trains multiple sub-classifiers with different robustness guarantees and combines them in a decision tree architecture. On Fashion-MNIST and CIFAR10, our approach improves clean performance by 9.63% and 30.89% respectively. On CIFAR100, our approach improves clean performance by 26.32%.

* Acceped at AAAI21

Via

Access Paper or Ask Questions