Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

John R. Goodall

Forming IDEAS Interactive Data Exploration & Analysis System

Jun 20, 2018

Robert A. Bridges, Maria A. Vincent, Kelly M. T. Huffer, John R. Goodall, Jessie D. Jamieson, Zachary Burch

Figure 1 for Forming IDEAS Interactive Data Exploration & Analysis System

Figure 2 for Forming IDEAS Interactive Data Exploration & Analysis System

Figure 3 for Forming IDEAS Interactive Data Exploration & Analysis System

Figure 4 for Forming IDEAS Interactive Data Exploration & Analysis System

Abstract:Modern cyber security operations collect an enormous amount of logging and alerting data. While analysts have the ability to query and compute simple statistics and plots from their data, current analytical tools are too simple to admit deep understanding. To detect advanced and novel attacks, analysts turn to manual investigations. While commonplace, current investigations are time-consuming, intuition-based, and proving insufficient. Our hypothesis is that arming the analyst with easy-to-use data science tools will increase their work efficiency, provide them with the ability to resolve hypotheses with scientific inquiry of their data, and support their decisions with evidence over intuition. To this end, we present our work to build IDEAS (Interactive Data Exploration and Analysis System). We present three real-world use-cases that drive the system design from the algorithmic capabilities to the user interface. Finally, a modular and scalable software architecture is discussed along with plans for our pilot deployment with a security operation command.

* Workshop on Information Security Workers, USENIX SOUPS 2018
* 4 page short paper on IDEAS System, 4 figures

Via

Access Paper or Ask Questions

GraphPrints: Towards a Graph Analytic Method for Network Anomaly Detection

Feb 02, 2016

Christopher R. Harshaw, Robert A. Bridges, Michael D. Iannacone, Joel W. Reed, John R. Goodall

Figure 1 for GraphPrints: Towards a Graph Analytic Method for Network Anomaly Detection

Figure 2 for GraphPrints: Towards a Graph Analytic Method for Network Anomaly Detection

Figure 3 for GraphPrints: Towards a Graph Analytic Method for Network Anomaly Detection

Abstract:This paper introduces a novel graph-analytic approach for detecting anomalies in network flow data called GraphPrints. Building on foundational network-mining techniques, our method represents time slices of traffic as a graph, then counts graphlets -- small induced subgraphs that describe local topology. By performing outlier detection on the sequence of graphlet counts, anomalous intervals of traffic are identified, and furthermore, individual IPs experiencing abnormal behavior are singled-out. Initial testing of GraphPrints is performed on real network data with an implanted anomaly. Evaluation shows false positive rates bounded by 2.84% at the time-interval level, and 0.05% at the IP-level with 100% true positive rates at both.

* 4 pages submitted to Cyber & Information Security Research Conference 2016, ACM

Via

Access Paper or Ask Questions

Automatic Labeling for Entity Extraction in Cyber Security

Jun 09, 2014

Robert A. Bridges, Corinne L. Jones, Michael D. Iannacone, Kelly M. Testa, John R. Goodall

Figure 1 for Automatic Labeling for Entity Extraction in Cyber Security

Figure 2 for Automatic Labeling for Entity Extraction in Cyber Security

Figure 3 for Automatic Labeling for Entity Extraction in Cyber Security

Figure 4 for Automatic Labeling for Entity Extraction in Cyber Security

Abstract:Timely analysis of cyber-security information necessitates automated information extraction from unstructured text. While state-of-the-art extraction methods produce extremely accurate results, they require ample training data, which is generally unavailable for specialized applications, such as detecting security related entities; moreover, manual annotation of corpora is very costly and often not a viable solution. In response, we develop a very precise method to automatically label text from several data sources by leveraging related, domain-specific, structured data and provide public access to a corpus annotated with cyber-security entities. Next, we implement a Maximum Entropy Model trained with the average perceptron on a portion of our corpus ($\sim$750,000 words) and achieve near perfect precision, recall, and accuracy, with training times under 17 seconds.

* 10 pages

Via

Access Paper or Ask Questions

PACE: Pattern Accurate Computationally Efficient Bootstrapping for Timely Discovery of Cyber-Security Concepts

Oct 11, 2013

Nikki McNeil, Robert A. Bridges, Michael D. Iannacone, Bogdan Czejdo, Nicolas Perez, John R. Goodall

Figure 1 for PACE: Pattern Accurate Computationally Efficient Bootstrapping for Timely Discovery of Cyber-Security Concepts

Figure 2 for PACE: Pattern Accurate Computationally Efficient Bootstrapping for Timely Discovery of Cyber-Security Concepts

Figure 3 for PACE: Pattern Accurate Computationally Efficient Bootstrapping for Timely Discovery of Cyber-Security Concepts

Figure 4 for PACE: Pattern Accurate Computationally Efficient Bootstrapping for Timely Discovery of Cyber-Security Concepts

Abstract:Public disclosure of important security information, such as knowledge of vulnerabilities or exploits, often occurs in blogs, tweets, mailing lists, and other online sources months before proper classification into structured databases. In order to facilitate timely discovery of such knowledge, we propose a novel semi-supervised learning algorithm, PACE, for identifying and classifying relevant entities in text sources. The main contribution of this paper is an enhancement of the traditional bootstrapping method for entity extraction by employing a time-memory trade-off that simultaneously circumvents a costly corpus search while strengthening pattern nomination, which should increase accuracy. An implementation in the cyber-security domain is discussed as well as challenges to Natural Language Processing imposed by the security domain.

* 6 pages, 3 figures, ieeeTran conference. International Conference on Machine Learning and Applications 2013

Via

Access Paper or Ask Questions