Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lukas Bajer

Malicious Internet Entity Detection Using Local Graph Inference

Aug 07, 2024

Simon Mandlik, Tomas Pevny, Vaclav Smidl, Lukas Bajer

Abstract:Detection of malicious behavior in a large network is a challenging problem for machine learning in computer security, since it requires a model with high expressive power and scalable inference. Existing solutions struggle to achieve this feat -- current cybersec-tailored approaches are still limited in expressivity, and methods successful in other domains do not scale well for large volumes of data, rendering frequent retraining impossible. This work proposes a new perspective for learning from graph data that is modeling network entity interactions as a large heterogeneous graph. High expressivity of the method is achieved with neural network architecture HMILnet that naturally models this type of data and provides theoretical guarantees. The scalability is achieved by pursuing local graph inference, i.e., classifying individual vertices and their neighborhood as independent samples. Our experiments exhibit improvement over the state-of-the-art Probabilistic Threat Propagation (PTP) algorithm, show a further threefold accuracy improvement when additional data is used, which is not possible with the PTP algorithm, and demonstrate the generalization capabilities of the method to new, previously unseen entities.

* A preprint. Full publication: https://ieeexplore.ieee.org/document/10418120

Via

Access Paper or Ask Questions

Learning Explainable Representations of Malware Behavior

Jun 23, 2021

Paul Prasse, Jan Brabec, Jan Kohout, Martin Kopp, Lukas Bajer, Tobias Scheffer

Figure 1 for Learning Explainable Representations of Malware Behavior

Figure 2 for Learning Explainable Representations of Malware Behavior

Figure 3 for Learning Explainable Representations of Malware Behavior

Figure 4 for Learning Explainable Representations of Malware Behavior

Abstract:We address the problems of identifying malware in network telemetry logs and providing \emph{indicators of compromise} -- comprehensible explanations of behavioral patterns that identify the threat. In our system, an array of specialized detectors abstracts network-flow data into comprehensible \emph{network events} in a first step. We develop a neural network that processes this sequence of events and identifies specific threats, malware families and broad categories of malware. We then use the \emph{integrated-gradients} method to highlight events that jointly constitute the characteristic behavioral pattern of the threat. We compare network architectures based on CNNs, LSTMs, and transformers, and explore the efficacy of unsupervised pre-training experimentally on large-scale telemetry data. We demonstrate how this system detects njRAT and other malware based on behavioral patterns.

* This is a pre-print of an article to appear in Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2021

Via

Access Paper or Ask Questions

Adaptive Generation-Based Evolution Control for Gaussian Process Surrogate Models

Sep 29, 2017

Jakub Repicky, Lukas Bajer, Zbynek Pitra, Martin Holena

Figure 1 for Adaptive Generation-Based Evolution Control for Gaussian Process Surrogate Models

Figure 2 for Adaptive Generation-Based Evolution Control for Gaussian Process Surrogate Models

Figure 3 for Adaptive Generation-Based Evolution Control for Gaussian Process Surrogate Models

Figure 4 for Adaptive Generation-Based Evolution Control for Gaussian Process Surrogate Models

Abstract:The interest in accelerating black-box optimizers has resulted in several surrogate model-assisted version of the Covariance Matrix Adaptation Evolution Strategy, a state-of-the-art continuous black-box optimizer. The version called Surrogate CMA-ES uses Gaussian processes or random forests surrogate models with a generation-based evolution control. This paper presents an adaptive improvement for S-CMA-ES based on a general procedure introduced with the s*ACM-ES algorithm, in which the number of generations using the surrogate model before retraining is adjusted depending on the performance of the last instance of the surrogate. Three algorithms that differ in the measure of the surrogate model's performance are evaluated on the COCO/BBOB framework. The results show a minor improvement on S-CMA-ES with constant model lifelengths, especially when larger lifelengths are considered.

* Updated version. The original article appeared in Proceedings of the 17th ITAT Conference Martin, September 22--26, 2017, published by CreateSpace Independent Publ. Platform and online by CEUR Workshop Proceedings. Abstract: Added credits to the s*ACM-ES algorithm. Section 1: Added references and clarified the motivation. Section 3: Added references

Via

Access Paper or Ask Questions

Model Guided Sampling Optimization for Low-dimensional Problems

Aug 31, 2015

Lukas Bajer, Martin Holena

Figure 1 for Model Guided Sampling Optimization for Low-dimensional Problems

Figure 2 for Model Guided Sampling Optimization for Low-dimensional Problems

Abstract:Optimization of very expensive black-box functions requires utilization of maximum information gathered by the process of optimization. Model Guided Sampling Optimization (MGSO) forms a more robust alternative to Jones' Gaussian-process-based EGO algorithm. Instead of EGO's maximizing expected improvement, the MGSO uses sampling the probability of improvement which is shown to be helpful against trapping in local minima. Further, the MGSO can reach close-to-optimum solutions faster than standard optimization algorithms on low dimensional or smooth problems.

* Bajer, L. & Holena, M. Model Guided Sampling Optimization for Low-dimensional Problems. in ICAART 2015 Proceedings of the International Conference on Agents and Artificial Intelligence, Volume 2 451-456 (SCITEPRESS, Lisbon, 2015)

Via

Access Paper or Ask Questions