Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Romy Fieblinger

Actionable Cyber Threat Intelligence using Knowledge Graphs and Large Language Models

Jun 30, 2024

Romy Fieblinger, Md Tanvirul Alam, Nidhi Rastogi

Figure 1 for Actionable Cyber Threat Intelligence using Knowledge Graphs and Large Language Models

Figure 2 for Actionable Cyber Threat Intelligence using Knowledge Graphs and Large Language Models

Figure 3 for Actionable Cyber Threat Intelligence using Knowledge Graphs and Large Language Models

Figure 4 for Actionable Cyber Threat Intelligence using Knowledge Graphs and Large Language Models

Abstract:Cyber threats are constantly evolving. Extracting actionable insights from unstructured Cyber Threat Intelligence (CTI) data is essential to guide cybersecurity decisions. Increasingly, organizations like Microsoft, Trend Micro, and CrowdStrike are using generative AI to facilitate CTI extraction. This paper addresses the challenge of automating the extraction of actionable CTI using advancements in Large Language Models (LLMs) and Knowledge Graphs (KGs). We explore the application of state-of-the-art open-source LLMs, including the Llama 2 series, Mistral 7B Instruct, and Zephyr for extracting meaningful triples from CTI texts. Our methodology evaluates techniques such as prompt engineering, the guidance framework, and fine-tuning to optimize information extraction and structuring. The extracted data is then utilized to construct a KG, offering a structured and queryable representation of threat intelligence. Experimental results demonstrate the effectiveness of our approach in extracting relevant information, with guidance and fine-tuning showing superior performance over prompt engineering. However, while our methods prove effective in small-scale tests, applying LLMs to large-scale data for KG construction and Link Prediction presents ongoing challenges.

* 6th Workshop on Attackers and Cyber-Crime Operations, 12 pages, 1 figure, 9 tables

Via

Access Paper or Ask Questions

SECURE: Benchmarking Generative Large Language Models for Cybersecurity Advisory

May 30, 2024

Dipkamal Bhusal, Md Tanvirul Alam, Le Nguyen, Ashim Mahara, Zachary Lightcap, Rodney Frazier, Romy Fieblinger, Grace Long Torales, Nidhi Rastogi

Abstract:Large Language Models (LLMs) have demonstrated potential in cybersecurity applications but have also caused lower confidence due to problems like hallucinations and a lack of truthfulness. Existing benchmarks provide general evaluations but do not sufficiently address the practical and applied aspects of LLM performance in cybersecurity-specific tasks. To address this gap, we introduce the SECURE (Security Extraction, Understanding \& Reasoning Evaluation), a benchmark designed to assess LLMs performance in realistic cybersecurity scenarios. SECURE includes six datasets focussed on the Industrial Control System sector to evaluate knowledge extraction, understanding, and reasoning based on industry-standard sources. Our study evaluates seven state-of-the-art models on these tasks, providing insights into their strengths and weaknesses in cybersecurity contexts, and offer recommendations for improving LLMs reliability as cyber advisory tools.

Via

Access Paper or Ask Questions

MORPH: Towards Automated Concept Drift Adaptation for Malware Detection

Jan 23, 2024

Md Tanvirul Alam, Romy Fieblinger, Ashim Mahara, Nidhi Rastogi

Figure 1 for MORPH: Towards Automated Concept Drift Adaptation for Malware Detection

Figure 2 for MORPH: Towards Automated Concept Drift Adaptation for Malware Detection

Figure 3 for MORPH: Towards Automated Concept Drift Adaptation for Malware Detection

Figure 4 for MORPH: Towards Automated Concept Drift Adaptation for Malware Detection

Abstract:Concept drift is a significant challenge for malware detection, as the performance of trained machine learning models degrades over time, rendering them impractical. While prior research in malware concept drift adaptation has primarily focused on active learning, which involves selecting representative samples to update the model, self-training has emerged as a promising approach to mitigate concept drift. Self-training involves retraining the model using pseudo labels to adapt to shifting data distributions. In this research, we propose MORPH -- an effective pseudo-label-based concept drift adaptation method specifically designed for neural networks. Through extensive experimental analysis of Android and Windows malware datasets, we demonstrate the efficacy of our approach in mitigating the impact of concept drift. Our method offers the advantage of reducing annotation efforts when combined with active learning. Furthermore, our method significantly improves over existing works in automated concept drift adaptation for malware detection.

Via

Access Paper or Ask Questions