Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Damon Woodard

Is the Digital Forensics and Incident Response Pipeline Ready for Text-Based Threats in LLM Era?

Jul 25, 2024

Avanti Bhandarkar, Ronald Wilson, Anushka Swarup, Mengdi Zhu, Damon Woodard

Figure 1 for Is the Digital Forensics and Incident Response Pipeline Ready for Text-Based Threats in LLM Era?

Figure 2 for Is the Digital Forensics and Incident Response Pipeline Ready for Text-Based Threats in LLM Era?

Figure 3 for Is the Digital Forensics and Incident Response Pipeline Ready for Text-Based Threats in LLM Era?

Figure 4 for Is the Digital Forensics and Incident Response Pipeline Ready for Text-Based Threats in LLM Era?

Abstract:In the era of generative AI, the widespread adoption of Neural Text Generators (NTGs) presents new cybersecurity challenges, particularly within the realms of Digital Forensics and Incident Response (DFIR). These challenges primarily involve the detection and attribution of sources behind advanced attacks like spearphishing and disinformation campaigns. As NTGs evolve, the task of distinguishing between human and NTG-authored texts becomes critically complex. This paper rigorously evaluates the DFIR pipeline tailored for text-based security systems, specifically focusing on the challenges of detecting and attributing authorship of NTG-authored texts. By introducing a novel human-NTG co-authorship text attack, termed CS-ACT, our study uncovers significant vulnerabilities in traditional DFIR methodologies, highlighting discrepancies between ideal scenarios and real-world conditions. Utilizing 14 diverse datasets and 43 unique NTGs, up to the latest GPT-4, our research identifies substantial vulnerabilities in the forensic profiling phase, particularly in attributing authorship to NTGs. Our comprehensive evaluation points to factors such as model sophistication and the lack of distinctive style within NTGs as significant contributors for these vulnerabilities. Our findings underscore the necessity for more sophisticated and adaptable strategies, such as incorporating adversarial learning, stylizing NTGs, and implementing hierarchical attribution through the mapping of NTG lineages to enhance source attribution. This sets the stage for future research and the development of more resilient text-based security systems.

* This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Via

Access Paper or Ask Questions

TraSE: Towards Tackling Authorial Style from a Cognitive Science Perspective

Jun 21, 2022

Ronald Wilson, Avanti Bhandarkar, Damon Woodard

Figure 1 for TraSE: Towards Tackling Authorial Style from a Cognitive Science Perspective

Figure 2 for TraSE: Towards Tackling Authorial Style from a Cognitive Science Perspective

Figure 3 for TraSE: Towards Tackling Authorial Style from a Cognitive Science Perspective

Figure 4 for TraSE: Towards Tackling Authorial Style from a Cognitive Science Perspective

Abstract:Stylistic analysis of text is a key task in research areas ranging from authorship attribution to forensic analysis and personality profiling. The existing approaches for stylistic analysis are plagued by issues like topic influence, lack of discriminability for large number of authors and the requirement for large amounts of diverse data. In this paper, the source of these issues are identified along with the necessity for a cognitive perspective on authorial style in addressing them. A novel feature representation, called Trajectory-based Style Estimation (TraSE), is introduced to support this purpose. Authorship attribution experiments with over 27,000 authors and 1.4 million samples in a cross-domain scenario resulted in 90% attribution accuracy suggesting that the feature representation is immune to such negative influences and an excellent candidate for stylistic analysis. Finally, a qualitative analysis is performed on TraSE using physical human characteristics, like age, to validate its claim on capturing cognitive traits.

Via

Access Paper or Ask Questions