Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Michin Hong

Exposing LLM Vulnerabilities: Adversarial Scam Detection and Performance

Dec 01, 2024

Chen-Wei Chang, Shailik Sarkar, Shutonu Mitra, Qi Zhang, Hossein Salemi, Hemant Purohit, Fengxiu Zhang, Michin Hong, Jin-Hee Cho, Chang-Tien Lu

Figure 1 for Exposing LLM Vulnerabilities: Adversarial Scam Detection and Performance

Figure 2 for Exposing LLM Vulnerabilities: Adversarial Scam Detection and Performance

Figure 3 for Exposing LLM Vulnerabilities: Adversarial Scam Detection and Performance

Figure 4 for Exposing LLM Vulnerabilities: Adversarial Scam Detection and Performance

Abstract:Can we trust Large Language Models (LLMs) to accurately predict scam? This paper investigates the vulnerabilities of LLMs when facing adversarial scam messages for the task of scam detection. We addressed this issue by creating a comprehensive dataset with fine-grained labels of scam messages, including both original and adversarial scam messages. The dataset extended traditional binary classes for the scam detection task into more nuanced scam types. Our analysis showed how adversarial examples took advantage of vulnerabilities of a LLM, leading to high misclassification rate. We evaluated the performance of LLMs on these adversarial scam messages and proposed strategies to improve their robustness.

* 4 pages, 2024 IEEE International Conference on Big Data workshop BigEACPS 2024

Via

Access Paper or Ask Questions