Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xuan Sheng

Punctuation Matters! Stealthy Backdoor Attack for Language Models

Dec 26, 2023

Xuan Sheng, Zhicheng Li, Zhaoyang Han, Xiangmao Chang, Piji Li

Abstract:Recent studies have pointed out that natural language processing (NLP) models are vulnerable to backdoor attacks. A backdoored model produces normal outputs on the clean samples while performing improperly on the texts with triggers that the adversary injects. However, previous studies on textual backdoor attack pay little attention to stealthiness. Moreover, some attack methods even cause grammatical issues or change the semantic meaning of the original texts. Therefore, they can easily be detected by humans or defense systems. In this paper, we propose a novel stealthy backdoor attack method against textual models, which is called \textbf{PuncAttack}. It leverages combinations of punctuation marks as the trigger and chooses proper locations strategically to replace them. Through extensive experiments, we demonstrate that the proposed method can effectively compromise multiple models in various tasks. Meanwhile, we conduct automatic evaluation and human inspection, which indicate the proposed method possesses good performance of stealthiness without bringing grammatical issues and altering the meaning of sentences.

* NLPCC 2023

Via

Access Paper or Ask Questions

A Survey on Backdoor Attack and Defense in Natural Language Processing

Nov 22, 2022

Xuan Sheng, Zhaoyang Han, Piji Li, Xiangmao Chang

Figure 1 for A Survey on Backdoor Attack and Defense in Natural Language Processing

Figure 2 for A Survey on Backdoor Attack and Defense in Natural Language Processing

Figure 3 for A Survey on Backdoor Attack and Defense in Natural Language Processing

Figure 4 for A Survey on Backdoor Attack and Defense in Natural Language Processing

Abstract:Deep learning is becoming increasingly popular in real-life applications, especially in natural language processing (NLP). Users often choose training outsourcing or adopt third-party data and models due to data and computation resources being limited. In such a situation, training data and models are exposed to the public. As a result, attackers can manipulate the training process to inject some triggers into the model, which is called backdoor attack. Backdoor attack is quite stealthy and difficult to be detected because it has little inferior influence on the model's performance for the clean samples. To get a precise grasp and understanding of this problem, in this paper, we conduct a comprehensive review of backdoor attacks and defenses in the field of NLP. Besides, we summarize benchmark datasets and point out the open issues to design credible systems to defend against backdoor attacks.

* 12 pages, QRS2022

Via

Access Paper or Ask Questions