Picture for Gongshen Liu

Gongshen Liu

Gracefully Filtering Backdoor Samples for Generative Large Language Models without Retraining

Add code
Dec 03, 2024
Viaarxiv icon

NSmark: Null Space Based Black-box Watermarking Defense Framework for Pre-trained Language Models

Add code
Oct 16, 2024
Figure 1 for NSmark: Null Space Based Black-box Watermarking Defense Framework for Pre-trained Language Models
Figure 2 for NSmark: Null Space Based Black-box Watermarking Defense Framework for Pre-trained Language Models
Figure 3 for NSmark: Null Space Based Black-box Watermarking Defense Framework for Pre-trained Language Models
Figure 4 for NSmark: Null Space Based Black-box Watermarking Defense Framework for Pre-trained Language Models
Viaarxiv icon

Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities

Add code
Jul 10, 2024
Figure 1 for Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities
Figure 2 for Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities
Figure 3 for Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities
Figure 4 for Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities
Viaarxiv icon

TrojanRAG: Retrieval-Augmented Generation Can Be Backdoor Driver in Large Language Models

Add code
May 22, 2024
Viaarxiv icon

MKF-ADS: Multi-Knowledge Fusion Based Self-supervised Anomaly Detection System for Control Area Network

Add code
Mar 15, 2024
Viaarxiv icon

How Large Language Models Encode Context Knowledge? A Layer-Wise Probing Study

Add code
Mar 04, 2024
Viaarxiv icon

Syntactic Ghost: An Imperceptible General-purpose Backdoor Attacks on Pre-trained Language Models

Add code
Feb 29, 2024
Viaarxiv icon

Acquiring Clean Language Models from Backdoor Poisoned Datasets by Downscaling Frequency Space

Add code
Feb 27, 2024
Figure 1 for Acquiring Clean Language Models from Backdoor Poisoned Datasets by Downscaling Frequency Space
Figure 2 for Acquiring Clean Language Models from Backdoor Poisoned Datasets by Downscaling Frequency Space
Figure 3 for Acquiring Clean Language Models from Backdoor Poisoned Datasets by Downscaling Frequency Space
Figure 4 for Acquiring Clean Language Models from Backdoor Poisoned Datasets by Downscaling Frequency Space
Viaarxiv icon

Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models

Add code
Feb 19, 2024
Viaarxiv icon

Improving Non-autoregressive Machine Translation with Error Exposure and Consistency Regularization

Add code
Feb 15, 2024
Viaarxiv icon