Picture for Kaiyuan Zhang

Kaiyuan Zhang

CENSOR: Defense Against Gradient Inversion via Orthogonal Subspace Bayesian Sampling

Add code
Jan 27, 2025
Viaarxiv icon

ProSec: Fortifying Code LLMs with Proactive Security Alignment

Add code
Nov 19, 2024
Viaarxiv icon

UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening

Add code
Jul 16, 2024
Figure 1 for UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening
Figure 2 for UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening
Figure 3 for UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening
Figure 4 for UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening
Viaarxiv icon

Source Code Foundation Models are Transferable Binary Analysis Knowledge Bases

Add code
May 30, 2024
Viaarxiv icon

LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning

Add code
Mar 25, 2024
Figure 1 for LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning
Figure 2 for LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning
Figure 3 for LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning
Figure 4 for LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning
Viaarxiv icon

Rapid Optimization for Jailbreaking LLMs via Subconscious Exploitation and Echopraxia

Add code
Feb 08, 2024
Viaarxiv icon

Elijah: Eliminating Backdoors Injected in Diffusion Models via Distribution Shift

Add code
Nov 27, 2023
Viaarxiv icon

GNN4EEG: A Benchmark and Toolkit for Electroencephalography Classification with Graph Neural Network

Add code
Sep 27, 2023
Viaarxiv icon

ParaFuzz: An Interpretability-Driven Technique for Detecting Poisoned Samples in NLP

Add code
Aug 04, 2023
Viaarxiv icon

Detecting Backdoors in Pre-trained Encoders

Add code
Mar 23, 2023
Figure 1 for Detecting Backdoors in Pre-trained Encoders
Figure 2 for Detecting Backdoors in Pre-trained Encoders
Figure 3 for Detecting Backdoors in Pre-trained Encoders
Figure 4 for Detecting Backdoors in Pre-trained Encoders
Viaarxiv icon