Picture for Qizhang Li

Qizhang Li

Deciphering the Chaos: Enhancing Jailbreak Attacks via Adversarial Prompt Translation

Add code
Oct 15, 2024
Viaarxiv icon

Improved Generation of Adversarial Examples Against Safety-aligned LLMs

Add code
May 28, 2024
Figure 1 for Improved Generation of Adversarial Examples Against Safety-aligned LLMs
Figure 2 for Improved Generation of Adversarial Examples Against Safety-aligned LLMs
Figure 3 for Improved Generation of Adversarial Examples Against Safety-aligned LLMs
Figure 4 for Improved Generation of Adversarial Examples Against Safety-aligned LLMs
Viaarxiv icon

Towards Evaluating Transfer-based Attacks Systematically, Practically, and Fairly

Add code
Nov 02, 2023
Viaarxiv icon

DualAug: Exploiting Additional Heavy Augmentation with OOD Data Rejection

Add code
Oct 16, 2023
Viaarxiv icon

Improving Transferability of Adversarial Examples via Bayesian Attacks

Add code
Jul 21, 2023
Figure 1 for Improving Transferability of Adversarial Examples via Bayesian Attacks
Figure 2 for Improving Transferability of Adversarial Examples via Bayesian Attacks
Figure 3 for Improving Transferability of Adversarial Examples via Bayesian Attacks
Figure 4 for Improving Transferability of Adversarial Examples via Bayesian Attacks
Viaarxiv icon

Improving Adversarial Transferability via Intermediate-level Perturbation Decay

Add code
May 09, 2023
Viaarxiv icon

Making Substitute Models More Bayesian Can Enhance Transferability of Adversarial Examples

Add code
Mar 02, 2023
Viaarxiv icon

Adversarial Contrastive Learning via Asymmetric InfoNCE

Add code
Jul 18, 2022
Figure 1 for Adversarial Contrastive Learning via Asymmetric InfoNCE
Figure 2 for Adversarial Contrastive Learning via Asymmetric InfoNCE
Figure 3 for Adversarial Contrastive Learning via Asymmetric InfoNCE
Figure 4 for Adversarial Contrastive Learning via Asymmetric InfoNCE
Viaarxiv icon

Collaborative Adversarial Training

Add code
May 23, 2022
Figure 1 for Collaborative Adversarial Training
Figure 2 for Collaborative Adversarial Training
Figure 3 for Collaborative Adversarial Training
Figure 4 for Collaborative Adversarial Training
Viaarxiv icon

An Intermediate-level Attack Framework on The Basis of Linear Regression

Add code
Mar 21, 2022
Figure 1 for An Intermediate-level Attack Framework on The Basis of Linear Regression
Figure 2 for An Intermediate-level Attack Framework on The Basis of Linear Regression
Figure 3 for An Intermediate-level Attack Framework on The Basis of Linear Regression
Figure 4 for An Intermediate-level Attack Framework on The Basis of Linear Regression
Viaarxiv icon