Picture for Xingjun Ma

Xingjun Ma

IDEATOR: Jailbreaking VLMs Using VLMs

Add code
Oct 29, 2024
Viaarxiv icon

BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks

Add code
Oct 28, 2024
Viaarxiv icon

Expose Before You Defend: Unifying and Enhancing Backdoor Defenses via Exposed Models

Add code
Oct 25, 2024
Viaarxiv icon

UnSeg: One Universal Unlearnable Example Generator is Enough against All Image Segmentation

Add code
Oct 13, 2024
Viaarxiv icon

AnyAttack: Towards Large-scale Self-supervised Generation of Targeted Adversarial Examples for Vision-Language Models

Add code
Oct 07, 2024
Viaarxiv icon

BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks on Large Language Models

Add code
Aug 23, 2024
Viaarxiv icon

EnJa: Ensemble Jailbreak on Large Language Models

Add code
Aug 07, 2024
Figure 1 for EnJa: Ensemble Jailbreak on Large Language Models
Figure 2 for EnJa: Ensemble Jailbreak on Large Language Models
Figure 3 for EnJa: Ensemble Jailbreak on Large Language Models
Figure 4 for EnJa: Ensemble Jailbreak on Large Language Models
Viaarxiv icon

AdvQDet: Detecting Query-Based Adversarial Attacks with Adversarial Contrastive Prompt Tuning

Add code
Aug 04, 2024
Viaarxiv icon

Downstream Transfer Attack: Adversarial Attacks on Downstream Models with Pre-trained Vision Transformers

Add code
Aug 03, 2024
Figure 1 for Downstream Transfer Attack: Adversarial Attacks on Downstream Models with Pre-trained Vision Transformers
Figure 2 for Downstream Transfer Attack: Adversarial Attacks on Downstream Models with Pre-trained Vision Transformers
Figure 3 for Downstream Transfer Attack: Adversarial Attacks on Downstream Models with Pre-trained Vision Transformers
Figure 4 for Downstream Transfer Attack: Adversarial Attacks on Downstream Models with Pre-trained Vision Transformers
Viaarxiv icon

Constrained Intrinsic Motivation for Reinforcement Learning

Add code
Jul 12, 2024
Viaarxiv icon