Picture for Tianshuo Cong

Tianshuo Cong

Jailbreak Attacks and Defenses Against Large Language Models: A Survey

Add code
Jul 05, 2024
Viaarxiv icon

On Evaluating The Performance of Watermarked Machine-Generated Texts Under Adversarial Attacks

Add code
Jul 05, 2024
Viaarxiv icon

JailbreakEval: An Integrated Toolkit for Evaluating Jailbreak Attempts Against Large Language Models

Add code
Jun 13, 2024
Viaarxiv icon

Have You Merged My Model? On The Robustness of Large Language Model IP Protection Methods Against Model Merging

Add code
Apr 08, 2024
Figure 1 for Have You Merged My Model? On The Robustness of Large Language Model IP Protection Methods Against Model Merging
Figure 2 for Have You Merged My Model? On The Robustness of Large Language Model IP Protection Methods Against Model Merging
Figure 3 for Have You Merged My Model? On The Robustness of Large Language Model IP Protection Methods Against Model Merging
Figure 4 for Have You Merged My Model? On The Robustness of Large Language Model IP Protection Methods Against Model Merging
Viaarxiv icon

FigStep: Jailbreaking Large Vision-language Models via Typographic Visual Prompts

Add code
Nov 09, 2023
Figure 1 for FigStep: Jailbreaking Large Vision-language Models via Typographic Visual Prompts
Figure 2 for FigStep: Jailbreaking Large Vision-language Models via Typographic Visual Prompts
Figure 3 for FigStep: Jailbreaking Large Vision-language Models via Typographic Visual Prompts
Figure 4 for FigStep: Jailbreaking Large Vision-language Models via Typographic Visual Prompts
Viaarxiv icon

SSLGuard: A Watermarking Scheme for Self-supervised Learning Pre-trained Encoders

Add code
Jan 27, 2022
Figure 1 for SSLGuard: A Watermarking Scheme for Self-supervised Learning Pre-trained Encoders
Figure 2 for SSLGuard: A Watermarking Scheme for Self-supervised Learning Pre-trained Encoders
Figure 3 for SSLGuard: A Watermarking Scheme for Self-supervised Learning Pre-trained Encoders
Figure 4 for SSLGuard: A Watermarking Scheme for Self-supervised Learning Pre-trained Encoders
Viaarxiv icon