Picture for Yuping Lin

Yuping Lin

Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis

Add code
Jun 16, 2024
Figure 1 for Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis
Figure 2 for Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis
Figure 3 for Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis
Figure 4 for Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis
Viaarxiv icon

FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models

Add code
Oct 03, 2023
Viaarxiv icon

Bandlimiting Neural Networks Against Adversarial Attacks

Add code
May 30, 2019
Figure 1 for Bandlimiting Neural Networks Against Adversarial Attacks
Figure 2 for Bandlimiting Neural Networks Against Adversarial Attacks
Figure 3 for Bandlimiting Neural Networks Against Adversarial Attacks
Figure 4 for Bandlimiting Neural Networks Against Adversarial Attacks
Viaarxiv icon