Picture for Rui Pu

Rui Pu

MirrorGuard: Adaptive Defense Against Jailbreaks via Entropy-Guided Mirror Crafting

Add code
Mar 17, 2025
Viaarxiv icon

Feint and Attack: Attention-Based Strategies for Jailbreaking and Protecting LLMs

Add code
Oct 18, 2024
Figure 1 for Feint and Attack: Attention-Based Strategies for Jailbreaking and Protecting LLMs
Figure 2 for Feint and Attack: Attention-Based Strategies for Jailbreaking and Protecting LLMs
Figure 3 for Feint and Attack: Attention-Based Strategies for Jailbreaking and Protecting LLMs
Figure 4 for Feint and Attack: Attention-Based Strategies for Jailbreaking and Protecting LLMs
Viaarxiv icon