Picture for Willow Primack

Willow Primack

MultiChallenge: A Realistic Multi-Turn Conversation Evaluation Benchmark Challenging to Frontier LLMs

Add code
Jan 29, 2025
Viaarxiv icon

LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet

Add code
Aug 27, 2024
Figure 1 for LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet
Figure 2 for LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet
Figure 3 for LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet
Figure 4 for LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet
Viaarxiv icon