Picture for Lauren Alvarez

Lauren Alvarez

Prompt Optimization and Evaluation for LLM Automated Red Teaming

Add code
Jul 29, 2025
Viaarxiv icon

Don't Lie to Me: Avoiding Malicious Explanations with STEALTH

Add code
Jan 25, 2023
Figure 1 for Don't Lie to Me: Avoiding Malicious Explanations with STEALTH
Figure 2 for Don't Lie to Me: Avoiding Malicious Explanations with STEALTH
Figure 3 for Don't Lie to Me: Avoiding Malicious Explanations with STEALTH
Figure 4 for Don't Lie to Me: Avoiding Malicious Explanations with STEALTH
Viaarxiv icon