Picture for Javier Rando

Javier Rando

Poisoning Attacks on LLMs Require a Near-constant Number of Poison Samples

Add code
Oct 08, 2025
Viaarxiv icon

AutoAdvExBench: Benchmarking autonomous exploitation of adversarial example defenses

Add code
Mar 03, 2025
Viaarxiv icon

Adversarial ML Problems Are Getting Harder to Solve and to Evaluate

Add code
Feb 04, 2025
Viaarxiv icon

Measuring Non-Adversarial Reproduction of Training Data in Large Language Models

Add code
Nov 15, 2024
Viaarxiv icon

Llama Guard 3 Vision: Safeguarding Human-AI Image Understanding Conversations

Add code
Nov 15, 2024
Figure 1 for Llama Guard 3 Vision: Safeguarding Human-AI Image Understanding Conversations
Figure 2 for Llama Guard 3 Vision: Safeguarding Human-AI Image Understanding Conversations
Figure 3 for Llama Guard 3 Vision: Safeguarding Human-AI Image Understanding Conversations
Figure 4 for Llama Guard 3 Vision: Safeguarding Human-AI Image Understanding Conversations
Viaarxiv icon

Persistent Pre-Training Poisoning of LLMs

Add code
Oct 17, 2024
Viaarxiv icon

Gradient-based Jailbreak Images for Multimodal Fusion Models

Add code
Oct 04, 2024
Figure 1 for Gradient-based Jailbreak Images for Multimodal Fusion Models
Figure 2 for Gradient-based Jailbreak Images for Multimodal Fusion Models
Figure 3 for Gradient-based Jailbreak Images for Multimodal Fusion Models
Figure 4 for Gradient-based Jailbreak Images for Multimodal Fusion Models
Viaarxiv icon

An Adversarial Perspective on Machine Unlearning for AI Safety

Add code
Sep 26, 2024
Viaarxiv icon

Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition

Add code
Jun 12, 2024
Figure 1 for Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition
Figure 2 for Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition
Figure 3 for Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition
Figure 4 for Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition
Viaarxiv icon

Competition Report: Finding Universal Jailbreak Backdoors in Aligned LLMs

Add code
Apr 22, 2024
Figure 1 for Competition Report: Finding Universal Jailbreak Backdoors in Aligned LLMs
Figure 2 for Competition Report: Finding Universal Jailbreak Backdoors in Aligned LLMs
Figure 3 for Competition Report: Finding Universal Jailbreak Backdoors in Aligned LLMs
Figure 4 for Competition Report: Finding Universal Jailbreak Backdoors in Aligned LLMs
Viaarxiv icon