Picture for Dan Valentine

Dan Valentine

Colorado School of Mines, Department of Applied Mathematics and Statistics

When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?

Add code
Jul 21, 2024
Figure 1 for When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?
Figure 2 for When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?
Figure 3 for When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?
Figure 4 for When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?
Viaarxiv icon

Debating with More Persuasive LLMs Leads to More Truthful Answers

Add code
Feb 15, 2024
Viaarxiv icon

Structured World Representations in Maze-Solving Transformers

Add code
Dec 05, 2023
Viaarxiv icon

A Configurable Library for Generating and Manipulating Maze Datasets

Add code
Sep 19, 2023
Viaarxiv icon