Picture for Dan Valentine

Dan Valentine

Colorado School of Mines, Department of Applied Mathematics and Statistics

When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?

Add code
Jul 21, 2024
Viaarxiv icon

Debating with More Persuasive LLMs Leads to More Truthful Answers

Add code
Feb 15, 2024
Viaarxiv icon

Structured World Representations in Maze-Solving Transformers

Add code
Dec 05, 2023
Viaarxiv icon

A Configurable Library for Generating and Manipulating Maze Datasets

Add code
Sep 19, 2023
Viaarxiv icon