Picture for John Hughes

John Hughes

University of Leeds, UK

Best-of-N Jailbreaking

Add code
Dec 04, 2024
Viaarxiv icon

Jailbreak Defense in a Narrow Domain: Limitations of Existing Methods and a New Transcript-Classifier Approach

Add code
Dec 03, 2024
Figure 1 for Jailbreak Defense in a Narrow Domain: Limitations of Existing Methods and a New Transcript-Classifier Approach
Figure 2 for Jailbreak Defense in a Narrow Domain: Limitations of Existing Methods and a New Transcript-Classifier Approach
Figure 3 for Jailbreak Defense in a Narrow Domain: Limitations of Existing Methods and a New Transcript-Classifier Approach
Figure 4 for Jailbreak Defense in a Narrow Domain: Limitations of Existing Methods and a New Transcript-Classifier Approach
Viaarxiv icon

Looking Inward: Language Models Can Learn About Themselves by Introspection

Add code
Oct 17, 2024
Viaarxiv icon

When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?

Add code
Jul 21, 2024
Figure 1 for When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?
Figure 2 for When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?
Figure 3 for When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?
Figure 4 for When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?
Viaarxiv icon

Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data

Add code
Apr 01, 2024
Viaarxiv icon

Debating with More Persuasive LLMs Leads to More Truthful Answers

Add code
Feb 15, 2024
Viaarxiv icon

Hierarchical Quantized Autoencoders

Add code
Feb 19, 2020
Figure 1 for Hierarchical Quantized Autoencoders
Figure 2 for Hierarchical Quantized Autoencoders
Figure 3 for Hierarchical Quantized Autoencoders
Figure 4 for Hierarchical Quantized Autoencoders
Viaarxiv icon

Automatic Extraction of Tagset Mappings from Parallel-Annotated Corpora

Add code
Jun 08, 1995
Figure 1 for Automatic Extraction of Tagset Mappings from Parallel-Annotated Corpora
Figure 2 for Automatic Extraction of Tagset Mappings from Parallel-Annotated Corpora
Figure 3 for Automatic Extraction of Tagset Mappings from Parallel-Annotated Corpora
Viaarxiv icon