Picture for Jason Vega

Jason Vega

Stochastic Monkeys at Play: Random Augmentations Cheaply Break LLM Safety Alignment

Add code
Nov 05, 2024
Figure 1 for Stochastic Monkeys at Play: Random Augmentations Cheaply Break LLM Safety Alignment
Figure 2 for Stochastic Monkeys at Play: Random Augmentations Cheaply Break LLM Safety Alignment
Figure 3 for Stochastic Monkeys at Play: Random Augmentations Cheaply Break LLM Safety Alignment
Figure 4 for Stochastic Monkeys at Play: Random Augmentations Cheaply Break LLM Safety Alignment
Viaarxiv icon

Bypassing the Safety Training of Open-Source LLMs with Priming Attacks

Add code
Dec 19, 2023
Figure 1 for Bypassing the Safety Training of Open-Source LLMs with Priming Attacks
Figure 2 for Bypassing the Safety Training of Open-Source LLMs with Priming Attacks
Figure 3 for Bypassing the Safety Training of Open-Source LLMs with Priming Attacks
Figure 4 for Bypassing the Safety Training of Open-Source LLMs with Priming Attacks
Viaarxiv icon

Neural Representation Learning for Scribal Hands of Linear B

Add code
Jul 14, 2021
Figure 1 for Neural Representation Learning for Scribal Hands of Linear B
Figure 2 for Neural Representation Learning for Scribal Hands of Linear B
Figure 3 for Neural Representation Learning for Scribal Hands of Linear B
Figure 4 for Neural Representation Learning for Scribal Hands of Linear B
Viaarxiv icon