Picture for Eric Pauley

Eric Pauley

On the Robustness Tradeoff in Fine-Tuning

Add code
Mar 19, 2025
Viaarxiv icon

Adversarial Agents: Black-Box Evasion Attacks with Reinforcement Learning

Add code
Mar 03, 2025
Viaarxiv icon

Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs

Add code
Jan 27, 2025
Viaarxiv icon

The Space of Adversarial Strategies

Add code
Sep 09, 2022
Figure 1 for The Space of Adversarial Strategies
Figure 2 for The Space of Adversarial Strategies
Figure 3 for The Space of Adversarial Strategies
Figure 4 for The Space of Adversarial Strategies
Viaarxiv icon

On the Robustness of Domain Constraints

Add code
May 18, 2021
Figure 1 for On the Robustness of Domain Constraints
Figure 2 for On the Robustness of Domain Constraints
Figure 3 for On the Robustness of Domain Constraints
Figure 4 for On the Robustness of Domain Constraints
Viaarxiv icon