Picture for Jean-Charles Noirot Ferrand

Jean-Charles Noirot Ferrand

On the Robustness Tradeoff in Fine-Tuning

Add code
Mar 19, 2025
Viaarxiv icon

Adversarial Agents: Black-Box Evasion Attacks with Reinforcement Learning

Add code
Mar 03, 2025
Viaarxiv icon

Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs

Add code
Jan 27, 2025
Viaarxiv icon

The Efficacy of Transformer-based Adversarial Attacks in Security Domains

Add code
Oct 17, 2023
Viaarxiv icon