Picture for M. H. I. Abdalla

M. H. I. Abdalla

REINFORCE Adversarial Attacks on Large Language Models: An Adaptive, Distributional, and Semantic Objective

Add code
Feb 24, 2025
Viaarxiv icon

Attacking Large Language Models with Projected Gradient Descent

Add code
Feb 14, 2024
Viaarxiv icon