Picture for Ahmed Salem

Ahmed Salem

Microsoft Research

Obliviate: Efficient Unmemorization for Protecting Intellectual Property in Large Language Models

Add code
Feb 20, 2025
Viaarxiv icon

Permissive Information-Flow Analysis for Large Language Models

Add code
Oct 04, 2024
Viaarxiv icon

Vera Verto: Multimodal Hijacking Attack

Add code
Jul 31, 2024
Viaarxiv icon

Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification

Add code
Jul 30, 2024
Viaarxiv icon

Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique

Add code
Jul 15, 2024
Figure 1 for Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique
Figure 2 for Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique
Figure 3 for Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique
Figure 4 for Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique
Viaarxiv icon

SOS! Soft Prompt Attack Against Open-Source Large Language Models

Add code
Jul 03, 2024
Viaarxiv icon

Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition

Add code
Jun 12, 2024
Figure 1 for Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition
Figure 2 for Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition
Figure 3 for Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition
Figure 4 for Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition
Viaarxiv icon

Are you still on track!? Catching LLM Task Drift with Activations

Add code
Jun 02, 2024
Viaarxiv icon

Great, Now Write an Article About That: The Crescendo Multi-Turn LLM Jailbreak Attack

Add code
Apr 02, 2024
Viaarxiv icon

Maatphor: Automated Variant Analysis for Prompt Injection Attacks

Add code
Dec 12, 2023
Viaarxiv icon