Picture for Ahmed Salem

Ahmed Salem

Microsoft Research

Permissive Information-Flow Analysis for Large Language Models

Add code
Oct 04, 2024
Viaarxiv icon

Vera Verto: Multimodal Hijacking Attack

Add code
Jul 31, 2024
Viaarxiv icon

Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification

Add code
Jul 30, 2024
Viaarxiv icon

Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique

Add code
Jul 15, 2024
Figure 1 for Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique
Figure 2 for Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique
Figure 3 for Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique
Figure 4 for Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique
Viaarxiv icon

SOS! Soft Prompt Attack Against Open-Source Large Language Models

Add code
Jul 03, 2024
Viaarxiv icon

Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition

Add code
Jun 12, 2024
Viaarxiv icon

Are you still on track!? Catching LLM Task Drift with Activations

Add code
Jun 02, 2024
Viaarxiv icon

Great, Now Write an Article About That: The Crescendo Multi-Turn LLM Jailbreak Attack

Add code
Apr 02, 2024
Viaarxiv icon

Maatphor: Automated Variant Analysis for Prompt Injection Attacks

Add code
Dec 12, 2023
Viaarxiv icon

Rethinking Privacy in Machine Learning Pipelines from an Information Flow Control Perspective

Add code
Nov 27, 2023
Viaarxiv icon