Picture for Florian Tramèr

Florian Tramèr

CaMeLs Can Use Computers Too: System-level Security for Computer Use Agents

Add code
Jan 14, 2026
Viaarxiv icon

Representations of Text and Images Align From Layer One

Add code
Jan 12, 2026
Viaarxiv icon

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

Add code
Sep 17, 2025
Figure 1 for Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
Figure 2 for Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
Figure 3 for Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
Figure 4 for Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
Viaarxiv icon

Design Patterns for Securing LLM Agents against Prompt Injections

Add code
Jun 11, 2025
Figure 1 for Design Patterns for Securing LLM Agents against Prompt Injections
Figure 2 for Design Patterns for Securing LLM Agents against Prompt Injections
Figure 3 for Design Patterns for Securing LLM Agents against Prompt Injections
Figure 4 for Design Patterns for Securing LLM Agents against Prompt Injections
Viaarxiv icon

Membership Inference Attacks on Sequence Models

Add code
Jun 05, 2025
Viaarxiv icon

RealMath: A Continuous Benchmark for Evaluating Language Models on Research-Level Mathematics

Add code
May 18, 2025
Viaarxiv icon

LLMs unlock new paths to monetizing exploits

Add code
May 16, 2025
Figure 1 for LLMs unlock new paths to monetizing exploits
Figure 2 for LLMs unlock new paths to monetizing exploits
Figure 3 for LLMs unlock new paths to monetizing exploits
Figure 4 for LLMs unlock new paths to monetizing exploits
Viaarxiv icon

The Jailbreak Tax: How Useful are Your Jailbreak Outputs?

Add code
Apr 14, 2025
Figure 1 for The Jailbreak Tax: How Useful are Your Jailbreak Outputs?
Figure 2 for The Jailbreak Tax: How Useful are Your Jailbreak Outputs?
Figure 3 for The Jailbreak Tax: How Useful are Your Jailbreak Outputs?
Figure 4 for The Jailbreak Tax: How Useful are Your Jailbreak Outputs?
Viaarxiv icon

Defeating Prompt Injections by Design

Add code
Mar 24, 2025
Viaarxiv icon

AutoAdvExBench: Benchmarking autonomous exploitation of adversarial example defenses

Add code
Mar 03, 2025
Figure 1 for AutoAdvExBench: Benchmarking autonomous exploitation of adversarial example defenses
Figure 2 for AutoAdvExBench: Benchmarking autonomous exploitation of adversarial example defenses
Figure 3 for AutoAdvExBench: Benchmarking autonomous exploitation of adversarial example defenses
Figure 4 for AutoAdvExBench: Benchmarking autonomous exploitation of adversarial example defenses
Viaarxiv icon