Picture for Christian Schroeder de Witt

Christian Schroeder de Witt

Michael Pokorny

Multi-Agent Security Tax: Trading Off Security and Collaboration Capabilities in Multi-Agent Systems

Add code
Feb 26, 2025
Viaarxiv icon

Fundamental Limitations in Defending LLM Finetuning APIs

Add code
Feb 20, 2025
Viaarxiv icon

Multi-Agent Risks from Advanced AI

Add code
Feb 19, 2025
Viaarxiv icon

PSyDUCK: Training-Free Steganography for Latent Diffusion

Add code
Jan 31, 2025
Figure 1 for PSyDUCK: Training-Free Steganography for Latent Diffusion
Figure 2 for PSyDUCK: Training-Free Steganography for Latent Diffusion
Figure 3 for PSyDUCK: Training-Free Steganography for Latent Diffusion
Figure 4 for PSyDUCK: Training-Free Steganography for Latent Diffusion
Viaarxiv icon

Humanity's Last Exam

Add code
Jan 24, 2025
Viaarxiv icon

MALT: Improving Reasoning with Multi-Agent LLM Training

Add code
Dec 02, 2024
Figure 1 for MALT: Improving Reasoning with Multi-Agent LLM Training
Figure 2 for MALT: Improving Reasoning with Multi-Agent LLM Training
Figure 3 for MALT: Improving Reasoning with Multi-Agent LLM Training
Viaarxiv icon

Delta-Influence: Unlearning Poisons via Influence Functions

Add code
Nov 20, 2024
Figure 1 for Delta-Influence: Unlearning Poisons via Influence Functions
Figure 2 for Delta-Influence: Unlearning Poisons via Influence Functions
Figure 3 for Delta-Influence: Unlearning Poisons via Influence Functions
Figure 4 for Delta-Influence: Unlearning Poisons via Influence Functions
Viaarxiv icon

MAD-Sherlock: Multi-Agent Debates for Out-of-Context Misinformation Detection

Add code
Oct 26, 2024
Figure 1 for MAD-Sherlock: Multi-Agent Debates for Out-of-Context Misinformation Detection
Figure 2 for MAD-Sherlock: Multi-Agent Debates for Out-of-Context Misinformation Detection
Figure 3 for MAD-Sherlock: Multi-Agent Debates for Out-of-Context Misinformation Detection
Figure 4 for MAD-Sherlock: Multi-Agent Debates for Out-of-Context Misinformation Detection
Viaarxiv icon

Efficient Dictionary Learning with Switch Sparse Autoencoders

Add code
Oct 10, 2024
Figure 1 for Efficient Dictionary Learning with Switch Sparse Autoencoders
Figure 2 for Efficient Dictionary Learning with Switch Sparse Autoencoders
Figure 3 for Efficient Dictionary Learning with Switch Sparse Autoencoders
Figure 4 for Efficient Dictionary Learning with Switch Sparse Autoencoders
Viaarxiv icon

Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap

Add code
Oct 09, 2024
Figure 1 for Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap
Figure 2 for Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap
Figure 3 for Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap
Figure 4 for Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap
Viaarxiv icon