Picture for Christian Schroeder de Witt

Christian Schroeder de Witt

Michael Pokorny

PSyDUCK: Training-Free Steganography for Latent Diffusion

Add code
Jan 31, 2025
Figure 1 for PSyDUCK: Training-Free Steganography for Latent Diffusion
Figure 2 for PSyDUCK: Training-Free Steganography for Latent Diffusion
Figure 3 for PSyDUCK: Training-Free Steganography for Latent Diffusion
Figure 4 for PSyDUCK: Training-Free Steganography for Latent Diffusion
Viaarxiv icon

Humanity's Last Exam

Add code
Jan 24, 2025
Viaarxiv icon

MALT: Improving Reasoning with Multi-Agent LLM Training

Add code
Dec 02, 2024
Figure 1 for MALT: Improving Reasoning with Multi-Agent LLM Training
Figure 2 for MALT: Improving Reasoning with Multi-Agent LLM Training
Figure 3 for MALT: Improving Reasoning with Multi-Agent LLM Training
Viaarxiv icon

Delta-Influence: Unlearning Poisons via Influence Functions

Add code
Nov 20, 2024
Figure 1 for Delta-Influence: Unlearning Poisons via Influence Functions
Figure 2 for Delta-Influence: Unlearning Poisons via Influence Functions
Figure 3 for Delta-Influence: Unlearning Poisons via Influence Functions
Figure 4 for Delta-Influence: Unlearning Poisons via Influence Functions
Viaarxiv icon

MAD-Sherlock: Multi-Agent Debates for Out-of-Context Misinformation Detection

Add code
Oct 26, 2024
Figure 1 for MAD-Sherlock: Multi-Agent Debates for Out-of-Context Misinformation Detection
Figure 2 for MAD-Sherlock: Multi-Agent Debates for Out-of-Context Misinformation Detection
Figure 3 for MAD-Sherlock: Multi-Agent Debates for Out-of-Context Misinformation Detection
Figure 4 for MAD-Sherlock: Multi-Agent Debates for Out-of-Context Misinformation Detection
Viaarxiv icon

Efficient Dictionary Learning with Switch Sparse Autoencoders

Add code
Oct 10, 2024
Figure 1 for Efficient Dictionary Learning with Switch Sparse Autoencoders
Figure 2 for Efficient Dictionary Learning with Switch Sparse Autoencoders
Figure 3 for Efficient Dictionary Learning with Switch Sparse Autoencoders
Figure 4 for Efficient Dictionary Learning with Switch Sparse Autoencoders
Viaarxiv icon

Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap

Add code
Oct 09, 2024
Figure 1 for Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap
Figure 2 for Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap
Figure 3 for Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap
Figure 4 for Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap
Viaarxiv icon

SAGE: Scalable Ground Truth Evaluations for Large Sparse Autoencoders

Add code
Oct 09, 2024
Figure 1 for SAGE: Scalable Ground Truth Evaluations for Large Sparse Autoencoders
Figure 2 for SAGE: Scalable Ground Truth Evaluations for Large Sparse Autoencoders
Figure 3 for SAGE: Scalable Ground Truth Evaluations for Large Sparse Autoencoders
Figure 4 for SAGE: Scalable Ground Truth Evaluations for Large Sparse Autoencoders
Viaarxiv icon

Hidden in Plain Text: Emergence & Mitigation of Steganographic Collusion in LLMs

Add code
Oct 02, 2024
Figure 1 for Hidden in Plain Text: Emergence & Mitigation of Steganographic Collusion in LLMs
Figure 2 for Hidden in Plain Text: Emergence & Mitigation of Steganographic Collusion in LLMs
Figure 3 for Hidden in Plain Text: Emergence & Mitigation of Steganographic Collusion in LLMs
Figure 4 for Hidden in Plain Text: Emergence & Mitigation of Steganographic Collusion in LLMs
Viaarxiv icon

IDs for AI Systems

Add code
Jun 17, 2024
Viaarxiv icon