Picture for Marc Fischer

Marc Fischer

AgentDojo: A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents

Add code
Jun 19, 2024
Viaarxiv icon

Overcoming the Paradox of Certified Training with Gaussian Smoothing

Add code
Mar 11, 2024
Viaarxiv icon

Evading Data Contamination Detection for Language Models is (too) Easy

Add code
Feb 12, 2024
Viaarxiv icon

Controlled Text Generation via Language Model Arithmetic

Add code
Nov 24, 2023
Viaarxiv icon

Prompt Sketching for Large Language Models

Add code
Nov 08, 2023
Viaarxiv icon

Understanding Certified Training with Interval Bound Propagation

Add code
Jun 17, 2023
Viaarxiv icon

TAPS: Connecting Certified and Adversarial Training

Add code
May 08, 2023
Viaarxiv icon

Efficient Certified Training and Robustness Verification of Neural ODEs

Add code
Mar 09, 2023
Viaarxiv icon

Prompting Is Programming: A Query Language For Large Language Models

Add code
Dec 12, 2022
Viaarxiv icon

Prompt Tuning for Parameter-efficient Medical Image Segmentation

Add code
Nov 16, 2022
Viaarxiv icon