Picture for David Wagner

David Wagner

Toxicity Detection for Free

Add code
May 29, 2024
Viaarxiv icon

Certifiably Robust RAG against Retrieval Corruption

Add code
May 24, 2024
Viaarxiv icon

Vulnerability Detection with Code Language Models: How Far Are We?

Add code
Mar 27, 2024
Figure 1 for Vulnerability Detection with Code Language Models: How Far Are We?
Figure 2 for Vulnerability Detection with Code Language Models: How Far Are We?
Figure 3 for Vulnerability Detection with Code Language Models: How Far Are We?
Figure 4 for Vulnerability Detection with Code Language Models: How Far Are We?
Viaarxiv icon

Generative AI Security: Challenges and Countermeasures

Add code
Feb 20, 2024
Figure 1 for Generative AI Security: Challenges and Countermeasures
Figure 2 for Generative AI Security: Challenges and Countermeasures
Figure 3 for Generative AI Security: Challenges and Countermeasures
Figure 4 for Generative AI Security: Challenges and Countermeasures
Viaarxiv icon

PAL: Proxy-Guided Black-Box Attack on Large Language Models

Add code
Feb 15, 2024
Viaarxiv icon

Jatmo: Prompt Injection Defense by Task-Specific Finetuning

Add code
Jan 08, 2024
Viaarxiv icon

Mark My Words: Analyzing and Evaluating Language Model Watermarks

Add code
Dec 07, 2023
Viaarxiv icon

Can LLMs Follow Simple Rules?

Add code
Nov 06, 2023
Figure 1 for Can LLMs Follow Simple Rules?
Figure 2 for Can LLMs Follow Simple Rules?
Figure 3 for Can LLMs Follow Simple Rules?
Figure 4 for Can LLMs Follow Simple Rules?
Viaarxiv icon

Defending Against Transfer Attacks From Public Models

Add code
Oct 26, 2023
Viaarxiv icon

DiverseVul: A New Vulnerable Source Code Dataset for Deep Learning Based Vulnerability Detection

Add code
Apr 01, 2023
Viaarxiv icon