Picture for Richard Fang

Richard Fang

CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities

Add code
Mar 21, 2025
Viaarxiv icon

Voice-Enabled AI Agents can Perform Common Scams

Add code
Oct 21, 2024
Viaarxiv icon

Teams of LLM Agents can Exploit Zero-Day Vulnerabilities

Add code
Jun 02, 2024
Figure 1 for Teams of LLM Agents can Exploit Zero-Day Vulnerabilities
Figure 2 for Teams of LLM Agents can Exploit Zero-Day Vulnerabilities
Figure 3 for Teams of LLM Agents can Exploit Zero-Day Vulnerabilities
Figure 4 for Teams of LLM Agents can Exploit Zero-Day Vulnerabilities
Viaarxiv icon

LLM Agents can Autonomously Exploit One-day Vulnerabilities

Add code
Apr 11, 2024
Viaarxiv icon

LLM Agents can Autonomously Hack Websites

Add code
Feb 16, 2024
Viaarxiv icon

Removing RLHF Protections in GPT-4 via Fine-Tuning

Add code
Nov 10, 2023
Viaarxiv icon