Picture for Jason Benn

Jason Benn

CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities

Add code
Mar 21, 2025
Viaarxiv icon