Picture for Dylan Bowman

Dylan Bowman

CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities

Add code
Mar 21, 2025
Viaarxiv icon

Voice-Enabled AI Agents can Perform Common Scams

Add code
Oct 21, 2024
Viaarxiv icon