Picture for Philip Li

Philip Li

ALERT: Zero-shot LLM Jailbreak Detection via Internal Discrepancy Amplification

Add code
Jan 07, 2026
Viaarxiv icon

CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities

Add code
Mar 21, 2025
Figure 1 for CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities
Figure 2 for CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities
Figure 3 for CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities
Figure 4 for CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities
Viaarxiv icon