Picture for Dawn Song

Dawn Song

University of California, Berkeley

A Framework for Formalizing LLM Agent Security

Add code
Mar 19, 2026
Viaarxiv icon

CUBE: A Standard for Unifying Agent Benchmarks

Add code
Mar 16, 2026
Viaarxiv icon

The Attack and Defense Landscape of Agentic AI: A Comprehensive Survey

Add code
Mar 11, 2026
Viaarxiv icon

Strategy Executability in Mathematical Reasoning: Leveraging Human-Model Differences for Effective Guidance

Add code
Feb 26, 2026
Viaarxiv icon

dLLM: Simple Diffusion Language Modeling

Add code
Feb 26, 2026
Viaarxiv icon

IMMACULATE: A Practical LLM Auditing Framework via Verifiable Computation

Add code
Feb 26, 2026
Viaarxiv icon

OpenSage: Self-programming Agent Generation Engine

Add code
Feb 18, 2026
Viaarxiv icon

Unsafer in Many Turns: Benchmarking and Defending Multi-Turn Safety Risks in Tool-Using Agents

Add code
Feb 13, 2026
Viaarxiv icon

Autonomous Continual Learning of Computer-Use Agents for Environment Adaptation

Add code
Feb 10, 2026
Viaarxiv icon

When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents

Add code
Feb 09, 2026
Viaarxiv icon