Picture for Scott Goodfriend

Scott Goodfriend

Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Add code
Jan 31, 2025
Viaarxiv icon

A Competition Winning Deep Reinforcement Learning Agent in microRTS

Add code
Feb 12, 2024
Viaarxiv icon