What's in Your "Safe" Data?: Identifying Benign Data that Breaks Safety

Add code
Apr 01, 2024

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: