Picture for Weikai Lu

Weikai Lu

Eraser: Jailbreaking Defense in Large Language Models via Unlearning Harmful Knowledge

Add code
Apr 08, 2024
Viaarxiv icon