Picture for Fenghua Weng

Fenghua Weng

DELMAN: Dynamic Defense Against Large Language Model Jailbreaking with Model Editing

Add code
Feb 17, 2025
Viaarxiv icon