Picture for Shunfan Zheng

Shunfan Zheng

NLSR: Neuron-Level Safety Realignment of Large Language Models Against Harmful Fine-Tuning

Add code
Dec 17, 2024
Viaarxiv icon

ACE-$M^3$: Automatic Capability Evaluator for Multimodal Medical Models

Add code
Dec 16, 2024
Viaarxiv icon

A safety realignment framework via subspace-oriented model fusion for large language models

Add code
May 15, 2024
Viaarxiv icon