Picture for Yihua Zhang

Yihua Zhang

Downgrade to Upgrade: Optimizer Simplification Enhances Robustness in LLM Unlearning

Add code
Oct 02, 2025
Figure 1 for Downgrade to Upgrade: Optimizer Simplification Enhances Robustness in LLM Unlearning
Figure 2 for Downgrade to Upgrade: Optimizer Simplification Enhances Robustness in LLM Unlearning
Figure 3 for Downgrade to Upgrade: Optimizer Simplification Enhances Robustness in LLM Unlearning
Figure 4 for Downgrade to Upgrade: Optimizer Simplification Enhances Robustness in LLM Unlearning
Viaarxiv icon

Reasoning Model Unlearning: Forgetting Traces, Not Just Answers, While Preserving Reasoning Skills

Add code
Jun 15, 2025
Viaarxiv icon

When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers

Add code
Apr 15, 2025
Viaarxiv icon

Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-tuning

Add code
Mar 14, 2025
Viaarxiv icon

Towards LLM Unlearning Resilient to Relearning Attacks: A Sharpness-Aware Minimization Perspective and Beyond

Add code
Feb 07, 2025
Viaarxiv icon

Forget Vectors at Play: Universal Input Perturbations Driving Machine Unlearning in Image Classification

Add code
Dec 21, 2024
Viaarxiv icon

UOE: Unlearning One Expert Is Enough For Mixture-of-experts LLMS

Add code
Nov 27, 2024
Figure 1 for UOE: Unlearning One Expert Is Enough For Mixture-of-experts LLMS
Figure 2 for UOE: Unlearning One Expert Is Enough For Mixture-of-experts LLMS
Figure 3 for UOE: Unlearning One Expert Is Enough For Mixture-of-experts LLMS
Figure 4 for UOE: Unlearning One Expert Is Enough For Mixture-of-experts LLMS
Viaarxiv icon

Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing

Add code
Nov 25, 2024
Figure 1 for Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing
Figure 2 for Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing
Figure 3 for Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing
Figure 4 for Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing
Viaarxiv icon

WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models

Add code
Oct 23, 2024
Viaarxiv icon

Pruning then Reweighting: Towards Data-Efficient Training of Diffusion Models

Add code
Sep 27, 2024
Viaarxiv icon