Picture for Xinwei Wu

Xinwei Wu

IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware Neurons

Add code
Jun 26, 2024
Viaarxiv icon

ConTrans: Weak-to-Strong Alignment Engineering via Concept Transplantation

Add code
May 22, 2024
Viaarxiv icon

Exploring Multilingual Human Value Concepts in Large Language Models: Is Value Alignment Consistent, Transferable and Controllable across Languages?

Add code
Feb 28, 2024
Viaarxiv icon

DEPN: Detecting and Editing Privacy Neurons in Pretrained Language Models

Add code
Oct 31, 2023
Viaarxiv icon

Large Language Model Alignment: A Survey

Add code
Sep 26, 2023
Viaarxiv icon

Swing Distillation: A Privacy-Preserving Knowledge Distillation Framework

Add code
Dec 16, 2022
Viaarxiv icon

FewFedWeight: Few-shot Federated Learning Framework across Multiple NLP Tasks

Add code
Dec 16, 2022
Viaarxiv icon