Picture for Nathalie Baracaldo

Nathalie Baracaldo

MAP: Multi-Human-Value Alignment Palette

Add code
Oct 24, 2024
Viaarxiv icon

WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models

Add code
Oct 23, 2024
Viaarxiv icon

Mitigating Forgetting in LLM Supervised Fine-Tuning and Preference Learning

Add code
Oct 20, 2024
Viaarxiv icon

Turning Generative Models Degenerate: The Power of Data Poisoning Attacks

Add code
Jul 18, 2024
Viaarxiv icon

Split, Unlearn, Merge: Leveraging Data Attributes for More Effective Unlearning in LLMs

Add code
Jun 17, 2024
Viaarxiv icon

Rethinking Machine Unlearning for Large Language Models

Add code
Feb 15, 2024
Viaarxiv icon

Enhancing In-context Learning via Linear Probe Calibration

Add code
Jan 22, 2024
Viaarxiv icon

FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs

Add code
Dec 12, 2023
Figure 1 for FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs
Figure 2 for FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs
Figure 3 for FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs
Figure 4 for FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs
Viaarxiv icon

Forcing Generative Models to Degenerate Ones: The Power of Data Poisoning Attacks

Add code
Dec 07, 2023
Viaarxiv icon

Privacy-Preserving Federated Learning over Vertically and Horizontally Partitioned Data for Financial Anomaly Detection

Add code
Oct 30, 2023
Viaarxiv icon