Picture for Keltin Grimes

Keltin Grimes

Concept-ROT: Poisoning Concepts in Large Language Models with Model Editing

Add code
Dec 17, 2024
Viaarxiv icon

Gone but Not Forgotten: Improved Benchmarks for Machine Unlearning

Add code
May 29, 2024
Viaarxiv icon

The SaTML '24 CNN Interpretability Competition: New Innovations for Concept-Level Interpretability

Add code
Apr 03, 2024
Viaarxiv icon