Picture for Marco Christiani

Marco Christiani

Concept-ROT: Poisoning Concepts in Large Language Models with Model Editing

Add code
Dec 17, 2024
Figure 1 for Concept-ROT: Poisoning Concepts in Large Language Models with Model Editing
Figure 2 for Concept-ROT: Poisoning Concepts in Large Language Models with Model Editing
Figure 3 for Concept-ROT: Poisoning Concepts in Large Language Models with Model Editing
Figure 4 for Concept-ROT: Poisoning Concepts in Large Language Models with Model Editing
Viaarxiv icon