Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Knowledge in Superposition: Unveiling the Failures of Lifelong Knowledge Editing for Large Language Models

Aug 14, 2024

Chenhui Hu, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao

Figure 1 for Knowledge in Superposition: Unveiling the Failures of Lifelong Knowledge Editing for Large Language Models

Figure 2 for Knowledge in Superposition: Unveiling the Failures of Lifelong Knowledge Editing for Large Language Models

Figure 3 for Knowledge in Superposition: Unveiling the Failures of Lifelong Knowledge Editing for Large Language Models

Figure 4 for Knowledge in Superposition: Unveiling the Failures of Lifelong Knowledge Editing for Large Language Models

Share this with someone who'll enjoy it:

Abstract:Knowledge editing aims to update outdated or incorrect knowledge in large language models (LLMs). However, current knowledge editing methods have limited scalability for lifelong editing. This study explores the fundamental reason why knowledge editing fails in lifelong editing. We begin with the closed-form solution derived from linear associative memory, which underpins state-of-the-art knowledge editing methods. We extend the solution from single editing to lifelong editing, and through rigorous mathematical derivation, identify an interference term in the final solution, suggesting that editing knowledge may impact irrelevant knowledge. Further analysis of the interference term reveals a close relationship with superposition between knowledge representations. When knowledge superposition does not exist in language models, the interference term vanishes, allowing for lossless knowledge editing. Experiments across numerous language models reveal that knowledge superposition is universal, exhibiting high kurtosis, zero mean, and heavy-tailed distributions with clear scaling laws. Ultimately, by combining theory and experiments, we demonstrate that knowledge superposition is the fundamental reason for the failure of lifelong editing. Moreover, this is the first study to investigate knowledge editing from the perspective of superposition and provides a comprehensive observation of superposition across numerous real-world language models. Code available at https://github.com/ChenhuiHu/knowledge_in_superposition.

View paper on

Share this with someone who'll enjoy it:

Title:Knowledge in Superposition: Unveiling the Failures of Lifelong Knowledge Editing for Large Language Models

Paper and Code