Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Memoria: Hebbian Memory Architecture for Human-Like Sequential Processing

Oct 04, 2023

Sangjun Park, JinYeong Bak

Share this with someone who'll enjoy it:

Abstract:Transformers have demonstrated their success in various domains and tasks. However, Transformers struggle with long input sequences due to their limited capacity. While one solution is to increase input length, endlessly stretching the length is unrealistic. Furthermore, humans selectively remember and use only relevant information from inputs, unlike Transformers which process all raw data from start to end. We introduce Memoria, a general memory network that applies Hebbian theory which is a major theory explaining human memory formulation to enhance long-term dependencies in neural networks. Memoria stores and retrieves information called engram at multiple memory levels of working memory, short-term memory, and long-term memory, using connection weights that change according to Hebb's rule. Through experiments with popular Transformer-based models like BERT and GPT, we present that Memoria significantly improves the ability to consider long-term dependencies in various tasks. Results show that Memoria outperformed existing methodologies in sorting and language modeling, and long text classification.

* Under review as a conference paper at ICLR 2024. 20 pages, 9 figures, 5 tables

View paper on

Share this with someone who'll enjoy it:

Title:Memoria: Hebbian Memory Architecture for Human-Like Sequential Processing

Paper and Code