Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ryan L. McAvoy

Learning to Remember, Forget and Ignore using Attention Control in Memory

Sep 28, 2018

T. S. Jayram, Younes Bouhadjar, Ryan L. McAvoy, Tomasz Kornuta, Alexis Asseman, Kamil Rocki, Ahmet S. Ozcan

Figure 1 for Learning to Remember, Forget and Ignore using Attention Control in Memory

Figure 2 for Learning to Remember, Forget and Ignore using Attention Control in Memory

Figure 3 for Learning to Remember, Forget and Ignore using Attention Control in Memory

Figure 4 for Learning to Remember, Forget and Ignore using Attention Control in Memory

Abstract:Typical neural networks with external memory do not effectively separate capacity for episodic and working memory as is required for reasoning in humans. Applying knowledge gained from psychological studies, we designed a new model called Differentiable Working Memory (DWM) in order to specifically emulate human working memory. As it shows the same functional characteristics as working memory, it robustly learns psychology inspired tasks and converges faster than comparable state-of-the-art models. Moreover, the DWM model successfully generalizes to sequences two orders of magnitude longer than the ones used in training. Our in-depth analysis shows that the behavior of DWM is interpretable and that it learns to have fine control over memory, allowing it to retain, ignore or forget information based on its relevance.

* 20 pages

Via

Access Paper or Ask Questions

Using Multi-task and Transfer Learning to Solve Working Memory Tasks

Sep 28, 2018

T. S. Jayram, Tomasz Kornuta, Ryan L. McAvoy, Ahmet S. Ozcan

Figure 1 for Using Multi-task and Transfer Learning to Solve Working Memory Tasks

Figure 2 for Using Multi-task and Transfer Learning to Solve Working Memory Tasks

Figure 3 for Using Multi-task and Transfer Learning to Solve Working Memory Tasks

Figure 4 for Using Multi-task and Transfer Learning to Solve Working Memory Tasks

Abstract:We propose a new architecture called Memory-Augmented Encoder-Solver (MAES) that enables transfer learning to solve complex working memory tasks adapted from cognitive psychology. It uses dual recurrent neural network controllers, inside the encoder and solver, respectively, that interface with a shared memory module and is completely differentiable. We study different types of encoders in a systematic manner and demonstrate a unique advantage of multi-task learning in obtaining the best possible encoder. We show by extensive experimentation that the trained MAES models achieve task-size generalization, i.e., they are capable of handling sequential inputs 50 times longer than seen during training, with appropriately large memory modules. We demonstrate that the performance achieved by MAES far outperforms existing and well-known models such as the LSTM, NTM and DNC on the entire suite of tasks.

* 16 pages

Via

Access Paper or Ask Questions