Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Using Fast Weights to Attend to the Recent Past

Dec 05, 2016

Jimmy Ba, Geoffrey Hinton, Volodymyr Mnih, Joel Z. Leibo, Catalin Ionescu

Figure 1 for Using Fast Weights to Attend to the Recent Past

Figure 2 for Using Fast Weights to Attend to the Recent Past

Figure 3 for Using Fast Weights to Attend to the Recent Past

Figure 4 for Using Fast Weights to Attend to the Recent Past

Share this with someone who'll enjoy it:

Abstract:Until recently, research on artificial neural networks was largely restricted to systems with only two types of variable: Neural activities that represent the current or recent input and weights that learn to capture regularities among inputs, outputs and payoffs. There is no good reason for this restriction. Synapses have dynamics at many different time-scales and this suggests that artificial neural networks might benefit from variables that change slower than activities but much faster than the standard weights. These "fast weights" can be used to store temporary memories of the recent past and they provide a neurally plausible way of implementing the type of attention to the past that has recently proved very helpful in sequence-to-sequence models. By using fast weights we can avoid the need to store copies of neural activity patterns.

* Added [Schmidhuber 1993] citation to the last paragraph of the introduction. Fixed typo appendix A.1 uniform initialization to 1/\sqrt{H}

View paper on

Share this with someone who'll enjoy it:

Title:Using Fast Weights to Attend to the Recent Past

Paper and Code