Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Regularizing Recurrent Networks - On Injected Noise and Norm-based Methods

Oct 21, 2014

Saahil Ognawala, Justin Bayer

Figure 1 for Regularizing Recurrent Networks - On Injected Noise and Norm-based Methods

Figure 2 for Regularizing Recurrent Networks - On Injected Noise and Norm-based Methods

Figure 3 for Regularizing Recurrent Networks - On Injected Noise and Norm-based Methods

Figure 4 for Regularizing Recurrent Networks - On Injected Noise and Norm-based Methods

Share this with someone who'll enjoy it:

Abstract:Advancements in parallel processing have lead to a surge in multilayer perceptrons' (MLP) applications and deep learning in the past decades. Recurrent Neural Networks (RNNs) give additional representational power to feedforward MLPs by providing a way to treat sequential data. However, RNNs are hard to train using conventional error backpropagation methods because of the difficulty in relating inputs over many time-steps. Regularization approaches from MLP sphere, like dropout and noisy weight training, have been insufficiently applied and tested on simple RNNs. Moreover, solutions have been proposed to improve convergence in RNNs but not enough to improve the long term dependency remembering capabilities thereof. In this study, we aim to empirically evaluate the remembering and generalization ability of RNNs on polyphonic musical datasets. The models are trained with injected noise, random dropout, norm-based regularizers and their respective performances compared to well-initialized plain RNNs and advanced regularization methods like fast-dropout. We conclude with evidence that training with noise does not improve performance as conjectured by a few works in RNN optimization before ours.

View paper on

Share this with someone who'll enjoy it:

Title:Regularizing Recurrent Networks - On Injected Noise and Norm-based Methods

Paper and Code