Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Srikanth Cherla

Linear-Time Sequence Classification using Restricted Boltzmann Machines

Mar 08, 2018

Son N. Tran, Srikanth Cherla, Artur Garcez, Tillman Weyde

Figure 1 for Linear-Time Sequence Classification using Restricted Boltzmann Machines

Figure 2 for Linear-Time Sequence Classification using Restricted Boltzmann Machines

Figure 3 for Linear-Time Sequence Classification using Restricted Boltzmann Machines

Figure 4 for Linear-Time Sequence Classification using Restricted Boltzmann Machines

Abstract:Classification of sequence data is the topic of interest for dynamic Bayesian models and Recurrent Neural Networks (RNNs). While the former can explicitly model the temporal dependencies between class variables, the latter have a capability of learning representations. Several attempts have been made to improve performance by combining these two approaches or increasing the processing capability of the hidden units in RNNs. This often results in complex models with a large number of learning parameters. In this paper, a compact model is proposed which offers both representation learning and temporal inference of class variables by rolling Restricted Boltzmann Machines (RBMs) and class variables over time. We address the key issue of intractability in this variant of RBMs by optimising a conditional distribution, instead of a joint distribution. Experiments reported in the paper on melody modelling and optical character recognition show that the proposed model can outperform the state-of-the-art. Also, the experimental results on optical character recognition, part-of-speech tagging and text chunking demonstrate that our model is comparable to recurrent neural networks with complex memory gates while requiring far fewer parameters.

Via

Access Paper or Ask Questions

Generalising the Discriminative Restricted Boltzmann Machine

Apr 06, 2016

Srikanth Cherla, Son N Tran, Tillman Weyde, Artur d'Avila Garcez

Figure 1 for Generalising the Discriminative Restricted Boltzmann Machine

Figure 2 for Generalising the Discriminative Restricted Boltzmann Machine

Figure 3 for Generalising the Discriminative Restricted Boltzmann Machine

Figure 4 for Generalising the Discriminative Restricted Boltzmann Machine

Abstract:We present a novel theoretical result that generalises the Discriminative Restricted Boltzmann Machine (DRBM). While originally the DRBM was defined assuming the {0, 1}-Bernoulli distribution in each of its hidden units, this result makes it possible to derive cost functions for variants of the DRBM that utilise other distributions, including some that are often encountered in the literature. This is illustrated with the Binomial and {-1, +1}-Bernoulli distributions here. We evaluate these two DRBM variants and compare them with the original one on three benchmark datasets, namely the MNIST and USPS digit classification datasets, and the 20 Newsgroups document classification dataset. Results show that each of the three compared models outperforms the remaining two in one of the three datasets, thus indicating that the proposed theoretical generalisation of the DRBM may be valuable in practice.

* Submitted to ECML 2016 conference track

Via

Access Paper or Ask Questions