Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Y. Cem Subakan

Diagonal RNNs in Symbolic Music Modeling

Apr 19, 2017

Y. Cem Subakan, Paris Smaragdis

Figure 1 for Diagonal RNNs in Symbolic Music Modeling

Figure 2 for Diagonal RNNs in Symbolic Music Modeling

Figure 3 for Diagonal RNNs in Symbolic Music Modeling

Abstract:In this paper, we propose a new Recurrent Neural Network (RNN) architecture. The novelty is simple: We use diagonal recurrent matrices instead of full. This results in better test likelihood and faster convergence compared to regular full RNNs in most of our experiments. We show the benefits of using diagonal recurrent matrices with popularly used LSTM and GRU architectures as well as with the vanilla RNN architecture, on four standard symbolic music datasets.

* Submitted to Waspaa 2017

Via

Access Paper or Ask Questions

A Dictionary Learning Approach for Factorial Gaussian Models

Aug 18, 2015

Y. Cem Subakan, Johannes Traa, Paris Smaragdis, Noah Stein

Figure 1 for A Dictionary Learning Approach for Factorial Gaussian Models

Figure 2 for A Dictionary Learning Approach for Factorial Gaussian Models

Abstract:In this paper, we develop a parameter estimation method for factorially parametrized models such as Factorial Gaussian Mixture Model and Factorial Hidden Markov Model. Our contributions are two-fold. First, we show that the emission matrix of the standard Factorial Model is unidentifiable even if the true assignment matrix is known. Secondly, we address the issue of identifiability by making a one component sharing assumption and derive a parameter learning algorithm for this case. Our approach is based on a dictionary learning problem of the form $X = O R$, where the goal is to learn the dictionary $O$ given the data matrix $X$. We argue that due to the specific structure of the activation matrix $R$ in the shared component factorial mixture model, and an incoherence assumption on the shared component, it is possible to extract the columns of the $O$ matrix without the need for alternating between the estimation of $O$ and $R$.

Via

Access Paper or Ask Questions