Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Paul Merolla

Structured Convolution Matrices for Energy-efficient Deep learning

Jun 08, 2016

Rathinakumar Appuswamy, Tapan Nayak, John Arthur, Steven Esser, Paul Merolla, Jeffrey Mckinstry, Timothy Melano, Myron Flickner, Dharmendra Modha

Figure 1 for Structured Convolution Matrices for Energy-efficient Deep learning

Figure 2 for Structured Convolution Matrices for Energy-efficient Deep learning

Figure 3 for Structured Convolution Matrices for Energy-efficient Deep learning

Figure 4 for Structured Convolution Matrices for Energy-efficient Deep learning

Abstract:We derive a relationship between network representation in energy-efficient neuromorphic architectures and block Toplitz convolutional matrices. Inspired by this connection, we develop deep convolutional networks using a family of structured convolutional matrices and achieve state-of-the-art trade-off between energy efficiency and classification accuracy for well-known image recognition tasks. We also put forward a novel method to train binary convolutional networks by utilising an existing connection between noisy-rectified linear units and binary activations.

Via

Access Paper or Ask Questions

Deep neural networks are robust to weight binarization and other non-linear distortions

Jun 07, 2016

Paul Merolla, Rathinakumar Appuswamy, John Arthur, Steve K. Esser, Dharmendra Modha

Figure 1 for Deep neural networks are robust to weight binarization and other non-linear distortions

Figure 2 for Deep neural networks are robust to weight binarization and other non-linear distortions

Figure 3 for Deep neural networks are robust to weight binarization and other non-linear distortions

Figure 4 for Deep neural networks are robust to weight binarization and other non-linear distortions

Abstract:Recent results show that deep neural networks achieve excellent performance even when, during training, weights are quantized and projected to a binary representation. Here, we show that this is just the tip of the iceberg: these same networks, during testing, also exhibit a remarkable robustness to distortions beyond quantization, including additive and multiplicative noise, and a class of non-linear projections where binarization is just a special case. To quantify this robustness, we show that one such network achieves 11% test error on CIFAR-10 even with 0.68 effective bits per weight. Furthermore, we find that a common training heuristic--namely, projecting quantized weights during backpropagation--can be altered (or even removed) and networks still achieve a base level of robustness during testing. Specifically, training with weight projections other than quantization also works, as does simply clipping the weights, both of which have never been reported before. We confirm our results for CIFAR-10 and ImageNet datasets. Finally, drawing from these ideas, we propose a stochastic projection rule that leads to a new state of the art network with 7.64% test error on CIFAR-10 using no data augmentation.

Via

Access Paper or Ask Questions

TrueHappiness: Neuromorphic Emotion Recognition on TrueNorth

Jan 16, 2016

Peter U. Diehl, Bruno U. Pedroni, Andrew Cassidy, Paul Merolla, Emre Neftci, Guido Zarrella

Figure 1 for TrueHappiness: Neuromorphic Emotion Recognition on TrueNorth

Figure 2 for TrueHappiness: Neuromorphic Emotion Recognition on TrueNorth

Figure 3 for TrueHappiness: Neuromorphic Emotion Recognition on TrueNorth

Figure 4 for TrueHappiness: Neuromorphic Emotion Recognition on TrueNorth

Abstract:We present an approach to constructing a neuromorphic device that responds to language input by producing neuron spikes in proportion to the strength of the appropriate positive or negative emotional response. Specifically, we perform a fine-grained sentiment analysis task with implementations on two different systems: one using conventional spiking neural network (SNN) simulators and the other one using IBM's Neurosynaptic System TrueNorth. Input words are projected into a high-dimensional semantic space and processed through a fully-connected neural network (FCNN) containing rectified linear units trained via backpropagation. After training, this FCNN is converted to a SNN by substituting the ReLUs with integrate-and-fire neurons. We show that there is practically no performance loss due to conversion to a spiking network on a sentiment analysis test set, i.e. correlations between predictions and human annotations differ by less than 0.02 comparing the original DNN and its spiking equivalent. Additionally, we show that the SNN generated with this technique can be mapped to existing neuromorphic hardware -- in our case, the TrueNorth chip. Mapping to the chip involves 4-bit synaptic weight discretization and adjustment of the neuron thresholds. The resulting end-to-end system can take a user input, i.e. a word in a vocabulary of over 300,000 words, and estimate its sentiment on TrueNorth with a power consumption of approximately 50 uW.

Via

Access Paper or Ask Questions

Gibbs Sampling with Low-Power Spiking Digital Neurons

Mar 27, 2015

Srinjoy Das, Bruno Umbria Pedroni, Paul Merolla, John Arthur, Andrew S. Cassidy, Bryan L. Jackson, Dharmendra Modha, Gert Cauwenberghs, Ken Kreutz-Delgado

Figure 1 for Gibbs Sampling with Low-Power Spiking Digital Neurons

Figure 2 for Gibbs Sampling with Low-Power Spiking Digital Neurons

Figure 3 for Gibbs Sampling with Low-Power Spiking Digital Neurons

Figure 4 for Gibbs Sampling with Low-Power Spiking Digital Neurons

Abstract:Restricted Boltzmann Machines and Deep Belief Networks have been successfully used in a wide variety of applications including image classification and speech recognition. Inference and learning in these algorithms uses a Markov Chain Monte Carlo procedure called Gibbs sampling. A sigmoidal function forms the kernel of this sampler which can be realized from the firing statistics of noisy integrate-and-fire neurons on a neuromorphic VLSI substrate. This paper demonstrates such an implementation on an array of digital spiking neurons with stochastic leak and threshold properties for inference tasks and presents some key performance metrics for such a hardware-based sampler in both the generative and discriminative contexts.

* Accepted at ISCAS 2015

Via

Access Paper or Ask Questions

The thermodynamic temperature of a rhythmic spiking network

Sep 28, 2010

Paul Merolla, Tristan Ursell, John Arthur

Figure 1 for The thermodynamic temperature of a rhythmic spiking network

Figure 2 for The thermodynamic temperature of a rhythmic spiking network

Figure 3 for The thermodynamic temperature of a rhythmic spiking network

Figure 4 for The thermodynamic temperature of a rhythmic spiking network

Abstract:Artificial neural networks built from two-state neurons are powerful computational substrates, whose computational ability is well understood by analogy with statistical mechanics. In this work, we introduce similar analogies in the context of spiking neurons in a fixed time window, where excitatory and inhibitory inputs drawn from a Poisson distribution play the role of temperature. For single neurons with a "bandgap" between their inputs and the spike threshold, this temperature allows for stochastic spiking. By imposing a global inhibitory rhythm over the fixed time windows, we connect neurons into a network that exhibits synchronous, clock-like updating akin to neural networks. We implement a single-layer Boltzmann machine without learning to demonstrate our model.

Via

Access Paper or Ask Questions