Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

John Arthur

Structured Convolution Matrices for Energy-efficient Deep learning

Jun 08, 2016

Rathinakumar Appuswamy, Tapan Nayak, John Arthur, Steven Esser, Paul Merolla, Jeffrey Mckinstry, Timothy Melano, Myron Flickner, Dharmendra Modha

Figure 1 for Structured Convolution Matrices for Energy-efficient Deep learning

Figure 2 for Structured Convolution Matrices for Energy-efficient Deep learning

Figure 3 for Structured Convolution Matrices for Energy-efficient Deep learning

Figure 4 for Structured Convolution Matrices for Energy-efficient Deep learning

Abstract:We derive a relationship between network representation in energy-efficient neuromorphic architectures and block Toplitz convolutional matrices. Inspired by this connection, we develop deep convolutional networks using a family of structured convolutional matrices and achieve state-of-the-art trade-off between energy efficiency and classification accuracy for well-known image recognition tasks. We also put forward a novel method to train binary convolutional networks by utilising an existing connection between noisy-rectified linear units and binary activations.

Via

Access Paper or Ask Questions

Deep neural networks are robust to weight binarization and other non-linear distortions

Jun 07, 2016

Paul Merolla, Rathinakumar Appuswamy, John Arthur, Steve K. Esser, Dharmendra Modha

Figure 1 for Deep neural networks are robust to weight binarization and other non-linear distortions

Figure 2 for Deep neural networks are robust to weight binarization and other non-linear distortions

Figure 3 for Deep neural networks are robust to weight binarization and other non-linear distortions

Figure 4 for Deep neural networks are robust to weight binarization and other non-linear distortions

Abstract:Recent results show that deep neural networks achieve excellent performance even when, during training, weights are quantized and projected to a binary representation. Here, we show that this is just the tip of the iceberg: these same networks, during testing, also exhibit a remarkable robustness to distortions beyond quantization, including additive and multiplicative noise, and a class of non-linear projections where binarization is just a special case. To quantify this robustness, we show that one such network achieves 11% test error on CIFAR-10 even with 0.68 effective bits per weight. Furthermore, we find that a common training heuristic--namely, projecting quantized weights during backpropagation--can be altered (or even removed) and networks still achieve a base level of robustness during testing. Specifically, training with weight projections other than quantization also works, as does simply clipping the weights, both of which have never been reported before. We confirm our results for CIFAR-10 and ImageNet datasets. Finally, drawing from these ideas, we propose a stochastic projection rule that leads to a new state of the art network with 7.64% test error on CIFAR-10 using no data augmentation.

Via

Access Paper or Ask Questions

Gibbs Sampling with Low-Power Spiking Digital Neurons

Mar 27, 2015

Srinjoy Das, Bruno Umbria Pedroni, Paul Merolla, John Arthur, Andrew S. Cassidy, Bryan L. Jackson, Dharmendra Modha, Gert Cauwenberghs, Ken Kreutz-Delgado

Figure 1 for Gibbs Sampling with Low-Power Spiking Digital Neurons

Figure 2 for Gibbs Sampling with Low-Power Spiking Digital Neurons

Figure 3 for Gibbs Sampling with Low-Power Spiking Digital Neurons

Figure 4 for Gibbs Sampling with Low-Power Spiking Digital Neurons

Abstract:Restricted Boltzmann Machines and Deep Belief Networks have been successfully used in a wide variety of applications including image classification and speech recognition. Inference and learning in these algorithms uses a Markov Chain Monte Carlo procedure called Gibbs sampling. A sigmoidal function forms the kernel of this sampler which can be realized from the firing statistics of noisy integrate-and-fire neurons on a neuromorphic VLSI substrate. This paper demonstrates such an implementation on an array of digital spiking neurons with stochastic leak and threshold properties for inference tasks and presents some key performance metrics for such a hardware-based sampler in both the generative and discriminative contexts.

* Accepted at ISCAS 2015

Via

Access Paper or Ask Questions

The thermodynamic temperature of a rhythmic spiking network

Sep 28, 2010

Paul Merolla, Tristan Ursell, John Arthur

Figure 1 for The thermodynamic temperature of a rhythmic spiking network

Figure 2 for The thermodynamic temperature of a rhythmic spiking network

Figure 3 for The thermodynamic temperature of a rhythmic spiking network

Figure 4 for The thermodynamic temperature of a rhythmic spiking network

Abstract:Artificial neural networks built from two-state neurons are powerful computational substrates, whose computational ability is well understood by analogy with statistical mechanics. In this work, we introduce similar analogies in the context of spiking neurons in a fixed time window, where excitatory and inhibitory inputs drawn from a Poisson distribution play the role of temperature. For single neurons with a "bandgap" between their inputs and the spike threshold, this temperature allows for stochastic spiking. By imposing a global inhibitory rhythm over the fixed time windows, we connect neurons into a network that exhibits synchronous, clock-like updating akin to neural networks. We implement a single-layer Boltzmann machine without learning to demonstrate our model.

Via

Access Paper or Ask Questions