Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Diandian Chen

A Highly Parallel FPGA Implementation of Sparse Neural Network Training

Oct 11, 2018

Sourya Dey, Diandian Chen, Zongyang Li, Souvik Kundu, Kuan-Wen Huang, Keith M. Chugg, Peter A. Beerel

Figure 1 for A Highly Parallel FPGA Implementation of Sparse Neural Network Training

Figure 2 for A Highly Parallel FPGA Implementation of Sparse Neural Network Training

Figure 3 for A Highly Parallel FPGA Implementation of Sparse Neural Network Training

Figure 4 for A Highly Parallel FPGA Implementation of Sparse Neural Network Training

Abstract:We demonstrate an FPGA implementation of a parallel and reconfigurable architecture for sparse neural networks, capable of on-chip training and inference. The network connectivity uses pre-determined, structured sparsity to significantly reduce complexity by lowering memory and computational requirements. The architecture uses a notion of edge-processing, leading to efficient pipelining and parallelization. Moreover, the device can be reconfigured to trade off resource utilization with training time to fit networks and datasets of varying sizes. The combined effects of complexity reduction and easy reconfigurability enable significantly greater exploration of network hyperparameters and structures on-chip. As proof of concept, we show implementation results on an Artix-7 FPGA.

* An abridged version of this work was accepted as a short paper (4 pages) at ReConFig: 2018 International Conference on Reconfigurable Computing and FPGAs. This is the full version of this work

Via

Access Paper or Ask Questions