Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alireza Khadem

CoDR: Computation and Data Reuse Aware CNN Accelerator

Apr 20, 2021

Alireza Khadem, Haojie Ye, Trevor Mudge

Figure 1 for CoDR: Computation and Data Reuse Aware CNN Accelerator

Figure 2 for CoDR: Computation and Data Reuse Aware CNN Accelerator

Figure 3 for CoDR: Computation and Data Reuse Aware CNN Accelerator

Figure 4 for CoDR: Computation and Data Reuse Aware CNN Accelerator

Abstract:Computation and Data Reuse is critical for the resource-limited Convolutional Neural Network (CNN) accelerators. This paper presents Universal Computation Reuse to exploit weight sparsity, repetition, and similarity simultaneously in a convolutional layer. Moreover, CoDR decreases the cost of weight memory access by proposing a customized Run-Length Encoding scheme and the number of memory accesses to the intermediate results by introducing an input and output stationary dataflow. Compared to two recent compressed CNN accelerators with the same area of 2.85 mm^2, CoDR decreases SRAM access by 5.08x and 7.99x, and consumes 3.76x and 6.84x less energy.

Via

Access Paper or Ask Questions

Design Challenges of Neural Network Acceleration Using Stochastic Computing

Jun 08, 2020

Alireza Khadem

Figure 1 for Design Challenges of Neural Network Acceleration Using Stochastic Computing

Figure 2 for Design Challenges of Neural Network Acceleration Using Stochastic Computing

Figure 3 for Design Challenges of Neural Network Acceleration Using Stochastic Computing

Figure 4 for Design Challenges of Neural Network Acceleration Using Stochastic Computing

Abstract:The enormous and ever-increasing complexity of state-of-the-art neural networks (NNs) has impeded the deployment of deep learning on resource-limited devices such as the Internet of Things (IoTs). Stochastic computing exploits the inherent amenability to approximation characteristic of NNs to reduce their energy and area footprint, two critical requirements of small embedded devices suitable for the IoTs. This report evaluates and compares two recently proposed stochastic-based NN designs, referred to as BISC (Binary Interfaced Stochastic Computing) by Sim and Lee, 2017, and ESL (Extended Stochastic Logic) by Canals et al., 2016. Using analysis and simulation, we compare three distinct implementations of these designs in terms of performance, power consumption, area, and accuracy. We also discuss the overall challenges faced in adopting stochastic computing for building NNs. We find that BISC outperforms the other architectures when executing the LeNet-5 NN model applied to the MNIST digit recognition dataset. Our analysis and simulation experiments indicate that this architecture is around 50X faster, occupies 5.7X and 2.9X less area, and consumes 7.8X and 1.8X less power than the two ESL architectures.

Via

Access Paper or Ask Questions