Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A generic and efficient convolutional neural network accelerator using HLS for a system on chip design

Apr 27, 2020

Kim Bjerge, Jonathan Schougaard, Daniel Ejnar Larsen

Figure 1 for A generic and efficient convolutional neural network accelerator using HLS for a system on chip design

Figure 2 for A generic and efficient convolutional neural network accelerator using HLS for a system on chip design

Figure 3 for A generic and efficient convolutional neural network accelerator using HLS for a system on chip design

Figure 4 for A generic and efficient convolutional neural network accelerator using HLS for a system on chip design

Share this with someone who'll enjoy it:

Abstract:This paper presents a generic convolutional neural network accelerator (CNNA) for a system on chip design (SoC). The goal was to accelerate inference of different deep learning networks on an embedded SoC platform. The presented CNNA has a scalable architecture which uses high level synthesis (HLS) and SystemC for the hardware accelerator. It is able to accelerate any CNN exported from Python and supports a combination of convolutional, max-pooling, and fully connected layers. A training method using fixed-point quantized weights is proposed and presented in the paper. The CNNA is template-based, enabling it to scale for different targets of the Xilinx ZYNQ platform. This approach enables design space exploration, which makes it possible to explore several configurations of the CNNA during C- and RTL-simulation, fitting it to the desired platform and model. The convolutional neural network VGG16 was used to test the solution on a Xilinx Ultra96 board. The result gave a high accuracy in training with an auto-scaled fixed-point Q2.14 format compared to a similar floating-point model. It was able to perform inference in 2.0 seconds, while having an average power consumption of 2.63 W, which corresponds to a power efficiency of 6.0 GOPS/W for the CNN accelerator.

* 18 pages, 15 figures

View paper on

Share this with someone who'll enjoy it:

Title:A generic and efficient convolutional neural network accelerator using HLS for a system on chip design

Paper and Code