Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zhuopin Xu

Reduce Computational Complexity for Convolutional Layers by Skipping Zeros

Jul 12, 2023

Zhiyi Zhang, Pengfei Zhang, Zhuopin Xu, Qi Wang

Figure 1 for Reduce Computational Complexity for Convolutional Layers by Skipping Zeros

Figure 2 for Reduce Computational Complexity for Convolutional Layers by Skipping Zeros

Figure 3 for Reduce Computational Complexity for Convolutional Layers by Skipping Zeros

Figure 4 for Reduce Computational Complexity for Convolutional Layers by Skipping Zeros

Abstract:Deep neural networks rely on parallel processors for acceleration. To design operators for them, it requires not only good algorithm to reduce complexity, but also sufficient utilization of hardwares. Convolutional layers mainly contain 3 kinds of operators: convolution in forward propagation, deconvolution and dilated-convolution in backward propagation. When executing these operators, 0s are always added to tensors, causing redundant calculations. This paper gives C-K-S algorithm (ConvV2, KS-deconv, Sk-dilated), which skips these 0s in two ways: trim the filters to exclude padded 0s; transform sparse tensors to dense tensors, to avoid inserted 0s in deconvolution and dilated-convolution. In contrast to regular convolution, deconvolution is hard to accelerate due to its complicacy. This paper provides high-performance GPU implementations of C-K-S, and verifies their effectiveness with comparison to PyTorch. According to the experiments, C-K-S has advantages over PyTorch in certain cases, especially in deconvolution on small feature-maps. Further enhancement of C-K-S can be done by making full optimizations oriented at specific GPU architectures.

* To download the code of Dragon-Alpha and experimental datas, please go to https://github.com/GilgameshXYZ123/Dragon-Alpha

Via

Access Paper or Ask Questions