Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Efficient Inference of CNNs via Channel Pruning

Aug 08, 2019

Boyu Zhang, Azadeh Davoodi, Yu Hen Hu

Figure 1 for Efficient Inference of CNNs via Channel Pruning

Figure 2 for Efficient Inference of CNNs via Channel Pruning

Figure 3 for Efficient Inference of CNNs via Channel Pruning

Figure 4 for Efficient Inference of CNNs via Channel Pruning

Share this with someone who'll enjoy it:

Abstract:The deployment of Convolutional Neural Networks (CNNs) on resource constrained platforms such as mobile devices and embedded systems has been greatly hindered by their high implementation cost, and thus motivated a lot research interest in compressing and accelerating trained CNN models. Among various techniques proposed in literature, structured pruning, especially channel pruning, has gain a lot focus due to 1) its superior performance in memory, computation, and energy reduction; and 2) it is friendly to existing hardware and software libraries. In this paper, we investigate the intermediate results of convolutional layers and present a novel pivoted QR factorization based channel pruning technique that can prune any specified number of input channels of any layer. We also explore more pruning opportunities in ResNet-like architectures by applying two tweaks to our technique. Experiment results on VGG-16 and ResNet-50 models with ImageNet ILSVRC 2012 dataset are very impressive with 4.29X and 2.84X computation reduction while only sacrificing about 1.40\% top-5 accuracy. Compared to many prior works, the pruned models produced by our technique require up to 47.7\% less computation while still achieve higher accuracies.

View paper on

Share this with someone who'll enjoy it:

Title:Efficient Inference of CNNs via Channel Pruning

Paper and Code