Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jianhua Zou

Accelerating Very Deep Convolutional Networks for Classification and Detection

Nov 18, 2015

Xiangyu Zhang, Jianhua Zou, Kaiming He, Jian Sun

Figure 1 for Accelerating Very Deep Convolutional Networks for Classification and Detection

Figure 2 for Accelerating Very Deep Convolutional Networks for Classification and Detection

Figure 3 for Accelerating Very Deep Convolutional Networks for Classification and Detection

Figure 4 for Accelerating Very Deep Convolutional Networks for Classification and Detection

Abstract:This paper aims to accelerate the test-time computation of convolutional neural networks (CNNs), especially very deep CNNs that have substantially impacted the computer vision community. Unlike previous methods that are designed for approximating linear filters or linear responses, our method takes the nonlinear units into account. We develop an effective solution to the resulting nonlinear optimization problem without the need of stochastic gradient descent (SGD). More importantly, while previous methods mainly focus on optimizing one or two layers, our nonlinear method enables an asymmetric reconstruction that reduces the rapidly accumulated error when multiple (e.g., >=10) layers are approximated. For the widely used very deep VGG-16 model, our method achieves a whole-model speedup of 4x with merely a 0.3% increase of top-5 error in ImageNet classification. Our 4x accelerated VGG-16 model also shows a graceful accuracy degradation for object detection when plugged into the Fast R-CNN detector.

* TPAMI, accepted. arXiv admin note: substantial text overlap with arXiv:1411.4229

Via

Access Paper or Ask Questions

Efficient and Accurate Approximations of Nonlinear Convolutional Networks

Nov 16, 2014

Xiangyu Zhang, Jianhua Zou, Xiang Ming, Kaiming He, Jian Sun

Figure 1 for Efficient and Accurate Approximations of Nonlinear Convolutional Networks

Abstract:This paper aims to accelerate the test-time computation of deep convolutional neural networks (CNNs). Unlike existing methods that are designed for approximating linear filters or linear responses, our method takes the nonlinear units into account. We minimize the reconstruction error of the nonlinear responses, subject to a low-rank constraint which helps to reduce the complexity of filters. We develop an effective solution to this constrained nonlinear optimization problem. An algorithm is also presented for reducing the accumulated error when multiple layers are approximated. A whole-model speedup ratio of 4x is demonstrated on a large network trained for ImageNet, while the top-5 error rate is only increased by 0.9%. Our accelerated model has a comparably fast speed as the "AlexNet", but is 4.7% more accurate.

Via

Access Paper or Ask Questions