Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chun Quan

FALCON: Fast and Lightweight Convolution for Compressing and Accelerating CNN

Sep 25, 2019

Chun Quan, Jun-Gi Jang, Hyun Dong Lee, U Kang

Figure 1 for FALCON: Fast and Lightweight Convolution for Compressing and Accelerating CNN

Figure 2 for FALCON: Fast and Lightweight Convolution for Compressing and Accelerating CNN

Figure 3 for FALCON: Fast and Lightweight Convolution for Compressing and Accelerating CNN

Figure 4 for FALCON: Fast and Lightweight Convolution for Compressing and Accelerating CNN

Abstract:How can we efficiently compress Convolutional Neural Networks (CNN) while retaining their accuracy on classification tasks? A promising direction is based on depthwise separable convolution which replaces a standard convolution with a depthwise convolution and a pointwise convolution. However, previous works based on depthwise separable convolution are limited since 1) they are mostly heuristic approaches without a precise understanding of their relations to standard convolution, and 2) their accuracies do not match that of the standard convolution. In this paper, we propose FALCON, an accurate and lightweight method for compressing CNN. FALCON is derived by interpreting existing convolution methods based on depthwise separable convolution using EHP, our proposed mathematical formulation to approximate the standard convolution kernel. Such interpretation leads to developing a generalized version rank-k FALCON which further improves the accuracy while sacrificing a bit of compression and computation reduction rates. In addition, we propose FALCON-branch by fitting FALCON into the previous state-of-the-art convolution unit ShuffleUnitV2 which gives even better accuracy. Experiments show that FALCON and FALCON-branch outperform 1) existing methods based on depthwise separable convolution and 2) standard CNN models by up to 8x compression and 8x computation reduction while ensuring similar accuracy. We also demonstrate that rank-k FALCON provides even better accuracy than standard convolution in many cases, while using a smaller number of parameters and floating-point operations.

Via

Access Paper or Ask Questions