Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks

Jul 01, 2018

Julian Faraone, Nicholas Fraser, Michaela Blott, Philip H. W. Leong

Figure 1 for SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks

Figure 2 for SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks

Figure 3 for SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks

Figure 4 for SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks

Share this with someone who'll enjoy it:

Abstract:Inference for state-of-the-art deep neural networks is computationally expensive, making them difficult to deploy on constrained hardware environments. An efficient way to reduce this complexity is to quantize the weight parameters and/or activations during training by approximating their distributions with a limited entry codebook. For very low-precisions, such as binary or ternary networks with 1-8-bit activations, the information loss from quantization leads to significant accuracy degradation due to large gradient mismatches between the forward and backward functions. In this paper, we introduce a quantization method to reduce this loss by learning a symmetric codebook for particular weight subgroups. These subgroups are determined based on their locality in the weight matrix, such that the hardware simplicity of the low-precision representations is preserved. Empirically, we show that symmetric quantization can substantially improve accuracy for networks with extremely low-precision weights and activations. We also demonstrate that this representation imposes minimal or no hardware implications to more coarse-grained approaches. Source code is available at https://www.github.com/julianfaraone/SYQ.

* Published as a conference paper at the 2018 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

View paper on

Share this with someone who'll enjoy it:

Title:SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks

Paper and Code