Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Integer-Only Neural Network Quantization Scheme Based on Shift-Batch-Normalization

May 28, 2021

Qingyu Guo, Yuan Wang, Xiaoxin Cui

Figure 1 for Integer-Only Neural Network Quantization Scheme Based on Shift-Batch-Normalization

Figure 2 for Integer-Only Neural Network Quantization Scheme Based on Shift-Batch-Normalization

Figure 3 for Integer-Only Neural Network Quantization Scheme Based on Shift-Batch-Normalization

Figure 4 for Integer-Only Neural Network Quantization Scheme Based on Shift-Batch-Normalization

Share this with someone who'll enjoy it:

Abstract:Neural networks are very popular in many areas, but great computing complexity makes it hard to run neural networks on devices with limited resources. To address this problem, quantization methods are used to reduce model size and computation cost, making it possible to use neural networks on embedded platforms or mobile devices. In this paper, an integer-only-quantization scheme is introduced. This scheme uses one layer that combines shift-based batch normalization and uniform quantization to implement 4-bit integer-only inference. Without big integer multiplication(which is used in previous integer-only-quantization methods), this scheme can achieve good power and latency efficiency, and is especially suitable to be deployed on co-designed hardware platforms. Tests have proved that this scheme works very well for easy tasks. And for tough tasks, performance loss can be tolerated for its inference efficiency. Our work is available on github: https://github.com/hguq/IntegerNet.

View paper on

Share this with someone who'll enjoy it:

Title:Integer-Only Neural Network Quantization Scheme Based on Shift-Batch-Normalization

Paper and Code