Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:iRNN: Integer-only Recurrent Neural Network

Sep 20, 2021

Eyyüb Sari, Vanessa Courville, Vahid Partovi Nia

Figure 1 for iRNN: Integer-only Recurrent Neural Network

Figure 2 for iRNN: Integer-only Recurrent Neural Network

Figure 3 for iRNN: Integer-only Recurrent Neural Network

Figure 4 for iRNN: Integer-only Recurrent Neural Network

Share this with someone who'll enjoy it:

Abstract:Recurrent neural networks (RNN) are used in many real-world text and speech applications. They include complex modules such as recurrence, exponential-based activation, gate interaction, unfoldable normalization, bi-directional dependence, and attention. The interaction between these elements prevents running them on integer-only operations without a significant performance drop. Deploying RNNs that include layer normalization and attention on integer-only arithmetic is still an open problem. We present a quantization-aware training method for obtaining a highly accurate integer-only recurrent neural network (iRNN). Our approach supports layer normalization, attention, and an adaptive piecewise linear approximation of activations, to serve a wide range of RNNs on various applications. The proposed method is proven to work on RNN-based language models and automatic speech recognition. Our iRNN maintains similar performance as its full-precision counterpart, their deployment on smartphones improves the runtime performance by $2\times$, and reduces the model size by $4\times$.

View paper on

Share this with someone who'll enjoy it:

Title:iRNN: Integer-only Recurrent Neural Network

Paper and Code