Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Quantized Approximately Orthogonal Recurrent Neural Networks

Feb 05, 2024

Armand Foucault, Franck Mamalet, François Malgouyres

Figure 1 for Quantized Approximately Orthogonal Recurrent Neural Networks

Figure 2 for Quantized Approximately Orthogonal Recurrent Neural Networks

Figure 3 for Quantized Approximately Orthogonal Recurrent Neural Networks

Figure 4 for Quantized Approximately Orthogonal Recurrent Neural Networks

Share this with someone who'll enjoy it:

Abstract:Orthogonal recurrent neural networks (ORNNs) are an appealing option for learning tasks involving time series with long-term dependencies, thanks to their simplicity and computational stability. However, these networks often require a substantial number of parameters to perform well, which can be prohibitive in power-constrained environments, such as compact devices. One approach to address this issue is neural network quantization. The construction of such networks remains an open problem, acknowledged for its inherent instability.In this paper, we explore the quantization of the recurrent and input weight matrices in ORNNs, leading to Quantized approximately Orthogonal RNNs (QORNNs). We investigate one post-training quantization (PTQ) strategy and three quantization-aware training (QAT) algorithms that incorporate orthogonal constraints and quantized weights. Empirical results demonstrate the advantages of employing QAT over PTQ. The most efficient model achieves results similar to state-of-the-art full-precision ORNN and LSTM on a variety of standard benchmarks, even with 3-bits quantization.

View paper on

Share this with someone who'll enjoy it:

Title:Quantized Approximately Orthogonal Recurrent Neural Networks

Paper and Code