Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ivan Koryakovskiy

RepQ: Generalizing Quantization-Aware Training for Re-Parametrized Architectures

Nov 09, 2023

Anastasiia Prutianova, Alexey Zaytsev, Chung-Kuei Lee, Fengyu Sun, Ivan Koryakovskiy

Figure 1 for RepQ: Generalizing Quantization-Aware Training for Re-Parametrized Architectures

Figure 2 for RepQ: Generalizing Quantization-Aware Training for Re-Parametrized Architectures

Figure 3 for RepQ: Generalizing Quantization-Aware Training for Re-Parametrized Architectures

Figure 4 for RepQ: Generalizing Quantization-Aware Training for Re-Parametrized Architectures

Abstract:Existing neural networks are memory-consuming and computationally intensive, making deploying them challenging in resource-constrained environments. However, there are various methods to improve their efficiency. Two such methods are quantization, a well-known approach for network compression, and re-parametrization, an emerging technique designed to improve model performance. Although both techniques have been studied individually, there has been limited research on their simultaneous application. To address this gap, we propose a novel approach called RepQ, which applies quantization to re-parametrized networks. Our method is based on the insight that the test stage weights of an arbitrary re-parametrized layer can be presented as a differentiable function of trainable parameters. We enable quantization-aware training by applying quantization on top of this function. RepQ generalizes well to various re-parametrized models and outperforms the baseline method LSQ quantization scheme in all experiments.

* BMVC 2023 (Oral)

Via

Access Paper or Ask Questions

QuantNAS for super resolution: searching for efficient quantization-friendly architectures against quantization noise

Aug 31, 2022

Egor Shvetsov, Dmitry Osin, Alexey Zaytsev, Ivan Koryakovskiy, Valentin Buchnev, Ilya Trofimov, Evgeny Burnaev

Figure 1 for QuantNAS for super resolution: searching for efficient quantization-friendly architectures against quantization noise

Figure 2 for QuantNAS for super resolution: searching for efficient quantization-friendly architectures against quantization noise

Figure 3 for QuantNAS for super resolution: searching for efficient quantization-friendly architectures against quantization noise

Figure 4 for QuantNAS for super resolution: searching for efficient quantization-friendly architectures against quantization noise

Abstract:There is a constant need for high-performing and computationally efficient neural network models for image super-resolution (SR) often used on low-capacity devices. One way to obtain such models is to compress existing architectures, e.g. quantization. Another option is a neural architecture search (NAS) that discovers new efficient solutions. We propose a novel quantization-aware NAS procedure for a specifically designed SR search space. Our approach performs NAS to find quantization-friendly SR models. The search relies on adding quantization noise to parameters and activations instead of quantizing parameters directly. Our QuantNAS finds architectures with better PSNR/BitOps trade-off than uniform or mixed precision quantization of fixed architectures. Additionally, our search against noise procedure is up to 30% faster than directly quantizing weights.

Via

Access Paper or Ask Questions