Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Valeriy Berezovskiy

Weight Averaging Improves Knowledge Distillation under Domain Shift

Sep 20, 2023

Valeriy Berezovskiy, Nikita Morozov

Abstract:Knowledge distillation (KD) is a powerful model compression technique broadly used in practical deep learning applications. It is focused on training a small student network to mimic a larger teacher network. While it is widely known that KD can offer an improvement to student generalization in i.i.d setting, its performance under domain shift, i.e. the performance of student networks on data from domains unseen during training, has received little attention in the literature. In this paper we make a step towards bridging the research fields of knowledge distillation and domain generalization. We show that weight averaging techniques proposed in domain generalization literature, such as SWAD and SMA, also improve the performance of knowledge distillation under domain shift. In addition, we propose a simplistic weight averaging strategy that does not require evaluation on validation data during training and show that it performs on par with SWAD and SMA when applied to KD. We name our final distillation approach Weight-Averaged Knowledge Distillation (WAKD).

* ICCV 2023 Workshop on Out-of-Distribution Generalization in Computer Vision (OOD-CV)

Via

Access Paper or Ask Questions

Image quality prediction using synthetic and natural codebooks: comparative results

Dec 21, 2022

Maxim Koroteev, Kirill Aistov, Valeriy Berezovskiy, Pavel Frolov

Figure 1 for Image quality prediction using synthetic and natural codebooks: comparative results

Figure 2 for Image quality prediction using synthetic and natural codebooks: comparative results

Figure 3 for Image quality prediction using synthetic and natural codebooks: comparative results

Figure 4 for Image quality prediction using synthetic and natural codebooks: comparative results

Abstract:We investigate a model for image/video quality assessment based on building a set of codevectors representing in a sense some basic properties of images, similar to well-known CORNIA model. We analyze the codebook building method and propose some modifications for it. Also the algorithm is investigated from the point of inference time reduction. Both natural and synthetic images are used for building codebooks and some analysis of synthetic images used for codebooks is provided. It is demonstrated the results on quality assessment may be improves with the use if synthetic images for codebook construction. We also demonstrate regimes of the algorithm in which real time execution on CPU is possible for sufficiently high correlations with mean opinion score (MOS). Various pooling strategies are considered as well as the problem of metric sensitivity to bitrate.

* 18 pages, 8 figures

Via

Access Paper or Ask Questions