Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Philippe Bordes

Revisiting the Sample Adaptive Offset post-filter of VVC with Neural-Networks

Jul 11, 2022

Philippe Bordes, Franck Galpin, Thierry Dumas, Pavel Nikitin

Figure 1 for Revisiting the Sample Adaptive Offset post-filter of VVC with Neural-Networks

Figure 2 for Revisiting the Sample Adaptive Offset post-filter of VVC with Neural-Networks

Figure 3 for Revisiting the Sample Adaptive Offset post-filter of VVC with Neural-Networks

Figure 4 for Revisiting the Sample Adaptive Offset post-filter of VVC with Neural-Networks

Abstract:The Sample Adaptive Offset (SAO) filter has been introduced in HEVC to reduce general coding and banding artefacts in the reconstructed pictures, in complement to the De-Blocking Filter (DBF) which reduces artifacts at block boundaries specifically. The new video compression standard Versatile Video Coding (VVC) reduces the BD-rate by about 36% at the same reconstruction quality compared to HEVC. It implements an additional new in-loop Adaptive Loop Filter (ALF) on top of the DBF and the SAO filter, the latter remaining unchanged compared to HEVC. However, the relative performance of SAO in VVC has been lowered significantly. In this paper, it is proposed to revisit the SAO filter using Neural Networks (NN). The general principles of the SAO are kept, but the a-priori classification of SAO is replaced with a set of neural networks that determine which reconstructed samples should be corrected and in which proportion. Similarly to the original SAO, some parameters are determined at the encoder side and encoded per CTU. The average BD-rate gain of the proposed SAO improves VVC by at least 2.3% in Random Access while the overall complexity is kept relatively small compared to other NN-based methods.

* PCS 2021

Via

Access Paper or Ask Questions

Neural Network based Inter bi-prediction Blending

Jan 26, 2022

Franck Galpin, Philippe Bordes, Thierry Dumas, Pavel Nikitin, Fabrice Le Leannec

Figure 1 for Neural Network based Inter bi-prediction Blending

Figure 2 for Neural Network based Inter bi-prediction Blending

Figure 3 for Neural Network based Inter bi-prediction Blending

Figure 4 for Neural Network based Inter bi-prediction Blending

Abstract:This paper presents a learning-based method to improve bi-prediction in video coding. In conventional video coding solutions, the motion compensation of blocks from already decoded reference pictures stands out as the principal tool used to predict the current frame. Especially, the bi-prediction, in which a block is obtained by averaging two different motion-compensated prediction blocks, significantly improves the final temporal prediction accuracy. In this context, we introduce a simple neural network that further improves the blending operation. A complexity balance, both in terms of network size and encoder mode selection, is carried out. Extensive tests on top of the recently standardized VVC codec are performed and show a BD-rate improvement of -1.4% in random access configuration for a network size of fewer than 10k parameters. We also propose a simple CPU-based implementation and direct network quantization to assess the complexity/gains tradeoff in a conventional codec framework.

* VCIP 2021

Via

Access Paper or Ask Questions

Combined neural network-based intra prediction and transform selection

Aug 18, 2021

Thierry Dumas, Franck Galpin, Philippe Bordes

Figure 1 for Combined neural network-based intra prediction and transform selection

Figure 2 for Combined neural network-based intra prediction and transform selection

Figure 3 for Combined neural network-based intra prediction and transform selection

Figure 4 for Combined neural network-based intra prediction and transform selection

Abstract:The interactions between different tools added successively to a block-based video codec are critical to its rate-distortion efficiency. In particular, when deep neural network-based intra prediction modes are inserted into a block-based video codec, as the neural network-based prediction function cannot be easily characterized, the adaptation of the transform selection process to the new modes can hardly be performed manually. That is why this paper presents a combined neural network-based intra prediction and transform selection for a block-based video codec. When putting a single neural network-based intra prediction mode and the learned prediction of the selected LFNST pair index into VTM-8.0, -3.71%, -3.17%, and -3.37% of mean BD-rate reduction in all-intra is obtained.

* Picture Coding Symposium 2021

Via

Access Paper or Ask Questions

Iterative training of neural networks for intra prediction

Mar 15, 2020

Thierry Dumas, Franck Galpin, Philippe Bordes

Figure 1 for Iterative training of neural networks for intra prediction

Figure 2 for Iterative training of neural networks for intra prediction

Figure 3 for Iterative training of neural networks for intra prediction

Figure 4 for Iterative training of neural networks for intra prediction

Abstract:This paper presents an iterative training of neural networks for intra prediction in a block-based image and video codec. First, the neural networks are trained on blocks arising from the codec partitioning of images, each paired with its context. Then, iteratively, blocks are collected from the partitioning of images via the codec including the neural networks trained at the previous iteration, each paired with its context, and the neural networks are retrained on the new pairs. Thanks to this training, the neural networks can learn intra prediction functions that both stand out from those already in the initial codec and boost the codec in terms of rate-distortion. Moreover, the iterative process allows the design of training data cleansings essential for the neural network training. When the iteratively trained neural networks are put into H.265 (HM-16.15), -4.2% of mean dB-rate reduction is obtained, that is -1.8% above the state-of-the-art. By moving them into H.266 (VTM-5.0), the mean dB-rate reduction reaches -1.9%.

* 13 pages, 16 figures

Via

Access Paper or Ask Questions